Audio
Upload reference audio
Upload user-provided voice style, which can be in base64 encoding or file format. Refer to (https://docs.siliconflow.cn/capabilities/text-to-speech#2-2)
POST
Authorizations
Use the following format for authentication: Bearer <your api key>
Body
multipart/form-data
Predefined voice style model name
Available options:
FunAudioLLM/CosyVoice2-0.5B
Example:
"FunAudioLLM/CosyVoice2-0.5B"
User-defined voice style name
Example:
"your-voice-name"
Corresponding text content for the audio
Example:
"在一无所知中, 梦里的一天结束了,一个新的轮回便会开始"
Audio file encoded in base64 with the header format of data:audio/mpeg;base64
Example:
"data:audio/mpeg;base64,aGVsbG93b3JsZA=="
Response
200
application/json
Successful response
Example:
"speech:your-voice-name:xxx:xxx"