创建文本对话请求
Creates a model response for the given chat conversation.
Authorizations
Use the following format for authentication: Bearer <your api key>
Body
对应的模型名称。为更好的提升服务质量,我们将不定期对本服务提供的模型做相关变更,包括但不限于模型上下线,模型服务能力调整,我们会在可行的情况下以公告、消息推送等适当的方式进行通知。
deepseek-ai/deepseek-vl2
, Qwen/Qwen2-VL-72B-Instruct
, OpenGVLab/InternVL2-26B
, Pro/Qwen/Qwen2-VL-7B-Instruct
, Pro/OpenGVLab/InternVL2-8B
, TeleAI/TeleMM
A list of messages comprising the conversation so far.
If set, tokens are returned as Server-Sent Events as they are made available. Stream terminates with data: [DONE]
The maximum number of tokens to generate.
1 < x < 4096
Up to 4 sequences where the API will stop generating further tokens. The returned text will not contain the stop sequence.
Determines the degree of randomness in the response.
The top_p
(nucleus) parameter is used to dynamically adjust the number of choices for each predicted token based on the cumulative probabilities.
Number of generations to return
An object specifying the format that the model must output.