创建文本对话请求
Creates a model response for the given chat conversation.
Authorizations
Use the following format for authentication: Bearer <your api key>
Body
A list of messages comprising the conversation so far.
对应的模型名称。为更好的提升服务质量,我们将不定期对本服务提供的模型做相关变更,包括但不限于模型上下线,模型服务能力调整,我们会在可行的情况下以公告、消息推送等适当的方式进行通知。
Qwen/QVQ-72B-Preview
, deepseek-ai/deepseek-vl2
, Qwen/Qwen2-VL-72B-Instruct
, OpenGVLab/InternVL2-26B
, Pro/Qwen/Qwen2-VL-7B-Instruct
, Pro/OpenGVLab/InternVL2-8B
, TeleAI/TeleMM
The maximum number of tokens to generate.
1 < x < 4096
Number of generations to return
An object specifying the format that the model must output.
Up to 4 sequences where the API will stop generating further tokens. The returned text will not contain the stop sequence.
If set, tokens are returned as Server-Sent Events as they are made available. Stream terminates with data: [DONE]
Determines the degree of randomness in the response.
The top_p
(nucleus) parameter is used to dynamically adjust the number of choices for each predicted token based on the cumulative probabilities.