Creates a model response for the given chat conversation.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Model ID
gpt-4o "gpt-4o"
A list of messages comprising the conversation so far.
1[{ "role": "user", "content": "Hello!" }]
The maximum number of tokens to generate.
x >= 1Sampling temperature. Higher values make output more random.
0 <= x <= 21
Nucleus sampling threshold.
0 <= x <= 1Penalizes repeated tokens based on their frequency in the text so far.
-2 <= x <= 2Penalizes tokens that have already appeared in the text.
-2 <= x <= 2If true, stream partial message deltas using SSE.
Sequences where the model will stop generating further tokens.
How many chat completion choices to generate for each input message.
x >= 1An object specifying the format that the model must output. Setting to {"type": "json_object"} enables JSON mode.
A list of tools the model may call. Currently, only functions are supported.
Controls which (if any) tool is called by the model.
A unique identifier representing your end-user.