Creates a model response for the given chat conversation.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Model ID
claude-opus-4-20250514-thinking "claude-opus-4-20250514-thinking"
A list of messages comprising the conversation so far.
1[{ "role": "user", "content": "Hello!" }]The maximum number of tokens to generate.
1 <= x <= 8192Sampling temperature. Higher values make output more random.
0 <= x <= 21
Nucleus sampling threshold.
0 <= x <= 1Penalizes repeated tokens based on their frequency in the text so far.
-2 <= x <= 2Penalizes tokens that have already appeared in the text.
-2 <= x <= 2If true, stream partial message deltas using SSE.
Sequences where the model will stop generating further tokens.