Key capabilities
- OpenAI-compatible — Works as a drop-in replacement with the OpenAI SDK
- Best coding model — Top performance on code generation, debugging, and refactoring
- Fast and efficient — Lower latency and cost compared to Opus models
- Strong reasoning — Excellent at multi-step problem solving and analysis
- Long context — Handles large documents and multi-turn conversations
- Streaming — Supports real-time token streaming via SSE
Quick example
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Must be claude-sonnet-4-20250514 |
messages | array | Yes | List of { role, content } objects |
max_tokens | integer | No | Maximum tokens to generate |
temperature | float | No | 0–2. Controls randomness. Default: 1 |
stream | boolean | No | Enable SSE streaming. Default: false |
top_p | float | No | Nucleus sampling threshold. Default: 1 |
stop | string / array | No | Sequences that stop generation |
API Reference
View the interactive API playground for Claude Sonnet 4.