Key capabilities
- OpenAI-compatible — Works as a drop-in replacement with the OpenAI SDK
- 1M token context — Handles massive documents and conversations with up to 66K output tokens
- Multi-modal input — Supports text, image, and video inputs
- Agentic coding — Substantially improved coding agent benchmarks
- Reasoning — Built-in thinking support via
enable_thinkingparameter - Built-in tools — Web search, code interpreter, web scraping, image search via Responses API
Quick example
Note:image_urlandvideo_urlsupport both remote URLs (https://...) and base64 data URIs (data:image/png;base64,.../data:video/mp4;base64,...). Image tokens and video tokens are counted inusage.prompt_tokens_details.
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Must be qwen3.6-flash |
messages | array | Yes | List of { role, content } objects. Supports image_url and video_url for multi-modal |
max_completion_tokens | integer | No | Maximum tokens to generate |
temperature | float | No | 0–2. Controls randomness. Default: 1 |
stream | boolean | No | Enable SSE streaming. Default: false |
top_p | float | No | Nucleus sampling threshold. Default: 1 |
stop | array | No | Stop sequences. Must be an array. Default: null |
enable_thinking | boolean | No | Enable reasoning via extra_body. Default: false |
API Reference
View the interactive API playground for Qwen3.6-Flash.