Doubao Seed 2.1 Pro

Doubao-Seed-2.1-Pro is ByteDance’s flagship model for the coding and agent era, available through Anyfast via an OpenAI-compatible interface. It is the strongest variant of the Seed 2.1 family — built for high-complexity work such as complex coding, long-horizon agents, and multi-step engineering delivery, with a 256K context window and further upgraded visual understanding.

Key capabilities

OpenAI-compatible — Works as a drop-in replacement with the OpenAI SDK
256K context window — Handles project-scale context, with up to 256K tokens of output in a single response
Deep thinking — Chain-of-thought reasoning, on by default, with adjustable reasoning_effort
Strong coding & agents — Requirement understanding, long-horizon planning, continuous repair, and engineering delivery
Multimodal understanding — Text, image, video, and document understanding (audio not supported)
Function calling, structured output & context caching — Robust tool use, JSON output (beta), and prompt caching
Streaming — Real-time token streaming via SSE

Output specifications

Property	Value
Input modality	Text, Image, Video, Document
Output modality	Text
Context window	256K tokens
Max output tokens	256K

Quick example

curl https://www.anyfast.ai/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seed-2.1-pro",
    "messages": [
      { "role": "user", "content": "Explain quantum entanglement in simple terms." }
    ]
  }'

Thinking mode

Doubao-Seed-2.1-Pro reasons before answering by default. thinking.type is enabled unless you set it to disabled to skip reasoning for lightweight tasks. Use reasoning_effort to tune how long the model thinks.

Python

response = client.chat.completions.create(
    model="doubao-seed-2.1-pro",
    messages=[
        {"role": "user", "content": "Design a REST API for a blogging platform."}
    ],
    extra_body={
        "thinking": {"type": "enabled"},
        "reasoning_effort": "high"
    }
)

print(response.choices[0].message.content)

reasoning_effort controls reasoning length (effective only when thinking is enabled) and accepts minimal, low, medium, and high. Default: high.

Parameters

Parameter	Type	Required	Description
`model`	string	Yes	Must be `doubao-seed-2.1-pro`
`messages`	array	Yes	List of `{ role, content }` objects
`thinking`	object	No	`{ "type": "enabled" \| "disabled" }`. Controls deep thinking. Default: `enabled`
`reasoning_effort`	string	No	`minimal`, `low`, `medium`, `high`. Effective when thinking is enabled. Default: `high`
`max_tokens`	integer	No	Maximum tokens to generate (up to 262144)
`temperature`	float	No	`0`–`2`. Controls randomness. Default: `1`
`top_p`	float	No	Nucleus sampling threshold. Default: `1`
`stream`	boolean	No	Enable SSE streaming. Default: `false`
`tools`	array	No	Function tool definitions for tool use
`response_format`	object	No	`{ "type": "json_object" }` for structured JSON output (beta)
`stop`	string / array	No	Sequences that stop generation

API Reference

View the interactive API playground for Doubao Seed 2.1 Pro.

​Key capabilities

​Output specifications

​Quick example

​Thinking mode

​Parameters