Claude Haiku 4.5

Claude Haiku 4.5 is Anthropic’s fastest and most cost-effective model, available through Anyfast via an OpenAI-compatible interface. It delivers near-instant responses with strong performance across everyday tasks, making it ideal for high-throughput and latency-sensitive applications.

Key capabilities

OpenAI-compatible — Works as a drop-in replacement with the OpenAI SDK
Ultra-fast — Optimized for low-latency responses
Cost-effective — Significantly lower cost per token compared to larger models
Streaming — Supports real-time token streaming via SSE
Capable — Strong performance on coding, writing, and analysis tasks

Quick example

curl https://www.anyfast.ai/v1/messages \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-haiku-4-5",
    "messages": [
      { "role": "user", "content": "Explain quantum entanglement in simple terms." }
    ]
  }'

Parameters

Parameter	Type	Required	Description
`model`	string	Yes	Must be `claude-haiku-4-5`
`messages`	array	Yes	List of `{ role, content }` objects
`max_tokens`	integer	No	Maximum tokens to generate
`temperature`	float	No	`0`–`2`. Controls randomness. Default: `1`
`stream`	boolean	No	Enable SSE streaming. Default: `false`
`top_p`	float	No	Nucleus sampling threshold. Default: `1`
`stop`	string / array	No	Sequences that stop generation

API Reference

View the interactive API playground for Claude Haiku 4.5.

Claude Sonnet 4 Thinking Claude Haiku 4.5 Thinking

​Key capabilities

​Quick example

​Parameters

API Reference

Key capabilities

Quick example

Parameters