Key capabilities
- Text-to-Video — Generate videos from natural language descriptions
- Image-to-Video — Use a reference image as the first frame
- Video Remix — Reuse structure, motion, and framing from a previous video
- Flexible Duration — 4, 8, or 12 seconds
- Multiple Resolutions — Portrait (720x1280), Landscape (1280x720), and more
Workflow
The Sora 2 API is asynchronous. Follow these steps:- Create task —
POST /v1/videos - Query status —
GET /v1/videos/{id}(poll untilstatusiscompleted) - Download video —
GET /v1/videos/{id}/content
Quick example
Step 1: Create task
Step 2: Query status
Step 3: Download video
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Must be sora-2 |
prompt | string | Yes | Natural language description of the video. Include shot type, subject, action, setting, lighting. Keep single-purpose for best results. |
seconds | string | No | 4, 8, 12. Default: 4 |
size | string | No | 720x1280, 1280x720, 1024x1792, 1792x1024. Default: 720x1280 |
input_reference | file | No | Reference image for the first frame. Accepts image/jpeg, image/png, image/webp. |
remix_video_id | string | No | ID of a completed video to reuse its structure, motion, and framing. |
API Reference
View the interactive API playground for Sora 2.