Key capabilities
- Text-to-image — Generate images from text descriptions
- Image editing — Pass a reference image in
inline_dataalongside your text instruction - Aspect ratio control —
1:1,4:3,3:4,16:9,9:16 - Resolution control —
1K(~1024px),2K(~2048px),4K(~4096px) on the long edge - Multi-modal output — Return both the image and a text caption with
responseModalities: ["TEXT", "IMAGE"]
Text-to-image example
Image editing example (with reference image)
Include both atext instruction and an inline_data reference image in the same parts array.
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
key | string | Yes | API key (query parameter) |
contents[].parts[].text | string | Yes | Text prompt or instruction |
contents[].parts[].inline_data.mime_type | string | No | Reference image type: image/jpeg, image/png, image/webp |
contents[].parts[].inline_data.data | string | No | Base64-encoded reference image |
generationConfig.responseModalities | array | Yes | ["IMAGE"] or ["TEXT", "IMAGE"] |
generationConfig.imageConfig.aspectRatio | string | No | 1:1 / 4:3 / 3:4 / 16:9 / 9:16 |
generationConfig.imageConfig.imageSize | string | No | 1K / 2K / 4K (default: 1K) |
API Reference
View the interactive API playground for Gemini 2.5 Flash Image.