Key capabilities
- Image editing — Modify existing images with natural language instructions
- Multi-modal input — Accept both text instructions and source images
- Text generation — Also supports standard text generation tasks
- Versatile — Fast and efficient for various multimodal tasks
Quick example
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
key | string | Yes | API key (query parameter) |
contents | array | Yes | Array of { role, parts } with text and inline_data |
generationConfig.responseModalities | array | Yes | Must include IMAGE |
API Reference
View the interactive API playground for Gemini 2.0 Flash.