Skip to main content
Gemini 2.0 Flash is Google’s multimodal model with image editing capabilities, available through AnyFast via the native Gemini API. It can edit existing images based on text instructions.

Key capabilities

  • Image editing — Modify existing images with natural language instructions
  • Multi-modal input — Accept both text instructions and source images
  • Text generation — Also supports standard text generation tasks
  • Versatile — Fast and efficient for various multimodal tasks

Quick example

curl "https://www.anyfast.ai/v1beta/models/gemini-2.0-flash:generateContent?key=YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [
          { "text": "Change the background to a beach scene" },
          { "inline_data": { "mime_type": "image/png", "data": "BASE64_DATA" } }
        ]
      }
    ],
    "generationConfig": {
      "responseModalities": ["IMAGE"]
    }
  }'

Parameters

ParameterTypeRequiredDescription
keystringYesAPI key (query parameter)
contentsarrayYesArray of { role, parts } with text and inline_data
generationConfig.responseModalitiesarrayYesMust include IMAGE

API Reference

View the interactive API playground for Gemini 2.0 Flash.