Gemini 2.0 Flash-Lite

Gemini 2.0 Flash-Lite is our fastest Gemini 2.0 model, optimized for cost efficiency and low latency.

Try in Agent Studio View in Model Garden Deploy example app

Note: "Deploy example app" requires a Google Cloud project with billing and Agent Platform API enabled.
Model ID
gemini-2.0-flash-lite
Modalities
Text Input and output
Image Input only
Audio Input only
Video Input only
Token limits
Context window
1,048,576
Maximum output tokens
8,192 (default)
See Consumption options for more information.
Input size limit
500 MB
Technical specifications
Text
  • Maximum number of files per prompt: 3,000
  • Maximum number of pages per file: 1,000
  • Maximum file size per file for the API or Cloud Storage imports: 50 MB(application/pdf) or 7 MB(text/plain)
  • Maximum file size per file for direct uploads through the console: 7 MB
  • Maximum tokens per minute (TPM) per project1:
    • US/Asia: 3.4 M
    • EU: 3.4 M
Image
  • Maximum images per prompt: 3,000
  • Maximum file size per file for inline data or direct uploads through the console: 7 MB
  • Maximum file size per file from Google Cloud Storage: 30 MB
  • Maximum tokens per minute (TPM):
    • High/Medium/Default media resolution:
      • US/Asia: 6.7 M
      • EU: 2.6 M
    • Low media resolution:
      • US/Asia: 2.6 M
      • EU: 2.6 M
  • Supported MIME types:
    image/png , image/jpeg , image/webp , image/heic , image/heif
Audio
  • Maximum audio length per prompt: Approximately 8.4 hours, or up to 1 million tokens
  • Maximum number of audio files per prompt: 1
  • Speech understanding for: Audio summarization, transcription, and translation
  • Maximum tokens per minute (TPM):
    • US/Asia: 3.5 M
    • EU: 3.5 M
  • Supported MIME types:
    audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/ogg , audio/pcm , audio/wav , audio/webm
Video
  • Maximum video length (with audio): Approximately 45 minutes
  • Maximum video length (without audio): Approximately 1 hour
  • Maximum number of videos per prompt: 10
  • Maximum tokens per minute (TPM):
    • High/Medium/Default media resolution:
      • US/Asia: 6.3 M
      • EU: 3.2 M
    • Low media resolution:
      • US/Asia: 3.2 M
      • EU: 3.2 M
  • Supported MIME types:
    video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp
Parameter defaults
  • Temperature: 0.0-2.0 (default 1.0)
  • topP: 0.0-1.0 (default 0.95)
  • topK: 64 (fixed)
  • candidateCount: 1–8 (default 1)
Supported regions

Model availability

  • Global
    • global
  • United States
    • us-central1
    • us-east1
    • us-east4
    • us-east5
    • us-south1
    • us-west1
    • us-west4
  • Europe
    • europe-central2
    • europe-north1
    • europe-southwest1
    • europe-west1
    • europe-west4
    • europe-west8
    • europe-west9

ML processing

  • United States
    • Multi-region
  • Europe
    • Multi-region
See Deployments and endpoints for more information.
Knowledge cutoff date
June 2024
Versions
  • gemini-2.0-flash-lite-001
    • Launch stage: Discontinued
    • Release date: February 25, 2025
    • Discontinuation date: June 1, 2026
Pricing
See Pricing .
Create a Mobile Website
View Site in Mobile | Classic
Share by: