Gemini 2.0 Flash-Lite

Caution: As of June 1, 2026, gemini-2.0-flash-001 and gemini-2.0-flash-lite-001 are discontinued and are no longer available. This includes both model serving and Provisioned Throughput. Use Gemini 3.1 Flash-Lite, Gemma 4, or more recent Gemini releases.

Gemini 2.0 Flash-Lite is our fastest Gemini 2.0 model, optimized for cost efficiency and low latency.

Try in Agent Studio View in Model Garden Deploy example app

Note: "Deploy example app" requires a Google Cloud project with billing and Agent Platform API enabled.

Model ID

gemini-2.0-flash-lite

Modalities

Text Input and output

Image Input only

Audio Input only

Video Input only

Token limits

Context window

1,048,576

Maximum output tokens

8,192 (default)

Capabilities

Supported

Not supported

Consumption options

Supported

Not supported

See Consumption options for more information.

Input size limit

500 MB

Technical specifications

Text

Maximum number of files per prompt: 3,000
Maximum number of pages per file: 1,000
Maximum file size per file for the API or Cloud Storage imports: 50 MB(application/pdf) or 7 MB(text/plain)
Maximum file size per file for direct uploads through the console: 7 MB
Maximum tokens per minute (TPM) per project1:
- US/Asia: 3.4 M
- EU: 3.4 M

Image

Maximum images per prompt: 3,000
Maximum file size per file for inline data or direct uploads through the console: 7 MB
Maximum file size per file from Google Cloud Storage: 30 MB
Maximum tokens per minute (TPM):
- High/Medium/Default media resolution:
  - US/Asia: 6.7 M
  - EU: 2.6 M
- Low media resolution:
  - US/Asia: 2.6 M
  - EU: 2.6 M
Supported MIME types:
image/png , image/jpeg , image/webp , image/heic , image/heif

Audio

Maximum audio length per prompt: Approximately 8.4 hours, or up to 1 million tokens
Maximum number of audio files per prompt: 1
Speech understanding for: Audio summarization, transcription, and translation
Maximum tokens per minute (TPM):
- US/Asia: 3.5 M
- EU: 3.5 M
Supported MIME types:
audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/ogg , audio/pcm , audio/wav , audio/webm

Video

Maximum video length (with audio): Approximately 45 minutes
Maximum video length (without audio): Approximately 1 hour
Maximum number of videos per prompt: 10
Maximum tokens per minute (TPM):
- High/Medium/Default media resolution:
  - US/Asia: 6.3 M
  - EU: 3.2 M
- Low media resolution:
  - US/Asia: 3.2 M
  - EU: 3.2 M
Supported MIME types:
video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp

Parameter defaults

Temperature: 0.0-2.0 (default 1.0)
topP: 0.0-1.0 (default 0.95)
topK: 64 (fixed)
candidateCount: 1–8 (default 1)

Supported regions

Model availability

Global

global

United States

us-central1
us-east1
us-east4
us-east5
us-south1
us-west1
us-west4

Europe

europe-central2
europe-north1
europe-southwest1
europe-west1
europe-west4
europe-west8
europe-west9

ML processing

United States

Multi-region

Europe

Multi-region

See Deployments and endpoints for more information.

Knowledge cutoff date

June 2024

Versions

gemini-2.0-flash-lite-001

Launch stage: Discontinued
Release date: February 25, 2025
Discontinuation date: June 1, 2026

Pricing

See Pricing .

Gemini 2.0 Flash-Lite Stay organized with collections Save and categorize content based on your preferences.

Gemini 2.0 Flash-Lite