Gemini 2.5 Flash is our best model in terms of price and performance, and offers well-rounded capabilities. Gemini 2.5 Flash is our first Flash model that features thinking capabilities , which lets you see the thinking process that the model goes through when generating its response.
For even more detailed technical information on Gemini 2.5 Flash (such as performance benchmarks, information on our training datasets, efforts on sustainability, intended usage and limitations, and our approach to ethics and safety), see our technical report on our Gemini 2.5 models and the model card for Gemini 2.5 Flash .
2.5 Flash
Try in Vertex AI View in Model Garden (Preview) Deploy example app
gemini-2.5-flash
- Inputs:
Text , Code , Images , Audio , Video - Outputs:
Text
- Maximum input tokens: 1,048,576
- Maximum output tokens: 65,535 (default)
- Supported
- Not supported
- Supported
- Not supported
- Maximum images per prompt: 3,000
- Maximum image size: 7 MB
- Supported MIME types:
image/png
,image/jpeg
,image/webp
- Maximum number of files per prompt: 3,000
- Maximum number of pages per file: 1,000
- Maximum file size per file for the API or Cloud Storage imports: 50 MB
- Maximum file size per file for direct uploads through the console: 7 MB
- Supported MIME types:
application/pdf
,text/plain
- Maximum video length (with audio): Approximately 45 minutes
- Maximum video length (without audio): Approximately 1 hour
- Maximum number of videos per prompt: 10
- Supported MIME types:
video/x-flv
,video/quicktime
,video/mpeg
,video/mpegs
,video/mpg
,video/mp4
,video/webm
,video/wmv
,video/3gpp
- Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens
- Maximum number of audio files per prompt: 1
- Speech understanding for: Audio summarization, transcription, and translation
- Supported MIME types:
audio/x-aac
,audio/flac
,audio/mp3
,audio/m4a
,audio/mpeg
,audio/mpga
,audio/mp4
,audio/opus
,audio/pcm
,audio/wav
,audio/webm
- Temperature: 0.0-2.0 (default 1.0)
- topP: 0.0-1.0 (default 0.95)
- topK: 64 (fixed)
- candidateCount: 1–8 (default 1)
Model availability
(Includes dynamic shared quota & Provisioned Throughput)
- Global
- global
- United States
- us-central1
- us-east1
- us-east4
- us-east5
- us-south1
- us-west1
- us-west4
- Europe
- europe-central2
- europe-north1
- europe-southwest1
- europe-west1
- europe-west4
- europe-west8
- europe-west9 +
ML processing
- United States
- Multi-region
- Canada
- northamerica-northeast1
- Europe
- Multi-region
- europe-west2 *
- Asia Pacific
- asia-northeast1 *
- asia-northeast3 *
- asia-south1 *
- asia-southeast1
- australia-southeast1 *
-
gemini-2.5-flash
- Launch stage: GA
- Release date: June 17, 2025
- Discontinuation date: June 17, 2026
-
gemini-live-2.5-flash
- Launch stage: Private GA
- Release date: June 17, 2025
-
gemini-2.5-flash-preview-05-20
- Launch stage: Public preview
- Release date: May 20, 2025
- Discontinuation date: July 15, 2025
-
gemini-2.5-flash-preview-04-17
- Launch stage: Public preview
- Release date: April 17, 2025
- Discontinuation date: July 15, 2025
* Available for 128K context window only
Image
Try in Vertex AI (Preview) Deploy example app
gemini-2.5-flash-image-preview
- Inputs:
Text , Images - Outputs:
Text and image
- Maximum input tokens: 32,768
- Maximum output tokens: 32,768
- Supported
- Not supported
- Supported
- Not supported
- Maximum images per prompt: 3
- Maximum image size: 7 MB
- Maximum number of output images per prompt: 10
- Supported MIME types:
image/png
,image/jpeg
,image/webp
- Maximum number of files per prompt: 3
- Maximum number of pages per file: 3
- Maximum file size per file: 50 MB
- Supported MIME types:
application/pdf
,text/plain
- Temperature: 0.0-2.0 (default 1.0)
- topP: 0.0-1.0 (default 0.95)
- topK: 64 (fixed)
- candidateCount: 1–8 (default 1)
Model availability
- Global
- global
-
gemini-2.5-flash-image-preview
- Launch stage: Public preview
- Release date: August 26, 2025
Live API native audio
Gemini 2.5 Flash with Live API native audio features our cutting-edge native audio functionality for Live API . In addition to the standard Live API features, this preview model includes:
- Enhanced voice quality and adaptability: Live API native audio provides richer, more natural voice interactions with 30 HD voices in 24 languages .
- Introducing Proactive Audio: When Proactive Audio is enabled, the model only responds when it's relevant. The model generates text transcripts and audio responses proactively only for queries directed to the device, and does not respond to non-device directed queries.
- Introducing Affective Dialog: Models using Live API native audio can understand and respond appropriately to users' emotional expressions for more nuanced conversations.
For more information on Live API, see our standalone Live API documentation .
gemini-live-2.5-flash-preview-native-audio
- Inputs:
Audio , Video - Outputs:
Audio
- Maximum input tokens: 1,048,576
- Maximum output tokens: 128K (default)
- Supported
- Grounding with Google Search
- Function calling
- Live API Preview feature
- Not supported
- Supported
- Not supported
- Maximum screenshare length: Approximately 10 minutes
- Supported MIME types:
video/x-flv
,video/quicktime
,video/mpeg
,video/mpegs
,video/mpg
,video/mp4
,video/webm
,video/wmv
,video/3gpp
- Maximum conversation length: Approximately 10 minutes
- Speech understanding for: Audio summarization, transcription, and translation
- Supported MIME types:
audio/x-aac
,audio/flac
,audio/mp3
,audio/m4a
,audio/mpeg
,audio/mpga
,audio/mp4
,audio/opus
,audio/pcm
,audio/wav
,audio/webm
- Temperature: 0.0-2.0 (default 1.0)
- topP: 0.0-1.0 (default 0.95)
- topK: 64 (fixed)
- candidateCount: 1–8 (default 1)
Model availability
- United States
- us-central1
-
gemini-live-2.5-flash-preview-native-audio
- Launch stage: Public preview
- Release date: June 17, 2025