Gemini 2.5 Flash

Caution: The gemini-2.0-flash-preview-image-generation and gemini-2.5-flash-image-preview models will be retired on October 31, 2025. Migrate any workflows to gemini-2.5-flash-image before that date to avoid service disruption.

Gemini 2.5 Flash is our best model in terms of price and performance, and offers well-rounded capabilities. Gemini 2.5 Flash is our first Flash model that features thinking capabilities , which lets you see the thinking process that the model goes through when generating its response.

For even more detailed technical information on Gemini 2.5 Flash (such as performance benchmarks, information on our training datasets, efforts on sustainability, intended usage and limitations, and our approach to ethics and safety), see our technical report on our Gemini 2.5 models.

2.5 Flash

Try in Vertex AI View in Model Garden (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.

Model ID

gemini-2.5-flash

Supported inputs & outputs

Inputs:
Text , Code , Images , Audio , Video
Outputs:
Text

Token limits

Maximum input tokens: 1,048,576
Maximum output tokens: 65,535 (default)

Capabilities

Supported

Not supported

Live API Preview feature

Usage types

Supported

Not supported

Fixed quota

Input size limit

500 MB

Technical specifications

Images

Maximum images per prompt: 3,000
Maximum image size: 7 MB
Supported MIME types:
image/png , image/jpeg , image/webp

Documents

Maximum number of files per prompt: 3,000
Maximum number of pages per file: 1,000
Maximum file size per file for the API or Cloud Storage imports: 50 MB
Maximum file size per file for direct uploads through the console: 7 MB
Supported MIME types:
application/pdf , text/plain

Video

Maximum video length (with audio): Approximately 45 minutes
Maximum video length (without audio): Approximately 1 hour
Maximum number of videos per prompt: 10
Supported MIME types:
video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp

Audio

Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens
Maximum number of audio files per prompt: 1
Speech understanding for: Audio summarization, transcription, and translation
Supported MIME types:
audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/ogg , audio/pcm , audio/wav , audio/webm

Parameter defaults

Temperature: 0.0-2.0 (default 1.0)
topP: 0.0-1.0 (default 0.95)
topK: 64 (fixed)
candidateCount: 1–8 (default 1)

Supported regions

Model availability

(Includes dynamic shared quota & Provisioned Throughput)

Global

global

United States

us-central1
us-east1
us-east4
us-east5
us-south1
us-west1
us-west4

Europe

europe-central2
europe-north1
europe-southwest1
europe-west1
europe-west4
europe-west8

ML processing

United States

Multi-region

Canada

northamerica-northeast1 ⁺

Europe

Multi-region
europe-west2 ^{* +}
europe-west3 ^{* +}
europe-west9 ^{* +}

Asia Pacific

asia-northeast1 ^{* +}
asia-northeast3 ^{* +}
asia-south1 ^{* +}
asia-southeast1 ⁺
australia-southeast1 ^{* +}

See Data residency for more information.

Knowledge cutoff date

January 2025

Versions

gemini-2.5-flash

Launch stage: GA
Release date: June 17, 2025
Discontinuation date: June 17, 2026

gemini-live-2.5-flash

Launch stage: Private GA
Release date: June 17, 2025

Security controls

See Security controls for more information.

Supported languages

See Supported languages .

Pricing

See Pricing .

+ Supervised fine tuning not supported
* Available for 128K context window only, supervised fine tuning not supported

2.5 Flash

Try in Vertex AI (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.

Model ID

gemini-2.5-flash-preview-09-2025

Supported inputs & outputs

Inputs:
Text , Code , Images , Audio , Video
Outputs:
Text

Token limits

Maximum input tokens: 1,048,576
Maximum output tokens: 65,535 (default)

Capabilities

Supported

Not supported

Tuning
Live API Preview feature

Usage types

Supported

Not supported

Technical specifications

Images

Maximum images per prompt: 3,000
Maximum image size: 7 MB
Supported MIME types:
image/png , image/jpeg , image/webp

Documents

Maximum number of files per prompt: 3,000
Maximum number of pages per file: 1,000
Maximum file size per file for the API or Cloud Storage imports: 50 MB
Maximum file size per file for direct uploads through the console: 7 MB
Supported MIME types:
application/pdf , text/plain

Video

Maximum video length (with audio): Approximately 45 minutes
Maximum video length (without audio): Approximately 1 hour
Maximum number of videos per prompt: 10
Supported MIME types:
video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp

Audio

Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens
Maximum number of audio files per prompt: 1
Speech understanding for: Audio summarization, transcription, and translation
Supported MIME types:
audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/ogg , audio/pcm , audio/wav , audio/webm

Parameter defaults

Temperature: 0.0-2.0 (default 1.0)
topP: 0.0-1.0 (default 0.95)
topK: 64 (fixed)
candidateCount: 1–8 (default 1)

Supported regions

Model availability

(Includes dynamic shared quota & Provisioned Throughput)

Global

global

See Data residency for more information.

Knowledge cutoff date

January 2025

Versions

gemini-2.5-flash-preview-09-2025

Launch stage: Public preview
Release date: September 25, 2025

Security controls

See Security controls for more information.

Supported languages

See Supported languages .

Pricing

See Pricing .

Gemini 2.5 Flash Stay organized with collections Save and categorize content based on your preferences.

2.5 Flash

2.5 Flash

Gemini 2.5 Flash