Gemini 3 Flash combines Gemini 3 Pro's reasoning capabilities with the Flash line's levels on latency, efficiency, and cost. It not only enables everyday tasks with improved reasoning, but is designed to tackle the most complex agentic workflows.
Gemini 3 Flash uses several new features to improve performance, control, and multimodal fidelity:
-
Thinking level: Use the
thinking_levelparameter to control the amount of internal reasoning the model performs ( minimal , low , medium , or high ) to balance response quality, reasoning complexity, latency, and cost. Thethinking_levelparameter replacesthinking_budgetfor Gemini 3 models.For details on the different thinking levels, see Thinking .
-
Thought signatures: Stricter validation of thought signatures improves reliability in multi-turn function calling.
-
Media resolution: Use the
media_resolutionparameter ( low , medium , high , or ultra high ) to control vision processing for multimodal inputs, impacting token usage and latency. See Get started with Gemini 3 for default resolution settings.- The ultra high
media resolution level is only available for the
IMAGEmodality. - PDF token counts will be listed under the
IMAGEmodality instead of theDOCUMENTmodality inusage_metadata.
- The ultra high
media resolution level is only available for the
-
Multimodal function responses: Function responses can now include multimodal objects like images and PDFs in addition to text .
-
Streaming Function calling: Stream partial function call arguments to improve user experience during tool use.
For more information on using these features, see Get started with Gemini 3 .
Try in Vertex AI View in Model Garden (Preview) Deploy example app
gemini-3-flash-preview
- Inputs:
Text , Code , Images , Audio , Video , PDF - Outputs:
Text
- Maximum input tokens: 1,048,576
- Maximum output tokens: 65,536
- Supported
- Not supported
- Maximum images per prompt: 900
- Maximum file size per file for inline data or direct uploads through the console: 7 MB
- Maximum file size per file from Google Cloud Storage: 30 MB
- Default resolution tokens: 1120
- Supported MIME types:
image/png,image/jpeg,image/webp,image/heic,image/heif
- Maximum number of files per prompt: 900
- Maximum number of pages per file: 900
- Maximum file size per file for the API or Cloud Storage imports: 50 MB
- Maximum file size per file for direct uploads through the console: 7 MB
- Default resolution tokens: 560
- OCR for scanned PDFs: Not used by default
- Supported MIME types:
application/pdf,text/plain
- Maximum video length (with audio): Approximately 45 minutes
- Maximum video length (without audio): Approximately 1 hour
- Maximum number of videos per prompt: 10
- Default resolution tokens per frame: 70
- Supported MIME types:
video/x-flv,video/quicktime,video/mpeg,video/mpegs,video/mpg,video/mp4,video/webm,video/wmv,video/3gpp
- Maximum audio length per prompt: Approximately 8.4 hours, or up to 1 million tokens
- Maximum number of audio files per prompt: 1
- Speech understanding for: Audio summarization, transcription, and translation
- Supported MIME types:
audio/x-aac,audio/flac,audio/mp3,audio/m4a,audio/mpeg,audio/mpga,audio/mp4,audio/ogg,audio/pcm,audio/wav,audio/webm
- Temperature: 0.0-2.0 (default 1.0)
- topP: 0.0-1.0 (default 0.95)
- topK: 64 (fixed)
- candidateCount: 1–8 (default 1)
Model availability
(Includes Standard PayGo & Provisioned Throughput)
- Global
- global
-
gemini-3-flash-preview - Launch stage: Public preview
- Release date: December 17, 2025

