Google and Partner models and generative AI features on Vertex AI are exposed as specific regional endpoints and a global endpoint. Global endpoints cover the entire world and provide higher availability and reliability than single regions.
Note that model endpoints don't guarantee region availability or in-region ML processing. For information about data residency, see Data residency .
Global endpoint
Selecting a global endpoint for your requests can improve overall availability while reducing resource exhausted (429) errors. Don't use the global endpoint if you have ML processing requirements, because you can't control or know which region your ML processing requests are sent to when a request is made.
Supported models
Usage of the global endpoint is supported for the following Google models in specified regions. For details about which regions support the global endpoint, see the Globaltab in the Google model endpoint locations table .
- Gemini 2.5 Flash Image Preview
- Gemini 2.5 Flash-Lite
- Gemini 2.5 Pro
- Gemini 2.5 Flash
- Gemini 2.0 Flash
- Gemini 2.0 Flash-Lite
For information about global endpoint availability for partner models, see the Globaltab in the Google Cloud partner model endpoint locations table .
Use the global endpoint
To use the global endpoint, exclude the location from the endpoint name and
configure the location of the resource to global
. For example, the following
is global endpoint URL:
https:// aiplatform.googleapis.com/v1/projects/test-project/locations/ global/publishers/google/models/gemini-2.0-flash-001:generateContent
For the Google Gen AI SDK
, create a client that uses the global
location:
client
=
genai
.
Client
(
vertexai
=
True
,
project
=
' PROJECT_ID
'
,
location
=
'global'
)
For the Vertex AI SDK for Python
,
initialize the SDK using the global
location:
import
vertexai
from
vertexai.generative_models
import
GenerativeModel
vertexai
.
init
(
project
=
' PROJECT_ID
'
,
location
=
'global'
)
Limitations
The following capabilities are not available when using the global endpoint:
- Tuning
- Batch prediction
- Retrieval-augmented generation (RAG) corpus (RAG requests are supported)
Usage of the global endpoint with Provisioned Throughput is available only for the following models:
Model | Latest supported model version |
---|---|
Gemini 2.5 Flash Image Preview ( preview ) | gemini-2.5-flash-image-preview
|
Gemini 2.5 Flash-Lite | gemini-2.5-flash-lite
|
Gemini 2.5 Pro | gemini-2.5-pro
|
Gemini 2.5 Flash | gemini-2.5-flash
|
Gemini 2.0 Flash | gemini-2.0-flash-001
|
Gemini 2.0 Flash-Lite | gemini-2.0-flash-lite-001
|
Google model endpoint locations
Google model endpoints for Generative AI on Vertex AI are available in the following regions.
United States
Columbus, Ohio (us-east5) | Dallas, Texas (us-south1) | Iowa (us-central1) | Las Vegas, Nevada (us-west4) | Moncks Corner, South Carolina (us-east1) | Northern Virginia (us-east4) | Oregon (us-west1) | |
---|---|---|---|---|---|---|---|
Gemini 2.5 Flash
( gemini-2.5-flash
) |
|||||||
Gemini 2.5 Pro
( gemini-2.5-pro
) |
|||||||
Gemini 2.5 Flash-Lite
( gemini-2.5-flash-lite
) |
|||||||
Gemini 2.0 Flash
( gemini-2.0-flash-001
) |
|||||||
Gemini 2.0 Flash
( gemini-2.0-flash-001
) |
|||||||
Gemini 2.0 Flash-Lite
( gemini-2.0-flash-lite-001
) |
|||||||
Gemini 1.5 Pro
( gemini-1.5-pro-002
) |
|||||||
Gemini 1.5 Flash
( gemini-1.5-flash-002
) |
|||||||
Gemini Embeddings
( gemini-embedding-001
) |
|||||||
Embeddings for Text
|
|||||||
Embeddings for Multimodal
|
|||||||
Imagen for Captioning & VQA
|
|||||||
Imagen
( imagegeneration@002
) |
|||||||
Imagen 2
( imagegeneration@005
) |
|||||||
Imagen 2
( imagegeneration@006
) |
|||||||
Imagen 3
( imagen-3.0-generate-001
) |
|||||||
Imagen 3 Fast
( imagen-3.0-fast-generate-001
) |
|||||||
Imagen 3 Editing and Customization
( imagen-3.0-capability-001
) |
|||||||
Imagen 3
( imagen-3.0-generate-002
) |
|||||||
Imagen 4
( imagen-4.0-generate-001
) |
|||||||
Imagen 4
( imagen-4.0-fast-generate-001
) |
|||||||
Imagen 4 Ultra Generate experimental
( imagen-4.0-ultra-generate-001
) |
|||||||
Veo 2
( veo-2.0-generate-001
) |
|||||||
Veo 3
( veo-3.0-generate-001
) |
|||||||
Veo 3 Fast
( veo-3.0-fast-generate-001
) |
|||||||
Veo 3 (Preview)
( veo-3.0-generate-preview
) |
|||||||
Veo 3 Fast (Preview)
( veo-3.0-fast-generate-preview
) |
Canada
* | Montréal (northamerica-northeast1) |
---|---|
Gemini 2.5 Flash ( gemini-2.5-flash
) |
* |
Gemini 2.5 Pro ( gemini-2.5-pro
) |
* |
Gemini 2.5 Flash-Lite ( gemini-2.5-flash-lite
) |
|
Gemini 2.0 Flash ( gemini-2.0-flash-001
) |
* |
Gemini 2.0 Flash-Lite ( gemini-2.0-flash-lite-001
) |
* |
Gemini 1.5 Pro ( gemini-1.5-pro-002
) |
* |
Gemini 1.5 Flash ( gemini-1.5-flash-002
) |
* |
Gemini Embeddings ( gemini-embedding-001
) |
* |
Embeddings for Text | * |
Embeddings for Multimodal | * |
Imagen for Captioning & VQA | * |
Imagen ( imagegeneration@002
) |
* |
Imagen 2 ( imagegeneration@005
) |
* |
Imagen 2 ( imagegeneration@006
) |
* |
Imagen 3 ( imagen-3.0-generate-001
) |
* |
Imagen 3 Fast ( imagen-3.0-fast-generate-001
) |
* |
Imagen 3 Editing and Customization ( imagen-3.0-capability-001
) |
* |
Imagen 3 ( imagen-3.0-generate-002
) |
* |
Imagen 4 ( imagen-4.0-generate-001
) |
* |
Imagen 4 ( imagen-4.0-fast-generate-001
) |
* |
Imagen 4 Ultra Generate experimental ( imagen-4.0-ultra-generate-001
) |
* |
South America
* | São Paulo, Brazil (southamerica-east1) |
---|---|
Gemini 2.5 Flash ( gemini-2.5-flash
) |
* |
Gemini 2.5 Pro ( gemini-2.5-pro
) |
* |
Gemini 2.5 Flash-Lite ( gemini-2.5-flash-lite
) |
|
Gemini 2.0 Flash ( gemini-2.0-flash-001
) |
* |
Gemini 2.0 Flash-Lite ( gemini-2.0-flash-lite-001
) |
* |
Gemini 1.5 Pro ( gemini-1.5-pro-002
) |
* |
Gemini 1.5 Flash ( gemini-1.5-flash-002
) |
* |
Gemini Embeddings ( gemini-embedding-001
) |
* |
Embeddings for Text | * |
Embeddings for Multimodal | * |
Imagen for Captioning & VQA | * |
Imagen ( imagegeneration@002
) |
* |
Imagen 2 ( imagegeneration@005
) |
* |
Imagen 2 ( imagegeneration@006
) |
* |
Imagen 3 ( imagen-3.0-generate-001
) |
* |
Imagen 3 Fast ( imagen-3.0-fast-generate-001
) |
* |
Imagen 3 Editing and Customization ( imagen-3.0-capability-001
) |
* |
Imagen 3 ( imagen-3.0-generate-002
) |
* |
Imagen 4 ( imagen-4.0-generate-001
) |
* |
Imagen 4 ( imagen-4.0-fast-generate-001
) |
* |
Imagen 4 Ultra Generate experimental ( imagen-4.0-ultra-generate-001
) |
* |
Europe
* | Netherlands (europe-west4) | Paris, France (europe-west9) | London, United Kingdom (europe-west2) | Frankfurt, Germany (europe-west3) | Belgium (europe-west1) | Zürich, Switzerland (europe-west6) | Madrid, Spain (europe-southwest1) | Milan, Italy (europe-west8) | Finland (europe-north1) | Warsaw, Poland (europe-central2) |
---|---|---|---|---|---|---|---|---|---|---|
Gemini 2.5 Flash ( gemini-2.5-flash
) |
* | + | * | * | * | * | * | * | * | * |
Gemini 2.5 Pro ( gemini-2.5-pro
) |
* | * | * | * | * | * | * | * | * | * |
Gemini 2.5 Flash-Lite ( gemini-2.5-flash-lite
) |
* | * | * | * | * | * | * | * | * | * |
Gemini 2.0 Flash ( gemini-2.0-flash-001
) |
* | * | * | * | * | * | * | * | * | * |
Gemini 2.0 Flash-Lite ( gemini-2.0-flash-lite-001
) |
* | * | * | * | * | * | * | * | * | * |
Gemini 1.5 Pro ( gemini-1.5-pro-002
) |
* | * | * | * | * | * | * | * | * | * |
Gemini 1.5 Flash ( gemini-1.5-flash-002
) |
* | * | * | * | * | * | * | * | * | * |
Gemini Embeddings ( gemini-embedding-001
) |
* | * | * | * | * | * | * | * | * | * |
Embeddings for Text | * | * | * | * | * | * | * | * | * | * |
Embeddings for Multimodal | * | * | * | * | * | * | * | * | * | * |
Imagen for Captioning & VQA | * | * | * | * | * | |||||
Imagen ( imagegeneration@002
) |
* | * | * | * | * | |||||
Imagen 2 ( imagegeneration@005
) |
* | * | * | * | * | * | * | * | * | * |
Imagen 2 ( imagegeneration@006
) |
* | * | * | * | * | |||||
Imagen 3 ( imagen-3.0-generate-001
) |
* | * | * | * | * | * | * | * | * | * |
Imagen 3 Fast ( imagen-3.0-fast-generate-001
) |
* | * | * | * | * | * | * | * | * | * |
Imagen 3 Editing and Customization ( imagen-3.0-capability-001
) |
* | * | * | * | * | * | * | * | * | * |
Imagen 3imagen-3.0-generate-002
|
* | * | * | * | * | * | * | * | * | * |
Imagen 4 ( imagen-4.0-generate-001
) |
* | * | * | * | * | * | * | * | * | * |
Imagen 4 ( imagen-4.0-fast-generate-001
) |
* | * | * | * | * | * | * | * | * | * |
Imagen 4 Ultra Generate experimental ( imagen-4.0-ultra-generate-001
) |
* | * | * | * | * | * | * | * | * | * |
Asia Pacific
* | Tokyo, Japan (asia-northeast1) | Sydney, Australia (australia-southeast1) | Singapore (asia-southeast1) | Seoul, Korea (asia-northeast3) | Taiwan (asia-east1) | Hong Kong, China (asia-east2) | Mumbai, India (asia-south1) |
---|---|---|---|---|---|---|---|
Gemini 2.5 Flash ( gemini-2.5-flash
) |
* | * | * | * | * | * | * |
Gemini 2.5 Pro ( gemini-2.5-pro
) |
* | * | * | * | * | * | * |
Gemini 2.5 Flash-Lite ( gemini-2.5-flash-lite
) |
|||||||
Gemini 2.0 Flash ( gemini-2.0-flash-001
) |
* | * | * | * | * | * | * |
Gemini 2.0 Flash-Lite ( gemini-2.0-flash-lite-001
) |
* | * | * | * | * | * | * |
Gemini 1.5 Pro ( gemini-1.5-pro-002
) |
* | * | * | * | * | * | * |
Gemini 1.5 Flash ( gemini-1.5-flash-002
) |
* | * | * | * | * | * | * |
Gemini Embeddings ( gemini-embedding-001
) |
* | * | * | * | * | * | * |
Embeddings for Text | * | * | * | * | * | * | * |
Embeddings for Multimodal | * | * | * | * | * | * | * |
Imagen for Captioning & VQA | * | * | * | * | |||
Imagen ( imagegeneration@002
) |
* | * | * | * | |||
Imagen 2 ( imagegeneration@005
) |
* | * | * | * | * | * | * |
Imagen 2 ( imagegeneration@006
) |
* | * | * | * | |||
Imagen 3 ( imagen-3.0-generate-001
) |
* | * | * | * | * | * | * |
Imagen 3 Fast ( imagen-3.0-fast-generate-001
) |
* | * | * | * | * | * | * |
Imagen 3 Editing and Customization ( imagen-3.0-capability-001
) |
* | * | * | * | * | * | * |
Imagen 3 ( imagen-3.0-generate-002
) |
* | * | * | * | * | * | * |
Imagen 4 ( imagen-4.0-generate-001
) |
* | * | * | * | * | * | * |
Imagen 4 ( imagen-4.0-fast-generate-001
) |
* | * | * | * | * | * | * |
Imagen 4 Ultra Generate experimental ( imagen-4.0-ultra-generate-001
) |
* | * | * | * | * | * | * |
Middle East
* | Dammam, Saudi Arabia (me-central2) | Doha, Qatar (me-central1) | Tel Aviv, Israel (me-west1) |
---|---|---|---|
Gemini 2.5 Flash ( gemini-2.5-flash
) |
* | * | * |
Gemini 2.5 Pro ( gemini-2.5-pro
) |
* | * | * |
Gemini 2.5 Flash-Lite ( gemini-2.5-flash-lite
) |
|||
Gemini 2.0 Flash ( gemini-2.0-flash-001
) |
* | * | * |
Gemini 2.0 Flash-Lite ( gemini-2.0-flash-lite-001
) |
* | * | * |
Gemini 1.5 Pro ( gemini-1.5-pro-002
) |
* | * | * |
Gemini 1.5 Flash ( gemini-1.5-flash-002
) |
* | * | * |
Gemini Embeddings ( gemini-embedding-001
) |
* | * | * |
Embeddings for Text | * | * | * |
Embeddings for Multimodal | * | * | * |
Imagen for Captioning & VQA | |||
Imagen ( imagegeneration@002
) |
|||
Imagen 2 ( imagegeneration@005
) |
* | * | * |
Imagen 2 ( imagegeneration@006
) |
|||
Imagen 3 ( imagen-3.0-generate-001
) |
* | * | * |
Imagen 3 Fast ( imagen-3.0-fast-generate-001
) |
* | * | * |
Imagen 3 Editing and Customization ( imagen-3.0-capability-001
) |
* | * | * |
Imagen 3 ( imagen-3.0-generate-002
) |
* | * | * |
Imagen 4 ( imagen-4.0-generate-001
) |
* | * | * |
Imagen 4 ( imagen-4.0-fast-generate-001
) |
* | * | * |
Imagen 4 Ultra Generate experimental ( imagen-4.0-ultra-generate-001
) |
* | * | * |
Global
Global (global) | |
---|---|
Gemini 2.5 Flash ( gemini-2.5-flash
) |
|
Gemini 2.5 Pro ( gemini-2.5-pro
) |
|
Gemini 2.5 Flash-Lite ( gemini-2.5-flash-lite
) |
|
Gemini 2.0 Flash ( gemini-2.0-flash-001
) |
|
Gemini 2.0 Flash-Lite ( gemini-2.0-flash-lite-001
) |
|
Gemini 1.5 Pro ( gemini-1.5-pro-002
) |
|
Gemini 1.5 Flash ( gemini-1.5-flash-002
) |
|
Gemini Embeddings ( gemini-embedding-001
) |
|
Embeddings for Text | |
Embeddings for Multimodal | |
Imagen for Captioning & VQA | |
Imagen ( imagegeneration@002
) |
|
Imagen 2 ( imagegeneration@005
) |
|
Imagen 2 ( imagegeneration@006
) |
|
Imagen 3 ( imagen-3.0-generate-001
) |
|
Imagen 3 Fast ( imagen-3.0-fast-generate-001
) |
|
Imagen 3 Editing and Customization ( imagen-3.0-capability-001
) |
|
Imagen 3 ( imagen-3.0-generate-002
) |
|
Imagen 4 ( imagen-4.0-generate-001
) |
|
Imagen 4 ( imagen-4.0-fast-generate-001
) |
|
Imagen 4 Ultra Generate experimental ( imagen-4.0-ultra-generate-001
) |
* Region is available only while using Single Zone Provisioned Throughput
+ Supervised fine tuning isn't supported in this region.
Google Cloud partner model endpoint locations
Google serves requests from the region that you specified. For some models, Google also offers a global endpoint to improve overall availability and reduce error rates. The global endpoint can have a separate set of quotas from the regional endpoint and doesn't support data residency requirements. For more information, see the "Regional and global endpoint" section in Vertex AI partner models for MaaS .
Partner model endpoints for Generative AI on Vertex AI are available in the following regions:
United States
* | Columbus, Ohio (us-east5) | Dallas, Texas (us-south1) | Iowa (us-central1) | Las Vegas, Nevada (us-west4) | Moncks Corner, South Carolina (us-east1) | Northern Virginia (us-east4) | Oregon (us-west1) |
---|---|---|---|---|---|---|---|
Anthropic's Claude Opus 4.1 | * | ||||||
Anthropic's Claude Opus 4 | * | ||||||
Anthropic's Claude Sonnet 4 | * | ||||||
Anthropic's Claude 3.7 Sonnet | * | ||||||
Anthropic's Claude 3.5 Sonnet v2 (deprecated) | * | ||||||
Anthropic's Claude 3.5 Sonnet (deprecated) | * | ||||||
Anthropic's Claude 3.5 Haiku | * | ||||||
Anthropic's Claude 3 Haiku | * | ||||||
DeepSeek R1 (0528) | * | * | * | ||||
Llama 4 Maverick 17B-128E (Preview) | * | ||||||
Llama 4 Scout 17B-16E (Preview) | * | ||||||
Llama 3.3 70B (Preview) | * | * | * | * | * | * | * |
Llama 3.2 90B (Preview) | * | * | * | * | * | * | * |
Llama 3.1 405B | * | * | * | * | * | * | * |
Llama 3.1 70B (Preview) | * | * | * | * | * | * | * |
Llama 3.1 8B (Preview) | * | * | * | * | * | * | * |
Mistral OCR (25.05) | * | * | * | ||||
Mistral Small 3.1 (25.03) | * | * | * | ||||
Mistral Large | * | * | * | ||||
Codestral | * | * | * | ||||
Jamba 1.5 Large (Deprecated) | * | * | * | ||||
Jamba 1.5 Mini (Deprecated) | * | * | * |
Europe
* | Netherlands (europe-west4) | Belgium (europe-west1) | |
---|---|---|---|
Anthropic's Claude Opus 4.1 | * | * | |
Anthropic's Claude Opus 4 | * | * | |
Anthropic's Claude Sonnet 4 | * | * | |
Anthropic's Claude 3.7 Sonnet | * | * | |
Anthropic's Claude 3.5 Sonnet v2 (deprecated) | * | * | |
Anthropic's Claude 3.5 Sonnet (deprecated) | * | * | |
Anthropic's Claude 3.5 Haiku | * | * | |
Anthropic's Claude 3 Haiku | * | * | |
DeepSeek R1 (0528) | * | ||
Llama 4 Maverick 17B-128E (Preview) | * | ||
Llama 4 Scout 17B-16E (Preview) | * | ||
Llama 3.3 70B (Preview) | * | * | * |
Llama 3.2 90B (Preview) | * | * | * |
Llama 3.1 405B | * | * | * |
Llama 3.1 70B (Preview) | * | * | * |
Llama 3.1 8B (Preview) | * | * | * |
Mistral OCR (25.05) | * | * | |
Mistral Small 3.1 (25.03) | * | * | |
Mistral Large | * | * | |
Codestral | * | * | |
Jamba 1.5 Large (Deprecated) | * | * | |
Jamba 1.5 Mini (Deprecated) | * | * |
Asia Pacific
Singapore (asia-southeast1) | Taiwan (asia-east1) | |
---|---|---|
Anthropic's Claude Opus 4.1
|
||
Anthropic's Claude Opus 4
|
||
Anthropic's Claude Sonnet 4
|
||
Anthropic's Claude 3.7 Sonnet
|
||
Anthropic's Claude 3.5 Sonnet v2 (deprecated)
|
||
Anthropic's Claude 3.5 Sonnet (deprecated)
|
||
Anthropic's Claude 3.5 Haiku
|
||
Anthropic's Claude 3 Haiku
|
||
DeepSeek R1 (0528)
|
||
Llama 4 Maverick 17B-128E (Preview)
|
||
Llama 4 Scout 17B-16E (Preview)
|
||
Llama 3.3 70B (Preview)
|
||
Llama 3.2 90B (Preview)
|
||
Llama 3.1 405B
|
||
Llama 3.1 70B (Preview)
|
||
Llama 3.1 8B (Preview)
|
||
Mistral OCR (25.05)
|
||
Mistral Small 3.1 (25.03)
|
||
Mistral Large
|
||
Codestral
|
||
Jamba 1.5 Large (Deprecated)
|
||
Jamba 1.5 Mini (Deprecated)
|
Global
* | Global (global) | |
---|---|---|
Anthropic's Claude Opus 4.1 | * | * |
Anthropic's Claude Opus 4 | * | * |
Anthropic's Claude Sonnet 4 | * | |
Anthropic's Claude 3.7 Sonnet | * | |
Anthropic's Claude 3.5 Sonnet v2 (deprecated) | * | |
Anthropic's Claude 3.5 Sonnet (deprecated) | * | |
Anthropic's Claude 3.5 Haiku | * | |
Anthropic's Claude 3 Haiku | * | |
DeepSeek R1 (0528) | * | |
Llama 4 Maverick 17B-128E (Preview) | * | |
Llama 4 Scout 17B-16E (Preview) | * | |
Llama 3.3 70B (Preview) | * | |
Llama 3.2 90B (Preview) | * | |
Llama 3.1 405B | * | |
Llama 3.1 70B (Preview) | * | |
Llama 3.1 8B (Preview) | * | |
Mistral OCR (25.05) | * | |
Mistral Small 3.1 (25.03) | * | |
Mistral Large | * | |
Codestral | * | |
Jamba 1.5 Large (Deprecated) | * | |
Jamba 1.5 Mini (Deprecated) | * |
What's next
- For a notebook tutorial that demonstrates the global endpoint, see Intro to the Vertex AI global endpoint .
- Learn more about Generative AI on Vertex AI data residency .
- Learn about Google Cloud regions .
- Learn more about security controls by feature .
- Learn about the models that provide Generative AI on Vertex AI support. See Generative AI foundational model reference .
- Learn about Vertex AI locations .