Anthropic's mid-size model with superior intelligence for high-volume uses, such as coding, in-depth research, and agents.
Try in Vertex AI View model card in Model Garden
Model ID
 
 claude-sonnet-4@20250514 
Launch stage
 
 GA
 
Supported inputs & outputs
 
 - Inputs: Text , Code , Images 
- Outputs: Text 
Token limits
 
 - Maximum input tokens: 1M (Preview)
 200,000 (GA)
- Maximum output tokens: 64,000
Capabilities
 
 - Supported
- Not supported
Usage types
 
 - Supported
- Not supported
Technical specifications
 
Images
 
 - Limitation and specifications: See Vision in Anthropic's documentation
Documents
 
 - Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
 
 March 2025
 
Versions
 
 -  claude-sonnet-4@20250514
- Launch stage: Generally available
- Release date: May 22, 2025
Supported regions
 
Model availability
(Includes fixed quota & Provisioned Throughput)
- United States
-  us-east5
- Europe
-  europe-west1
- Asia pacific
-  asia-east1
- Global
-  global endpoint
ML processing
- United States
-  Multi-region
- Europe
-  Multi-region
- Asia pacific
-  asia-east1
Quota limits
 
 us-east5:
- QPM: 35
- Context length: 1,000,000
europe-west1:
- QPM: 25
- Context length: 1,000,000
asia-east1:
- QPM: 70
- Context length: 1,000,000
global endpoint:
- QPM: 35
- Context length: 1,000,000
Pricing
 
  

