Anthropic's mid-size model with superior intelligence for high-volume uses, such as coding, in-depth research, and agents.
Try in Vertex AI View model card in Model Garden
claude-sonnet-4
- Inputs:
Text , Code , Images - Outputs:
Text
- Maximum input tokens: 1M (Preview)
200,000 (GA) - Maximum output tokens: 64,000
- Supported
- Not supported
- Supported
- Not supported
- Limitation and specifications: See Vision in Anthropic's documentation
- Limitation and specifications: See PDF support in Anthropic's documentation
-
claude-sonnet-4 - Launch stage: Generally available
- Release date: May 22, 2025
- Retirement date not sooner than: May 14, 2026
Model availability
(Includes fixed quota & Provisioned Throughput)
- United States
-
us-east5 - Europe
-
europe-west1 - Global
-
global endpoint
ML processing
- United States
-
Multi-region - Europe
-
Multi-region
us-east5:
- QPM: 35
- Input TPM: 280,000 uncached and cache write
- Output TPM: 20,000
- Context length: 1,000,000
europe-west1:
- QPM: 25
- Input TPM: 180,000 uncached and cache write
- Output TPM: 20,000
- Context length: 1,000,000
global endpoint:
- QPM: 35
- Input TPM: 276,000 uncached and cache write
- Output TPM: 24,000
- Context length: 1,000,000

