Claude 3.5 Haiku is optimized for use cases where speed and affordability matter.
Try in Vertex AI View model card in Model Garden
Model ID
claude-3-5-haiku@20241022
Launch stage
GA
Supported inputs & outputs
- Inputs:
Text , Code , Images - Outputs:
Text
Token limits
- Maximum input tokens: 200,000
- Maximum output tokens: 8,000
Capabilities
- Supported
- Not supported
Usage types
- Supported
- Not supported
Technical specifications
Images
- Limitation and specifications: See Vision in Anthropic's documentation
Documents
- Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
July 2024
Versions
-
claude-3-5-haiku@20241022 - Launch stage: Generally available
- Release date: October 22, 2024
Supported regions
Model availability
(Includes fixed quota & Provisioned Throughput)
- United States
-
us-east5 - Europe
-
europe-west1
ML processing
- United States
-
Multi-region - Europe
-
Multi-region
Quota limits
us-east5:
- QPM: 80
- TPM: 350,000 (input and output)
- Context length: 200,000
europe-west1:
- QPM: 90
- TPM: 400,000 (input and output)
- Context length: 200,000
Pricing

