Claude 3.5 Haiku, the next generation of Anthropic's fastest and most cost-effective model, is optimal for use cases where speed and affordability matter.
Try in Vertex AI View model card in Model Garden
Property
Description
Model ID
claude-3-5-haiku@20241022
Token limits
Capabilities
- Batch predictions Supported
- Prompt caching Supported
- Function calling Supported
- Extended thinking Not supported
- Count tokens Supported
Technical specifications
Images
- Limitation and specifications: See Vision in Anthropic's documentation
Documents
- Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
July 2024
Versions
-
claude-3-5-haiku@20241022
- Launch stage: Generally available
- Release date: October 22, 2024
Supported regions
Model availability
(Includes fixed quota & Provisioned Throughput)
United States
-
us-east5
Europe
-
europe-west1
ML processing
United States
-
Multi-region
Europe
-
Multi-region
Quota limits
us-east5:
- QPM: 80
- TPM: 350,000 (input and output)
- Context length: 200,000
europe-west1:
- QPM: 90
- TPM: 400,000 (input and output)
- Context length: 200,000
Pricing