Claude 3.5 Haiku is optimized for use cases where speed and affordability matter.
Try in Vertex AI View model card in Model Garden
Model ID
 
 claude-3-5-haiku@20241022 
Launch stage
 
 GA
 
Supported inputs & outputs
 
 - Inputs: Text , Code , Images 
- Outputs: Text 
Token limits
 
 - Maximum input tokens: 200,000
- Maximum output tokens: 8,000
Capabilities
 
 - Supported
- Not supported
Usage types
 
 - Supported
- Not supported
Technical specifications
 
Images
 
 - Limitation and specifications: See Vision in Anthropic's documentation
Documents
 
 - Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
 
 July 2024
 
Versions
 
 -  claude-3-5-haiku@20241022
- Launch stage: Generally available
- Release date: October 22, 2024
Supported regions
 
Model availability
(Includes fixed quota & Provisioned Throughput)
- United States
-  us-east5
- Europe
-  europe-west1
ML processing
- United States
-  Multi-region
- Europe
-  Multi-region
Quota limits
 
 us-east5:
- QPM: 80
- TPM: 350,000 (input and output)
- Context length: 200,000
europe-west1:
- QPM: 90
- TPM: 400,000 (input and output)
- Context length: 200,000
Pricing
 
  

