Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions.
View model card in Model Garden
Model ID
 
 claude-3-haiku@20240307 
Launch stage
 
 GA
 
Supported inputs & outputs
 
 - Inputs: Text , Code , Images 
- Outputs: Text 
Token limits
 
 - Maximum input tokens: 200,000
- Maximum output tokens: 8,000
Capabilities
 
 - Supported
- Not supported
Usage types
 
 - Supported
- Not supported
Technical specifications
 
Images
 
 - Limitation and specifications: See Vision in Anthropic's documentation
Documents
 
 - Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
 
 August 2023
 
Versions
 
 -  claude-3-haiku@20240307
- Launch stage: Generally available
- Release date: March 19, 2024
Supported regions
 
Model availability
(Includes fixed quota & Provisioned Throughput)
- United States
-  us-east5
- Europe
-  europe-west1
- Asia pacific
-  asia-southeast1
ML processing
- United States
-  Multi-region
- Europe
-  Multi-region
- Asia pacific
-  asia-southeast1
Quota limits
 
 us-east5:
- QPM: 245
- TPM: 600,000 (input and output)
- Context length: 200,000
europe-west1:
- QPM: 75
- TPM: 181,000 (input and output)
- Context length: 200,000
asia-southeast1:
- QPM: 70
- TPM: 174,000 (input and output)
- Context length: 200,000
Pricing
 
  

