Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions.
View model card in Model Garden
Property
Description
Model ID
claude-3-haiku@20240307
Token limits
Capabilities
- Batch predictions Not supported
- Prompt caching Supported
- Function calling Supported
- Extended thinking Not supported
- Count tokens Supported
Technical specifications
Images
- Limitation and specifications: See Vision in Anthropic's documentation
Documents
- Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
August 2023
Versions
-
claude-3-haiku@20240307
- Launch stage: Generally available
- Release date: March 19, 2024
Supported regions
Model availability
(Includes fixed quota & Provisioned Throughput)
United States
-
us-east5
Europe
-
europe-west1
Asia pacific
-
asia-southeast1
ML processing
United States
-
Multi-region
Europe
-
Multi-region
Asia pacific
-
asia-southeast1
Quota limits
us-east5:
- QPM: 245
- TPM: 600,000 (input and output)
- Context length: 200,000
europe-west1:
- QPM: 75
- TPM: 181,000 (input and output)
- Context length: 200,000
asia-southeast1:
- QPM: 70
- TPM: 174,000 (input and output)
- Context length: 200,000
Pricing