Model deprecations (MaaS)

After a period of time, MaaS models are deprecated and typically replaced with newer model versions. To provide you with time to test and migrate to newer models, this page lists all models that are deprecated along with their shutdown date.

Claude 3.5 Sonnet v2

Claude 3.5 Sonnet v2 is deprecated as of August 20, 2025and will be shut down on February 19, 2026. Claude 3.5 Sonnet v2 is available to existing customers only.

Claude 3.5 Sonnet v2 is a state-of-the-art model for real-world software engineering tasks and agentic capabilities.

Try in Vertex AI View model card in Model Garden

Property
Description
Model ID
claude-3-5-sonnet-v2@20241022
Token limits
Maximum input tokens 200,000
Maximum output tokens 8,000
Capabilities
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
August 2024
Versions
  • claude-3-5-sonnet-v2@20241022
    • Launch stage: Generally available
    • Release date: October 22, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

United States

  • us-east5

Europe

  • europe-west1

Global

  • global endpoint

ML processing

United States

  • Multi-region

Europe

  • Multi-region
Quota limits

us-east5:

  • QPM: 90
  • TPM: 540,000 (input and output)
  • Context length: 200,000

europe-west1:

  • QPM: 55
  • TPM: 330,000 (input and output)
  • Context length: 200,000

global endpoint:

  • QPM: 25
  • TPM: 140,000 (input and output)
  • Context length: 200,000
Pricing
See Pricing .

Claude 3.5 Sonnet

Claude 3.5 Sonnet is deprecated as of August 20, 2025and will be shut down on February 19, 2026. Claude 3.5 Sonnet is available to existing customers only.

Claude 3.5 Sonnet outperforms Anthropic's Claude 3 Opus on a wide range of Anthropic's evaluations with the speed and cost of Anthropic's mid-tier model, Claude 3 Sonnet.

View model card in Model Garden

Property
Description
Model ID
claude-3-5-sonnet@20240620
Token limits
Maximum input tokens 200,000
Maximum output tokens 8,000
Capabilities
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
April 2024
Versions
  • claude-3-5-sonnet@20240620
    • Launch stage: Generally available
    • Release date: June 20, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

United States

  • us-east5

Europe

  • europe-west1

Asia pacific

  • asia-southeast1

ML processing

United States

  • Multi-region

Europe

  • Multi-region

Asia pacific

  • asia-southeast1
Quota limits

us-east5:

  • QPM: 80
  • TPM: 350,000 (input and output)
  • Context length: 200,000

europe-west1:

  • QPM: 130
  • TPM: 600,000 (input and output)
  • Context length: 200,000

asia-southeast1:

  • QPM: 35
  • TPM: 150,000 (input and output)
  • Context length: 200,000
Pricing
See Pricing .

Jamba 1.5 Large

Jamba 1.5 Large is deprecated as of August 27, 2025and will be shut down on February 27, 2026. Jamba 1.5 Large is available to existing customers only.

AI21 Labs's Jamba 1.5 Large is well balanced across quality, throughput, and low cost.

View model card in Model Garden

Property
Description
Model ID
jamba-1.5-large
Knowledge cutoff date
March 2024
Versions
  • jamba-1.5-large
    • Launch stage: Preview
    • Release date: August 22, 2024
Supported regions

Model availability

United States

  • us-central1

Europe

  • europe-west4

ML processing

United States

  • Multi-region
Quota limits

us-central1:

  • QPM: 20
  • TPM: 20,000
  • Context length: 256,000

europe-west4:

  • QPM: 20
  • TPM: 20,000
  • Context length: 256,000
Pricing
See Pricing .

Jamba 1.5 Mini

Jamba 1.5 Mini is deprecated as of August 27, 2025and will be shut down on February 27, 2026. Jamba 1.5 Mini is available to existing customers only.

AI21 Labs's Jamba 1.5 Mini is well balanced across quality, throughput, and low cost.

View model card in Model Garden

Property
Description
Model ID
jamba-1.5-mini
Knowledge cutoff date
March 2024
Versions
  • jamba-1.5-mini
    • Launch stage: Preview
    • Release date: August 22, 2024
Supported regions

Model availability

United States

  • us-central1

Europe

  • europe-west4

ML processing

United States

  • Multi-region
Quota limits

us-central1:

  • QPM: 50
  • TPM: 60,000
  • Context length: 256,000

europe-west4:

  • QPM: 50
  • TPM: 60,000
  • Context length: 256,000
Pricing
See Pricing .

Mistral Nemo

Mistral Nemo is deprecated as of June 30, 2025and will be shut down on August 20, 2025. Mistral Nemo is available to existing customers only.

Mistral Nemo is Mistral AI's most cost efficient proprietary model. Use Mistral Nemo low-latency workloads and basic tasks that can be done in bulk, such as classification, customer support, and text generation.

View model card in Model Garden

Property
Description
Model ID
mistral-nemo
Versions
  • mistral-nemo
    • Launch stage: Deprecated
    • Release date: July 24, 2024
Supported regions

Model availability

United States

  • us-central1

Europe

  • europe-west4

ML processing

United States

  • Multi-region

Europe

  • Multi-region
Quota limits

us-central1:

  • QPM: 60
  • TPM: 400,000
  • Context length: 128,000

europe-west4:

  • QPM: 60
  • TPM: 400,000
  • Context length: 128,000
Pricing
See Pricing .

Claude 3 Opus

Anthropic's Claude 3 Opus is deprecated as of June 30, 2025and will be shut down on August 1, 2025. Claude 3 Opus is available to existing customers only.

Anthropic's Claude 3 Opus is a powerful AI model with top-level performance on highly complex tasks. It can navigate open-ended prompts and sight-unseen scenarios with remarkable fluency and human-like understanding. Claude 3 Opus is optimized for the following use cases:

  • Task automation, such as interactive coding and planning, or running complex actions across APIs and databases.

  • Research and development tasks, such as research review, brainstorming and hypothesis generation, and product testing.

  • Strategy tasks, such as advanced analysis of charts and graphs, financials and market trends, and forecasting.

  • Vision tasks, such as processing images to return text output. Also, analysis of charts, graphs, technical diagrams, reports, and other visual content.

View model card in Model Garden

Property
Description
Model ID
claude-3-opus@20240229
Token limits
Maximum input tokens 200,000
Maximum output tokens 8,000
Capabilities
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date
August 2023
Versions
  • claude-3-opus@20240229
    • Launch stage: Deprecated
    • Release date: May 31, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

United States

  • us-east5

ML processing

United States

  • Multi-region
Quota limits

us-east5:

  • QPM: 20
  • TPM: 105,000 (input and output)
  • Context length: 200,000
Pricing
See Pricing .
Create a Mobile Website
View Site in Mobile | Classic
Share by: