OpenAI gpt-oss 120B

OpenAI gpt-oss 120B is a 120B open-weight language model released under the Apache 2.0 license. It is well-suited for reasoning and function calling use cases. The model is optimized for deployment on consumer hardware.

The 120B model achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running on a single 80GB GPU.

Managed API (MaaS) specifications

View model card in Model Garden

Model ID

gpt-oss-120b-maas

Launch stage

Supported inputs & outputs

Inputs:
Text
Outputs:
Text

Capabilities

Supported

Not supported

Usage types

Supported

Not supported

Fixed quota

Versions

gpt-oss-120b-maas

Launch stage: GA
Release date: August 13, 2025

Supported regions

Model availability

Global

global endpoint

United States

us-central1

ML processing

United States

Multi-region

Limits

global endpoint:

Max output: 131,072
Context length: 131,072

us-central1:

Max output: 131,072
Context length: 131,072

Pricing

See Pricing .

Deploy as a self-deployed model

To self-deploy the model, navigate to the gpt-oss 120B model card in the Model Garden console and click Deploy model. For more information about deploying and using partner models, see Deploy a partner model and make prediction requests .

OpenAI gpt-oss 120B Stay organized with collections Save and categorize content based on your preferences.

Managed API (MaaS) specifications

Deploy as a self-deployed model

OpenAI gpt-oss 120B