OpenAI gpt-oss 120B is a 120B open-weight language model released under the Apache 2.0 license. It is well-suited for reasoning and function calling use cases. The model is optimized for deployment on consumer hardware.
The 120B model achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running on a single 80GB GPU.
Managed API (MaaS) specifications
View model card in Model Garden
gpt-oss-120b-maas
- Inputs:
Text - Outputs:
Text
- Supported
- Not supported
- Supported
- Not supported
-
gpt-oss-120b-maas - Launch stage: GA
- Release date: August 13, 2025
Model availability
- Global
-
global endpoint - United States
-
us-central1
ML processing
- United States
-
Multi-region
global endpoint:
- Max output: 131,072
- Context length: 131,072
us-central1:
- Max output: 131,072
- Context length: 131,072
Deploy as a self-deployed model
To self-deploy the model, navigate to the gpt-oss 120B model card in the Model Garden console and click Deploy model. For more information about deploying and using partner models, see Deploy a partner model and make prediction requests .

