OpenAI

OpenAI OSS

gpt-oss-120b

GPT-OSS-120B

GPT-OSS-120B is available after commercial review or workspace setup.

Model detailAccess reviewDense Transformer

Params

120B

Context

128K

Max Output

32K

License

Apache-2.0

API Surface

/v1/chat/completions

Access

Access review

Why pick it

  • Access is enabled only after workspace setup or commercial review.
  • Pricing appears only when verified by the live service catalog.

Pricing

TierPublicCachedNote
Realtime$0.021 / $0.091$0.007Current published price
Batch$0.021 / $0.091$0.007Published batch rates appear here when available
Pricing is synced from the live BatchIn catalog.

Quick start

This model is not yet open for public testing. Contact the team for the right access path.

Python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.batchin.tech/v1",
    api_key="BATCHIN_API_KEY"
)

resp = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=[{"role": "user", "content": "Summarize why this model is a fit for my workload"}]
)

print(resp.choices[0].message.content)
JavaScript
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.batchin.tech/v1",
  apiKey: process.env.BATCHIN_API_KEY,
});

const resp = await client.chat.completions.create({
  model: "gpt-oss-120b",
  messages: [{ role: "user", content: "Summarize why this model is a fit for my workload" }],
});

console.log(resp.choices[0]?.message?.content);
cURL
curl https://api.batchin.tech/v1/chat/completions \
  -H "Authorization: Bearer ***" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-oss-120b",
    "messages": [{"role":"user","content":"Summarize why this model is a fit for my workload"}]
  }'

Specs

Architecture

Dense Transformer

Vendor

OpenAI OSS

Context window

128K

Max output

32K

Best for

oss
featured

Related models

Back to model center
OpenAI

OpenAI OSS

gpt-oss-20b

GPT-OSS-20B

GPT-OSS-20B is available after commercial review or workspace setup.

View detail
Qwen

Qwen / Alibaba

qwen3-next-80b-a3b

Qwen3-Next-80B-A3B

Qwen3-Next-80B-A3B is available after commercial review or workspace setup.

View detail
Meta

Meta

llama-3-3-70b

Llama 3.3 70B

Proven Meta route for steady general-purpose inference

View detail
DeepSeek

DeepSeek

deepseek-v4-flash

DeepSeek V4 Flash

DeepSeek V4 Flash is available after commercial review or workspace setup.

View detail