Skip to content
Cerebras

Gpt Oss 120b

Gpt Oss 120b is available via Cerebras with a 131K context window and up to 32,768 output tokens. Pricing: $0.3500/1M input tokens, $0.7500/1M output tokens.

Gpt Oss 120b Pricing & Specifications

Input Price$0.35 per 1M tokens
Output Price$0.75 per 1M tokens
Context Window131,072 tokens (131K)
Max Output32,768 tokens
ProviderCerebras

What is Gpt Oss 120b?

Gpt Oss 120b is a large language model by Cerebras with a 131K context window and up to 32,768 output tokens. It costs $0.35 per 1M input tokens and $0.75 per 1M output tokens. Gpt Oss 120b is available via Cerebras with a 131K context window and up to 32,768 output tokens. Pricing: $0.3500/1M input tokens, $0.7500/1M output tokens.

Capabilities

text function calling reasoning json mode

Gpt Oss 120b Cost Examples

Short prompt (500 tokens)

$0.000175

Medium prompt (2K tokens)

$0.00070

Long output (4K tokens)

$0.00300

Count tokens for Gpt Oss 120b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Gpt Oss 120b

Cerebras

Qwen 3 32b

$0.40/1M in 128K ctx

Cerebras

Llama3.1 8b

$0.10/1M in 128K ctx

Cerebras

Llama3.1 70b

$0.60/1M in 128K ctx

Cerebras

Llama 3.3 70b

$0.85/1M in 128K ctx

Frequently Asked Questions

How much does Gpt Oss 120b cost per token? +
Gpt Oss 120b costs $0.35 per 1M input tokens and $0.75 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000725.
What is the context window for Gpt Oss 120b? +
Gpt Oss 120b supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Gpt Oss 120b? +
Gpt Oss 120b can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Gpt Oss 120b good for coding tasks? +
Yes, Gpt Oss 120b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Cerebras Models