Skip to content
Lemonade

Gpt Oss 120b Mxfp GGUF

Gpt Oss 120b Mxfp GGUF is available via Lemonade with a 131K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

Gpt Oss 120b Mxfp GGUF Pricing & Specifications

Input Price$0.000 per 1M tokens
Output Price$0.000 per 1M tokens
Context Window131,072 tokens (131K)
Max Output32,768 tokens
ProviderLemonade

What is Gpt Oss 120b Mxfp GGUF?

Gpt Oss 120b Mxfp GGUF is a large language model by Lemonade with a 131K context window and up to 32,768 output tokens. It costs $0.000 per 1M input tokens and $0.000 per 1M output tokens. Gpt Oss 120b Mxfp GGUF is available via Lemonade with a 131K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

Capabilities

text function calling json mode

Gpt Oss 120b Mxfp GGUF Cost Examples

Short prompt (500 tokens)

$0.000000

Medium prompt (2K tokens)

$0.00000

Long output (4K tokens)

$0.00000

Count tokens for Gpt Oss 120b Mxfp GGUF

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Gpt Oss 120b Mxfp GGUF

Lemonade

Qwen3 Coder 30B A3B Instruct GGUF

$0.000/1M in 262K ctx

Lemonade

Gpt Oss 20b Mxfp4 GGUF

$0.000/1M in 131K ctx

Lemonade

Gemma 3 4b It GGUF

$0.000/1M in 128K ctx

Lemonade

Qwen3 4B Instruct 2507 GGUF

$0.000/1M in 262K ctx

Frequently Asked Questions

How much does Gpt Oss 120b Mxfp GGUF cost per token? +
Gpt Oss 120b Mxfp GGUF costs $0.000 per 1M input tokens and $0.000 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000000.
What is the context window for Gpt Oss 120b Mxfp GGUF? +
Gpt Oss 120b Mxfp GGUF supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Gpt Oss 120b Mxfp GGUF? +
Gpt Oss 120b Mxfp GGUF can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Gpt Oss 120b Mxfp GGUF good for coding tasks? +
Yes, Gpt Oss 120b Mxfp GGUF supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Lemonade Models