Skip to content

Cerebras Models

Cerebras provides 7 AI models accessible via API.

Visit Cerebras →

7

Models Available

$0.10

Cheapest Input / 1M

131K

Largest Context

What is Cerebras?

Cerebras is an AI model provider offering 7 large language models for developers. Their cheapest model starts at $0.10 per 1M input tokens, and their largest context window reaches 131K. Cerebras provides 7 AI models accessible via API.

Cerebras Strengths

All Cerebras Models

Model Input $/1M Output $/1M Context Max Output Released
Llama3.1 8b $0.10 $0.10 128K 128,000
Gpt Oss 120b $0.35 $0.75 131K 32,768
Qwen 3 32b $0.40 $0.80 128K 128,000
Llama3.1 70b $0.60 $0.60 128K 128,000
Llama 3.3 70b $0.85 $1.20 128K 128,000
Zai Glm 4.6 $2.25 $2.75 128K 128,000
Zai Glm 4.7 $2.25 $2.75 128K 128,000

Model Details

Llama3.1 8b

Llama3.1 8b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 128K
text function calling

Gpt Oss 120b

Gpt Oss 120b is available via Cerebras with a 131K context window and up to 32,768 output tokens. Pricing: $0.3500/1M input tokens, $0.7500/1M output tokens.

Input: $0.35/1M Output: $0.75/1M Context: 131K
text function calling reasoning json mode

Qwen 3 32b

Qwen 3 32b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.4000/1M input tokens, $0.8000/1M output tokens.

Input: $0.40/1M Output: $0.80/1M Context: 128K
text function calling reasoning

Llama3.1 70b

Llama3.1 70b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

Input: $0.60/1M Output: $0.60/1M Context: 128K
text function calling

Llama 3.3 70b

Llama 3.3 70b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.8500/1M input tokens, $1.20/1M output tokens.

Input: $0.85/1M Output: $1.20/1M Context: 128K
text function calling

Zai Glm 4.6

Zai Glm 4.6 is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $2.25/1M input tokens, $2.75/1M output tokens.

Input: $2.25/1M Output: $2.75/1M Context: 128K
text function calling reasoning

Zai Glm 4.7

Zai Glm 4.7 is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $2.25/1M input tokens, $2.75/1M output tokens.

Input: $2.25/1M Output: $2.75/1M Context: 128K
text function calling reasoning

Compare Cerebras model pricing

Use our pricing calculator to find the cheapest Cerebras model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →