Skip to content

SambaNova Models

SambaNova provides 16 AI models accessible via API.

Visit SambaNova →

16

Models Available

$0.040

Cheapest Input / 1M

131K

Largest Context

What is SambaNova?

SambaNova is an AI model provider offering 16 large language models for developers. Their cheapest model starts at $0.040 per 1M input tokens, and their largest context window reaches 131K. SambaNova provides 16 AI models accessible via API.

SambaNova Strengths

All SambaNova Models

Model Input $/1M Output $/1M Context Max Output Released
Meta Llama 3.2 1B Instruct $0.040 $0.080 16K 16,384
Meta Llama 3.2 3B Instruct $0.080 $0.16 4K 4,096
Meta Llama 3.1 8B Instruct $0.10 $0.20 16K 16,384
Meta Llama Guard 3 8B $0.30 $0.30 16K 16,384
Llama 4 Scout 17B 16E Instruct $0.40 $0.70 8K 8,192
Qwen3 32B $0.40 $0.80 8K 8,192
QwQ 32B $0.50 $1.00 16K 16,384
Qwen2 Audio 7B Instruct $0.50 $100.00 4K 4,096
Meta Llama 3.3 70B Instruct $0.60 $1.20 131K 131,072
Llama 4 Maverick 17B 128E Instruct $0.63 $1.80 131K 131,072
DeepSeek R1 Distill Llama 70B $0.70 $1.40 131K 131,072
DeepSeek V3 0324 $3.00 $4.50 33K 32,768
DeepSeek V3.1 $3.00 $4.50 33K 32,768
Gpt Oss 120b $3.00 $4.50 131K 131,072
DeepSeek R1 $5.00 $7.00 33K 32,768
Meta Llama 3.1 405B Instruct $5.00 $10.00 16K 16,384

Model Details

Meta Llama 3.2 1B Instruct

Meta Llama 3.2 1B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.0400/1M input tokens, $0.0800/1M output tokens.

Input: $0.040/1M Output: $0.080/1M Context: 16K
text

Meta Llama 3.2 3B Instruct

Meta Llama 3.2 3B Instruct is available via SambaNova with a 4K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.1600/1M output tokens.

Input: $0.080/1M Output: $0.16/1M Context: 4K
text

Meta Llama 3.1 8B Instruct

Meta Llama 3.1 8B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.2000/1M output tokens.

Input: $0.10/1M Output: $0.20/1M Context: 16K
text function calling json mode

Meta Llama Guard 3 8B

Meta Llama Guard 3 8B is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

Input: $0.30/1M Output: $0.30/1M Context: 16K
text

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout 17B 16E Instruct is available via SambaNova with a 8K context window and up to 8,192 output tokens. Pricing: $0.4000/1M input tokens, $0.7000/1M output tokens.

Input: $0.40/1M Output: $0.70/1M Context: 8K
text function calling json mode

Qwen3 32B

Qwen3 32B is available via SambaNova with a 8K context window and up to 8,192 output tokens. Pricing: $0.4000/1M input tokens, $0.8000/1M output tokens.

Input: $0.40/1M Output: $0.80/1M Context: 8K
text function calling reasoning

QwQ 32B

QwQ 32B is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.5000/1M input tokens, $1.00/1M output tokens.

Input: $0.50/1M Output: $1.00/1M Context: 16K
text

Qwen2 Audio 7B Instruct

Qwen2 Audio 7B Instruct is available via SambaNova with a 4K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $100.00/1M output tokens.

Input: $0.50/1M Output: $100.00/1M Context: 4K
text audio

Meta Llama 3.3 70B Instruct

Meta Llama 3.3 70B Instruct is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $1.20/1M output tokens.

Input: $0.60/1M Output: $1.20/1M Context: 131K
text function calling json mode

Llama 4 Maverick 17B 128E Instruct

Llama 4 Maverick 17B 128E Instruct is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $0.6300/1M input tokens, $1.80/1M output tokens.

Input: $0.63/1M Output: $1.80/1M Context: 131K
text vision function calling json mode

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $0.7000/1M input tokens, $1.40/1M output tokens.

Input: $0.70/1M Output: $1.40/1M Context: 131K
text

DeepSeek V3 0324

DeepSeek V3 0324 is available via SambaNova with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $4.50/1M output tokens.

Input: $3.00/1M Output: $4.50/1M Context: 33K
text function calling reasoning

DeepSeek V3.1

DeepSeek V3.1 is available via SambaNova with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $4.50/1M output tokens.

Input: $3.00/1M Output: $4.50/1M Context: 33K
text function calling reasoning

Gpt Oss 120b

Gpt Oss 120b is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $4.50/1M output tokens.

Input: $3.00/1M Output: $4.50/1M Context: 131K
text function calling reasoning

DeepSeek R1

DeepSeek R1 is available via SambaNova with a 33K context window and up to 32,768 output tokens. Pricing: $5.00/1M input tokens, $7.00/1M output tokens.

Input: $5.00/1M Output: $7.00/1M Context: 33K
text

Meta Llama 3.1 405B Instruct

Meta Llama 3.1 405B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $5.00/1M input tokens, $10.00/1M output tokens.

Input: $5.00/1M Output: $10.00/1M Context: 16K
text function calling json mode

Compare SambaNova model pricing

Use our pricing calculator to find the cheapest SambaNova model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →