Skip to content
Groq

Llama 3.3 70b Versatile

Llama 3.3 70b Versatile is available via Groq with a 128K context window and up to 32,768 output tokens. Pricing: $0.5900/1M input tokens, $0.7900/1M output tokens.

Llama 3.3 70b Versatile Pricing & Specifications

Input Price$0.59 per 1M tokens
Output Price$0.79 per 1M tokens
Context Window128,000 tokens (128K)
Max Output32,768 tokens
ProviderGroq

What is Llama 3.3 70b Versatile?

Llama 3.3 70b Versatile is a large language model by Groq with a 128K context window and up to 32,768 output tokens. It costs $0.59 per 1M input tokens and $0.79 per 1M output tokens. Llama 3.3 70b Versatile is available via Groq with a 128K context window and up to 32,768 output tokens. Pricing: $0.5900/1M input tokens, $0.7900/1M output tokens.

Capabilities

text function calling

Llama 3.3 70b Versatile Cost Examples

Short prompt (500 tokens)

$0.000295

Medium prompt (2K tokens)

$0.00118

Long output (4K tokens)

$0.00316

Count tokens for Llama 3.3 70b Versatile

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama 3.3 70b Versatile

Groq

Qwen/Qwen3 32b

$0.29/1M in 131K ctx

Groq

Meta Llama/Llama Guard 4 12b

$0.20/1M in 8K ctx

Groq

Meta Llama/Llama 4 Maverick 17b 128e Instruct

$0.20/1M in 131K ctx

Groq

Moonshotai/Kimi K2 Instruct 0905

$1.00/1M in 262K ctx

Frequently Asked Questions

How much does Llama 3.3 70b Versatile cost per token? +
Llama 3.3 70b Versatile costs $0.59 per 1M input tokens and $0.79 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000985.
What is the context window for Llama 3.3 70b Versatile? +
Llama 3.3 70b Versatile supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama 3.3 70b Versatile? +
Llama 3.3 70b Versatile can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama 3.3 70b Versatile good for coding tasks? +
Yes, Llama 3.3 70b Versatile supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Groq Models