Question 1

How much does Llama 3.3 70b Versatile cost per token?

Accepted Answer

Llama 3.3 70b Versatile costs $0.59 per 1M input tokens and $0.79 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000985.

Question 2

What is the context window for Llama 3.3 70b Versatile?

Accepted Answer

Llama 3.3 70b Versatile supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.

Question 3

What is the maximum output length for Llama 3.3 70b Versatile?

Accepted Answer

Llama 3.3 70b Versatile can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.

Question 4

Is Llama 3.3 70b Versatile good for coding tasks?

Accepted Answer

Yes, Llama 3.3 70b Versatile supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.

Input Price	$0.59 per 1M tokens
Output Price	$0.79 per 1M tokens
Context Window	128,000 tokens (128K)
Max Output	32,768 tokens
Provider	Groq

Llama 3.3 70b Versatile

Llama 3.3 70b Versatile Pricing & Specifications

What is Llama 3.3 70b Versatile?

Capabilities

Llama 3.3 70b Versatile Cost Examples

Count tokens for Llama 3.3 70b Versatile

Similar Models to Llama 3.3 70b Versatile

Qwen/Qwen3 32b

Meta Llama/Llama Guard 4 12b

Meta Llama/Llama 4 Maverick 17b 128e Instruct

Moonshotai/Kimi K2 Instruct 0905

Frequently Asked Questions