Question 1

How much does Llama 3.3 70b cost per token?

Accepted Answer

Llama 3.3 70b costs $0.85 per 1M input tokens and $1.20 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.001450.

Question 2

What is the context window for Llama 3.3 70b?

Accepted Answer

Llama 3.3 70b supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.

Question 3

What is the maximum output length for Llama 3.3 70b?

Accepted Answer

Llama 3.3 70b can generate up to 128,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.

Question 4

Is Llama 3.3 70b good for coding tasks?

Accepted Answer

Yes, Llama 3.3 70b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.

Input Price	$0.85 per 1M tokens
Output Price	$1.20 per 1M tokens
Context Window	128,000 tokens (128K)
Max Output	128,000 tokens
Provider	Cerebras

Llama 3.3 70b

Llama 3.3 70b Pricing & Specifications

What is Llama 3.3 70b?

Capabilities

Llama 3.3 70b Cost Examples

Count tokens for Llama 3.3 70b

Similar Models to Llama 3.3 70b

Llama3.1 70b

Qwen 3 32b

Gpt Oss 120b

Llama3.1 8b

Frequently Asked Questions