Skip to content
Google Vertex AI

Meta/Llama 4 Maverick 17b 128e Instruct Maas

Meta/Llama 4 Maverick 17b 128e Instruct Maas is available via Google Vertex AI with a 1M context window and up to 1,000,000 output tokens. Pricing: $0.3500/1M input tokens, $1.15/1M output tokens.

Meta/Llama 4 Maverick 17b 128e Instruct Maas Pricing & Specifications

Input Price$0.35 per 1M tokens
Output Price$1.15 per 1M tokens
Context Window1,000,000 tokens (1M)
Max Output1,000,000 tokens
ProviderGoogle Vertex AI

What is Meta/Llama 4 Maverick 17b 128e Instruct Maas?

Meta/Llama 4 Maverick 17b 128e Instruct Maas is a large language model by Google Vertex AI with a 1M context window and up to 1,000,000 output tokens. It costs $0.35 per 1M input tokens and $1.15 per 1M output tokens. Meta/Llama 4 Maverick 17b 128e Instruct Maas is available via Google Vertex AI with a 1M context window and up to 1,000,000 output tokens. Pricing: $0.3500/1M input tokens, $1.15/1M output tokens.

Capabilities

text function calling

Meta/Llama 4 Maverick 17b 128e Instruct Maas Cost Examples

Short prompt (500 tokens)

$0.000175

Medium prompt (2K tokens)

$0.00070

Long output (4K tokens)

$0.00460

Count tokens for Meta/Llama 4 Maverick 17b 128e Instruct Maas

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Meta/Llama 4 Maverick 17b 128e Instruct Maas

Google Vertex AI

Meta/Llama 4 Maverick 17b 16e Instruct Maas

$0.35/1M in 1M ctx

Google Vertex AI

Gemini 2.5 Flash

$0.30/1M in 1.0M ctx

Google Vertex AI

Gemini 2.5 Flash Preview 09 2025

$0.30/1M in 1.0M ctx

Google Vertex AI

Gemini Robotics Er 1.5 Preview

$0.30/1M in 1.0M ctx

Frequently Asked Questions

How much does Meta/Llama 4 Maverick 17b 128e Instruct Maas cost per token? +
Meta/Llama 4 Maverick 17b 128e Instruct Maas costs $0.35 per 1M input tokens and $1.15 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000925.
What is the context window for Meta/Llama 4 Maverick 17b 128e Instruct Maas? +
Meta/Llama 4 Maverick 17b 128e Instruct Maas supports a context window of 1,000,000 tokens (1M). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Meta/Llama 4 Maverick 17b 128e Instruct Maas? +
Meta/Llama 4 Maverick 17b 128e Instruct Maas can generate up to 1,000,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Meta/Llama 4 Maverick 17b 128e Instruct Maas good for coding tasks? +
Yes, Meta/Llama 4 Maverick 17b 128e Instruct Maas supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Google Vertex AI Models