Skip to content
Llamagate

Qwen3 Vl 8b

Qwen3 Vl 8b is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.5500/1M output tokens.

Qwen3 Vl 8b Pricing & Specifications

Input Price$0.15 per 1M tokens
Output Price$0.55 per 1M tokens
Context Window32,768 tokens (33K)
Max Output8,192 tokens
ProviderLlamagate

What is Qwen3 Vl 8b?

Qwen3 Vl 8b is a large language model by Llamagate with a 33K context window and up to 8,192 output tokens. It costs $0.15 per 1M input tokens and $0.55 per 1M output tokens. Qwen3 Vl 8b is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.5500/1M output tokens.

Capabilities

text vision function calling json mode

Qwen3 Vl 8b Cost Examples

Short prompt (500 tokens)

$0.000075

Medium prompt (2K tokens)

$0.00030

Long output (4K tokens)

$0.00220

Count tokens for Qwen3 Vl 8b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Qwen3 Vl 8b

Llamagate

Mistral 7b V0.3

$0.10/1M in 33K ctx

Llamagate

Deepseek R1 8b

$0.10/1M in 66K ctx

Llamagate

Llava 7b

$0.10/1M in 4K ctx

Llamagate

Dolphin3 8b

$0.080/1M in 128K ctx

Frequently Asked Questions

How much does Qwen3 Vl 8b cost per token? +
Qwen3 Vl 8b costs $0.15 per 1M input tokens and $0.55 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000425.
What is the context window for Qwen3 Vl 8b? +
Qwen3 Vl 8b supports a context window of 32,768 tokens (33K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Qwen3 Vl 8b? +
Qwen3 Vl 8b can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Qwen3 Vl 8b good for coding tasks? +
Yes, Qwen3 Vl 8b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Llamagate Models