Skip to content

Fireworks AI Models

Fireworks AI provides 244 AI models accessible via API.

Visit Fireworks AI →

244

Models Available

$0.001

Cheapest Input / 1M

262K

Largest Context

What is Fireworks AI?

Fireworks AI is an AI model provider offering 244 large language models for developers. Their cheapest model starts at $0.001 per 1M input tokens, and their largest context window reaches 262K. Fireworks AI provides 244 AI models accessible via API.

Fireworks AI Strengths

All Fireworks AI Models

Model Input $/1M Output $/1M Context Max Output Released
Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union $0.001 $0.001 4K 4,096
Accounts/Fireworks/Models/Gpt Oss 20b $0.050 $0.20 131K 131,072
Accounts/Fireworks/Models/Llama V3p1 8b Instruct $0.10 $0.10 16K 16,384
Accounts/Fireworks/Models/Llama V3p2 1b Instruct $0.10 $0.10 16K 16,384
Accounts/Fireworks/Models/Llama V3p2 3b Instruct $0.10 $0.10 16K 16,384
Accounts/Fireworks/Models/Codegemma 2b $0.10 $0.10 8K 8,192
Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Deepseek Coder 1b Base $0.10 $0.10 16K 16,384
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Flux 1 Dev $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Flux 1 Schnell $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Gemma 2b It $0.10 $0.10 8K 8,192
Accounts/Fireworks/Models/Llama Guard 3 1b $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Llama V2 70b $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Llama V3p2 1b $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Llama V3p2 3b $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Minimax M1 80k $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512 $0.10 $0.10 256K 256,000
Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Phi 2 3b $0.10 $0.10 2K 2,048
Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 3b $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct $0.10 $0.10 33K 32,768
Accounts/Fireworks/Models/Qwen3 0p6b $0.10 $0.10 41K 40,960
Accounts/Fireworks/Models/Qwen3 1p7b $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft $0.10 $0.10 262K 262,144
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072 $0.10 $0.10 131K 131,072
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960 $0.10 $0.10 41K 40,960
Accounts/Fireworks/Models/Stablecode 3b $0.10 $0.10 4K 4,096
Accounts/Fireworks/Models/Starcoder2 3b $0.10 $0.10 16K 16,384
Accounts/Fireworks/Models/Gpt Oss 120b $0.15 $0.60 131K 131,072
Accounts/Fireworks/Models/Llama4 Scout Instruct Basic $0.15 $0.60 131K 131,072
Accounts/Fireworks/Models/Qwen3 30b A3b $0.15 $0.60 131K 131,072
Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct $0.15 $0.60 262K 262,144
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct $0.15 $0.60 262K 262,144
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking $0.15 $0.60 262K 262,144
Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Chronos Hermes 13b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Code Llama 13b $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Code Llama 13b Instruct $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Code Llama 13b Python $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Code Llama 7b $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Code Llama 7b Instruct $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Code Llama 7b Python $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Code Qwen 1p5 7b $0.20 $0.20 66K 65,536
Accounts/Fireworks/Models/Codegemma 7b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Deepseek Coder 7b Base $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5 $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5 $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Firellava 13b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Firesearch Ocr $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Gemma 7b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Gemma 7b It $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Gemma2 9b It $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Internvl3 8b $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Llama Guard 2 8b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Llama Guard 3 8b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Llama V2 13b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Llama V2 13b Chat $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Llama V2 7b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Llama V2 7b Chat $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Llama V3 8b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Llama V3 8b Instruct Hf $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Llamaguard 7b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512 $0.20 $0.20 256K 256,000
Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512 $0.20 $0.20 256K 256,000
Accounts/Fireworks/Models/Mistral 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Mistral 7b Instruct 4k $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Mistral 7b Instruct V0p2 $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Mistral 7b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Mistral 7b V0p2 $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Mistral Nemo Base 2407 $0.20 $0.20 128K 128,000
Accounts/Fireworks/Models/Mistral Nemo Instruct 2407 $0.20 $0.20 128K 128,000
Accounts/Fireworks/Models/Mythomax L2 13b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Nous Capybara 7b V1p9 $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Nous Hermes Llama2 13b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Nous Hermes Llama2 7b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Openchat 3p5 0106 7b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Openhermes 2 Mistral 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Openorca 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct $0.20 $0.20 32K 32,064
Accounts/Fireworks/Models/Pythia 12b $0.20 $0.20 2K 2,048
Accounts/Fireworks/Models/Qwen V2p5 14b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen V2p5 7b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Qwen2 7b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 14b $0.20 $0.20 131K 131,072
Accounts/Fireworks/Models/Qwen2p5 7b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 14b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct $0.20 $0.20 128K 128,000
Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct $0.20 $0.20 128K 128,000
Accounts/Fireworks/Models/Qwen3 14b $0.20 $0.20 41K 40,960
Accounts/Fireworks/Models/Qwen3 4b $0.20 $0.20 41K 40,960
Accounts/Fireworks/Models/Qwen3 4b Instruct 2507 $0.20 $0.20 262K 262,144
Accounts/Fireworks/Models/Qwen3 8b $0.20 $0.20 41K 40,960
Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Rolm Ocr $0.20 $0.20 128K 128,000
Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Starcoder 16b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Starcoder 7b $0.20 $0.20 8K 8,192
Accounts/Fireworks/Models/Starcoder2 15b $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Starcoder2 7b $0.20 $0.20 16K 16,384
Accounts/Fireworks/Models/Toppy M 7b $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Yi 6b $0.20 $0.20 4K 4,096
Accounts/Fireworks/Models/Zephyr 7b Beta $0.20 $0.20 33K 32,768
Accounts/Fireworks/Models/Glm 4p5 Air $0.22 $0.88 128K 96,000
Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic $0.22 $0.88 131K 131,072
Accounts/Fireworks/Models/Qwen3 235b A22b $0.22 $0.88 131K 131,072
Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507 $0.22 $0.88 262K 262,144
Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507 $0.22 $0.88 262K 262,144
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct $0.22 $0.88 262K 262,144
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking $0.22 $0.88 262K 262,144
Accounts/Fireworks/Models/Minimax M2p1 $0.30 $1.20 205K 204,800
Minimax M2p1 $0.30 $1.20 205K 204,800
Accounts/Fireworks/Models/Minimax M2 $0.30 $1.20 4K 4,096
Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct $0.45 $1.80 262K 262,144
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base $0.50 $0.50 164K 163,840
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct $0.50 $0.50 164K 163,840
Accounts/Fireworks/Models/Deepseek V2 Lite Chat $0.50 $0.50 164K 163,840
Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b $0.50 $0.50 33K 32,768
Accounts/Fireworks/Models/Firefunction $0.50 $0.50 33K 32,768
Accounts/Fireworks/Models/Gpt Oss Safeguard 20b $0.50 $0.50 131K 131,072
Accounts/Fireworks/Models/Mixtral 8x7b $0.50 $0.50 33K 32,768
Accounts/Fireworks/Models/Mixtral 8x7b Instruct $0.50 $0.50 33K 32,768
Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf $0.50 $0.50 33K 32,768
Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo $0.50 $0.50 33K 32,768
Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507 $0.50 $0.50 262K 262,144
Accounts/Fireworks/Models/Deepseek R1 Basic $0.55 $2.19 128K 20,480
Accounts/Fireworks/Models/Glm 4p5 $0.55 $2.19 128K 96,000
Accounts/Fireworks/Models/Glm 4p6 $0.55 $2.19 203K 202,800
Accounts/Fireworks/Models/Deepseek V3p1 $0.56 $1.68 128K 8,192
Accounts/Fireworks/Models/Deepseek V3p1 Terminus $0.56 $1.68 128K 8,192
Accounts/Fireworks/Models/Deepseek V3p2 $0.56 $1.68 164K 163,840
Accounts/Fireworks/Models/Glm 4p7 $0.60 $2.20 203K 202,800
Accounts/Fireworks/Models/Kimi K2 Instruct $0.60 $2.50 131K 16,384
Accounts/Fireworks/Models/Kimi K2 Instruct 0905 $0.60 $2.50 262K 32,768
Accounts/Fireworks/Models/Kimi K2 Thinking $0.60 $2.50 262K 262,144
Accounts/Fireworks/Models/Kimi K2p5 $0.60 $3.00 262K 262,144
Glm 4p7 $0.60 $2.20 203K 202,800
Kimi K2p5 $0.60 $3.00 262K 262,144
Accounts/Fireworks/Models/Deepseek $0.90 $0.90 128K 8,192
Accounts/Fireworks/Models/Deepseek V3 0324 $0.90 $0.90 164K 163,840
Accounts/Fireworks/Models/Firefunction $0.90 $0.90 8K 8,192
Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Qwen2 72b Instruct $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Code Llama 34b $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Code Llama 34b Instruct $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Code Llama 34b Python $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Code Llama 70b $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Code Llama 70b Instruct $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Code Llama 70b Python $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Deepseek Coder 33b Instruct $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Devstral Small 2505 $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Fare 20b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Gemma 3 27b It $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Internvl3 38b $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Internvl3 78b $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Kat Coder $0.90 $0.90 262K 262,144
Accounts/Fireworks/Models/Kat Dev 32b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Kat Dev 72b Exp $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Llama V2 70b Chat $0.90 $0.90 2K 2,048
Accounts/Fireworks/Models/Llama V3 70b Instruct $0.90 $0.90 8K 8,192
Accounts/Fireworks/Models/Llama V3 70b Instruct Hf $0.90 $0.90 8K 8,192
Accounts/Fireworks/Models/Llama V3p1 70b Instruct $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Llama V3p3 70b Instruct $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Llava Yi 34b $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501 $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Nous Hermes Llama2 70b $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Phind Code Llama 34b Python $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Phind Code Llama 34b $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Phind Code Llama 34b $0.90 $0.90 16K 16,384
Accounts/Fireworks/Models/Qwen Qwq 32b Preview $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen1p5 72b Chat $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 32b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Qwen2p5 32b Instruct $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 72b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Qwen2p5 72b Instruct $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 32b $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope $0.90 $0.90 33K 32,768
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k $0.90 $0.90 66K 65,536
Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct $0.90 $0.90 128K 128,000
Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct $0.90 $0.90 128K 128,000
Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507 $0.90 $0.90 262K 262,144
Accounts/Fireworks/Models/Qwen3 32b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16 $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Qwq 32b $0.90 $0.90 131K 131,072
Accounts/Fireworks/Models/Yi 34b $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Yi 34b 200k Capybara $0.90 $0.90 200K 200,000
Accounts/Fireworks/Models/Yi 34b Chat $0.90 $0.90 4K 4,096
Accounts/Fireworks/Models/Deepseek Coder V2 Instruct $1.20 $1.20 66K 65,536
Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf $1.20 $1.20 66K 65,536
Accounts/Fireworks/Models/Cogito 671b V2 P1 $1.20 $1.20 164K 163,840
Accounts/Fireworks/Models/Dbrx Instruct $1.20 $1.20 33K 32,768
Accounts/Fireworks/Models/Deepseek Prover $1.20 $1.20 164K 163,840
Accounts/Fireworks/Models/Deepseek V2p5 $1.20 $1.20 33K 32,768
Accounts/Fireworks/Models/Glm 4p5v $1.20 $1.20 131K 131,072
Accounts/Fireworks/Models/Gpt Oss Safeguard 120b $1.20 $1.20 131K 131,072
Accounts/Fireworks/Models/Mistral Large 3 Fp8 $1.20 $1.20 256K 256,000
Accounts/Fireworks/Models/Mixtral 8x22b $1.20 $1.20 66K 65,536
Accounts/Fireworks/Models/Mixtral 8x22b Instruct $1.20 $1.20 66K 65,536
Accounts/Fireworks/Models/Deepseek R1 $3.00 $8.00 128K 20,480
Accounts/Fireworks/Models/Deepseek R1 0528 $3.00 $8.00 160K 160,000
Accounts/Fireworks/Models/Llama V3p1 405b Instruct $3.00 $3.00 128K 16,384
Accounts/Fireworks/Models/Yi Large $3.00 $3.00 33K 32,768

Model Details

Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union

Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.001000/1M input tokens, $0.001000/1M output tokens.

Input: $0.001/1M Output: $0.001/1M Context: 4K
text

Accounts/Fireworks/Models/Gpt Oss 20b

Accounts/Fireworks/Models/Gpt Oss 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

Input: $0.050/1M Output: $0.20/1M Context: 131K
text function calling reasoning json mode

Accounts/Fireworks/Models/Llama V3p1 8b Instruct

Accounts/Fireworks/Models/Llama V3p1 8b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 16K
text json mode

Accounts/Fireworks/Models/Llama V3p2 1b Instruct

Accounts/Fireworks/Models/Llama V3p2 1b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 16K
text json mode

Accounts/Fireworks/Models/Llama V3p2 3b Instruct

Accounts/Fireworks/Models/Llama V3p2 3b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 16K
text json mode

Accounts/Fireworks/Models/Codegemma 2b

Accounts/Fireworks/Models/Codegemma 2b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 8K
text

Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b

Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek Coder 1b Base

Accounts/Fireworks/Models/Deepseek Coder 1b Base is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 16K
text

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt

Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt

Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Flux 1 Dev

Accounts/Fireworks/Models/Flux 1 Dev is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Flux 1 Schnell

Accounts/Fireworks/Models/Flux 1 Schnell is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Gemma 2b It

Accounts/Fireworks/Models/Gemma 2b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 8K
text

Accounts/Fireworks/Models/Llama Guard 3 1b

Accounts/Fireworks/Models/Llama Guard 3 1b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Llama V2 70b

Accounts/Fireworks/Models/Llama V2 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long

Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b

Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V3p2 1b

Accounts/Fireworks/Models/Llama V3p2 1b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Llama V3p2 3b

Accounts/Fireworks/Models/Llama V3p2 3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Minimax M1 80k

Accounts/Fireworks/Models/Minimax M1 80k is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512

Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 256K
text

Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl

Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Phi 2 3b

Accounts/Fireworks/Models/Phi 2 3b is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 2K
text

Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct

Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct

Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 3b

Accounts/Fireworks/Models/Qwen2p5 Coder 3b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen3 0p6b

Accounts/Fireworks/Models/Qwen3 0p6b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 41K
text

Accounts/Fireworks/Models/Qwen3 1p7b

Accounts/Fireworks/Models/Qwen3 1p7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072 is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960 is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 41K
text

Accounts/Fireworks/Models/Stablecode 3b

Accounts/Fireworks/Models/Stablecode 3b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 4K
text

Accounts/Fireworks/Models/Starcoder2 3b

Accounts/Fireworks/Models/Starcoder2 3b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 16K
text

Accounts/Fireworks/Models/Gpt Oss 120b

Accounts/Fireworks/Models/Gpt Oss 120b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 131K
text function calling reasoning json mode

Accounts/Fireworks/Models/Llama4 Scout Instruct Basic

Accounts/Fireworks/Models/Llama4 Scout Instruct Basic is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 131K
text json mode

Accounts/Fireworks/Models/Qwen3 30b A3b

Accounts/Fireworks/Models/Qwen3 30b A3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct

Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 262K
text

Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct

Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text vision json mode

Accounts/Fireworks/Models/Chronos Hermes 13b

Accounts/Fireworks/Models/Chronos Hermes 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Code Llama 13b

Accounts/Fireworks/Models/Code Llama 13b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 13b Instruct

Accounts/Fireworks/Models/Code Llama 13b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 13b Python

Accounts/Fireworks/Models/Code Llama 13b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 7b

Accounts/Fireworks/Models/Code Llama 7b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 7b Instruct

Accounts/Fireworks/Models/Code Llama 7b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 7b Python

Accounts/Fireworks/Models/Code Llama 7b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Code Qwen 1p5 7b

Accounts/Fireworks/Models/Code Qwen 1p5 7b is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 66K
text

Accounts/Fireworks/Models/Codegemma 7b

Accounts/Fireworks/Models/Codegemma 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b

Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek Coder 7b Base

Accounts/Fireworks/Models/Deepseek Coder 7b Base is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5

Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5

Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b

Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b

Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Firellava 13b

Accounts/Fireworks/Models/Firellava 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Firesearch Ocr

Accounts/Fireworks/Models/Firesearch Ocr is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Gemma 7b

Accounts/Fireworks/Models/Gemma 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Gemma 7b It

Accounts/Fireworks/Models/Gemma 7b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Gemma2 9b It

Accounts/Fireworks/Models/Gemma2 9b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b

Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Internvl3 8b

Accounts/Fireworks/Models/Internvl3 8b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Llama Guard 2 8b

Accounts/Fireworks/Models/Llama Guard 2 8b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Llama Guard 3 8b

Accounts/Fireworks/Models/Llama Guard 3 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Llama V2 13b

Accounts/Fireworks/Models/Llama V2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V2 13b Chat

Accounts/Fireworks/Models/Llama V2 13b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V2 7b

Accounts/Fireworks/Models/Llama V2 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V2 7b Chat

Accounts/Fireworks/Models/Llama V2 7b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Llama V3 8b

Accounts/Fireworks/Models/Llama V3 8b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Llama V3 8b Instruct Hf

Accounts/Fireworks/Models/Llama V3 8b Instruct Hf is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Llamaguard 7b

Accounts/Fireworks/Models/Llamaguard 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512

Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 256K
text

Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512

Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 256K
text

Accounts/Fireworks/Models/Mistral 7b

Accounts/Fireworks/Models/Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Mistral 7b Instruct 4k

Accounts/Fireworks/Models/Mistral 7b Instruct 4k is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Mistral 7b Instruct V0p2

Accounts/Fireworks/Models/Mistral 7b Instruct V0p2 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Mistral 7b Instruct

Accounts/Fireworks/Models/Mistral 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Mistral 7b V0p2

Accounts/Fireworks/Models/Mistral 7b V0p2 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Mistral Nemo Base 2407

Accounts/Fireworks/Models/Mistral Nemo Base 2407 is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 128K
text

Accounts/Fireworks/Models/Mistral Nemo Instruct 2407

Accounts/Fireworks/Models/Mistral Nemo Instruct 2407 is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 128K
text

Accounts/Fireworks/Models/Mythomax L2 13b

Accounts/Fireworks/Models/Mythomax L2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Nous Capybara 7b V1p9

Accounts/Fireworks/Models/Nous Capybara 7b V1p9 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Nous Hermes Llama2 13b

Accounts/Fireworks/Models/Nous Hermes Llama2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Nous Hermes Llama2 7b

Accounts/Fireworks/Models/Nous Hermes Llama2 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b

Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b

Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Openchat 3p5 0106 7b

Accounts/Fireworks/Models/Openchat 3p5 0106 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Openhermes 2 Mistral 7b

Accounts/Fireworks/Models/Openhermes 2 Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b

Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Openorca 7b

Accounts/Fireworks/Models/Openorca 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct

Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct is available via Fireworks AI with a 32K context window and up to 32,064 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 32K
text

Accounts/Fireworks/Models/Pythia 12b

Accounts/Fireworks/Models/Pythia 12b is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 2K
text

Accounts/Fireworks/Models/Qwen V2p5 14b Instruct

Accounts/Fireworks/Models/Qwen V2p5 14b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen V2p5 7b

Accounts/Fireworks/Models/Qwen V2p5 7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen2 7b Instruct

Accounts/Fireworks/Models/Qwen2 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct

Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 14b

Accounts/Fireworks/Models/Qwen2p5 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen2p5 7b Instruct

Accounts/Fireworks/Models/Qwen2p5 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 14b

Accounts/Fireworks/Models/Qwen2p5 Coder 14b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 7b

Accounts/Fireworks/Models/Qwen2p5 Coder 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 128K
text

Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 128K
text

Accounts/Fireworks/Models/Qwen3 14b

Accounts/Fireworks/Models/Qwen3 14b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 41K
text

Accounts/Fireworks/Models/Qwen3 4b

Accounts/Fireworks/Models/Qwen3 4b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 41K
text

Accounts/Fireworks/Models/Qwen3 4b Instruct 2507

Accounts/Fireworks/Models/Qwen3 4b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 8b

Accounts/Fireworks/Models/Qwen3 8b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 41K
text reasoning

Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Rolm Ocr

Accounts/Fireworks/Models/Rolm Ocr is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 128K
text

Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo

Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Starcoder 16b

Accounts/Fireworks/Models/Starcoder 16b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Starcoder 7b

Accounts/Fireworks/Models/Starcoder 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 8K
text

Accounts/Fireworks/Models/Starcoder2 15b

Accounts/Fireworks/Models/Starcoder2 15b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Starcoder2 7b

Accounts/Fireworks/Models/Starcoder2 7b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 16K
text

Accounts/Fireworks/Models/Toppy M 7b

Accounts/Fireworks/Models/Toppy M 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Yi 6b

Accounts/Fireworks/Models/Yi 6b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 4K
text

Accounts/Fireworks/Models/Zephyr 7b Beta

Accounts/Fireworks/Models/Zephyr 7b Beta is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

Input: $0.20/1M Output: $0.20/1M Context: 33K
text

Accounts/Fireworks/Models/Glm 4p5 Air

Accounts/Fireworks/Models/Glm 4p5 Air is available via Fireworks AI with a 128K context window and up to 96,000 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 128K
text function calling reasoning json mode

Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic

Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 131K
text json mode

Accounts/Fireworks/Models/Qwen3 235b A22b

Accounts/Fireworks/Models/Qwen3 235b A22b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507

Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507

Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

Input: $0.22/1M Output: $0.88/1M Context: 262K
text

Accounts/Fireworks/Models/Minimax M2p1

Accounts/Fireworks/Models/Minimax M2p1 is available via Fireworks AI with a 205K context window and up to 204,800 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

Input: $0.30/1M Output: $1.20/1M Context: 205K
text function calling json mode

Minimax M2p1

Minimax M2p1 is available via Fireworks AI with a 205K context window and up to 204,800 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

Input: $0.30/1M Output: $1.20/1M Context: 205K
text function calling json mode

Accounts/Fireworks/Models/Minimax M2

Accounts/Fireworks/Models/Minimax M2 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

Input: $0.30/1M Output: $1.20/1M Context: 4K
text

Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct

Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.4500/1M input tokens, $1.80/1M output tokens.

Input: $0.45/1M Output: $1.80/1M Context: 262K
text reasoning

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 164K
text

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 164K
text

Accounts/Fireworks/Models/Deepseek V2 Lite Chat

Accounts/Fireworks/Models/Deepseek V2 Lite Chat is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 164K
text

Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b

Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 33K
text

Accounts/Fireworks/Models/Firefunction

Accounts/Fireworks/Models/Firefunction is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 33K
text

Accounts/Fireworks/Models/Gpt Oss Safeguard 20b

Accounts/Fireworks/Models/Gpt Oss Safeguard 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 131K
text

Accounts/Fireworks/Models/Mixtral 8x7b

Accounts/Fireworks/Models/Mixtral 8x7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 33K
text

Accounts/Fireworks/Models/Mixtral 8x7b Instruct

Accounts/Fireworks/Models/Mixtral 8x7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 33K
text

Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf

Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 33K
text

Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo

Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507

Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

Input: $0.50/1M Output: $0.50/1M Context: 262K
text

Accounts/Fireworks/Models/Deepseek R1 Basic

Accounts/Fireworks/Models/Deepseek R1 Basic is available via Fireworks AI with a 128K context window and up to 20,480 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

Input: $0.55/1M Output: $2.19/1M Context: 128K
text json mode

Accounts/Fireworks/Models/Glm 4p5

Accounts/Fireworks/Models/Glm 4p5 is available via Fireworks AI with a 128K context window and up to 96,000 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

Input: $0.55/1M Output: $2.19/1M Context: 128K
text function calling reasoning json mode

Accounts/Fireworks/Models/Glm 4p6

Accounts/Fireworks/Models/Glm 4p6 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

Input: $0.55/1M Output: $2.19/1M Context: 203K
text function calling reasoning json mode

Accounts/Fireworks/Models/Deepseek V3p1

Accounts/Fireworks/Models/Deepseek V3p1 is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

Input: $0.56/1M Output: $1.68/1M Context: 128K
text reasoning json mode

Accounts/Fireworks/Models/Deepseek V3p1 Terminus

Accounts/Fireworks/Models/Deepseek V3p1 Terminus is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

Input: $0.56/1M Output: $1.68/1M Context: 128K
text reasoning json mode

Accounts/Fireworks/Models/Deepseek V3p2

Accounts/Fireworks/Models/Deepseek V3p2 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

Input: $0.56/1M Output: $1.68/1M Context: 164K
text function calling reasoning json mode

Accounts/Fireworks/Models/Glm 4p7

Accounts/Fireworks/Models/Glm 4p7 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

Input: $0.60/1M Output: $2.20/1M Context: 203K
text function calling reasoning json mode

Accounts/Fireworks/Models/Kimi K2 Instruct

Accounts/Fireworks/Models/Kimi K2 Instruct is available via Fireworks AI with a 131K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

Input: $0.60/1M Output: $2.50/1M Context: 131K
text function calling json mode

Accounts/Fireworks/Models/Kimi K2 Instruct 0905

Accounts/Fireworks/Models/Kimi K2 Instruct 0905 is available via Fireworks AI with a 262K context window and up to 32,768 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

Input: $0.60/1M Output: $2.50/1M Context: 262K
text function calling json mode

Accounts/Fireworks/Models/Kimi K2 Thinking

Accounts/Fireworks/Models/Kimi K2 Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

Input: $0.60/1M Output: $2.50/1M Context: 262K
text function calling web search json mode

Accounts/Fireworks/Models/Kimi K2p5

Accounts/Fireworks/Models/Kimi K2p5 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

Input: $0.60/1M Output: $3.00/1M Context: 262K
text function calling json mode

Glm 4p7

Glm 4p7 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

Input: $0.60/1M Output: $2.20/1M Context: 203K
text function calling reasoning json mode

Kimi K2p5

Kimi K2p5 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

Input: $0.60/1M Output: $3.00/1M Context: 262K
text function calling json mode

Accounts/Fireworks/Models/Deepseek

Accounts/Fireworks/Models/Deepseek is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 128K
text json mode

Accounts/Fireworks/Models/Deepseek V3 0324

Accounts/Fireworks/Models/Deepseek V3 0324 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 164K
text json mode

Accounts/Fireworks/Models/Firefunction

Accounts/Fireworks/Models/Firefunction is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 8K
text function calling json mode

Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct

Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text vision json mode

Accounts/Fireworks/Models/Qwen2 72b Instruct

Accounts/Fireworks/Models/Qwen2 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text json mode

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text json mode

Accounts/Fireworks/Models/Code Llama 34b

Accounts/Fireworks/Models/Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 34b Instruct

Accounts/Fireworks/Models/Code Llama 34b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 34b Python

Accounts/Fireworks/Models/Code Llama 34b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Code Llama 70b

Accounts/Fireworks/Models/Code Llama 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Code Llama 70b Instruct

Accounts/Fireworks/Models/Code Llama 70b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Code Llama 70b Python

Accounts/Fireworks/Models/Code Llama 70b Python is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b

Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek Coder 33b Instruct

Accounts/Fireworks/Models/Deepseek Coder 33b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Devstral Small 2505

Accounts/Fireworks/Models/Devstral Small 2505 is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New

Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b

Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Fare 20b

Accounts/Fireworks/Models/Fare 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Gemma 3 27b It

Accounts/Fireworks/Models/Gemma 3 27b It is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Internvl3 38b

Accounts/Fireworks/Models/Internvl3 38b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Internvl3 78b

Accounts/Fireworks/Models/Internvl3 78b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Kat Coder

Accounts/Fireworks/Models/Kat Coder is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 262K
text

Accounts/Fireworks/Models/Kat Dev 32b

Accounts/Fireworks/Models/Kat Dev 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Kat Dev 72b Exp

Accounts/Fireworks/Models/Kat Dev 72b Exp is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Llama V2 70b Chat

Accounts/Fireworks/Models/Llama V2 70b Chat is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 2K
text

Accounts/Fireworks/Models/Llama V3 70b Instruct

Accounts/Fireworks/Models/Llama V3 70b Instruct is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 8K
text

Accounts/Fireworks/Models/Llama V3 70b Instruct Hf

Accounts/Fireworks/Models/Llama V3 70b Instruct Hf is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 8K
text

Accounts/Fireworks/Models/Llama V3p1 70b Instruct

Accounts/Fireworks/Models/Llama V3p1 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct

Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Llama V3p3 70b Instruct

Accounts/Fireworks/Models/Llama V3p3 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Llava Yi 34b

Accounts/Fireworks/Models/Llava Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501

Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b

Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Nous Hermes Llama2 70b

Accounts/Fireworks/Models/Nous Hermes Llama2 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Phind Code Llama 34b Python

Accounts/Fireworks/Models/Phind Code Llama 34b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Phind Code Llama 34b

Accounts/Fireworks/Models/Phind Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Phind Code Llama 34b

Accounts/Fireworks/Models/Phind Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 16K
text

Accounts/Fireworks/Models/Qwen Qwq 32b Preview

Accounts/Fireworks/Models/Qwen Qwq 32b Preview is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen1p5 72b Chat

Accounts/Fireworks/Models/Qwen1p5 72b Chat is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct

Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 32b

Accounts/Fireworks/Models/Qwen2p5 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen2p5 32b Instruct

Accounts/Fireworks/Models/Qwen2p5 32b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 72b

Accounts/Fireworks/Models/Qwen2p5 72b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen2p5 72b Instruct

Accounts/Fireworks/Models/Qwen2p5 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 32b

Accounts/Fireworks/Models/Qwen2p5 Coder 32b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 33K
text

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 66K
text

Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct

Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 128K
text

Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 128K
text

Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507

Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 262K
text

Accounts/Fireworks/Models/Qwen3 32b

Accounts/Fireworks/Models/Qwen3 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text reasoning

Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16

Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Qwq 32b

Accounts/Fireworks/Models/Qwq 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 131K
text

Accounts/Fireworks/Models/Yi 34b

Accounts/Fireworks/Models/Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Yi 34b 200k Capybara

Accounts/Fireworks/Models/Yi 34b 200k Capybara is available via Fireworks AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 200K
text

Accounts/Fireworks/Models/Yi 34b Chat

Accounts/Fireworks/Models/Yi 34b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 4K
text

Accounts/Fireworks/Models/Deepseek Coder V2 Instruct

Accounts/Fireworks/Models/Deepseek Coder V2 Instruct is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 66K
text json mode

Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf

Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 66K
text function calling json mode

Accounts/Fireworks/Models/Cogito 671b V2 P1

Accounts/Fireworks/Models/Cogito 671b V2 P1 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 164K
text

Accounts/Fireworks/Models/Dbrx Instruct

Accounts/Fireworks/Models/Dbrx Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 33K
text

Accounts/Fireworks/Models/Deepseek Prover

Accounts/Fireworks/Models/Deepseek Prover is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 164K
text

Accounts/Fireworks/Models/Deepseek V2p5

Accounts/Fireworks/Models/Deepseek V2p5 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 33K
text

Accounts/Fireworks/Models/Glm 4p5v

Accounts/Fireworks/Models/Glm 4p5v is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 131K
text reasoning

Accounts/Fireworks/Models/Gpt Oss Safeguard 120b

Accounts/Fireworks/Models/Gpt Oss Safeguard 120b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 131K
text

Accounts/Fireworks/Models/Mistral Large 3 Fp8

Accounts/Fireworks/Models/Mistral Large 3 Fp8 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 256K
text

Accounts/Fireworks/Models/Mixtral 8x22b

Accounts/Fireworks/Models/Mixtral 8x22b is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 66K
text

Accounts/Fireworks/Models/Mixtral 8x22b Instruct

Accounts/Fireworks/Models/Mixtral 8x22b Instruct is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

Input: $1.20/1M Output: $1.20/1M Context: 66K
text

Accounts/Fireworks/Models/Deepseek R1

Accounts/Fireworks/Models/Deepseek R1 is available via Fireworks AI with a 128K context window and up to 20,480 output tokens. Pricing: $3.00/1M input tokens, $8.00/1M output tokens.

Input: $3.00/1M Output: $8.00/1M Context: 128K
text json mode

Accounts/Fireworks/Models/Deepseek R1 0528

Accounts/Fireworks/Models/Deepseek R1 0528 is available via Fireworks AI with a 160K context window and up to 160,000 output tokens. Pricing: $3.00/1M input tokens, $8.00/1M output tokens.

Input: $3.00/1M Output: $8.00/1M Context: 160K
text json mode

Accounts/Fireworks/Models/Llama V3p1 405b Instruct

Accounts/Fireworks/Models/Llama V3p1 405b Instruct is available via Fireworks AI with a 128K context window and up to 16,384 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.

Input: $3.00/1M Output: $3.00/1M Context: 128K
text function calling json mode

Accounts/Fireworks/Models/Yi Large

Accounts/Fireworks/Models/Yi Large is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.

Input: $3.00/1M Output: $3.00/1M Context: 33K
text json mode

Compare Fireworks AI model pricing

Use our pricing calculator to find the cheapest Fireworks AI model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →