Skip to content

LLM Model Directory

Compare pricing, context windows, and capabilities for 1813+ AI models from 53 providers. Prices shown per 1M tokens.

1813

Models

53

Providers

$0.000

Cheapest Input

10M

Largest Context

All Models by Price

Model Provider Input $/1M Output $/1M Context
Gemini Exp 1114 Google Gemini $0.000 $0.000 1.0M
Gemini Exp 1206 Google Gemini $0.000 $0.000 2.1M
Gemma 3 27b It Google Gemini $0.000 $0.000 131K
Learnlm 1.5 Pro Experimental Google Gemini $0.000 $0.000 33K
Lyria 3 Clip Preview Google Gemini $0.000 $0.000 131K
Lyria 3 Pro Preview Google Gemini $0.000 $0.000 131K
GigaChat 2 Lite Gigachat $0.000 $0.000 128K
GigaChat 2 Max Gigachat $0.000 $0.000 128K
GigaChat 2 Pro Gigachat $0.000 $0.000 128K
Qwen3 Coder 30B A3B Instruct GGUF Lemonade $0.000 $0.000 262K
Gpt Oss 20b Mxfp4 GGUF Lemonade $0.000 $0.000 131K
Gpt Oss 120b Mxfp GGUF Lemonade $0.000 $0.000 131K
Gemma 3 4b It GGUF Lemonade $0.000 $0.000 128K
Qwen3 4B Instruct 2507 GGUF Lemonade $0.000 $0.000 262K
Codestral 2405 Mistral $0.000 $0.000 32K
Codestral Latest Mistral $0.000 $0.000 32K
Codegeex4 Ollama $0.000 $0.000 33K
Deepseek Coder V2 Instruct Ollama $0.000 $0.000 33K
Deepseek Coder V2 Lite Instruct Ollama $0.000 $0.000 33K
Deepseek V3.1:671b Cloud Ollama $0.000 $0.000 164K
Gpt Oss:120b Cloud Ollama $0.000 $0.000 131K
Gpt Oss:20b Cloud Ollama $0.000 $0.000 131K
Internlm2 5 20b Chat Ollama $0.000 $0.000 33K
Llama2 Ollama $0.000 $0.000 4K
Llama2:13b Ollama $0.000 $0.000 4K
Llama2:70b Ollama $0.000 $0.000 4K
Llama2:7b Ollama $0.000 $0.000 4K
Llama3 Ollama $0.000 $0.000 8K
Llama3.1 Ollama $0.000 $0.000 8K
Llama3:70b Ollama $0.000 $0.000 8K
Llama3:8b Ollama $0.000 $0.000 8K
Mistral 7B Instruct V0.1 Ollama $0.000 $0.000 8K
Mistral 7B Instruct V0.2 Ollama $0.000 $0.000 33K
Mistral Large Instruct 2407 Ollama $0.000 $0.000 66K
Mixtral 8x22B Instruct V0.1 Ollama $0.000 $0.000 66K
Mixtral 8x7B Instruct V0.1 Ollama $0.000 $0.000 33K
Qwen3 Coder:480b Cloud Ollama $0.000 $0.000 262K
Openrouter/Auto OpenRouter $0.000 $0.000 2M
Openrouter/Free OpenRouter $0.000 $0.000 200K
Openrouter/Bodybuilder OpenRouter $0.000 $0.000 128K
Pplx 70b Online Perplexity $0.000 $2.80 4K
Pplx 7b Online Perplexity $0.000 $0.28 4K
Sonar Medium Online Perplexity $0.000 $1.80 12K
Sonar Small Online Perplexity $0.000 $0.28 12K
Swiss Ai/Apertus 8b Instruct Publicai $0.000 $0.000 8K
Swiss Ai/Apertus 70b Instruct Publicai $0.000 $0.000 8K
Aisingapore/Gemma SEA LION V4 27B IT Publicai $0.000 $0.000 8K
BSC LT/Salamandra 7b Instruct Tools 16k Publicai $0.000 $0.000 16K
BSC LT/ALIA 40b Instruct Q8 0 Publicai $0.000 $0.000 8K
Allenai/Olmo 3 7B Instruct Publicai $0.000 $0.000 33K
Aisingapore/Qwen SEA LION V4 32B IT Publicai $0.000 $0.000 33K
Allenai/Olmo 3 7B Think Publicai $0.000 $0.000 33K
Allenai/Olmo 3 32B Think Publicai $0.000 $0.000 33K
Meta Textgeneration Llama 2 13b F AWS SageMaker $0.000 $0.000 4K
Meta Textgeneration Llama 2 70b B F AWS SageMaker $0.000 $0.000 4K
Meta Textgeneration Llama 2 7b F AWS SageMaker $0.000 $0.000 4K
Sarvam M Sarvam $0.000 $0.000 8K
Meta/Llama 3.1 70b Instruct Maas Google Vertex AI $0.000 $0.000 128K
Meta/Llama 3.1 8b Instruct Maas Google Vertex AI $0.000 $0.000 128K
Meta/Llama 3.2 90b Vision Instruct Maas Google Vertex AI $0.000 $0.000 128K
Meta/Llama3 405b Instruct Maas Google Vertex AI $0.000 $0.000 32K
Meta/Llama3 70b Instruct Maas Google Vertex AI $0.000 $0.000 32K
Meta/Llama3 8b Instruct Maas Google Vertex AI $0.000 $0.000 32K
Deepseek V3 2 251201 Volcengine $0.000 $0.000 98K
Glm 4 7 251222 Volcengine $0.000 $0.000 205K
Kimi K2 Thinking 251104 Volcengine $0.000 $0.000 229K
Glm 4.5 Flash Zai $0.000 $0.000 128K
Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union Fireworks AI $0.001 $0.001 4K
Qwen/Qwen2.5 Coder 7B Nebius $0.010 $0.030 33K
Llama3.2 11b Vision Instruct Lambda Ai $0.015 $0.025 131K
Llama3.2 3b Instruct Lambda Ai $0.015 $0.025 131K
Meta Llama/Llama 3.2 3B Instruct DeepInfra $0.020 $0.020 131K
Meta Llama/Meta Llama 3.1 8B Instruct Turbo DeepInfra $0.020 $0.030 131K
Mistralai/Mistral Nemo Instruct 2407 DeepInfra $0.020 $0.040 131K
Meta Llama/Llama Guard 3 8B Nebius $0.020 $0.060 128K
Meta Llama/Meta Llama 3.1 8B Instruct Nebius $0.020 $0.060 128K
Qwen/Qwen2 VL 7B Instruct Nebius $0.020 $0.060 131K
Paddlepaddle/Paddleocr Vl Novita AI $0.020 $0.020 16K
Meta Llama/Llama 3.1 8b Instruct Novita AI $0.020 $0.050 16K
Openai/Gpt Oss 20b OpenRouter $0.020 $0.10 131K
Hermes3 8b Lambda Ai $0.025 $0.040 131K
Lfm 7b Lambda Ai $0.025 $0.040 131K
Llama3.1 8b Instruct Lambda Ai $0.025 $0.040 131K
Meta Llama/Meta Llama 3 8B Instruct DeepInfra $0.030 $0.060 8K
Meta Llama/Meta Llama 3.1 8B Instruct DeepInfra $0.030 $0.050 131K
Llama 3.1 8b Llamagate $0.030 $0.050 131K
Gemma3 4b Llamagate $0.030 $0.080 128K
Deepseek/Deepseek Ocr Novita AI $0.030 $0.030 8K
Qwen/Qwen3 4b Fp8 Novita AI $0.030 $0.030 128K
Meta Llama/Llama 3.2 3b Instruct Novita AI $0.030 $0.050 33K
Nova Micro Amazon Nova $0.035 $0.14 128K
Amazon.Nova Micro AWS Bedrock $0.035 $0.14 128K
Us.Amazon.Nova Micro AWS Bedrock $0.035 $0.14 128K
Zai Org/Autoglm Phone 9b Multilingual Novita AI $0.035 $0.14 66K
Qwen/Qwen3 8b Fp8 Novita AI $0.035 $0.14 128K
Amazon/Nova Micro Vercel AI Gateway $0.035 $0.14 128K
Apac.Amazon.Nova Micro AWS Bedrock $0.037 $0.15 128K
Ministral 3b Azure AI $0.040 $0.040 128K
Google.Gemma 3 4b It AWS Bedrock $0.040 $0.080 128K
Mistral.Voxtral Mini 3b 2507 AWS Bedrock $0.040 $0.040 128K
Qwen/Qwen2.5 7B Instruct DeepInfra $0.040 $0.10 33K
Sao10K/L3 8B Lunaris V1 Turbo DeepInfra $0.040 $0.050 8K
Google/Gemma 3 4b It DeepInfra $0.040 $0.080 131K
Nvidia/NVIDIA Nemotron Nano 9B DeepInfra $0.040 $0.16 131K
Openai/Gpt Oss 20b DeepInfra $0.040 $0.15 131K
Llama 3.2 3b Llamagate $0.040 $0.080 131K
Qwen3 8b Llamagate $0.040 $0.14 33K
Mistralai/Mistral Nemo Instruct 2407 Nebius $0.040 $0.12 128K
Openai/Gpt Oss 20b Novita AI $0.040 $0.15 131K
Mistralai/Mistral Nemo Novita AI $0.040 $0.17 60K
Meta Llama/Llama 3 8b Instruct Novita AI $0.040 $0.040 8K
Gpt Oss 20b Ovhcloud $0.040 $0.15 131K
Meta Llama 3.2 1B Instruct SambaNova $0.040 $0.080 16K
Mistral/Ministral 3b Vercel AI Gateway $0.040 $0.040 128K
Eu.Amazon.Nova Micro AWS Bedrock $0.046 $0.18 128K
Meta Llama/Llama 3.2 11B Vision Instruct DeepInfra $0.049 $0.049 131K
Databricks Gpt 5 Nano Databricks $0.050 $0.40 272K
Gpt 5 Nano Azure OpenAI $0.050 $0.40 272K
Gpt 5 Nano 2025 08 07 Azure OpenAI $0.050 $0.40 272K
Qwen Turbo Dashscope $0.050 $0.20 129K
Qwen Turbo 2024 11 01 Dashscope $0.050 $0.20 1M
Qwen Turbo 2025 04 28 Dashscope $0.050 $0.20 1M
Qwen Turbo Latest Dashscope $0.050 $0.20 1M
Google/Gemma 3 12b It DeepInfra $0.050 $0.10 131K
Mistralai/Mistral Small 24B Instruct 2501 DeepInfra $0.050 $0.080 33K
Openai/Gpt Oss 120b DeepInfra $0.050 $0.45 131K
Accounts/Fireworks/Models/Gpt Oss 20b Fireworks AI $0.050 $0.20 131K
Llama 3.1 8b Instant Groq $0.050 $0.080 128K
Gemma 7b It Groq $0.050 $0.080 8K
Llama 4 Maverick 17b 128e Instruct Fp8 Lambda Ai $0.050 $0.10 131K
Llama 4 Scout 17b 16e Instruct Lambda Ai $0.050 $0.10 16K
Qwen25 Coder 32b Instruct Lambda Ai $0.050 $0.10 131K
Qwen3 32b Fp8 Lambda Ai $0.050 $0.10 131K
Openai/Gpt Oss 120b Novita AI $0.050 $0.25 131K
Google/Gemma 3 12b It Novita AI $0.050 $0.10 131K
Sao10k/L3 8b Lunaris Novita AI $0.050 $0.050 8K
Sao10K/L3 8B Stheno V3.2 Novita AI $0.050 $0.050 8K
Gpt 5 Nano OpenAI $0.050 $0.40 272K
Gpt 5 Nano 2025 08 07 OpenAI $0.050 $0.40 272K
Openai/Gpt 5 Nano OpenRouter $0.050 $0.40 272K
Meta/Llama 2 7b Replicate $0.050 $0.25 4K
Meta/Llama 2 7b Chat Replicate $0.050 $0.25 4K
Meta/Llama 3 8b Replicate $0.050 $0.25 8K
Meta/Llama 3 8b Instruct Replicate $0.050 $0.25 8K
Mistralai/Mistral 7b Instruct V0.2 Replicate $0.050 $0.25 4K
Mistralai/Mistral 7b V0.1 Replicate $0.050 $0.25 4K
Openai/Gpt Oss 20b Together AI $0.050 $0.20 128K
Meta/Llama 3 8b Vercel AI Gateway $0.050 $0.080 8K
Meta/Llama 3.1 8b Vercel AI Gateway $0.050 $0.080 131K
Eu/Gpt 5 Nano 2025 08 07 Azure OpenAI $0.055 $0.44 272K
Us/Gpt 5 Nano 2025 08 07 Azure OpenAI $0.055 $0.44 272K
Meta Llama/Llama Guard 3 8B DeepInfra $0.055 $0.055 131K
Nova Lite Amazon Nova $0.060 $0.24 300K
Amazon.Nova Lite AWS Bedrock $0.060 $0.24 300K
Nvidia.Nemotron Nano 9b AWS Bedrock $0.060 $0.23 128K
Nvidia.Nemotron Nano 3 30b AWS Bedrock $0.060 $0.24 262K
Us.Amazon.Nova Lite AWS Bedrock $0.060 $0.24 300K
Qwen/Qwen3 14B DeepInfra $0.060 $0.24 41K
Qwen2.5 Coder 7b Llamagate $0.060 $0.12 33K
Deepseek Coder 6.7b Llamagate $0.060 $0.12 16K
Codellama 7b Llamagate $0.060 $0.12 16K
Mistral Small Latest Mistral $0.060 $0.18 131K
Mistral Small 3 2 2506 Mistral $0.060 $0.18 131K
Google/Gemma 3 27b It Nebius $0.060 $0.20 128K
Qwen/Qwen2.5 32B Instruct Nebius $0.060 $0.20 128K
Deepseek/Deepseek R1 0528 Qwen3 8b Novita AI $0.060 $0.090 128K
Amazon/Nova Lite Vercel AI Gateway $0.060 $0.24 300K
Ibm/Granite 4 H Small IBM Watsonx $0.060 $0.25 20K
Apac.Amazon.Nova Lite AWS Bedrock $0.063 $0.25 300K
Openai.Gpt Oss 20b 1 AWS Bedrock $0.070 $0.30 128K
Openai.Gpt Oss Safeguard 20b AWS Bedrock $0.070 $0.20 128K
Zai.Glm 4.7 Flash AWS Bedrock $0.070 $0.40 200K
Databricks Gpt Oss 20b Databricks $0.070 $0.30 131K
Microsoft/Phi 4 DeepInfra $0.070 $0.14 16K
Qwen/Qwen3 Coder 30b A3b Instruct Novita AI $0.070 $0.27 160K
Baidu/Ernie 4.5 21B A3b Thinking Novita AI $0.070 $0.28 131K
Baichuan/Baichuan M2 32b Novita AI $0.070 $0.070 131K
Baidu/Ernie 4.5 21B A3b Novita AI $0.070 $0.28 120K
Qwen/Qwen2.5 7b Instruct Novita AI $0.070 $0.070 32K
Z Ai/Glm 4.7 Flash OpenRouter $0.070 $0.40 200K
Mistral 7b Instruct Perplexity $0.070 $0.28 4K
Mixtral 8x7b Instruct Perplexity $0.070 $0.28 4K
Pplx 7b Chat Perplexity $0.070 $0.28 8K
Sonar Small Chat Perplexity $0.070 $0.28 16K
Mistral/Devstral Small Vercel AI Gateway $0.070 $0.28 128K
Qwen/Qwen3 235b A22b 2507 OpenRouter $0.071 $0.10 262K
Phi 4 Mini Instruct Azure AI $0.075 $0.30 131K
Openai.Gpt Oss 20b Bedrock Mantle $0.075 $0.30 131K
Openai.Gpt Oss Safeguard 20b Bedrock Mantle $0.075 $0.30 131K
Mistralai/Mistral Small 3.2 24B Instruct 2506 DeepInfra $0.075 $0.20 128K
Gemini 2.0 Flash Lite Google Gemini $0.075 $0.30 1.0M
Gemini 2.0 Flash Lite 001 Google Gemini $0.075 $0.30 1.0M
Openai/Gpt Oss 20b Groq $0.075 $0.30 131K
Openai/Gpt Oss Safeguard 20b Groq $0.075 $0.30 131K
Google.Gemini 2.5 Flash Lite Oci $0.075 $0.30 1.0M
Google/Gemini 2.0 Flash Lite Vercel AI Gateway $0.075 $0.30 1.0M
Gemini 2.0 Flash Lite Google Vertex AI $0.075 $0.30 1.0M
Gemini 2.0 Flash Lite 001 Google Vertex AI $0.075 $0.30 1.0M
Openai/Gpt Oss 20b Maas Google Vertex AI $0.075 $0.30 131K
Eu.Amazon.Nova Lite AWS Bedrock $0.078 $0.31 300K
Phi 4 Multimodal Instruct Azure AI $0.080 $0.32 131K
Phi 4 Mini Reasoning Azure AI $0.080 $0.32 131K
Gryphe/MythoMax L2 13b DeepInfra $0.080 $0.090 4K
Qwen/Qwen3 30B A3B DeepInfra $0.080 $0.29 41K
Meta Llama/Llama 4 Scout 17B 16E Instruct DeepInfra $0.080 $0.30 328K
Dolphin3 8b Llamagate $0.080 $0.15 128K
Deepseek R1 7b Qwen Llamagate $0.080 $0.15 131K
Openthinker 7b Llamagate $0.080 $0.15 33K
Qwen/Qwen3 14B Nebius $0.080 $0.24 33K
Qwen/Qwen3 4B Nebius $0.080 $0.24 33K
Qwen/Qwen3 Vl 8b Instruct Novita AI $0.080 $0.50 131K
Qwen3 32B Ovhcloud $0.080 $0.23 32K
Gpt Oss 120b Ovhcloud $0.080 $0.40 131K
Meta Llama 3.2 3B Instruct SambaNova $0.080 $0.16 4K
Alibaba/Qwen 3 14b Vercel AI Gateway $0.080 $0.24 41K
Google.Gemma 3 12b It AWS Bedrock $0.090 $0.29 128K
Qwen/Qwen3 235B A22B Instruct 2507 DeepInfra $0.090 $0.60 262K
Google/Gemma 3 27b It DeepInfra $0.090 $0.16 131K
Qwen/Qwen3 235b A22b Instruct 2507 Novita AI $0.090 $0.58 131K
Qwen/Qwen3 30b A3b Fp8 Novita AI $0.090 $0.45 41K
Gryphe/Mythomax L2 13b Novita AI $0.090 $0.090 4K
Cohere.Command A Translate 08 2025 Oci $0.090 $0.090 256K
Xiaomi/Mimo V2 Flash OpenRouter $0.090 $0.29 262K
Mistral Small 3.2 24B Instruct 2506 Ovhcloud $0.090 $0.28 128K
Gpt 4.1 Nano Azure OpenAI $0.10 $0.40 1.0M
Gpt 4.1 Nano 2025 04 14 Azure OpenAI $0.10 $0.40 1.0M
Mistral Small 2503 Azure AI $0.10 $0.30 128K
Meta.Llama3 2 1b Instruct AWS Bedrock $0.10 $0.10 128K
Mistral.Ministral 3 3b Instruct AWS Bedrock $0.10 $0.10 128K
Mistral.Voxtral Small 24b 2507 AWS Bedrock $0.10 $0.30 128K
Us.Meta.Llama3 2 1b Instruct AWS Bedrock $0.10 $0.10 128K
Llama3.1 8b Cerebras $0.10 $0.10 128K
Qwen/Qwen3 32B DeepInfra $0.10 $0.28 41K
Google/Gemini 2.0 Flash 001 DeepInfra $0.10 $0.40 1M
Meta Llama/Meta Llama 3.1 70B Instruct Turbo DeepInfra $0.10 $0.28 131K
Nvidia/Llama 3.3 Nemotron Super 49B V1.5 DeepInfra $0.10 $0.40 131K
Accounts/Fireworks/Models/Llama V3p1 8b Instruct Fireworks AI $0.10 $0.10 16K
Accounts/Fireworks/Models/Llama V3p2 1b Instruct Fireworks AI $0.10 $0.10 16K
Accounts/Fireworks/Models/Llama V3p2 3b Instruct Fireworks AI $0.10 $0.10 16K
Accounts/Fireworks/Models/Codegemma 2b Fireworks AI $0.10 $0.10 8K
Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Deepseek Coder 1b Base Fireworks AI $0.10 $0.10 16K
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Flux 1 Dev Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Flux 1 Schnell Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Gemma 2b It Fireworks AI $0.10 $0.10 8K
Accounts/Fireworks/Models/Llama Guard 3 1b Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Llama V2 70b Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Llama V3p2 1b Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Llama V3p2 3b Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Minimax M1 80k Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512 Fireworks AI $0.10 $0.10 256K
Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Phi 2 3b Fireworks AI $0.10 $0.10 2K
Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 3b Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct Fireworks AI $0.10 $0.10 33K
Accounts/Fireworks/Models/Qwen3 0p6b Fireworks AI $0.10 $0.10 41K
Accounts/Fireworks/Models/Qwen3 1p7b Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft Fireworks AI $0.10 $0.10 262K
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072 Fireworks AI $0.10 $0.10 131K
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960 Fireworks AI $0.10 $0.10 41K
Accounts/Fireworks/Models/Stablecode 3b Fireworks AI $0.10 $0.10 4K
Accounts/Fireworks/Models/Starcoder2 3b Fireworks AI $0.10 $0.10 16K
Meta Llama 3.1 8b Instruct FriendliAI $0.10 $0.10 8K
Gemini 2.0 Flash NEW Google Gemini $0.10 $0.40 1.0M
Gemini 2.0 Flash 001 Google Gemini $0.10 $0.40 1.0M
Gemini 2.5 Flash Lite Google Gemini $0.10 $0.40 1.0M
Gemini 2.5 Flash Lite Preview 09 2025 Google Gemini $0.10 $0.40 1.0M
Gemini Flash Lite Latest Google Gemini $0.10 $0.40 1.0M
Gemini 2.5 Flash Lite Preview 06 17 Google Gemini $0.10 $0.40 1.0M
Gemini Flash Lite Latest Google Gemini $0.10 $0.40 1.0M
Lfm 40b Lambda Ai $0.10 $0.20 131K
Mistral 7b V0.3 Llamagate $0.10 $0.15 33K
Deepseek R1 8b Llamagate $0.10 $0.20 66K
Llava 7b Llamagate $0.10 $0.20 4K
Devstral Small 2505 Mistral $0.10 $0.30 128K
Devstral Small 2507 Mistral $0.10 $0.30 128K
Devstral Small Latest Mistral $0.10 $0.30 256K
Labs Devstral Small 2512 Mistral $0.10 $0.30 256K
Mistral Small Mistral $0.10 $0.30 32K
Ministral 3 3b 2512 Mistral $0.10 $0.10 131K
Nvidia/Llama 3.3 Nemotron Super 49B Nebius $0.10 $0.40 131K
Qwen/Qwen3 32B Nebius $0.10 $0.30 33K
Qwen/Qwen3 30B A3B Nebius $0.10 $0.30 33K
Xiaomimimo/Mimo V2 Flash Novita AI $0.10 $0.30 262K
Qwen/Qwen3 32b Fp8 Novita AI $0.10 $0.45 41K
Gpt 4.1 Nano NEW OpenAI $0.10 $0.40 1.0M
Gpt 4.1 Nano 2025 04 14 OpenAI $0.10 $0.40 1.0M
Bytedance/Ui Tars 1.5 7b OpenRouter $0.10 $0.20 131K
Google/Gemini 2.0 Flash 001 OpenRouter $0.10 $0.40 1.0M
Mistralai/Ministral 3b 2512 OpenRouter $0.10 $0.10 131K
Openai/Gpt 4.1 Nano OpenRouter $0.10 $0.40 1.0M
Qwen/Qwen3.5 Flash 02 23 OpenRouter $0.10 $0.40 1M
Llama 3.1 8B Instruct Ovhcloud $0.10 $0.10 131K
Mistral 7B Instruct V0.3 Ovhcloud $0.10 $0.10 127K
Meta/Llama 2 13b Replicate $0.10 $0.50 4K
Meta/Llama 2 13b Chat Replicate $0.10 $0.50 4K
Meta Llama 3.1 8B Instruct SambaNova $0.10 $0.20 16K
Alibaba/Qwen 3 30b Vercel AI Gateway $0.10 $0.30 41K
Alibaba/Qwen 3 32b Vercel AI Gateway $0.10 $0.30 41K
Meta/Llama 3.2 1b Vercel AI Gateway $0.10 $0.10 128K
Meta/Llama 4 Scout Vercel AI Gateway $0.10 $0.30 131K
Mistral/Ministral 8b Vercel AI Gateway $0.10 $0.10 128K
Mistral/Mistral Small Vercel AI Gateway $0.10 $0.30 32K
Openai/Gpt 4.1 Nano Vercel AI Gateway $0.10 $0.40 1.0M
Gemini 2.0 Flash Google Vertex AI $0.10 $0.40 1.0M
Gemini 2.5 Flash Lite Google Vertex AI $0.10 $0.40 1.0M
Gemini 2.5 Flash Lite Preview 09 2025 Google Vertex AI $0.10 $0.40 1.0M
Gemini 2.5 Flash Lite Preview 06 17 Google Vertex AI $0.10 $0.40 1.0M
Ibm/Granite Guardian 3 2 2b IBM Watsonx $0.10 $0.10 8K
Ibm/Granite Vision 3 2 2b IBM Watsonx $0.10 $0.10 8K
Meta Llama/Llama 3 2 1b Instruct IBM Watsonx $0.10 $0.10 128K
Mistralai/Mistral Small 2503 IBM Watsonx $0.10 $0.30 32K
Mistralai/Mistral Small 3 1 24b Instruct 2503 IBM Watsonx $0.10 $0.30 32K
Glm 4 32b 0414 128k Zai $0.10 $0.10 128K
Us/Gpt 4.1 Nano 2025 04 14 Azure OpenAI $0.11 $0.44 1.0M
Meta Llama/Llama 4 Scout 17b 16e Instruct Groq $0.11 $0.34 131K
Qwen/Qwen3 235b A22b Thinking 2507 OpenRouter $0.11 $0.60 262K
Google/Gemma 3 27b It Novita AI $0.12 $0.20 98K
Qwen/Qwen2.5 72B Instruct DeepInfra $0.12 $0.39 33K
NousResearch/Hermes 3 Llama 3.1 70B Hyperbolic $0.12 $0.30 33K
Qwen/Qwen2.5 72B Instruct Hyperbolic $0.12 $0.30 131K
Qwen/Qwen2.5 Coder 32B Instruct Hyperbolic $0.12 $0.30 33K
Meta Llama/Llama 3.2 3B Instruct Hyperbolic $0.12 $0.30 33K
Meta Llama/Llama 3.3 70B Instruct Hyperbolic $0.12 $0.30 131K
Meta Llama/Meta Llama 3 70B Instruct Hyperbolic $0.12 $0.30 131K
Meta Llama/Meta Llama 3.1 405B Instruct Hyperbolic $0.12 $0.30 33K
Meta Llama/Meta Llama 3.1 70B Instruct Hyperbolic $0.12 $0.30 33K
Meta Llama/Meta Llama 3.1 8B Instruct Hyperbolic $0.12 $0.30 33K
Hermes3 70b Lambda Ai $0.12 $0.30 131K
Llama3.1 70b Instruct Fp8 Lambda Ai $0.12 $0.30 131K
Llama3.1 Nemotron 70b Instruct Fp8 Lambda Ai $0.12 $0.30 131K
Llama3.3 70b Instruct Fp8 Lambda Ai $0.12 $0.30 131K
Phi 4 Azure AI $0.13 $0.50 16K
Phi 4 Reasoning Azure AI $0.13 $0.50 33K
Chat Bison Palm $0.13 $0.13 8K
Chat Bison 001 Palm $0.13 $0.13 8K
Phi 3 Mini 128k Instruct Azure AI $0.13 $0.52 128K
Phi 3 Mini 4k Instruct Azure AI $0.13 $0.52 4K
Phi 3.5 Mini Instruct Azure AI $0.13 $0.52 128K
Phi 3.5 Vision Instruct Azure AI $0.13 $0.52 128K
Eu.Meta.Llama3 2 1b Instruct AWS Bedrock $0.13 $0.13 128K
Meta Llama/Llama 3.3 70B Instruct Turbo DeepInfra $0.13 $0.39 131K
Meta Llama/Llama 3.3 70B Instruct Nebius $0.13 $0.40 128K
Meta Llama/Meta Llama 3.1 70B Instruct Nebius $0.13 $0.40 128K
Qwen/Qwen2.5 72B Instruct Nebius $0.13 $0.40 128K
Qwen/Qwen2.5 VL 72B Instruct Nebius $0.13 $0.40 131K
Qwen/Qwen2 VL 72B Instruct Nebius $0.13 $0.40 131K
Zai Org/Glm 4.5 Air Novita AI $0.13 $0.85 131K
Mistral Nemo Instruct 2407 Ovhcloud $0.13 $0.13 118K
Meta Llama/Llama 3.3 70b Instruct Novita AI $0.14 $0.40 131K
Qwen/Qwen3 Next 80B A3B Instruct DeepInfra $0.14 $1.40 262K
Qwen/Qwen3 Next 80B A3B Thinking DeepInfra $0.14 $1.40 262K
Deepseek Coder DeepSeek $0.14 $0.28 128K
Nousresearch/Hermes 2 Pro Llama 3 8b Novita AI $0.14 $0.14 8K
Baidu/Ernie 4.5 Vl 28b A3b Novita AI $0.14 $0.56 30K
Deepseek/Deepseek Chat OpenRouter $0.14 $0.28 66K
Deepseek/Deepseek Chat V3 0324 OpenRouter $0.14 $0.28 66K
HuggingFaceH4/Zephyr 7b Beta Anyscale $0.15 $0.15 16K
Google/Gemma 7b It Anyscale $0.15 $0.15 8K
Meta Llama/Llama 2 7b Chat Hf Anyscale $0.15 $0.15 4K
Meta Llama/Meta Llama 3 8B Instruct Anyscale $0.15 $0.15 8K
Mistralai/Mistral 7B Instruct V0.1 Anyscale $0.15 $0.15 16K
Mistralai/Mixtral 8x7B Instruct V0.1 Anyscale $0.15 $0.15 16K
Global Standard/Gpt 4o Mini Azure OpenAI $0.15 $0.60 128K
Gpt Oss 120b Azure AI $0.15 $0.60 131K
Phi 3 Small 128k Instruct Azure AI $0.15 $0.60 128K
Phi 3 Small 8k Instruct Azure AI $0.15 $0.60 8K
Mistral Nemo Azure AI $0.15 $0.15 131K
Us East 1/Mistral.Mistral 7b Instruct AWS Bedrock $0.15 $0.20 32K
Us West 2/Mistral.Mistral 7b Instruct AWS Bedrock $0.15 $0.20 32K
Meta.Llama3 2 3b Instruct AWS Bedrock $0.15 $0.15 128K
Mistral.Ministral 3 8b Instruct AWS Bedrock $0.15 $0.15 128K
Mistral.Mistral 7b Instruct AWS Bedrock $0.15 $0.20 32K
Nvidia.Nemotron Super 3 120b AWS Bedrock $0.15 $0.65 256K
Openai.Gpt Oss 120b 1 AWS Bedrock $0.15 $0.60 128K
Openai.Gpt Oss Safeguard 120b AWS Bedrock $0.15 $0.60 128K
Qwen.Qwen3 Coder 30b A3b AWS Bedrock $0.15 $0.60 262K
Qwen.Qwen3 32b AWS Bedrock $0.15 $0.60 131K
Qwen.Qwen3 Next 80b A3b AWS Bedrock $0.15 $1.20 128K
Us.Meta.Llama3 2 3b Instruct AWS Bedrock $0.15 $0.15 128K
Openai.Gpt Oss 120b Bedrock Mantle $0.15 $0.60 131K
Openai.Gpt Oss Safeguard 120b Bedrock Mantle $0.15 $0.60 131K
Command R Cohere $0.15 $0.60 128K
Command R 08 2024 Cohere $0.15 $0.60 128K
Command R7b 12 2024 Cohere $0.15 $0.037 128K
Qwen3 Next 80b A3b Instruct Dashscope $0.15 $1.20 262K
Qwen3 Next 80b A3b Thinking Dashscope $0.15 $1.20 262K
Qwen/QwQ 32B DeepInfra $0.15 $0.40 131K
Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 DeepInfra $0.15 $0.60 1.0M
Accounts/Fireworks/Models/Gpt Oss 120b Fireworks AI $0.15 $0.60 131K
Accounts/Fireworks/Models/Llama4 Scout Instruct Basic Fireworks AI $0.15 $0.60 131K
Accounts/Fireworks/Models/Qwen3 30b A3b Fireworks AI $0.15 $0.60 131K
Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct Fireworks AI $0.15 $0.60 262K
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct Fireworks AI $0.15 $0.60 262K
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking Fireworks AI $0.15 $0.60 262K
Openai/Gpt 4o Mini Gmi $0.15 $0.60 131K
Openai/Gpt Oss 120b Groq $0.15 $0.60 131K
Qwen3 Vl 8b Llamagate $0.15 $0.55 33K
Ministral 3 8b 2512 Mistral $0.15 $0.15 262K
Pixtral 12b 2409 Mistral $0.15 $0.15 128K
Qwen/QwQ 32B Nebius $0.15 $0.45 33K
Qwen/Qwen3 Next 80b A3b Instruct Novita AI $0.15 $1.50 131K
Qwen/Qwen3 Next 80b A3b Thinking Novita AI $0.15 $1.50 131K
Deepseek/Deepseek R1 Distill Qwen 14b Novita AI $0.15 $0.15 33K
Cohere.Command R 08 2024 Oci $0.15 $0.15 128K
Google.Gemini 2.5 Flash Oci $0.15 $0.60 1.0M
Gpt 4o Mini OpenAI $0.15 $0.60 128K
Gpt 4o Mini 2024 07 18 OpenAI $0.15 $0.60 128K
Gpt 4o Mini Audio Preview OpenAI $0.15 $0.60 128K
Gpt 4o Mini Audio Preview 2024 12 17 OpenAI $0.15 $0.60 128K
Gpt 4o Mini Search Preview OpenAI $0.15 $0.60 128K
Gpt 4o Mini Search Preview 2025 03 11 OpenAI $0.15 $0.60 128K
Mistralai/Devstral 2512 OpenRouter $0.15 $0.60 262K
Mistralai/Ministral 8b 2512 OpenRouter $0.15 $0.15 262K
Openai/Gpt Oss 120b Together AI $0.15 $0.60 128K
Qwen/Qwen3 Next 80B A3B Instruct Together AI $0.15 $1.50 262K
Qwen/Qwen3 Next 80B A3B Thinking Together AI $0.15 $1.50 262K
Cohere/Command R Vercel AI Gateway $0.15 $0.60 128K
Google/Gemini 2.0 Flash Vercel AI Gateway $0.15 $0.60 1.0M
Meta/Llama 3.2 3b Vercel AI Gateway $0.15 $0.15 128K
Mistral/Pixtral 12b Vercel AI Gateway $0.15 $0.15 128K
Openai/Gpt 4o Mini Vercel AI Gateway $0.15 $0.60 128K
Gemini 2.0 Flash 001 Google Vertex AI $0.15 $0.60 1.0M
Mistral Nemo@Latest Google Vertex AI $0.15 $0.15 128K
Openai/Gpt Oss 120b Maas Google Vertex AI $0.15 $0.60 131K
Qwen/Qwen3 Next 80b A3b Instruct Maas Google Vertex AI $0.15 $1.20 262K
Qwen/Qwen3 Next 80b A3b Thinking Maas Google Vertex AI $0.15 $1.20 262K
Meta Llama/Llama 3 2 3b Instruct IBM Watsonx $0.15 $0.15 128K
Openai/Gpt Oss 120b IBM Watsonx $0.15 $0.60 8K
Databricks Gemma 3 12b Databricks $0.15 $0.50 128K
Databricks Gpt Oss 120b Databricks $0.15 $0.60 131K
Databricks Meta Llama 3 1 8b Instruct Databricks $0.15 $0.45 200K
Phi 3.5 MoE Instruct Azure AI $0.16 $0.64 128K
Qwen3 Vl 32b Instruct Dashscope $0.16 $0.64 131K
Qwen3 Vl 32b Thinking Dashscope $0.16 $2.87 131K
Meta/Llama 3.2 11b Vercel AI Gateway $0.16 $0.16 128K
Eu/Gpt 4o Mini 2024 07 18 Azure OpenAI $0.17 $0.66 128K
Gpt 4o Mini Azure OpenAI $0.17 $0.66 128K
Gpt 4o Mini 2024 07 18 Azure OpenAI $0.17 $0.66 128K
Us/Gpt 4o Mini 2024 07 18 Azure OpenAI $0.17 $0.66 128K
Phi 3 Medium 128k Instruct Azure AI $0.17 $0.68 128K
Phi 3 Medium 4k Instruct Azure AI $0.17 $0.68 4K
Meta.Llama4 Scout 17b Instruct AWS Bedrock $0.17 $0.66 128K
Us.Meta.Llama4 Scout 17b Instruct AWS Bedrock $0.17 $0.66 128K
Qwen/Qwen3 235B A22B DeepInfra $0.18 $0.54 41K
Meta Llama/Llama Guard 4 12B DeepInfra $0.18 $0.18 164K
Meta Llama/Llama 4 Scout 17b 16e Instruct Novita AI $0.18 $0.59 131K
Openai/Gpt Oss 120b OpenRouter $0.18 $0.80 131K
Qwen/Qwen 2.5 Coder 32b Instruct OpenRouter $0.18 $0.18 34K
Eu.Meta.Llama3 2 3b Instruct AWS Bedrock $0.19 $0.19 128K
Mamba Codestral 7B V0.1 Ovhcloud $0.19 $0.19 256K
Jamba 1.5 AI21 $0.20 $0.40 256K
Jamba 1.5 Mini AI21 $0.20 $0.40 256K
Jamba 1.5 Mini AI21 $0.20 $0.40 256K
Jamba Mini 1.6 AI21 $0.20 $0.40 256K
Jamba Mini 1.7 AI21 $0.20 $0.40 256K
Gpt 5.4 Nano Azure OpenAI $0.20 $1.25 1.1M
Llama 4 Scout 17B 16E Instruct Azure AI $0.20 $0.78 10M
Grok 4 Fast Non Reasoning Azure AI $0.20 $0.50 131K
Grok 4 Fast Reasoning Azure AI $0.20 $0.50 131K
Grok 4 1 Fast Non Reasoning Azure AI $0.20 $0.50 131K
Grok 4 1 Fast Reasoning Azure AI $0.20 $0.50 131K
Grok Code Fast 1 Azure AI $0.20 $1.50 131K
Ai21.Jamba 1 5 Mini AWS Bedrock $0.20 $0.40 256K
Eu West 3/Mistral.Mistral 7b Instruct AWS Bedrock $0.20 $0.26 32K
Mistral.Ministral 3 14b Instruct AWS Bedrock $0.20 $0.20 128K
Nvidia.Nemotron Nano 12b AWS Bedrock $0.20 $0.60 128K
Qwen/Qwen2.5 VL 32B Instruct DeepInfra $0.20 $0.60 128K
Deepseek Ai/DeepSeek R1 Distill Llama 70B DeepInfra $0.20 $0.60 131K
Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Chronos Hermes 13b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Code Llama 13b Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Code Llama 13b Instruct Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Code Llama 13b Python Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Code Llama 7b Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Code Llama 7b Instruct Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Code Llama 7b Python Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Code Qwen 1p5 7b Fireworks AI $0.20 $0.20 66K
Accounts/Fireworks/Models/Codegemma 7b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Deepseek Coder 7b Base Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5 Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5 Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Firellava 13b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Firesearch Ocr Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Gemma 7b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Gemma 7b It Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Gemma2 9b It Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Internvl3 8b Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Llama Guard 2 8b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Llama Guard 3 8b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Llama V2 13b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Llama V2 13b Chat Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Llama V2 7b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Llama V2 7b Chat Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Llama V3 8b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Llama V3 8b Instruct Hf Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Llamaguard 7b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512 Fireworks AI $0.20 $0.20 256K
Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512 Fireworks AI $0.20 $0.20 256K
Accounts/Fireworks/Models/Mistral 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Mistral 7b Instruct 4k Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Mistral 7b Instruct V0p2 Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Mistral 7b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Mistral 7b V0p2 Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Mistral Nemo Base 2407 Fireworks AI $0.20 $0.20 128K
Accounts/Fireworks/Models/Mistral Nemo Instruct 2407 Fireworks AI $0.20 $0.20 128K
Accounts/Fireworks/Models/Mythomax L2 13b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Nous Capybara 7b V1p9 Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Nous Hermes Llama2 13b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Nous Hermes Llama2 7b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Openchat 3p5 0106 7b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Openhermes 2 Mistral 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Openorca 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct Fireworks AI $0.20 $0.20 32K
Accounts/Fireworks/Models/Pythia 12b Fireworks AI $0.20 $0.20 2K
Accounts/Fireworks/Models/Qwen V2p5 14b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen V2p5 7b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Qwen2 7b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2p5 14b Fireworks AI $0.20 $0.20 131K
Accounts/Fireworks/Models/Qwen2p5 7b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 14b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct Fireworks AI $0.20 $0.20 128K
Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct Fireworks AI $0.20 $0.20 128K
Accounts/Fireworks/Models/Qwen3 14b Fireworks AI $0.20 $0.20 41K
Accounts/Fireworks/Models/Qwen3 4b Fireworks AI $0.20 $0.20 41K
Accounts/Fireworks/Models/Qwen3 4b Instruct 2507 Fireworks AI $0.20 $0.20 262K
Accounts/Fireworks/Models/Qwen3 8b Fireworks AI $0.20 $0.20 41K
Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Rolm Ocr Fireworks AI $0.20 $0.20 128K
Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Starcoder 16b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Starcoder 7b Fireworks AI $0.20 $0.20 8K
Accounts/Fireworks/Models/Starcoder2 15b Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Starcoder2 7b Fireworks AI $0.20 $0.20 16K
Accounts/Fireworks/Models/Toppy M 7b Fireworks AI $0.20 $0.20 33K
Accounts/Fireworks/Models/Yi 6b Fireworks AI $0.20 $0.20 4K
Accounts/Fireworks/Models/Zephyr 7b Beta Fireworks AI $0.20 $0.20 33K
Meta Llama/Llama Guard 4 12b Groq $0.20 $0.20 8K
Meta Llama/Llama 4 Maverick 17b 128e Instruct Groq $0.20 $0.60 131K
Qwen/QwQ 32B Hyperbolic $0.20 $0.20 131K
Deepseek Ai/DeepSeek V3 Hyperbolic $0.20 $0.20 33K
Deepseek Llama3.3 70b Lambda Ai $0.20 $0.60 131K
Deepseek R1 0528 Lambda Ai $0.20 $0.60 131K
Deepseek V3 0324 Lambda Ai $0.20 $0.60 131K
Ministral 3 14b 2512 Mistral $0.20 $0.20 262K
Kimi Latest 8k Moonshot $0.20 $2.00 8K
Moonshot V1 8k Moonshot $0.20 $2.00 8K
Moonshot V1 8k 0430 Moonshot $0.20 $2.00 8K
Moonshot V1 8k Vision Preview Moonshot $0.20 $2.00 8K
Qwen/Qwen3 235B A22B Nebius $0.20 $0.60 262K
Skywork/R1v4 Lite Novita AI $0.20 $0.60 262K
Qwen/Qwen3 235b A22b Fp8 Novita AI $0.20 $0.80 41K
Qwen/Qwen3 Vl 30b A3b Instruct Novita AI $0.20 $0.70 131K
Qwen/Qwen3 Vl 30b A3b Thinking Novita AI $0.20 $1.00 131K
Ft:Gpt 4.1 Nano 2025 04 14 OpenAI $0.20 $0.80 1.0M
Gpt 5.4 Nano OpenAI $0.20 $1.25 272K
Deepseek/Deepseek Chat V3.1 OpenRouter $0.20 $0.80 164K
Deepseek/Deepseek V3.2 Exp OpenRouter $0.20 $0.40 164K
Mistralai/Ministral 14b 2512 OpenRouter $0.20 $0.20 262K
Llama 3.1 8b Instruct Perplexity $0.20 $0.20 131K
Qwen/Qwen3 235B A22B Instruct 2507 Tput Together AI $0.20 $6.00 262K
Qwen/Qwen3 235B A22B Fp8 Tput Together AI $0.20 $0.60 40K
Zai Org/GLM 4.5 Air FP8 Together AI $0.20 $1.10 128K
Alibaba/Qwen 3 235b Vercel AI Gateway $0.20 $0.60 41K
Google/Gemma 2 9b Vercel AI Gateway $0.20 $0.20 8K
Meta/Llama 4 Maverick Vercel AI Gateway $0.20 $0.60 131K
Zai/Glm 4.5 Air Vercel AI Gateway $0.20 $1.10 128K
Codestral 2501 Google Vertex AI $0.20 $0.60 128K
Codestral Google Vertex AI $0.20 $0.60 128K
Codestral@Latest Google Vertex AI $0.20 $0.60 128K
Jamba 1.5 Google Vertex AI $0.20 $0.40 256K
Jamba 1.5 Mini Google Vertex AI $0.20 $0.40 256K
Jamba 1.5 Mini Google Vertex AI $0.20 $0.40 256K
Ibm/Granite 3 8b Instruct IBM Watsonx $0.20 $0.20 8K
Ibm/Granite 3 3 8b Instruct IBM Watsonx $0.20 $0.20 8K
Ibm/Granite Guardian 3 3 8b IBM Watsonx $0.20 $0.20 8K
Grok 4 Fast Reasoning Xai $0.20 $0.50 2M
Grok 4 Fast Non Reasoning Xai $0.20 $0.50 2M
Grok 4 1 Fast Xai $0.20 $0.50 2M
Grok 4 1 Fast Reasoning Xai $0.20 $0.50 2M
Grok 4 1 Fast Reasoning Latest Xai $0.20 $0.50 2M
Grok 4 1 Fast Non Reasoning Xai $0.20 $0.50 2M
Grok 4 1 Fast Non Reasoning Latest Xai $0.20 $0.50 2M
Grok Code Fast Xai $0.20 $1.50 256K
Grok Code Fast 1 Xai $0.20 $1.50 256K
Grok Code Fast 1 0825 Xai $0.20 $1.50 256K
Glm 4.5 Air Zai $0.20 $1.10 128K
Qwen/Qwen Vl Plus OpenRouter $0.21 $0.63 8K
Meta.Llama3 1 8b Instruct AWS Bedrock $0.22 $0.22 128K
Qwen.Qwen3 Coder 480b A35b AWS Bedrock $0.22 $1.80 262K
Qwen.Qwen3 235b A22b 2507 AWS Bedrock $0.22 $0.88 262K
Us.Meta.Llama3 1 8b Instruct AWS Bedrock $0.22 $0.22 128K
Accounts/Fireworks/Models/Glm 4p5 Air Fireworks AI $0.22 $0.88 128K
Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic Fireworks AI $0.22 $0.88 131K
Accounts/Fireworks/Models/Qwen3 235b A22b Fireworks AI $0.22 $0.88 131K
Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507 Fireworks AI $0.22 $0.88 262K
Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507 Fireworks AI $0.22 $0.88 262K
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct Fireworks AI $0.22 $0.88 262K
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking Fireworks AI $0.22 $0.88 262K
Qwen/Qwen3 Coder OpenRouter $0.22 $0.95 262K
Google.Gemma 3 27b It AWS Bedrock $0.23 $0.38 128K
Meta Llama/Llama 3.3 70B Instruct DeepInfra $0.23 $0.40 131K
Meta.Llama4 Maverick 17b Instruct AWS Bedrock $0.24 $0.97 128K
Us.Meta.Llama4 Maverick 17b Instruct AWS Bedrock $0.24 $0.97 128K
Databricks Gpt 5 Mini Databricks $0.25 $2.00 272K
Claude 3 Haiku Anthropic $0.25 $1.25 200K
Meta Llama/Llama 2 13b Chat Hf Anyscale $0.25 $0.25 4K
Gpt 5 Mini Azure OpenAI $0.25 $2.00 272K
Gpt 5 Mini 2025 08 07 Azure OpenAI $0.25 $2.00 272K
Global/Grok 3 Mini Azure AI $0.25 $1.27 131K
Grok 3 Mini Azure AI $0.25 $1.27 131K
Anthropic.Claude 3 Haiku 20240307 AWS Bedrock $0.25 $1.25 200K
Apac.Anthropic.Claude 3 Haiku 20240307 AWS Bedrock $0.25 $1.25 200K
Eu.Anthropic.Claude 3 5 Haiku 20241022 AWS Bedrock $0.25 $1.25 200K
Eu.Anthropic.Claude 3 Haiku 20240307 AWS Bedrock $0.25 $1.25 200K
Us.Anthropic.Claude 3 Haiku 20240307 AWS Bedrock $0.25 $1.25 200K
Deepseek Ai/DeepSeek V3 0324 DeepInfra $0.25 $0.88 164K
Gemini 3.1 Flash Lite Preview Google Gemini $0.25 $1.50 1.0M
Deepseek Ai/DeepSeek R1 0528 Hyperbolic $0.25 $0.25 131K
Codestral Mamba Latest Mistral $0.25 $0.25 256K
Mistral Tiny Mistral $0.25 $0.25 32K
Open Codestral Mamba Mistral $0.25 $0.25 256K
Open Mistral 7b Mistral $0.25 $0.25 32K
Deepseek Ai/DeepSeek R1 Distill Llama 70B Nebius $0.25 $0.75 128K
Qwen/Qwen3 Omni 30b A3b Thinking Novita AI $0.25 $0.97 66K
Qwen/Qwen3 Omni 30b A3b Instruct Novita AI $0.25 $0.97 66K
Qwen/Qwen Mt Plus Novita AI $0.25 $0.75 16K
Gpt 5 Mini OpenAI $0.25 $2.00 272K
Gpt 5 Mini 2025 08 07 OpenAI $0.25 $2.00 272K
Openai/Gpt 5 Mini OpenRouter $0.25 $2.00 272K
Qwen/Qwen3.5 35b A3b OpenRouter $0.25 $2.00 262K
Anthropic/Claude 3 Haiku Vercel AI Gateway $0.25 $1.25 200K
Inception/Mercury Coder Small Vercel AI Gateway $0.25 $1.00 32K
Gemini 3.1 Flash Lite Preview Google Vertex AI $0.25 $1.50 1.0M
Claude 3 Haiku Google Vertex AI $0.25 $1.25 200K
Claude 3 Haiku Google Vertex AI $0.25 $1.25 200K
Gemini 3.1 Flash Lite Preview Google Vertex AI $0.25 $1.50 1.0M
Meta/Llama 4 Scout 17b 128e Instruct Maas Google Vertex AI $0.25 $0.70 10M
Meta/Llama 4 Scout 17b 16e Instruct Maas Google Vertex AI $0.25 $0.70 10M
Qwen/Qwen3 235b A22b Instruct 2507 Maas Google Vertex AI $0.25 $1.00 262K
Minimax/Minimax M2 OpenRouter $0.26 $1.02 205K
Deepseek/Deepseek V3.2 Novita AI $0.27 $0.40 164K
Allenai/OlmOCR 7B 0725 FP8 DeepInfra $0.27 $1.50 16K
Deepseek Ai/DeepSeek R1 Distill Qwen 32B DeepInfra $0.27 $0.27 131K
Deepseek Ai/DeepSeek V3.1 DeepInfra $0.27 $1.00 164K
Deepseek Ai/DeepSeek V3.1 Terminus DeepInfra $0.27 $1.00 164K
Deepseek DeepSeek $0.27 $1.10 66K
Deepseek/Deepseek V3.2 Exp Novita AI $0.27 $0.41 164K
Deepseek/Deepseek V3.1 Terminus Novita AI $0.27 $1.00 131K
Deepseek/Deepseek V3.1 Novita AI $0.27 $1.00 131K
Deepseek/Deepseek V3 0324 Novita AI $0.27 $1.12 164K
Meta Llama/Llama 4 Maverick 17b 128e Instruct Fp8 Novita AI $0.27 $0.85 1.0M
Minimax/Minimax M2.1 OpenRouter $0.27 $1.20 204K
Eu/Gpt 5 Mini 2025 08 07 Azure OpenAI $0.28 $2.20 272K
Us/Gpt 5 Mini 2025 08 07 Azure OpenAI $0.28 $2.20 272K
Deepseek Chat NEW DeepSeek $0.28 $0.42 131K
Deepseek Reasoner NEW DeepSeek $0.28 $0.42 131K
Deepseek Chat DeepSeek $0.28 $0.42 131K
Deepseek Reasoner DeepSeek $0.28 $0.42 131K
Deepseek V3.2 DeepSeek $0.28 $0.40 164K
Deepseek Ai/DeepSeek V3.2 Gmi $0.28 $0.40 164K
Deepseek Ai/DeepSeek V3 0324 Gmi $0.28 $0.88 164K
Baidu/Ernie 4.5 300b A47b Paddle Novita AI $0.28 $1.10 123K
Deepseek/Deepseek V3.2 OpenRouter $0.28 $0.40 164K
Qwen/Qwen3 Coder 480B A35B Instruct Turbo DeepInfra $0.29 $1.20 262K
Qwen/Qwen3 32b Groq $0.29 $0.59 131K
Llava V1.6 Mistral 7b Hf Ovhcloud $0.29 $0.29 32K
Meta Llama 3.1 8B Instruct Azure AI $0.30 $0.61 128K
Amazon.Nova 2 Lite AWS Bedrock $0.30 $2.50 1M
Amazon.Titan Text Lite AWS Bedrock $0.30 $0.40 42K
Us East 1/Meta.Llama3 8b Instruct AWS Bedrock $0.30 $0.60 8K
Us East 1/Minimax.Minimax M2.1 AWS Bedrock $0.30 $1.20 196K
Us East 1/Minimax.Minimax M2.5 AWS Bedrock $0.30 $1.20 1M
Us East 2/Minimax.Minimax M2.1 AWS Bedrock $0.30 $1.20 196K
Us East 2/Minimax.Minimax M2.5 AWS Bedrock $0.30 $1.20 1M
Us Gov East 1/Amazon.Titan Text Lite AWS Bedrock $0.30 $0.40 42K
Us Gov East 1/Anthropic.Claude 3 Haiku 20240307 AWS Bedrock $0.30 $1.50 200K
Us Gov East 1/Meta.Llama3 8b Instruct AWS Bedrock $0.30 $2.65 8K
Us Gov West 1/Amazon.Titan Text Lite AWS Bedrock $0.30 $0.40 42K
Us Gov West 1/Anthropic.Claude 3 Haiku 20240307 AWS Bedrock $0.30 $1.50 200K
Us Gov West 1/Meta.Llama3 8b Instruct AWS Bedrock $0.30 $2.65 8K
Us West 1/Meta.Llama3 8b Instruct AWS Bedrock $0.30 $0.60 8K
Us West 2/Minimax.Minimax M2.1 AWS Bedrock $0.30 $1.20 196K
Us West 2/Minimax.Minimax M2.5 AWS Bedrock $0.30 $1.20 1M
Cohere.Command Light Text AWS Bedrock $0.30 $0.60 4K
Global.Amazon.Nova 2 Lite AWS Bedrock $0.30 $2.50 1M
Meta.Llama3 8b Instruct AWS Bedrock $0.30 $0.60 8K
Minimax.Minimax M2 AWS Bedrock $0.30 $1.20 128K
Minimax.Minimax M2.1 AWS Bedrock $0.30 $1.20 196K
Minimax.Minimax M2.5 AWS Bedrock $0.30 $1.20 1M
Command Light Cohere $0.30 $0.60 4K
Qwen Coder Dashscope $0.30 $1.50 1M
NousResearch/Hermes 3 Llama 3.1 70B DeepInfra $0.30 $0.30 131K
Qwen/Qwen3 235B A22B Thinking 2507 DeepInfra $0.30 $2.90 262K
Google/Gemini 2.5 Flash DeepInfra $0.30 $2.50 1M
Accounts/Fireworks/Models/Minimax M2p1 Fireworks AI $0.30 $1.20 205K
Minimax M2p1 Fireworks AI $0.30 $1.20 205K
Accounts/Fireworks/Models/Minimax M2 Fireworks AI $0.30 $1.20 4K
Gemini Robotics Er 1.5 Preview Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Preview 09 2025 Google Gemini $0.30 $2.50 1.0M
Gemini Flash Latest Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Native Audio Latest Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Native Audio Preview 09 2025 Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Native Audio Preview 12 2025 Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Native Audio Latest Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Native Audio Preview 09 2025 Google Gemini $0.30 $2.50 1.0M
Gemini 2.5 Flash Native Audio Preview 12 2025 Google Gemini $0.30 $2.50 1.0M
Gemini Flash Latest Google Gemini $0.30 $2.50 1.0M
Gemini Exp 1206 Google Gemini $0.30 $2.50 1.0M
MiniMaxAI/MiniMax M2.1 Gmi $0.30 $1.20 197K
Qwen/Qwen3 VL 235B A22B Instruct FP8 Gmi $0.30 $1.40 262K
MiniMax M2.1 Minimax $0.30 $1.20 1M
MiniMax M2.1 Lightning Minimax $0.30 $2.40 1M
MiniMax M2.5 Minimax $0.30 $1.20 1M
MiniMax M2.5 Lightning Minimax $0.30 $2.40 1M
MiniMax M2 Minimax $0.30 $1.20 200K
Codestral 2508 Mistral $0.30 $0.90 256K
Open Mistral Nemo Mistral $0.30 $0.30 128K
Open Mistral Nemo 2407 Mistral $0.30 $0.30 128K
Minimax/Minimax M2.1 Novita AI $0.30 $1.20 205K
Minimax/Minimax M2 Novita AI $0.30 $1.20 205K
Zai Org/Glm 4.6v Novita AI $0.30 $0.90 131K
Kwaipilot/Kat Coder Pro Novita AI $0.30 $1.20 256K
Qwen/Qwen3 Vl 235b A22b Instruct Novita AI $0.30 $1.50 131K
Qwen/Qwen3 Coder 480b A35b Instruct Novita AI $0.30 $1.30 262K
Qwen/Qwen3 235b A22b Thinking 2507 Novita AI $0.30 $3.00 131K
Deepseek/Deepseek R1 Distill Qwen 32b Novita AI $0.30 $0.30 64K
Xai.Grok 3 Mini Oci $0.30 $0.50 131K
Ft:Gpt 4o Mini 2024 07 18 OpenAI $0.30 $1.20 128K
Google/Gemini 2.5 Flash OpenRouter $0.30 $2.50 1.0M
Qwen/Qwen3.5 27b OpenRouter $0.30 $2.40 262K
Minimax/Minimax M2.5 OpenRouter $0.30 $1.10 197K
Mistralai/Mixtral 8x7b Instruct V0.1 Replicate $0.30 $1.00 4K
Meta Llama Guard 3 8B SambaNova $0.30 $0.30 16K
Google/Gemini 2.5 Flash Vercel AI Gateway $0.30 $2.50 1M
Mistral/Codestral Vercel AI Gateway $0.30 $0.90 256K
Xai/Grok 3 Mini Vercel AI Gateway $0.30 $0.50 131K
Gemini 2.5 Flash Google Vertex AI $0.30 $2.50 1.0M
Gemini 2.5 Flash Preview 09 2025 Google Vertex AI $0.30 $2.50 1.0M
Gemini Robotics Er 1.5 Preview Google Vertex AI $0.30 $2.50 1.0M
Mistralai/Codestral 2 Google Vertex AI $0.30 $0.90 128K
Codestral 2 Google Vertex AI $0.30 $0.90 128K
Codestral 2 Google Vertex AI $0.30 $0.90 128K
Mistralai/Codestral 2 Google Vertex AI $0.30 $0.90 128K
Minimaxai/Minimax M2 Maas Google Vertex AI $0.30 $1.20 197K
Grok 3 Mini Xai $0.30 $0.50 131K
Grok 3 Mini Beta Xai $0.30 $0.50 131K
Grok 3 Mini Latest Xai $0.30 $0.50 131K
Databricks Gemini 2 5 Flash Databricks $0.30 $2.50 1.0M
Ap Southeast 2/Minimax.Minimax M2.5 AWS Bedrock $0.31 $1.24 1M
Eu West 1/Meta.Llama3 8b Instruct AWS Bedrock $0.32 $0.65 8K
Apac.Amazon.Nova 2 Lite AWS Bedrock $0.33 $2.75 1M
Eu.Amazon.Nova 2 Lite AWS Bedrock $0.33 $2.75 1M
Us.Amazon.Nova 2 Lite AWS Bedrock $0.33 $2.75 1M
Ca Central 1/Meta.Llama3 8b Instruct AWS Bedrock $0.35 $0.69 8K
Meta.Llama3 2 11b Instruct AWS Bedrock $0.35 $0.35 128K
Us.Meta.Llama3 2 11b Instruct AWS Bedrock $0.35 $0.35 128K
Gpt Oss 120b Cerebras $0.35 $0.75 131K
Codellama 34b Instruct Perplexity $0.35 $1.40 16K
Meta/Llama 4 Maverick 17b 128e Instruct Maas Google Vertex AI $0.35 $1.15 1M
Meta/Llama 4 Maverick 17b 16e Instruct Maas Google Vertex AI $0.35 $1.15 1M
Meta Llama/Llama 3 2 11b Vision Instruct IBM Watsonx $0.35 $0.35 128K
Meta Llama/Llama 4 Maverick 17b IBM Watsonx $0.35 $1.40 128K
Meta Llama/Llama Guard 3 11b Vision IBM Watsonx $0.35 $0.35 128K
Mistralai/Pixtral 12b 2409 IBM Watsonx $0.35 $0.35 128K
Ap Northeast 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Ap Northeast 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Ap South 1/Meta.Llama3 8b Instruct AWS Bedrock $0.36 $0.72 8K
Ap South 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Ap South 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Ap Southeast 3/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Ap Southeast 3/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Eu North 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Eu North 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Eu Central 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Eu Central 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Eu West 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Eu West 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Eu South 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Eu South 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Sa East 1/Minimax.Minimax M2.1 AWS Bedrock $0.36 $1.44 196K
Sa East 1/Minimax.Minimax M2.5 AWS Bedrock $0.36 $1.44 1M
Llama 3.2 11B Vision Instruct Azure AI $0.37 $0.37 128K
Deepseek Ai/DeepSeek V3 DeepInfra $0.38 $0.89 164K
Qwen/Qwen 2.5 72b Instruct Novita AI $0.38 $0.40 32K
Ibm/Granite Ttm 1024 96 R2 IBM Watsonx $0.38 $0.38 1K
Ibm/Granite Ttm 1536 96 R2 IBM Watsonx $0.38 $0.38 1K
Ibm/Granite Ttm 512 96 R2 IBM Watsonx $0.38 $0.38 1K
Eu West 2/Meta.Llama3 8b Instruct AWS Bedrock $0.39 $0.78 8K
Baidu/Ernie 4.5 Vl 28b A3b Thinking Novita AI $0.39 $0.39 131K
Gpt 4.1 Mini Azure OpenAI $0.40 $1.60 1.0M
Gpt 4.1 Mini 2025 04 14 Azure OpenAI $0.40 $1.60 1.0M
Mistral Medium 2505 Azure AI $0.40 $2.00 131K
Mistral.Devstral 2 123b AWS Bedrock $0.40 $2.00 256K
Qwen 3 32b Cerebras $0.40 $0.80 128K
Qwen Plus Dashscope $0.40 $1.20 129K
Qwen Plus 2025 01 25 Dashscope $0.40 $1.20 129K
Qwen Plus 2025 04 28 Dashscope $0.40 $1.20 129K
Qwen Plus 2025 07 14 Dashscope $0.40 $1.20 129K
Qwen3 Vl 235b A22b Instruct Dashscope $0.40 $1.60 131K
Qwen3 Vl 235b A22b Thinking Dashscope $0.40 $4.00 131K
Qwen/Qwen3 Coder 480B A35B Instruct DeepInfra $0.40 $1.60 262K
Meta Llama/Meta Llama 3.1 70B Instruct DeepInfra $0.40 $0.40 131K
Mistralai/Mixtral 8x7B Instruct V0.1 DeepInfra $0.40 $0.40 33K
Zai Org/GLM 4.5 DeepInfra $0.40 $1.60 131K
Zai Org/GLM 4.7 FP8 Gmi $0.40 $2.00 203K
Deepseek Ai/DeepSeek R1 Hyperbolic $0.40 $0.40 33K
Deepseek Ai/DeepSeek V3 0324 Hyperbolic $0.40 $0.40 33K
Devstral Medium 2507 Mistral $0.40 $2.00 128K
Devstral Latest Mistral $0.40 $2.00 256K
Devstral Medium Latest Mistral $0.40 $2.00 256K
Devstral 2512 Mistral $0.40 $2.00 256K
Mistral Medium 2505 Mistral $0.40 $2.00 131K
Mistral Medium Latest Mistral $0.40 $2.00 131K
Mistral Medium 3 1 2508 Mistral $0.40 $2.00 131K
Deepseek/Deepseek V3 Turbo Novita AI $0.40 $1.30 64K
Gpt 4.1 Mini NEW OpenAI $0.40 $1.60 1.0M
Gpt 4.1 Mini 2025 04 14 OpenAI $0.40 $1.60 1.0M
Openai/Gpt 4.1 Mini OpenRouter $0.40 $1.60 1.0M
Qwen/Qwen3.5 122b A10b OpenRouter $0.40 $2.00 262K
Qwen/Qwen3.5 Plus 02 15 OpenRouter $0.40 $2.40 1M
Z Ai/Glm 4.6 OpenRouter $0.40 $1.75 203K
Z Ai/Glm 4.7 OpenRouter $0.40 $1.50 203K
Llama 4 Scout 17B 16E Instruct SambaNova $0.40 $0.70 8K
Qwen3 32B SambaNova $0.40 $0.80 8K
Alibaba/Qwen3 Coder Vercel AI Gateway $0.40 $1.60 262K
Openai/Gpt 4.1 Mini Vercel AI Gateway $0.40 $1.60 1.0M
Mistral Medium 3 Google Vertex AI $0.40 $2.00 128K
Mistral Medium 3 Google Vertex AI $0.40 $2.00 128K
Mistralai/Mistral Medium 3 Google Vertex AI $0.40 $2.00 128K
Mistralai/Mistral Medium 3 Google Vertex AI $0.40 $2.00 128K
Baidu/Ernie 4.5 Vl 424b A47b Novita AI $0.42 $1.25 123K
Us/Gpt 4.1 Mini 2025 04 14 Azure OpenAI $0.44 $1.76 1.0M
Us East 1/Mistral.Mixtral 8x7b Instruct AWS Bedrock $0.45 $0.70 32K
Us West 2/Mistral.Mixtral 8x7b Instruct AWS Bedrock $0.45 $0.70 32K
Mistral.Mixtral 8x7b Instruct AWS Bedrock $0.45 $0.70 32K
Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct Fireworks AI $0.45 $1.80 262K
Z Ai/Glm 4.6:Exacto OpenRouter $0.45 $1.90 203K
Zai Org/GLM 4.7 Together AI $0.45 $2.00 200K
Zai/Glm 4.6 Vercel AI Gateway $0.45 $1.80 200K
Eu West 2/Minimax.Minimax M2.1 AWS Bedrock $0.47 $1.86 196K
Eu West 2/Minimax.Minimax M2.5 AWS Bedrock $0.47 $1.86 1M
Microsoft/WizardLM 2 8x22B DeepInfra $0.48 $0.48 66K
Gpt 3.5 Turbo Azure OpenAI $0.50 $1.50 4K
Gpt 3.5 Turbo 0125 Azure OpenAI $0.50 $1.50 16K
Gpt 35 Turbo Azure OpenAI $0.50 $1.50 4K
Gpt 35 Turbo 0125 Azure OpenAI $0.50 $1.50 16K
Jamba Instruct Azure AI $0.50 $0.70 70K
Mistral Large 3 Azure AI $0.50 $1.50 256K
Ai21.Jamba Instruct AWS Bedrock $0.50 $0.70 70K
Amazon.Titan Text Premier AWS Bedrock $0.50 $1.50 42K
Sa East 1/Meta.Llama3 8b Instruct AWS Bedrock $0.50 $1.01 8K
Us East 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.50 $1.20 262K
Us East 2/Qwen.Qwen3 Coder Next AWS Bedrock $0.50 $1.20 262K
Us Gov East 1/Amazon.Titan Text Premier AWS Bedrock $0.50 $1.50 42K
Us Gov West 1/Amazon.Titan Text Premier AWS Bedrock $0.50 $1.50 42K
Us West 2/Qwen.Qwen3 Coder Next AWS Bedrock $0.50 $1.20 262K
Cohere.Command R AWS Bedrock $0.50 $1.50 128K
Mistral.Magistral Small 2509 AWS Bedrock $0.50 $1.50 128K
Mistral.Mistral Large 3 675b Instruct AWS Bedrock $0.50 $1.50 128K
Qwen.Qwen3 Coder Next AWS Bedrock $0.50 $1.20 262K
Deepseek Ai/DeepSeek R1 0528 DeepInfra $0.50 $2.15 164K
Moonshotai/Kimi K2 Instruct DeepInfra $0.50 $2.00 131K
Moonshotai/Kimi K2 Instruct 0905 DeepInfra $0.50 $2.00 262K
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base Fireworks AI $0.50 $0.50 164K
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct Fireworks AI $0.50 $0.50 164K
Accounts/Fireworks/Models/Deepseek V2 Lite Chat Fireworks AI $0.50 $0.50 164K
Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b Fireworks AI $0.50 $0.50 33K
Accounts/Fireworks/Models/Firefunction Fireworks AI $0.50 $0.50 33K
Accounts/Fireworks/Models/Gpt Oss Safeguard 20b Fireworks AI $0.50 $0.50 131K
Accounts/Fireworks/Models/Mixtral 8x7b Fireworks AI $0.50 $0.50 33K
Accounts/Fireworks/Models/Mixtral 8x7b Instruct Fireworks AI $0.50 $0.50 33K
Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf Fireworks AI $0.50 $0.50 33K
Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo Fireworks AI $0.50 $0.50 33K
Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507 Fireworks AI $0.50 $0.50 262K
Gemini 3 Flash Preview Google Gemini $0.50 $3.00 1.0M
Google/Gemini 3 Flash Preview Gmi $0.50 $3.00 1.0M
Magistral Small 2506 Mistral $0.50 $1.50 40K
Magistral Small Latest Mistral $0.50 $1.50 40K
Magistral Small 1 2 2509 Mistral $0.50 $1.50 40K
Mistral Large Latest Mistral $0.50 $1.50 262K
Mistral Large 3 Mistral $0.50 $1.50 262K
Mistral Large 2512 Mistral $0.50 $1.50 262K
Deepseek Ai/DeepSeek V3 Nebius $0.50 $1.50 128K
Deepseek Ai/DeepSeek V3 0324 Nebius $0.50 $1.50 128K
Chatdolphin NLP Cloud $0.50 $0.50 16K
Gpt 3.5 Turbo OpenAI $0.50 $1.50 16K
Gpt 3.5 Turbo 0125 OpenAI $0.50 $1.50 16K
Deepseek/Deepseek R1 0528 OpenRouter $0.50 $2.15 65K
Google/Gemini 3 Flash Preview OpenRouter $0.50 $3.00 1.0M
Mistralai/Mistral Large 2512 OpenRouter $0.50 $1.50 262K
QwQ 32B SambaNova $0.50 $1.00 16K
Qwen2 Audio 7B Instruct SambaNova $0.50 $100.00 4K
Moonshotai/Kimi K2.5 Together AI $0.50 $2.80 256K
Mistral/Magistral Small Vercel AI Gateway $0.50 $1.50 128K
Openai/Gpt 3.5 Turbo Vercel AI Gateway $0.50 $1.50 16K
Gemini 3 Flash Preview Google Vertex AI $0.50 $3.00 1.0M
Gemini 3 Flash Preview Google Vertex AI $0.50 $3.00 1.0M
Databricks Llama 2 70b Chat Databricks $0.50 $1.50 4K
Databricks Llama 4 Maverick Databricks $0.50 $1.50 128K
Databricks Meta Llama 3 3 70b Instruct Databricks $0.50 $1.50 128K
Databricks Mixtral 8x7b Instruct Databricks $0.50 $1.00 4K
Databricks Mpt 7b Instruct Databricks $0.50 $0.000 8K
Meta Llama/Llama 3 70b Instruct Novita AI $0.51 $0.74 8K
Qwen.Qwen3 Vl 235b A22b AWS Bedrock $0.53 $2.66 128K
Deepseek R1 DeepSeek $0.55 $2.19 66K
Accounts/Fireworks/Models/Deepseek R1 Basic Fireworks AI $0.55 $2.19 128K
Accounts/Fireworks/Models/Glm 4p5 Fireworks AI $0.55 $2.19 128K
Accounts/Fireworks/Models/Glm 4p6 Fireworks AI $0.55 $2.19 203K
Zai Org/Glm 4.6 Novita AI $0.55 $2.20 205K
Minimaxai/Minimax M1 80k Novita AI $0.55 $2.20 1M
Deepseek/Deepseek R1 OpenRouter $0.55 $2.19 65K
Deepseek Ai/DeepSeek R1 0528 Tput Together AI $0.55 $2.19 128K
Deepseek/Deepseek R1 Vercel AI Gateway $0.55 $2.19 128K
Moonshotai/Kimi K2 Vercel AI Gateway $0.55 $2.20 131K
Accounts/Fireworks/Models/Deepseek V3p1 Fireworks AI $0.56 $1.68 128K
Accounts/Fireworks/Models/Deepseek V3p1 Terminus Fireworks AI $0.56 $1.68 128K
Accounts/Fireworks/Models/Deepseek V3p2 Fireworks AI $0.56 $1.68 164K
Deepseek Ai/Deepseek V3.2 Maas Google Vertex AI $0.56 $1.68 164K
Moonshotai/Kimi K2 Instruct Novita AI $0.57 $2.30 131K
Deepseek V3.2 Azure AI $0.58 $1.68 164K
Deepseek V3.2 Speciale Azure AI $0.58 $1.68 164K
Deepseek.V3 AWS Bedrock $0.58 $1.68 164K
Eu West 3/Mistral.Mixtral 8x7b Instruct AWS Bedrock $0.59 $0.91 32K
Llama 3.3 70b Versatile Groq $0.59 $0.79 128K
Meta/Llama 3 70b Vercel AI Gateway $0.59 $0.79 8K
Gpt Audio Mini 2025 10 06 Azure OpenAI $0.60 $2.40 128K
Gpt 4o Mini Realtime Preview 2024 12 17 Azure OpenAI $0.60 $2.40 128K
Gpt Realtime Mini 2025 10 06 Azure OpenAI $0.60 $2.40 32K
Kimi K2.5 Azure AI $0.60 $3.00 262K
Us.Writer.Palmyra X5 AWS Bedrock $0.60 $6.00 1M
Writer.Palmyra X5 AWS Bedrock $0.60 $6.00 1M
Ap Northeast 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Moonshotai.Kimi K2.5 AWS Bedrock $0.60 $3.03 262K
Ap South 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Ap Southeast 3/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Eu Central 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Eu West 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Eu South 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Sa East 1/Qwen.Qwen3 Coder Next AWS Bedrock $0.60 $1.44 262K
Us East 1/Moonshotai.Kimi K2 Thinking AWS Bedrock $0.60 $2.50 262K
Us East 1/Moonshotai.Kimi K2.5 AWS Bedrock $0.60 $3.00 262K
Us East 2/Moonshotai.Kimi K2 Thinking AWS Bedrock $0.60 $2.50 262K
Us East 2/Moonshotai.Kimi K2.5 AWS Bedrock $0.60 $3.00 262K
Us West 2/Moonshotai.Kimi K2 Thinking AWS Bedrock $0.60 $2.50 262K
Us West 2/Moonshotai.Kimi K2.5 AWS Bedrock $0.60 $3.00 262K
Moonshot.Kimi K2 Thinking AWS Bedrock $0.60 $2.50 128K
Moonshotai.Kimi K2.5 AWS Bedrock $0.60 $3.00 262K
Zai.Glm 4.7 AWS Bedrock $0.60 $2.20 200K
Llama3.1 70b Cerebras $0.60 $0.60 128K
Nvidia/Llama 3.1 Nemotron 70B Instruct DeepInfra $0.60 $0.60 131K
Accounts/Fireworks/Models/Glm 4p7 Fireworks AI $0.60 $2.20 203K
Accounts/Fireworks/Models/Kimi K2 Instruct Fireworks AI $0.60 $2.50 131K
Accounts/Fireworks/Models/Kimi K2 Instruct 0905 Fireworks AI $0.60 $2.50 262K
Accounts/Fireworks/Models/Kimi K2 Thinking Fireworks AI $0.60 $2.50 262K
Accounts/Fireworks/Models/Kimi K2p5 Fireworks AI $0.60 $3.00 262K
Glm 4p7 Fireworks AI $0.60 $2.20 203K
Kimi K2p5 Fireworks AI $0.60 $3.00 262K
Meta Llama 3.1 70b Instruct FriendliAI $0.60 $0.60 8K
Kimi K2 0711 Preview Moonshot $0.60 $2.50 131K
Kimi K2 0905 Preview Moonshot $0.60 $2.50 262K
Kimi K2.5 Moonshot $0.60 $3.00 262K
Kimi Thinking Preview Moonshot $0.60 $2.50 131K
Kimi K2 Thinking Moonshot $0.60 $2.50 262K
Nvidia/Llama 3.1 Nemotron Ultra 253B Nebius $0.60 $1.80 128K
Zai Org/Glm 4.7 Novita AI $0.60 $2.20 205K
Moonshotai/Kimi K2 Thinking Novita AI $0.60 $2.50 262K
Moonshotai/Kimi K2 0905 Novita AI $0.60 $2.50 262K
Zai Org/Glm 4.5 Novita AI $0.60 $2.20 131K
Zai Org/Glm 4.5v Novita AI $0.60 $1.80 66K
Xai.Grok 3 Mini Fast Oci $0.60 $4.00 131K
Gpt Audio Mini OpenAI $0.60 $2.40 128K
Gpt Audio Mini 2025 10 06 OpenAI $0.60 $2.40 128K
Gpt Audio Mini 2025 12 15 OpenAI $0.60 $2.40 128K
Gpt 4o Mini Realtime Preview OpenAI $0.60 $2.40 128K
Gpt 4o Mini Realtime Preview 2024 12 17 OpenAI $0.60 $2.40 128K
Gpt Realtime Mini OpenAI $0.60 $2.40 128K
Gpt Realtime Mini 2025 10 06 OpenAI $0.60 $2.40 128K
Gpt Realtime Mini 2025 12 15 OpenAI $0.60 $2.40 128K
Moonshotai/Kimi K2.5 OpenRouter $0.60 $3.00 262K
Qwen/Qwen3.5 397b A17b OpenRouter $0.60 $3.60 262K
Sonar Medium Chat Perplexity $0.60 $1.80 16K
Meta Llama 3.3 70B Instruct SambaNova $0.60 $1.20 131K
Zai Org/GLM 4.6 Together AI $0.60 $2.20 200K
Qwen/Qwen3.5 397B A17B Together AI $0.60 $3.60 262K
Xai/Grok 3 Mini Fast Vercel AI Gateway $0.60 $4.00 131K
Zai/Glm 4.5 Vercel AI Gateway $0.60 $2.20 131K
Moonshotai/Kimi K2 Thinking Maas Google Vertex AI $0.60 $2.50 256K
Zai Org/Glm 4.7 Maas Google Vertex AI $0.60 $2.20 200K
Moonshotai/Kimi K2 Instruct Wandb $0.60 $2.50 128K
Google/Flan T5 Xl 3b IBM Watsonx $0.60 $0.60 8K
Ibm/Granite 13b Chat IBM Watsonx $0.60 $0.60 8K
Ibm/Granite 13b Instruct IBM Watsonx $0.60 $0.60 8K
Grok 3 Mini Fast Xai $0.60 $4.00 131K
Grok 3 Mini Fast Beta Xai $0.60 $4.00 131K
Grok 3 Mini Fast Latest Xai $0.60 $4.00 131K
Glm 4.7 Zai $0.60 $2.20 200K
Glm 4.6 Zai $0.60 $2.20 200K
Glm 4.5 Zai $0.60 $2.20 128K
Glm 4.5v Zai $0.60 $1.80 128K
Us East 1/Deepseek.V3.2 AWS Bedrock $0.62 $1.85 164K
Us East 2/Deepseek.V3.2 AWS Bedrock $0.62 $1.85 164K
Us West 2/Deepseek.V3.2 AWS Bedrock $0.62 $1.85 164K
Deepseek.V3.2 AWS Bedrock $0.62 $1.85 164K
Us.Deepseek.V3.2 AWS Bedrock $0.62 $1.85 164K
Microsoft/Wizardlm 2 8x22b Novita AI $0.62 $0.62 66K
Mixtral 8x7B Instruct V0.1 Ovhcloud $0.63 $0.63 32K
Llama 4 Maverick 17B 128E Instruct SambaNova $0.63 $1.80 131K
Sao10K/L3.1 70B Euryale V2.2 DeepInfra $0.65 $0.75 131K
Sao10K/L3.3 70B Euryale V2.3 DeepInfra $0.65 $0.75 131K
Meta/Llama 2 70b Replicate $0.65 $2.75 4K
Meta/Llama 2 70b Chat Replicate $0.65 $2.75 4K
Meta/Llama 3 70b Replicate $0.65 $2.75 8K
Meta/Llama 3 70b Instruct Replicate $0.65 $2.75 8K
Qwen/Qwen3 235B A22B Thinking 2507 Together AI $0.65 $3.00 256K
Eu/Gpt 4o Mini Realtime Preview 2024 12 17 Azure OpenAI $0.66 $2.64 128K
Us/Gpt 4o Mini Realtime Preview 2024 12 17 Azure OpenAI $0.66 $2.64 128K
DeepSeek R1 Distill Llama 70B Ovhcloud $0.67 $0.67 131K
Meta Llama 3 1 70B Instruct Ovhcloud $0.67 $0.67 131K
Meta Llama 3 3 70B Instruct Ovhcloud $0.67 $0.67 131K
Deepseek Ai/Deepseek V3.1 Replicate $0.67 $2.02 164K
Deepseek Ai/DeepSeek R1 DeepInfra $0.70 $2.40 164K
Open Mixtral 8x7b Mistral $0.70 $0.70 32K
Deepseek/Deepseek R1 0528 Novita AI $0.70 $2.50 164K
Deepseek/Deepseek Prover V2 671b Novita AI $0.70 $2.50 160K
Deepseek/Deepseek R1 Turbo Novita AI $0.70 $2.50 64K
Codellama 70b Instruct Perplexity $0.70 $2.80 16K
Llama 2 70b Chat Perplexity $0.70 $2.80 4K
Pplx 70b Chat Perplexity $0.70 $2.80 4K
DeepSeek R1 Distill Llama 70B SambaNova $0.70 $1.40 131K
Llama 3.3 70B Instruct Azure AI $0.71 $0.71 128K
Ap South 1/Moonshotai.Kimi K2 Thinking AWS Bedrock $0.71 $2.94 262K
Meta Llama/Llama 3 3 70b Instruct IBM Watsonx $0.71 $0.71 128K
Ap Northeast 1/Moonshotai.Kimi K2.5 AWS Bedrock $0.72 $3.60 262K
Ap South 1/Moonshotai.Kimi K2.5 AWS Bedrock $0.72 $3.60 262K
Ap Southeast 3/Moonshotai.Kimi K2.5 AWS Bedrock $0.72 $3.60 262K
Eu North 1/Moonshotai.Kimi K2.5 AWS Bedrock $0.72 $3.60 262K
Sa East 1/Moonshotai.Kimi K2.5 AWS Bedrock $0.72 $3.60 262K
Meta.Llama3 3 70b Instruct AWS Bedrock $0.72 $0.72 128K
Us.Meta.Llama3 3 70b Instruct AWS Bedrock $0.72 $0.72 128K
Meta.Llama 3.3 70b Instruct Oci $0.72 $0.72 128K
Meta.Llama 4 Maverick 17b 128e Instruct Fp8 Oci $0.72 $0.72 512K
Meta.Llama 4 Scout 17b 16e Instruct Oci $0.72 $0.72 192K
Meta.Llama 3.1 70b Instruct Oci $0.72 $0.72 128K
Meta.Llama 3.3 70b Instruct Fp8 Dynamic Oci $0.72 $0.72 128K
Meta/Llama 3.1 70b Vercel AI Gateway $0.72 $0.72 128K
Meta/Llama 3.2 90b Vercel AI Gateway $0.72 $0.72 128K
Meta/Llama 3.3 70b Vercel AI Gateway $0.72 $0.72 128K
Ap Northeast 1/Moonshotai.Kimi K2 Thinking AWS Bedrock $0.73 $3.03 262K
Moonshotai.Kimi K2 Thinking AWS Bedrock $0.73 $3.03 262K
Sa East 1/Moonshotai.Kimi K2 Thinking AWS Bedrock $0.73 $3.03 262K
Ap Northeast 1/Deepseek.V3.2 AWS Bedrock $0.74 $2.22 164K
Ap South 1/Deepseek.V3.2 AWS Bedrock $0.74 $2.22 164K
Ap Southeast 3/Deepseek.V3.2 AWS Bedrock $0.74 $2.22 164K
Eu North 1/Deepseek.V3.2 AWS Bedrock $0.74 $2.22 164K
Sa East 1/Deepseek.V3.2 AWS Bedrock $0.74 $2.22 164K
Eu.Deepseek.V3.2 AWS Bedrock $0.74 $2.22 164K
Gpt 5.4 Mini Azure OpenAI $0.75 $4.50 1.1M
Meta.Llama2 13b Chat AWS Bedrock $0.75 $1.00 4K
Gemini 3.1 Flash Live Preview Google Gemini $0.75 $4.50 131K
Gemini 3.1 Flash Live Preview Google Gemini $0.75 $4.50 131K
Gpt 5.4 Mini OpenAI $0.75 $4.50 272K
Deepseek/Deepseek R1 Distill Llama 70b Vercel AI Gateway $0.75 $0.99 131K
Eu West 2/Qwen.Qwen3 Coder Next AWS Bedrock $0.78 $1.86 262K
Mistral/Mistral Saba 24b Vercel AI Gateway $0.79 $0.79 33K
Nova Pro Amazon Nova $0.80 $3.20 300K
Amazon.Nova Pro AWS Bedrock $0.80 $3.20 300K
Anthropic.Claude 3 5 Haiku 20241022 AWS Bedrock $0.80 $4.00 200K
Anthropic.Claude Instant AWS Bedrock $0.80 $2.40 100K
Us East 1/Anthropic.Claude Instant AWS Bedrock $0.80 $2.40 100K
Us West 2/Anthropic.Claude Instant AWS Bedrock $0.80 $2.40 100K
Us.Anthropic.Claude 3 5 Haiku 20241022 AWS Bedrock $0.80 $4.00 200K
Us.Amazon.Nova Pro AWS Bedrock $0.80 $3.20 300K
Us.Anthropic.Claude 3 5 Haiku 20241022 AWS Bedrock $0.80 $4.00 200K
Qwq Plus Dashscope $0.80 $2.40 98K
Moonshotai/Kimi K2 Thinking Gmi $0.80 $1.20 262K
Deepseek R1 671b Lambda Ai $0.80 $0.80 131K
Hermes3 405b Lambda Ai $0.80 $0.80 131K
Llama3.1 405b Instruct Fp8 Lambda Ai $0.80 $0.80 131K
Morph V3 Fast Morph $0.80 $1.20 16K
Deepseek Ai/DeepSeek R1 Nebius $0.80 $2.40 128K
Deepseek Ai/DeepSeek R1 0528 Nebius $0.80 $2.40 164K
Deepseek/Deepseek R1 Distill Llama 70b Novita AI $0.80 $0.80 8K
Qwen/Qwen2.5 Vl 72b Instruct Novita AI $0.80 $0.80 33K
Ft:Gpt 4.1 Mini 2025 04 14 OpenAI $0.80 $3.20 1.0M
Z Ai/Glm 5 OpenRouter $0.80 $2.56 203K
Amazon/Nova Pro Vercel AI Gateway $0.80 $3.20 300K
Anthropic/Claude 3.5 Haiku Vercel AI Gateway $0.80 $4.00 200K
Morph/Morph V3 Fast Vercel AI Gateway $0.80 $1.20 33K
Apac.Amazon.Nova Pro AWS Bedrock $0.84 $3.36 300K
Llama 3.3 70b Cerebras $0.85 $1.20 128K
Switchpoint/Router OpenRouter $0.85 $3.40 131K
Qwen2.5 Coder 32B Instruct Ovhcloud $0.87 $0.87 32K
Mistralai/Mixtral 8x22B Instruct V0.1 Anyscale $0.90 $0.90 66K
Accounts/Fireworks/Models/Deepseek Fireworks AI $0.90 $0.90 128K
Accounts/Fireworks/Models/Deepseek V3 0324 Fireworks AI $0.90 $0.90 164K
Accounts/Fireworks/Models/Firefunction Fireworks AI $0.90 $0.90 8K
Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Qwen2 72b Instruct Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Code Llama 34b Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Code Llama 34b Instruct Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Code Llama 34b Python Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Code Llama 70b Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Code Llama 70b Instruct Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Code Llama 70b Python Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Deepseek Coder 33b Instruct Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Devstral Small 2505 Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Fare 20b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Gemma 3 27b It Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Internvl3 38b Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Internvl3 78b Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Kat Coder Fireworks AI $0.90 $0.90 262K
Accounts/Fireworks/Models/Kat Dev 32b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Kat Dev 72b Exp Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Llama V2 70b Chat Fireworks AI $0.90 $0.90 2K
Accounts/Fireworks/Models/Llama V3 70b Instruct Fireworks AI $0.90 $0.90 8K
Accounts/Fireworks/Models/Llama V3 70b Instruct Hf Fireworks AI $0.90 $0.90 8K
Accounts/Fireworks/Models/Llama V3p1 70b Instruct Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Llama V3p3 70b Instruct Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Llava Yi 34b Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501 Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Nous Hermes Llama2 70b Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Phind Code Llama 34b Python Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Phind Code Llama 34b Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Phind Code Llama 34b Fireworks AI $0.90 $0.90 16K
Accounts/Fireworks/Models/Qwen Qwq 32b Preview Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen1p5 72b Chat Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2p5 32b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Qwen2p5 32b Instruct Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2p5 72b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Qwen2p5 72b Instruct Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope Fireworks AI $0.90 $0.90 33K
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k Fireworks AI $0.90 $0.90 66K
Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct Fireworks AI $0.90 $0.90 128K
Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct Fireworks AI $0.90 $0.90 128K
Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507 Fireworks AI $0.90 $0.90 262K
Accounts/Fireworks/Models/Qwen3 32b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16 Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Qwq 32b Fireworks AI $0.90 $0.90 131K
Accounts/Fireworks/Models/Yi 34b Fireworks AI $0.90 $0.90 4K
Accounts/Fireworks/Models/Yi 34b 200k Capybara Fireworks AI $0.90 $0.90 200K
Accounts/Fireworks/Models/Yi 34b Chat Fireworks AI $0.90 $0.90 4K
Morph V3 Large Morph $0.90 $1.90 16K
Deepseek/Deepseek Vercel AI Gateway $0.90 $0.90 128K
Morph/Morph V3 Large Vercel AI Gateway $0.90 $1.90 33K
Qwen2.5 VL 72B Instruct Ovhcloud $0.91 $0.91 32K
Us Gov East 1/Amazon.Nova Pro AWS Bedrock $0.96 $3.84 300K
Us Gov West 1/Amazon.Nova Pro AWS Bedrock $0.96 $3.84 300K
Qwen/Qwen3 Vl 235b A22b Thinking Novita AI $0.98 $3.95 131K
Meta.Llama3 1 70b Instruct AWS Bedrock $0.99 $0.99 128K
Us.Meta.Llama3 1 70b Instruct AWS Bedrock $0.99 $0.99 128K
Claude Haiku 4 5 Anthropic $1.00 $5.00 200K
Claude Haiku 4 5 Anthropic $1.00 $5.00 200K
Codellama/CodeLlama 34b Instruct Hf Anyscale $1.00 $1.00 4K
Codellama/CodeLlama 70b Instruct Hf Anyscale $1.00 $1.00 4K
Meta Llama/Llama 2 70b Chat Hf Anyscale $1.00 $1.00 4K
Meta Llama/Meta Llama 3 70B Instruct Anyscale $1.00 $1.00 8K
Gpt 35 Turbo 1106 Azure OpenAI $1.00 $2.00 16K
Claude Haiku 4 5 Azure AI $1.00 $5.00 200K
Mistral Small Azure AI $1.00 $3.00 32K
Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.00 $5.00 200K
Anthropic.Claude Haiku 4 5 AWS Bedrock $1.00 $5.00 200K
Global.Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.00 $5.00 200K
Mistral.Mistral Small 2402 AWS Bedrock $1.00 $3.00 32K
Zai.Glm 5 AWS Bedrock $1.00 $3.20 200K
NousResearch/Hermes 3 Llama 3.1 405B DeepInfra $1.00 $1.00 131K
Deepseek Ai/DeepSeek R1 0528 Turbo DeepInfra $1.00 $3.00 33K
Deepseek Ai/DeepSeek R1 Turbo DeepInfra $1.00 $3.00 41K
Moonshotai/Kimi K2 Instruct 0905 Groq $1.00 $3.00 262K
Codestral 2405 Mistral $1.00 $3.00 32K
Codestral Latest Mistral $1.00 $3.00 32K
Kimi Latest 32k Moonshot $1.00 $3.00 33K
Moonshot V1 32k Moonshot $1.00 $3.00 33K
Moonshot V1 32k 0430 Moonshot $1.00 $3.00 33K
Moonshot V1 32k Vision Preview Moonshot $1.00 $3.00 33K
Meta Llama/Meta Llama 3.1 405B Instruct Nebius $1.00 $3.00 128K
NousResearch/Hermes 3 Llama 3.1 405B Nebius $1.00 $3.00 128K
Gpt 3.5 Turbo 1106 OpenAI $1.00 $2.00 16K
Anthropic/Claude Haiku 4.5 OpenRouter $1.00 $5.00 200K
Qwen/Qwen3 Coder Plus OpenRouter $1.00 $5.00 998K
Llama 3.1 70b Instruct Perplexity $1.00 $1.00 131K
Sonar Perplexity $1.00 $1.00 128K
Sonar Reasoning Perplexity $1.00 $5.00 128K
Moonshotai/Kimi K2 Instruct 0905 Together AI $1.00 $3.00 262K
Anthropic/Claude Haiku 4.5 Vercel AI Gateway $1.00 $5.00 200K
Perplexity/Sonar Vercel AI Gateway $1.00 $1.00 127K
Perplexity/Sonar Reasoning Vercel AI Gateway $1.00 $5.00 127K
Claude 3 5 Haiku Google Vertex AI $1.00 $5.00 200K
Claude 3 5 Haiku Google Vertex AI $1.00 $5.00 200K
Claude Haiku 4 5 Google Vertex AI $1.00 $5.00 200K
Claude Haiku 4 5 Google Vertex AI $1.00 $5.00 200K
Zai Org/Glm 5 Maas Google Vertex AI $1.00 $3.20 200K
Mistral Small 2503 Google Vertex AI $1.00 $3.00 128K
Mistral Small 2503 Google Vertex AI $1.00 $3.00 32K
Qwen/Qwen3 Coder 480b A35b Instruct Maas Google Vertex AI $1.00 $4.00 262K
Glm 5 Zai $1.00 $3.20 200K
Databricks Claude Haiku 4 5 Databricks $1.00 $5.00 200K
Databricks Meta Llama 3 70b Instruct Databricks $1.00 $3.00 128K
Databricks Mpt 30b Instruct Databricks $1.00 $1.00 8K
Eu.Amazon.Nova Pro AWS Bedrock $1.05 $4.20 300K
O1 Mini 2024 09 12 Azure OpenAI $1.10 $4.40 128K
O3 Mini Azure OpenAI $1.10 $4.40 200K
O3 Mini 2025 01 31 Azure OpenAI $1.10 $4.40 200K
O4 Mini Azure OpenAI $1.10 $4.40 200K
O4 Mini 2025 04 16 Azure OpenAI $1.10 $4.40 200K
Meta Llama 3 70B Instruct Azure AI $1.10 $0.37 8K
Apac.Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.10 $5.50 200K
Eu.Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.10 $5.50 200K
Jp.Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.10 $5.50 200K
Us.Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.10 $5.50 200K
Au.Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.10 $5.50 200K
O3 Mini OpenAI $1.10 $4.40 200K
O3 Mini 2025 01 31 OpenAI $1.10 $4.40 200K
O4 Mini OpenAI $1.10 $4.40 200K
O4 Mini 2025 04 16 OpenAI $1.10 $4.40 200K
Openai/O3 Mini OpenRouter $1.10 $4.40 128K
Openai/O3 Mini High OpenRouter $1.10 $4.40 128K
Openai/O3 Mini Vercel AI Gateway $1.10 $4.40 200K
Openai/O4 Mini Vercel AI Gateway $1.10 $4.40 200K
Glm 4.5 Airx Zai $1.10 $4.50 128K
Deepseek Azure AI $1.14 $4.56 128K
Deepseek V3 0324 Azure AI $1.14 $4.56 128K
Kimi K2 Turbo Preview Moonshot $1.15 $8.00 262K
Kimi K2 Thinking Turbo Moonshot $1.15 $8.00 262K
Us Gov East 1/Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.20 $6.00 200K
Us Gov West 1/Anthropic.Claude Haiku 4 5 20251001 AWS Bedrock $1.20 $6.00 200K
Accounts/Fireworks/Models/Deepseek Coder V2 Instruct Fireworks AI $1.20 $1.20 66K
Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf Fireworks AI $1.20 $1.20 66K
Accounts/Fireworks/Models/Cogito 671b V2 P1 Fireworks AI $1.20 $1.20 164K
Accounts/Fireworks/Models/Dbrx Instruct Fireworks AI $1.20 $1.20 33K
Accounts/Fireworks/Models/Deepseek Prover Fireworks AI $1.20 $1.20 164K
Accounts/Fireworks/Models/Deepseek V2p5 Fireworks AI $1.20 $1.20 33K
Accounts/Fireworks/Models/Glm 4p5v Fireworks AI $1.20 $1.20 131K
Accounts/Fireworks/Models/Gpt Oss Safeguard 120b Fireworks AI $1.20 $1.20 131K
Accounts/Fireworks/Models/Mistral Large 3 Fp8 Fireworks AI $1.20 $1.20 256K
Accounts/Fireworks/Models/Mixtral 8x22b Fireworks AI $1.20 $1.20 66K
Accounts/Fireworks/Models/Mixtral 8x22b Instruct Fireworks AI $1.20 $1.20 66K
Mistral/Mixtral 8x22b Instruct Vercel AI Gateway $1.20 $1.20 66K
Glm 5 Code Zai $1.20 $5.00 200K
Eu/O1 Mini 2024 09 12 Azure OpenAI $1.21 $4.84 128K
Eu/O3 Mini 2025 01 31 Azure OpenAI $1.21 $4.84 200K
O1 Mini Azure OpenAI $1.21 $4.84 128K
Us/O1 Mini 2024 09 12 Azure OpenAI $1.21 $4.84 128K
Us/O3 Mini 2025 01 31 Azure OpenAI $1.21 $4.84 200K
Us/O4 Mini 2025 04 16 Azure OpenAI $1.21 $4.84 200K
Databricks Gemini 2 5 Pro Databricks $1.25 $10.00 1.0M
Databricks Gpt 5 Databricks $1.25 $10.00 272K
Databricks Gpt 5 1 Databricks $1.25 $10.00 272K
Global/Gpt 5.1 Azure OpenAI $1.25 $10.00 272K
Global/Gpt 5.1 Chat Azure OpenAI $1.25 $10.00 128K
Gpt 5.1 2025 11 13 Azure OpenAI $1.25 $10.00 272K
Gpt 5.1 Chat 2025 11 13 Azure OpenAI $1.25 $10.00 128K
Gpt 5 Azure OpenAI $1.25 $10.00 272K
Gpt 5 2025 08 07 Azure OpenAI $1.25 $10.00 272K
Gpt 5 Chat Azure OpenAI $1.25 $10.00 128K
Gpt 5 Chat Latest Azure OpenAI $1.25 $10.00 128K
Gpt 5.1 Azure OpenAI $1.25 $10.00 272K
Gpt 5.1 Chat Azure OpenAI $1.25 $10.00 128K
Google/Gemini 2.5 Pro DeepInfra $1.25 $10.00 1M
Gemini 2.5 Pro Google Gemini $1.25 $10.00 1.0M
Gemini 2.5 Computer Use Preview 10 2025 Google Gemini $1.25 $10.00 128K
Gemini 2.5 Pro Preview Tts Google Gemini $1.25 $10.00 1.0M
Gemini Pro Latest Google Gemini $1.25 $10.00 1.0M
Gemini Pro Latest Google Gemini $1.25 $10.00 1.0M
Openai/Gpt 5.1 Gmi $1.25 $10.00 410K
Openai/Gpt 5 Gmi $1.25 $10.00 410K
Google.Gemini 2.5 Pro Oci $1.25 $10.00 1.0M
Gpt 5 OpenAI $1.25 $10.00 272K
Gpt 5.1 OpenAI $1.25 $10.00 272K
Gpt 5.1 2025 11 13 OpenAI $1.25 $10.00 272K
Gpt 5.1 Chat Latest OpenAI $1.25 $10.00 128K
Gpt 5 2025 08 07 OpenAI $1.25 $10.00 272K
Gpt 5 Chat OpenAI $1.25 $10.00 128K
Gpt 5 Chat Latest OpenAI $1.25 $10.00 128K
Gpt 5 Search Api OpenAI $1.25 $10.00 272K
Gpt 5 Search Api 2025 10 14 OpenAI $1.25 $10.00 272K
Google/Gemini 2.5 Pro OpenRouter $1.25 $10.00 1.0M
Openai/Gpt 5 Chat OpenRouter $1.25 $10.00 128K
Openai/Gpt 5 Codex OpenRouter $1.25 $10.00 272K
Openai/Gpt 5 OpenRouter $1.25 $10.00 272K
Openai/Gpt 5.1 Codex Max OpenRouter $1.25 $10.00 400K
Deepseek Ai/DeepSeek V3 Together AI $1.25 $1.25 66K
Gemini 2.5 Pro Google Vertex AI $1.25 $10.00 1.0M
Gemini 2.5 Pro Preview Tts Google Vertex AI $1.25 $10.00 1.0M
Gemini 2.5 Computer Use Preview 10 2025 Google Vertex AI $1.25 $10.00 128K
Amazon.Titan Text Express AWS Bedrock $1.30 $1.70 42K
Us Gov East 1/Amazon.Titan Text Express AWS Bedrock $1.30 $1.70 42K
Us Gov West 1/Amazon.Titan Text Express AWS Bedrock $1.30 $1.70 42K
MAI DS R1 Azure AI $1.35 $5.40 128K
Deepseek R1 Azure AI $1.35 $5.40 128K
Us.Deepseek.R1 AWS Bedrock $1.35 $5.40 128K
Deepseek Ai/Deepseek V3.1 Maas Google Vertex AI $1.35 $5.40 164K
Deepseek Ai/Deepseek R1 0528 Maas Google Vertex AI $1.35 $5.40 65K
Eu/Gpt 5 2025 08 07 Azure OpenAI $1.38 $11.00 272K
Us/Gpt 5 2025 08 07 Azure OpenAI $1.38 $11.00 272K
Eu/Gpt 5.1 Azure OpenAI $1.38 $11.00 272K
Eu/Gpt 5.1 Chat Azure OpenAI $1.38 $11.00 128K
Us/Gpt 5.1 Azure OpenAI $1.38 $11.00 272K
Us/Gpt 5.1 Chat Azure OpenAI $1.38 $11.00 128K
Llama 4 Maverick 17B 128E Instruct FP8 Azure AI $1.41 $0.35 1M
Deepseek Ai/Deepseek Replicate $1.45 $1.45 66K
Sao10k/L3 70b Euryale V2.1 Novita AI $1.48 $1.48 8K
Sao10k/L31 70b Euryale V2.2 Novita AI $1.48 $1.48 8K
Cohere.Command Text AWS Bedrock $1.50 $2.00 4K
Openai/Gpt 3.5 Turbo Instruct Vercel AI Gateway $1.50 $2.00 8K
Cohere.Command Latest Oci $1.56 $1.56 128K
Cohere.Command A 03 2025 Oci $1.56 $1.56 256K
Cohere.Command Plus Latest Oci $1.56 $1.56 128K
Cohere.Command A Reasoning 08 2025 Oci $1.56 $1.56 256K
Cohere.Command A Vision 07 2025 Oci $1.56 $1.56 128K
Cohere.Command R Plus 08 2024 Oci $1.56 $1.56 128K
Qwen Max Dashscope $1.60 $6.40 31K
Gpt 5.2 Azure OpenAI $1.75 $14.00 272K
Gpt 5.2 2025 12 11 Azure OpenAI $1.75 $14.00 272K
Gpt 5.2 Chat Azure OpenAI $1.75 $14.00 128K
Gpt 5.2 Chat 2025 12 11 Azure OpenAI $1.75 $14.00 128K
Gpt 5.3 Chat Azure OpenAI $1.75 $14.00 128K
Openai/Gpt 5.2 Gmi $1.75 $14.00 410K
Gpt 5.2 OpenAI $1.75 $14.00 272K
Gpt 5.2 2025 12 11 OpenAI $1.75 $14.00 272K
Gpt 5.2 Chat Latest OpenAI $1.75 $14.00 128K
Gpt 5.3 Chat Latest OpenAI $1.75 $14.00 128K
Openai/Gpt 5.2 Codex OpenRouter $1.75 $14.00 272K
Openai/Gpt 5.2 OpenRouter $1.75 $14.00 272K
Openai/Gpt 5.2 Chat OpenRouter $1.75 $14.00 128K
Sdaia/Allam 1 13b Instruct IBM Watsonx $1.80 $1.80 8K
@Cf/Meta/Llama 2 7b Chat Fp16 Cloudflare $1.92 $1.92 3K
@Cf/Meta/Llama 2 7b Chat Int8 Cloudflare $1.92 $1.92 2K
@Cf/Mistral/Mistral 7b Instruct V0.1 Cloudflare $1.92 $1.92 8K
@Hf/Thebloke/Codellama 7b Instruct Awq Cloudflare $1.92 $1.92 4K
Meta.Llama2 70b Chat AWS Bedrock $1.95 $2.56 4K
Jamba 1.5 Large AI21 $2.00 $8.00 256K
Jamba 1.5 Large AI21 $2.00 $8.00 256K
Jamba Large 1.6 AI21 $2.00 $8.00 256K
Jamba Large 1.7 AI21 $2.00 $8.00 256K
Gpt 4.1 Azure OpenAI $2.00 $8.00 1.0M
Gpt 4.1 2025 04 14 Azure OpenAI $2.00 $8.00 1.0M
O3 Azure OpenAI $2.00 $8.00 200K
O3 2025 04 16 Azure OpenAI $2.00 $8.00 200K
Mistral Large 2407 Azure AI $2.00 $6.00 128K
Mistral Large Latest Azure AI $2.00 $6.00 128K
Ai21.Jamba 1 5 Large AWS Bedrock $2.00 $8.00 256K
Eu.Mistral.Pixtral Large 2502 AWS Bedrock $2.00 $6.00 128K
Meta.Llama3 2 90b Instruct AWS Bedrock $2.00 $2.00 128K
Us.Meta.Llama3 2 90b Instruct AWS Bedrock $2.00 $2.00 128K
Us.Mistral.Pixtral Large 2502 AWS Bedrock $2.00 $6.00 128K
Gemini 3 Pro Preview Google Gemini $2.00 $12.00 1.0M
Gemini 3.1 Pro Preview Google Gemini $2.00 $12.00 1.0M
Gemini 3.1 Pro Preview Customtools Google Gemini $2.00 $12.00 1.0M
Google/Gemini 3 Pro Preview Gmi $2.00 $12.00 1.0M
Qwen/Qwen3 235B A22B Hyperbolic $2.00 $2.00 131K
Moonshotai/Kimi K2 Instruct Hyperbolic $2.00 $2.00 131K
Magistral Medium 2506 Mistral $2.00 $5.00 40K
Magistral Medium 2509 Mistral $2.00 $5.00 40K
Magistral Medium 1 2 2509 Mistral $2.00 $5.00 40K
Magistral Medium Latest Mistral $2.00 $5.00 40K
Mistral Large 2411 Mistral $2.00 $6.00 128K
Open Mixtral 8x22b Mistral $2.00 $6.00 65K
Pixtral Large 2411 Mistral $2.00 $6.00 128K
Pixtral Large Latest Mistral $2.00 $6.00 128K
Kimi Latest Moonshot $2.00 $5.00 131K
Kimi Latest 128k Moonshot $2.00 $5.00 131K
Moonshot V1 128k Moonshot $2.00 $5.00 131K
Moonshot V1 128k 0430 Moonshot $2.00 $5.00 131K
Moonshot V1 128k Vision Preview Moonshot $2.00 $5.00 131K
Moonshot V1 Auto Moonshot $2.00 $5.00 131K
Meta.Llama 3.2 90b Vision Instruct Oci $2.00 $2.00 128K
Meta.Llama 3.2 11b Vision Instruct Oci $2.00 $2.00 128K
Gpt 4.1 NEW OpenAI $2.00 $8.00 1.0M
Gpt 4.1 2025 04 14 OpenAI $2.00 $8.00 1.0M
O3 NEW OpenAI $2.00 $8.00 200K
O3 2025 04 16 OpenAI $2.00 $8.00 200K
Google/Gemini 3 Pro Preview OpenRouter $2.00 $12.00 1.0M
Google/Gemini 3.1 Pro Preview OpenRouter $2.00 $12.00 1.0M
Openai/Gpt 4.1 OpenRouter $2.00 $8.00 1.0M
Sonar Deep Research Perplexity $2.00 $8.00 128K
Sonar Reasoning Pro Perplexity $2.00 $8.00 128K
Qwen/Qwen3 Coder 480B A35B Instruct FP8 Together AI $2.00 $2.00 256K
Mistral/Magistral Medium Vercel AI Gateway $2.00 $5.00 128K
Mistral/Mistral Large Vercel AI Gateway $2.00 $6.00 32K
Mistral/Pixtral Large Vercel AI Gateway $2.00 $6.00 128K
Openai/Gpt 4.1 Vercel AI Gateway $2.00 $8.00 1.0M
Openai/O3 Vercel AI Gateway $2.00 $8.00 200K
Perplexity/Sonar Reasoning Pro Vercel AI Gateway $2.00 $8.00 127K
Xai/Grok 2 Vercel AI Gateway $2.00 $10.00 131K
Xai/Grok 2 Vision Vercel AI Gateway $2.00 $10.00 33K
Gemini 3 Pro Preview Google Vertex AI $2.00 $12.00 1.0M
Gemini 3.1 Pro Preview Google Vertex AI $2.00 $12.00 1.0M
Gemini 3.1 Pro Preview Customtools Google Vertex AI $2.00 $12.00 1.0M
Gemini 3 Pro Preview Google Vertex AI $2.00 $12.00 1.0M
Gemini 3.1 Pro Preview Google Vertex AI $2.00 $12.00 1.0M
Gemini 3.1 Pro Preview Customtools Google Vertex AI $2.00 $12.00 1.0M
Jamba 1.5 Large Google Vertex AI $2.00 $8.00 256K
Jamba 1.5 Large Google Vertex AI $2.00 $8.00 256K
Mistral Large 2411 Google Vertex AI $2.00 $6.00 128K
Mistral Large Google Vertex AI $2.00 $6.00 128K
Mistral Large@2411 001 Google Vertex AI $2.00 $6.00 128K
Mistral Large@Latest Google Vertex AI $2.00 $6.00 128K
Meta Llama/Llama 3 2 90b Vision Instruct IBM Watsonx $2.00 $2.00 128K
Grok 2 Xai $2.00 $10.00 131K
Grok 2 1212 Xai $2.00 $10.00 131K
Grok 2 Latest Xai $2.00 $10.00 131K
Grok 2 Vision Xai $2.00 $10.00 33K
Grok 2 Vision 1212 Xai $2.00 $10.00 33K
Grok 2 Vision Latest Xai $2.00 $10.00 33K
Grok 4.20 Multi Agent Beta 0309 Xai $2.00 $6.00 2M
Grok 4.20 Beta 0309 Reasoning Xai $2.00 $6.00 2M
Grok 4.20 Beta 0309 Non Reasoning Xai $2.00 $6.00 2M
Llama 3.2 90B Vision Instruct Azure AI $2.04 $2.04 128K
Qwen/Qwen3 Max Novita AI $2.11 $8.45 262K
Amazon.Nova 2 Pro Preview 20251202 AWS Bedrock $2.19 $17.50 1M
Apac.Amazon.Nova 2 Pro Preview 20251202 AWS Bedrock $2.19 $17.50 1M
Eu.Amazon.Nova 2 Pro Preview 20251202 AWS Bedrock $2.19 $17.50 1M
Us.Amazon.Nova 2 Pro Preview 20251202 AWS Bedrock $2.19 $17.50 1M
Us/Gpt 4.1 2025 04 14 Azure OpenAI $2.20 $8.80 1.0M
Us/O3 2025 04 16 Azure OpenAI $2.20 $8.80 200K
Glm 4.5 X Zai $2.20 $8.90 128K
Ap Northeast 1/Anthropic.Claude Instant AWS Bedrock $2.23 $7.55 100K
Zai Glm 4.6 Cerebras $2.25 $2.75 128K
Zai Glm 4.7 Cerebras $2.25 $2.75 128K
Eu Central 1/Anthropic.Claude Instant AWS Bedrock $2.48 $8.38 100K
Nova Premier Amazon Nova $2.50 $12.50 1M
Global Standard/Gpt 4o 2024 08 06 Azure OpenAI $2.50 $10.00 128K
Global Standard/Gpt 4o 2024 11 20 Azure OpenAI $2.50 $10.00 128K
Global/Gpt 4o 2024 08 06 Azure OpenAI $2.50 $10.00 128K
Global/Gpt 4o 2024 11 20 Azure OpenAI $2.50 $10.00 128K
Gpt 4o Azure OpenAI $2.50 $10.00 128K
Gpt 4o 2024 08 06 Azure OpenAI $2.50 $10.00 128K
Gpt Audio 2025 08 28 Azure OpenAI $2.50 $10.00 128K
Gpt Audio 1.5 2026 02 23 Azure OpenAI $2.50 $10.00 128K
Gpt 4o Audio Preview 2024 12 17 Azure OpenAI $2.50 $10.00 128K
Gpt 4o Mini Audio Preview 2024 12 17 Azure OpenAI $2.50 $10.00 128K
Gpt 5.4 Azure OpenAI $2.50 $15.00 1.1M
Gpt 5.4 2026 03 05 Azure OpenAI $2.50 $15.00 1.1M
Us.Writer.Palmyra X4 AWS Bedrock $2.50 $10.00 128K
Writer.Palmyra X4 AWS Bedrock $2.50 $10.00 128K
Us.Amazon.Nova Premier AWS Bedrock $2.50 $12.50 1M
Command A 03 2025 Cohere $2.50 $10.00 256K
Command R Plus Cohere $2.50 $10.00 128K
Command R Plus 08 2024 Cohere $2.50 $10.00 128K
Openai/Gpt 4o Gmi $2.50 $10.00 131K
Gpt 4o OpenAI $2.50 $10.00 128K
Gpt 4o 2024 08 06 OpenAI $2.50 $10.00 128K
Gpt 4o 2024 11 20 OpenAI $2.50 $10.00 128K
Gpt 4o Audio Preview OpenAI $2.50 $10.00 128K
Gpt 4o Audio Preview 2024 12 17 OpenAI $2.50 $10.00 128K
Gpt 4o Audio Preview 2025 06 03 OpenAI $2.50 $10.00 128K
Gpt Audio OpenAI $2.50 $10.00 128K
Gpt Audio 1.5 OpenAI $2.50 $10.00 128K
Gpt Audio 2025 08 28 OpenAI $2.50 $10.00 128K
Gpt 4o Search Preview OpenAI $2.50 $10.00 128K
Gpt 4o Search Preview 2025 03 11 OpenAI $2.50 $10.00 128K
Gpt 5.4 OpenAI $2.50 $15.00 1.1M
Gpt 5.4 2026 03 05 OpenAI $2.50 $15.00 1.1M
Openai/Gpt 4o OpenRouter $2.50 $10.00 128K
Cohere/Command A Vercel AI Gateway $2.50 $10.00 256K
Cohere/Command R Plus Vercel AI Gateway $2.50 $10.00 128K
Google/Gemini 2.5 Pro Vercel AI Gateway $2.50 $10.00 1.0M
Openai/Gpt 4o Vercel AI Gateway $2.50 $10.00 128K
Us East 1/Meta.Llama3 70b Instruct AWS Bedrock $2.65 $3.50 8K
Us Gov East 1/Meta.Llama3 70b Instruct AWS Bedrock $2.65 $3.50 8K
Us Gov West 1/Meta.Llama3 70b Instruct AWS Bedrock $2.65 $3.50 8K
Us West 1/Meta.Llama3 70b Instruct AWS Bedrock $2.65 $3.50 8K
Meta.Llama3 70b Instruct AWS Bedrock $2.65 $3.50 8K
Meta Llama 3.1 70B Instruct Azure AI $2.68 $3.54 128K
Mistral Medium Mistral $2.70 $8.10 32K
Mistral Medium 2312 Mistral $2.70 $8.10 32K
Eu/Gpt 4o 2024 08 06 Azure OpenAI $2.75 $11.00 128K
Eu/Gpt 4o 2024 11 20 Azure OpenAI $2.75 $11.00 128K
Gpt 4o 2024 11 20 Azure OpenAI $2.75 $11.00 128K
Us/Gpt 4o 2024 08 06 Azure OpenAI $2.75 $11.00 128K
Us/Gpt 4o 2024 11 20 Azure OpenAI $2.75 $11.00 128K
Eu West 1/Meta.Llama3 70b Instruct AWS Bedrock $2.86 $3.78 8K
Databricks Claude 3 7 Sonnet Databricks $3.00 $15.00 200K
Databricks Claude Sonnet 4 Databricks $3.00 $15.00 200K
Databricks Claude Sonnet 4 1 Databricks $3.00 $15.00 200K
Databricks Claude Sonnet 4 5 Databricks $3.00 $15.00 200K
Claude 3 7 Sonnet Anthropic $3.00 $15.00 200K
Claude 4 Sonnet Anthropic $3.00 $15.00 1M
Claude Sonnet 4 5 Anthropic $3.00 $15.00 200K
Claude Sonnet 4 5 Anthropic $3.00 $15.00 200K
Claude Sonnet 4 6 Anthropic $3.00 $15.00 1M
Claude Sonnet 4 NEW Anthropic $3.00 $15.00 1M
Command R Plus Azure OpenAI $3.00 $15.00 128K
Computer Use Preview Azure OpenAI $3.00 $12.00 8K
Gpt 35 Turbo 16k Azure OpenAI $3.00 $4.00 16K
Gpt 35 Turbo 16k 0613 Azure OpenAI $3.00 $4.00 16K
Computer Use Preview Azure OpenAI $3.00 $12.00 8K
Claude Sonnet 4 5 Azure AI $3.00 $15.00 200K
Claude Sonnet 4 6 Azure AI $3.00 $15.00 1M
Global/Grok 3 Azure AI $3.00 $15.00 131K
Grok 3 Azure AI $3.00 $15.00 131K
Grok 4 Azure AI $3.00 $15.00 131K
Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.00 $15.00 1M
Anthropic.Claude 3 5 Sonnet 20241022 AWS Bedrock $3.00 $15.00 1M
Anthropic.Claude 3 7 Sonnet 20250219 AWS Bedrock $3.00 $15.00 200K
Anthropic.Claude 3 Sonnet 20240229 AWS Bedrock $3.00 $15.00 200K
Anthropic.Claude Sonnet 4 6 AWS Bedrock $3.00 $15.00 1M
Global.Anthropic.Claude Sonnet 4 6 AWS Bedrock $3.00 $15.00 1M
Anthropic.Claude Sonnet 4 20250514 AWS Bedrock $3.00 $15.00 1M
Anthropic.Claude Sonnet 4 5 20250929 AWS Bedrock $3.00 $15.00 200K
Apac.Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.00 $15.00 200K
Apac.Anthropic.Claude 3 5 Sonnet 20241022 AWS Bedrock $3.00 $15.00 200K
Apac.Anthropic.Claude 3 Sonnet 20240229 AWS Bedrock $3.00 $15.00 200K
Apac.Anthropic.Claude Sonnet 4 20250514 AWS Bedrock $3.00 $15.00 1M
Invoke/Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.00 $15.00 200K
Claude Sonnet 4 5 20250929 AWS Bedrock $3.00 $15.00 200K
Cohere.Command R Plus AWS Bedrock $3.00 $15.00 128K
Eu.Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.00 $15.00 200K
Eu.Anthropic.Claude 3 5 Sonnet 20241022 AWS Bedrock $3.00 $15.00 200K
Eu.Anthropic.Claude 3 7 Sonnet 20250219 AWS Bedrock $3.00 $15.00 200K
Eu.Anthropic.Claude 3 Sonnet 20240229 AWS Bedrock $3.00 $15.00 200K
Eu.Anthropic.Claude Sonnet 4 20250514 AWS Bedrock $3.00 $15.00 1M
Global.Anthropic.Claude Sonnet 4 5 20250929 AWS Bedrock $3.00 $15.00 200K
Global.Anthropic.Claude Sonnet 4 20250514 AWS Bedrock $3.00 $15.00 1M
Mistral.Mistral Large 2407 AWS Bedrock $3.00 $9.00 128K
Us.Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.00 $15.00 200K
Us.Anthropic.Claude 3 5 Sonnet 20241022 AWS Bedrock $3.00 $15.00 200K
Us.Anthropic.Claude 3 7 Sonnet 20250219 AWS Bedrock $3.00 $15.00 200K
Us.Anthropic.Claude 3 Sonnet 20240229 AWS Bedrock $3.00 $15.00 200K
Us.Anthropic.Claude Sonnet 4 20250514 AWS Bedrock $3.00 $15.00 1M
Accounts/Fireworks/Models/Deepseek R1 Fireworks AI $3.00 $8.00 128K
Accounts/Fireworks/Models/Deepseek R1 0528 Fireworks AI $3.00 $8.00 160K
Accounts/Fireworks/Models/Llama V3p1 405b Instruct Fireworks AI $3.00 $3.00 128K
Accounts/Fireworks/Models/Yi Large Fireworks AI $3.00 $3.00 33K
Anthropic/Claude Sonnet 4.5 Gmi $3.00 $15.00 410K
Anthropic/Claude Sonnet 4 Gmi $3.00 $15.00 410K
Mistral Large 2407 Mistral $3.00 $9.00 128K
Xai.Grok 3 Oci $3.00 $15.00 131K
Xai.Grok 4 Oci $3.00 $15.00 128K
Xai.Grok 4.20 Oci $3.00 $15.00 131K
Xai.Grok 4.20 Multi Agent Oci $3.00 $15.00 131K
Ft:Gpt 3.5 Turbo OpenAI $3.00 $6.00 16K
Ft:Gpt 3.5 Turbo 0125 OpenAI $3.00 $6.00 16K
Ft:Gpt 3.5 Turbo 0613 OpenAI $3.00 $6.00 4K
Ft:Gpt 3.5 Turbo 1106 OpenAI $3.00 $6.00 16K
Ft:Gpt 4.1 2025 04 14 OpenAI $3.00 $12.00 1.0M
Gpt 3.5 Turbo 16k OpenAI $3.00 $4.00 16K
Anthropic/Claude 3.5 Sonnet OpenRouter $3.00 $15.00 200K
Anthropic/Claude 3.7 Sonnet OpenRouter $3.00 $15.00 200K
Anthropic/Claude Sonnet 4 OpenRouter $3.00 $15.00 1M
Anthropic/Claude Sonnet 4.6 OpenRouter $3.00 $15.00 1M
Anthropic/Claude Sonnet 4.5 OpenRouter $3.00 $15.00 1M
X Ai/Grok 4 OpenRouter $3.00 $15.00 256K
Sonar Pro Perplexity $3.00 $15.00 200K
DeepSeek V3 0324 SambaNova $3.00 $4.50 33K
DeepSeek V3.1 SambaNova $3.00 $4.50 33K
Gpt Oss 120b SambaNova $3.00 $4.50 131K
Deepseek Ai/DeepSeek R1 Together AI $3.00 $7.00 128K
V0 1.0 Md V0 $3.00 $15.00 128K
V0 1.5 Md V0 $3.00 $15.00 128K
Anthropic/Claude 3.5 Sonnet Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude 3.7 Sonnet Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude 4 Sonnet Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude 3 5 Sonnet Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude 3 5 Sonnet Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude 3 7 Sonnet Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude Sonnet 4 Vercel AI Gateway $3.00 $15.00 200K
Anthropic/Claude Sonnet 4.5 Vercel AI Gateway $3.00 $15.00 1M
Perplexity/Sonar Pro Vercel AI Gateway $3.00 $15.00 200K
Vercel/V0 1.0 Md Vercel AI Gateway $3.00 $15.00 128K
Vercel/V0 1.5 Md Vercel AI Gateway $3.00 $15.00 128K
Xai/Grok 3 Vercel AI Gateway $3.00 $15.00 131K
Xai/Grok 4 Vercel AI Gateway $3.00 $15.00 256K
Claude 3 5 Sonnet Google Vertex AI $3.00 $15.00 200K
Claude 3 5 Sonnet Google Vertex AI $3.00 $15.00 200K
Claude 3 7 Sonnet Google Vertex AI $3.00 $15.00 200K
Claude 3 Sonnet Google Vertex AI $3.00 $15.00 200K
Claude 3 Sonnet Google Vertex AI $3.00 $15.00 200K
Claude Sonnet 4 5 Google Vertex AI $3.00 $15.00 200K
Claude Sonnet 4 6 Google Vertex AI $3.00 $15.00 1M
Claude Sonnet 4 5 Google Vertex AI $3.00 $15.00 200K
Claude Sonnet 4 Google Vertex AI $3.00 $15.00 1M
Claude Sonnet 4 Google Vertex AI $3.00 $15.00 1M
Mistral Nemo Google Vertex AI $3.00 $3.00 128K
Claude Sonnet 4 6@Default Google Vertex AI $3.00 $15.00 1M
Mistralai/Mistral Large IBM Watsonx $3.00 $10.00 131K
Mistralai/Mistral Medium 2505 IBM Watsonx $3.00 $10.00 128K
Grok 3 Xai $3.00 $15.00 131K
Grok 3 Beta Xai $3.00 $15.00 131K
Grok 3 Latest Xai $3.00 $15.00 131K
Grok 4 Xai $3.00 $15.00 256K
Grok 4 0709 Xai $3.00 $15.00 256K
Grok 4 Latest Xai $3.00 $15.00 256K
Ca Central 1/Meta.Llama3 70b Instruct AWS Bedrock $3.05 $4.03 8K
Ap South 1/Meta.Llama3 70b Instruct AWS Bedrock $3.18 $4.20 8K
Us.Anthropic.Claude Sonnet 4 6 AWS Bedrock $3.30 $16.50 1M
Eu.Anthropic.Claude Sonnet 4 6 AWS Bedrock $3.30 $16.50 1M
Au.Anthropic.Claude Sonnet 4 6 AWS Bedrock $3.30 $16.50 1M
Au.Anthropic.Claude Sonnet 4 5 20250929 AWS Bedrock $3.30 $16.50 200K
Eu.Anthropic.Claude Sonnet 4 5 20250929 AWS Bedrock $3.30 $16.50 200K
Jp.Anthropic.Claude Sonnet 4 5 20250929 AWS Bedrock $3.30 $16.50 200K
Us.Anthropic.Claude Sonnet 4 5 20250929 AWS Bedrock $3.30 $16.50 200K
Anthropic/Claude 3 7 Sonnet Latest DeepInfra $3.30 $16.50 200K
Anthropic/Claude 4 Sonnet DeepInfra $3.30 $16.50 200K
Eu West 2/Meta.Llama3 70b Instruct AWS Bedrock $3.45 $4.55 8K
Anthropic.Claude 3 7 Sonnet 20240620 AWS Bedrock $3.60 $18.00 200K
Us Gov East 1/Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.60 $18.00 200K
Us Gov East 1/Claude Sonnet 4 5 20250929 AWS Bedrock $3.60 $18.00 200K
Us Gov West 1/Anthropic.Claude 3 7 Sonnet 20250219 AWS Bedrock $3.60 $18.00 200K
Us Gov West 1/Anthropic.Claude 3 5 Sonnet 20240620 AWS Bedrock $3.60 $18.00 200K
Us Gov West 1/Claude Sonnet 4 5 20250929 AWS Bedrock $3.60 $18.00 200K
Ft:Gpt 4o 2024 08 06 OpenAI $3.75 $15.00 128K
Ft:Gpt 4o 2024 11 20 OpenAI $3.75 $15.00 128K
Deepseek Ai/Deepseek R1 Replicate $3.75 $10.00 66K
Gpt Realtime 2025 08 28 Azure OpenAI $4.00 $16.00 32K
Gpt Realtime 1.5 2026 02 23 Azure OpenAI $4.00 $16.00 32K
Mistral Large Azure AI $4.00 $12.00 32K
Mistral Large 2402 Mistral $4.00 $12.00 32K
Ft:O4 Mini 2025 04 16 OpenAI $4.00 $16.00 200K
Gpt Realtime OpenAI $4.00 $16.00 32K
Gpt Realtime 1.5 OpenAI $4.00 $16.00 32K
Gpt Realtime 2025 08 28 OpenAI $4.00 $16.00 32K
Sa East 1/Meta.Llama3 70b Instruct AWS Bedrock $4.45 $5.88 8K
Claude Opus 4 5 Anthropic $5.00 $25.00 200K
Claude Opus 4 5 Anthropic $5.00 $25.00 200K
Claude Opus 4 6 Anthropic $5.00 $25.00 1M
Claude Opus 4 6 Anthropic $5.00 $25.00 1M
Gpt 4o 2024 05 13 Azure OpenAI $5.00 $15.00 128K
Gpt 4o Realtime Preview 2024 10 01 Azure OpenAI $5.00 $20.00 128K
Gpt 4o Realtime Preview 2024 12 17 Azure OpenAI $5.00 $20.00 128K
Claude Opus 4 5 Azure AI $5.00 $25.00 200K
Claude Opus 4 6 Azure AI $5.00 $25.00 200K
Anthropic.Claude Opus 4 5 20251101 AWS Bedrock $5.00 $25.00 200K
Anthropic.Claude Opus 4 6 AWS Bedrock $5.00 $25.00 1M
Global.Anthropic.Claude Opus 4 6 AWS Bedrock $5.00 $25.00 1M
Global.Anthropic.Claude Opus 4 5 20251101 AWS Bedrock $5.00 $25.00 200K
Eu.Anthropic.Claude Opus 4 5 20251101 AWS Bedrock $5.00 $25.00 200K
Anthropic/Claude Opus 4.5 Gmi $5.00 $25.00 410K
Xai.Grok 3 Fast Oci $5.00 $25.00 131K
Xai.Grok 4 Fast Oci $5.00 $25.00 131K
Xai.Grok 4.1 Fast Oci $5.00 $25.00 131K
Xai.Grok Code Fast 1 Oci $5.00 $25.00 131K
Chatgpt 4o Latest OpenAI $5.00 $15.00 128K
Gpt 4o 2024 05 13 OpenAI $5.00 $15.00 128K
Gpt 4o Realtime Preview OpenAI $5.00 $20.00 128K
Gpt 4o Realtime Preview 2024 12 17 OpenAI $5.00 $20.00 128K
Gpt 4o Realtime Preview 2025 06 03 OpenAI $5.00 $20.00 128K
Anthropic/Claude Opus 4.5 OpenRouter $5.00 $25.00 200K
Anthropic/Claude Opus 4.6 OpenRouter $5.00 $25.00 1M
Openai/Gpt 4o 2024 05 13 OpenRouter $5.00 $15.00 128K
DeepSeek R1 SambaNova $5.00 $7.00 33K
Meta Llama 3.1 405B Instruct SambaNova $5.00 $10.00 16K
Anthropic/Claude Opus 4.5 Vercel AI Gateway $5.00 $25.00 200K
Anthropic/Claude Opus 4.6 Vercel AI Gateway $5.00 $25.00 200K
Xai/Grok 3 Fast Vercel AI Gateway $5.00 $25.00 131K
Claude Opus 4 5 Google Vertex AI $5.00 $25.00 200K
Claude Opus 4 5 Google Vertex AI $5.00 $25.00 200K
Claude Opus 4 6 Google Vertex AI $5.00 $25.00 1M
Claude Opus 4 6@Default Google Vertex AI $5.00 $25.00 1M
Meta/Llama 3.1 405b Instruct Maas Google Vertex AI $5.00 $16.00 128K
Grok 3 Fast Beta Xai $5.00 $25.00 131K
Grok 3 Fast Latest Xai $5.00 $25.00 131K
Grok Beta Xai $5.00 $15.00 131K
Grok Vision Beta Xai $5.00 $15.00 8K
Databricks Claude Opus 4 5 Databricks $5.00 $25.00 200K
Databricks Meta Llama 3 1 405b Instruct Databricks $5.00 $15.00 128K
Meta.Llama3 1 405b Instruct AWS Bedrock $5.32 $16.00 128K
Us.Meta.Llama3 1 405b Instruct AWS Bedrock $5.32 $16.00 128K
Meta Llama 3.1 405B Instruct Azure AI $5.33 $16.00 128K
Eu/Gpt 4o Realtime Preview 2024 10 01 Azure OpenAI $5.50 $22.00 128K
Eu/Gpt 4o Realtime Preview 2024 12 17 Azure OpenAI $5.50 $22.00 128K
Us/Gpt 4o Realtime Preview 2024 10 01 Azure OpenAI $5.50 $22.00 128K
Us/Gpt 4o Realtime Preview 2024 12 17 Azure OpenAI $5.50 $22.00 128K
Us.Anthropic.Claude Opus 4 6 AWS Bedrock $5.50 $27.50 1M
Eu.Anthropic.Claude Opus 4 6 AWS Bedrock $5.50 $27.50 1M
Au.Anthropic.Claude Opus 4 6 AWS Bedrock $5.50 $27.50 1M
Us.Anthropic.Claude Opus 4 5 20251101 AWS Bedrock $5.50 $27.50 200K
Mistral Large 2402 Azure OpenAI $8.00 $24.00 32K
Mistral Large Latest Azure OpenAI $8.00 $24.00 32K
Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Ap Northeast 1/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Ap Northeast 1/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Eu Central 1/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Eu Central 1/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Us East 1/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Us East 1/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Us East 1/Mistral.Mistral Large 2402 AWS Bedrock $8.00 $24.00 32K
Us West 2/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Us West 2/Anthropic.Claude AWS Bedrock $8.00 $24.00 100K
Us West 2/Mistral.Mistral Large 2402 AWS Bedrock $8.00 $24.00 32K
Mistral.Mistral Large 2402 AWS Bedrock $8.00 $24.00 32K
Gpt 4 0125 Preview Azure OpenAI $10.00 $30.00 128K
Gpt 4 1106 Preview Azure OpenAI $10.00 $30.00 128K
Gpt 4 Turbo Azure OpenAI $10.00 $30.00 128K
Gpt 4 Turbo 2024 04 09 Azure OpenAI $10.00 $30.00 128K
Gpt 4 Turbo Vision Preview Azure OpenAI $10.00 $30.00 128K
Gpt 4 0125 Preview OpenAI $10.00 $30.00 128K
Gpt 4 1106 Preview OpenAI $10.00 $30.00 128K
Gpt 4 Turbo OpenAI $10.00 $30.00 128K
Gpt 4 Turbo 2024 04 09 OpenAI $10.00 $30.00 128K
Gpt 4 Turbo Preview OpenAI $10.00 $30.00 128K
Openai/Gpt 4 Turbo Vercel AI Gateway $10.00 $30.00 128K
Eu West 3/Mistral.Mistral Large 2402 AWS Bedrock $10.40 $31.20 32K
Meta.Llama 3.1 405b Instruct Oci $10.68 $10.68 128K
Ai21.J2 Mid AWS Bedrock $12.50 $12.50 8K
Claude 3 Opus Anthropic $15.00 $75.00 200K
Claude 4 Opus Anthropic $15.00 $75.00 200K
Claude Opus 4 1 Anthropic $15.00 $75.00 200K
Claude Opus 4 1 Anthropic $15.00 $75.00 200K
Claude Opus 4 Anthropic $15.00 $75.00 200K
O1 Azure OpenAI $15.00 $60.00 200K
O1 2024 12 17 Azure OpenAI $15.00 $60.00 200K
O1 Preview Azure OpenAI $15.00 $60.00 128K
O1 Preview 2024 09 12 Azure OpenAI $15.00 $60.00 128K
Claude Opus 4 1 Azure AI $15.00 $75.00 200K
Anthropic.Claude 3 Opus 20240229 AWS Bedrock $15.00 $75.00 200K
Anthropic.Claude Opus 4 1 20250805 AWS Bedrock $15.00 $75.00 200K
Anthropic.Claude Opus 4 20250514 AWS Bedrock $15.00 $75.00 200K
Eu.Anthropic.Claude 3 Opus 20240229 AWS Bedrock $15.00 $75.00 200K
Eu.Anthropic.Claude Opus 4 1 20250805 AWS Bedrock $15.00 $75.00 200K
Eu.Anthropic.Claude Opus 4 20250514 AWS Bedrock $15.00 $75.00 200K
Us.Anthropic.Claude 3 Opus 20240229 AWS Bedrock $15.00 $75.00 200K
Us.Anthropic.Claude Opus 4 1 20250805 AWS Bedrock $15.00 $75.00 200K
Us.Anthropic.Claude Opus 4 20250514 AWS Bedrock $15.00 $75.00 200K
Databricks Claude Opus 4 Databricks $15.00 $75.00 200K
Databricks Claude Opus 4 1 Databricks $15.00 $75.00 200K
Anthropic/Claude Opus 4 Gmi $15.00 $75.00 410K
O1 OpenAI $15.00 $60.00 200K
O1 2024 12 17 OpenAI $15.00 $60.00 200K
Anthropic/Claude Opus 4 OpenRouter $15.00 $75.00 200K
Anthropic/Claude Opus 4.1 OpenRouter $15.00 $75.00 200K
Openai/O1 OpenRouter $15.00 $60.00 200K
V0 1.5 Lg V0 $15.00 $75.00 512K
Anthropic/Claude 3 Opus Vercel AI Gateway $15.00 $75.00 200K
Anthropic/Claude 4 Opus Vercel AI Gateway $15.00 $75.00 200K
Anthropic/Claude Opus 4 Vercel AI Gateway $15.00 $75.00 200K
Anthropic/Claude Opus 4.1 Vercel AI Gateway $15.00 $75.00 200K
Openai/O1 Vercel AI Gateway $15.00 $60.00 200K
Claude 3 Opus Google Vertex AI $15.00 $75.00 200K
Claude 3 Opus Google Vertex AI $15.00 $75.00 200K
Claude Opus 4 Google Vertex AI $15.00 $75.00 200K
Claude Opus 4 1 Google Vertex AI $15.00 $75.00 200K
Claude Opus 4 1 Google Vertex AI $15.00 $75.00 200K
Claude Opus 4 Google Vertex AI $15.00 $75.00 200K
Eu/O1 2024 12 17 Azure OpenAI $16.50 $66.00 200K
Eu/O1 Preview 2024 09 12 Azure OpenAI $16.50 $66.00 128K
Us/O1 2024 12 17 Azure OpenAI $16.50 $66.00 200K
Us/O1 Preview 2024 09 12 Azure OpenAI $16.50 $66.00 128K
Anthropic/Claude 4 Opus DeepInfra $16.50 $82.50 200K
Ai21.J2 Ultra AWS Bedrock $18.80 $18.80 8K
Openai/Gpt 5.2 Pro OpenRouter $21.00 $168.00 272K
Gpt 4 Azure OpenAI $30.00 $60.00 8K
Gpt 4 0613 Azure OpenAI $30.00 $60.00 8K
Ft:Gpt 4 0613 OpenAI $30.00 $60.00 8K
Gpt 4 OpenAI $30.00 $60.00 8K
Gpt 4 0314 OpenAI $30.00 $60.00 8K
Gpt 4 0613 OpenAI $30.00 $60.00 8K
Gpt 4 32k Azure OpenAI $60.00 $120.00 33K
Gpt 4 32k 0613 Azure OpenAI $60.00 $120.00 33K
Gpt 4.5 Preview Azure OpenAI $75.00 $150.00 128K
Bigscience/Mt0 Xxl 13b IBM Watsonx $500.00 $2000.00 8K
Core42/Jais 13b Chat IBM Watsonx $500.00 $2000.00 8K
Jais 30b Chat Azure AI $3200.00 $9710.00 8K
Openai/Gpt Oss 20b Wandb $5000.00 $20000.00 131K
Microsoft/Phi 4 Mini Instruct Wandb $8000.00 $35000.00 128K
Qwen/Qwen3 235B A22B Instruct 2507 Wandb $10000.00 $10000.00 262K
Qwen/Qwen3 235B A22B Thinking 2507 Wandb $10000.00 $10000.00 262K
Openai/Gpt Oss 120b Wandb $15000.00 $60000.00 131K
Meta Llama/Llama 4 Scout 17B 16E Instruct Wandb $17000.00 $66000.00 64K
Meta Llama/Llama 3.1 8B Instruct Wandb $22000.00 $22000.00 128K
Zai Org/GLM 4.5 Wandb $55000.00 $200000.00 131K
Deepseek Ai/DeepSeek V3.1 Wandb $55000.00 $165000.00 128K
Meta Llama/Llama 3.3 70B Instruct Wandb $71000.00 $71000.00 128K
Qwen/Qwen3 Coder 480B A35B Instruct Wandb $100000.00 $150000.00 262K
Deepseek Ai/DeepSeek V3 0324 Wandb $114000.00 $275000.00 161K
Deepseek Ai/DeepSeek R1 0528 Wandb $135000.00 $540000.00 161K

AWS Bedrock Models

View provider details →

Amazon.Nova Micro

Amazon.Nova Micro is available via AWS Bedrock with a 128K context window and up to 10,000 output tokens. Pricing: $0.0350/1M input tokens, $0.1400/1M output tokens.

$0.035 / 1M in 128K context

Us.Amazon.Nova Micro

Us.Amazon.Nova Micro is available via AWS Bedrock with a 128K context window and up to 10,000 output tokens. Pricing: $0.0350/1M input tokens, $0.1400/1M output tokens.

$0.035 / 1M in 128K context

Apac.Amazon.Nova Micro

Apac.Amazon.Nova Micro is available via AWS Bedrock with a 128K context window and up to 10,000 output tokens. Pricing: $0.0370/1M input tokens, $0.1480/1M output tokens.

$0.037 / 1M in 128K context

Google.Gemma 3 4b It

Google.Gemma 3 4b It is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.0800/1M output tokens.

$0.040 / 1M in 128K context

Mistral.Voxtral Mini 3b 2507

Mistral.Voxtral Mini 3b 2507 is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.

$0.040 / 1M in 128K context

Eu.Amazon.Nova Micro

Eu.Amazon.Nova Micro is available via AWS Bedrock with a 128K context window and up to 10,000 output tokens. Pricing: $0.0460/1M input tokens, $0.1840/1M output tokens.

$0.046 / 1M in 128K context

Amazon.Nova Lite

Amazon.Nova Lite is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.0600/1M input tokens, $0.2400/1M output tokens.

$0.060 / 1M in 300K context

Nvidia.Nemotron Nano 9b

Nvidia.Nemotron Nano 9b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.0600/1M input tokens, $0.2300/1M output tokens.

$0.060 / 1M in 128K context

Nvidia.Nemotron Nano 3 30b

Nvidia.Nemotron Nano 3 30b is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.0600/1M input tokens, $0.2400/1M output tokens.

$0.060 / 1M in 262K context

Us.Amazon.Nova Lite

Us.Amazon.Nova Lite is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.0600/1M input tokens, $0.2400/1M output tokens.

$0.060 / 1M in 300K context

Apac.Amazon.Nova Lite

Apac.Amazon.Nova Lite is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.0630/1M input tokens, $0.2520/1M output tokens.

$0.063 / 1M in 300K context

Openai.Gpt Oss 20b 1

Openai.Gpt Oss 20b 1 is available via AWS Bedrock with a 128K context window and up to 128,000 output tokens. Pricing: $0.0700/1M input tokens, $0.3000/1M output tokens.

$0.070 / 1M in 128K context

Openai.Gpt Oss Safeguard 20b

Openai.Gpt Oss Safeguard 20b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.0700/1M input tokens, $0.2000/1M output tokens.

$0.070 / 1M in 128K context

Zai.Glm 4.7 Flash

Zai.Glm 4.7 Flash is available via AWS Bedrock with a 200K context window and up to 128,000 output tokens. Pricing: $0.0700/1M input tokens, $0.4000/1M output tokens.

$0.070 / 1M in 200K context

Eu.Amazon.Nova Lite

Eu.Amazon.Nova Lite is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.0780/1M input tokens, $0.3120/1M output tokens.

$0.078 / 1M in 300K context

Google.Gemma 3 12b It

Google.Gemma 3 12b It is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.0900/1M input tokens, $0.2900/1M output tokens.

$0.090 / 1M in 128K context

Meta.Llama3 2 1b Instruct

Meta.Llama3 2 1b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Mistral.Ministral 3 3b Instruct

Mistral.Ministral 3 3b Instruct is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Mistral.Voxtral Small 24b 2507

Mistral.Voxtral Small 24b 2507 is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 128K context

Us.Meta.Llama3 2 1b Instruct

Us.Meta.Llama3 2 1b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Eu.Meta.Llama3 2 1b Instruct

Eu.Meta.Llama3 2 1b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.1300/1M output tokens.

$0.13 / 1M in 128K context

Us East 1/Mistral.Mistral 7b Instruct

Us East 1/Mistral.Mistral 7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.1500/1M input tokens, $0.2000/1M output tokens.

$0.15 / 1M in 32K context

Us West 2/Mistral.Mistral 7b Instruct

Us West 2/Mistral.Mistral 7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.1500/1M input tokens, $0.2000/1M output tokens.

$0.15 / 1M in 32K context

Meta.Llama3 2 3b Instruct

Meta.Llama3 2 3b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Mistral.Ministral 3 8b Instruct

Mistral.Ministral 3 8b Instruct is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Mistral.Mistral 7b Instruct

Mistral.Mistral 7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.1500/1M input tokens, $0.2000/1M output tokens.

$0.15 / 1M in 32K context

Nvidia.Nemotron Super 3 120b

Nvidia.Nemotron Super 3 120b is available via AWS Bedrock with a 256K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $0.6500/1M output tokens.

$0.15 / 1M in 256K context

Openai.Gpt Oss 120b 1

Openai.Gpt Oss 120b 1 is available via AWS Bedrock with a 128K context window and up to 128,000 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Openai.Gpt Oss Safeguard 120b

Openai.Gpt Oss Safeguard 120b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Qwen.Qwen3 Coder 30b A3b

Qwen.Qwen3 Coder 30b A3b is available via AWS Bedrock with a 262K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 262K context

Qwen.Qwen3 32b

Qwen.Qwen3 32b is available via AWS Bedrock with a 131K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Qwen.Qwen3 Next 80b A3b

Qwen.Qwen3 Next 80b A3b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $1.20/1M output tokens.

$0.15 / 1M in 128K context

Us.Meta.Llama3 2 3b Instruct

Us.Meta.Llama3 2 3b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Meta.Llama4 Scout 17b Instruct

Meta.Llama4 Scout 17b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6600/1M output tokens.

$0.17 / 1M in 128K context

Us.Meta.Llama4 Scout 17b Instruct

Us.Meta.Llama4 Scout 17b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6600/1M output tokens.

$0.17 / 1M in 128K context

Eu.Meta.Llama3 2 3b Instruct

Eu.Meta.Llama3 2 3b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.1900/1M input tokens, $0.1900/1M output tokens.

$0.19 / 1M in 128K context

Ai21.Jamba 1 5 Mini

Ai21.Jamba 1 5 Mini is available via AWS Bedrock with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Eu West 3/Mistral.Mistral 7b Instruct

Eu West 3/Mistral.Mistral 7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.2000/1M input tokens, $0.2600/1M output tokens.

$0.20 / 1M in 32K context

Mistral.Ministral 3 14b Instruct

Mistral.Ministral 3 14b Instruct is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 128K context

Nvidia.Nemotron Nano 12b

Nvidia.Nemotron Nano 12b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 128K context

Meta.Llama3 1 8b Instruct

Meta.Llama3 1 8b Instruct is available via AWS Bedrock with a 128K context window and up to 2,048 output tokens. Pricing: $0.2200/1M input tokens, $0.2200/1M output tokens.

$0.22 / 1M in 128K context

Qwen.Qwen3 Coder 480b A35b

Qwen.Qwen3 Coder 480b A35b is available via AWS Bedrock with a 262K context window and up to 65,536 output tokens. Pricing: $0.2200/1M input tokens, $1.80/1M output tokens.

$0.22 / 1M in 262K context

Qwen.Qwen3 235b A22b 2507

Qwen.Qwen3 235b A22b 2507 is available via AWS Bedrock with a 262K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 262K context

Us.Meta.Llama3 1 8b Instruct

Us.Meta.Llama3 1 8b Instruct is available via AWS Bedrock with a 128K context window and up to 2,048 output tokens. Pricing: $0.2200/1M input tokens, $0.2200/1M output tokens.

$0.22 / 1M in 128K context

Google.Gemma 3 27b It

Google.Gemma 3 27b It is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.2300/1M input tokens, $0.3800/1M output tokens.

$0.23 / 1M in 128K context

Meta.Llama4 Maverick 17b Instruct

Meta.Llama4 Maverick 17b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.2400/1M input tokens, $0.9700/1M output tokens.

$0.24 / 1M in 128K context

Us.Meta.Llama4 Maverick 17b Instruct

Us.Meta.Llama4 Maverick 17b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.2400/1M input tokens, $0.9700/1M output tokens.

$0.24 / 1M in 128K context

Anthropic.Claude 3 Haiku 20240307

Anthropic.Claude 3 Haiku 20240307 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Apac.Anthropic.Claude 3 Haiku 20240307

Apac.Anthropic.Claude 3 Haiku 20240307 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Eu.Anthropic.Claude 3 5 Haiku 20241022

Eu.Anthropic.Claude 3 5 Haiku 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Eu.Anthropic.Claude 3 Haiku 20240307

Eu.Anthropic.Claude 3 Haiku 20240307 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Us.Anthropic.Claude 3 Haiku 20240307

Us.Anthropic.Claude 3 Haiku 20240307 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Amazon.Nova 2 Lite

Amazon.Nova 2 Lite is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1M context

Amazon.Titan Text Lite

Amazon.Titan Text Lite is available via AWS Bedrock with a 42K context window and up to 4,000 output tokens. Pricing: $0.3000/1M input tokens, $0.4000/1M output tokens.

$0.30 / 1M in 42K context

Us East 1/Meta.Llama3 8b Instruct

Us East 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $0.6000/1M output tokens.

$0.30 / 1M in 8K context

Us East 1/Minimax.Minimax M2.1

Us East 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 196K context

Us East 1/Minimax.Minimax M2.5

Us East 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 1M context

Us East 2/Minimax.Minimax M2.1

Us East 2/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 196K context

Us East 2/Minimax.Minimax M2.5

Us East 2/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 1M context

Us Gov East 1/Amazon.Titan Text Lite

Us Gov East 1/Amazon.Titan Text Lite is available via AWS Bedrock with a 42K context window and up to 4,000 output tokens. Pricing: $0.3000/1M input tokens, $0.4000/1M output tokens.

$0.30 / 1M in 42K context

Us Gov East 1/Anthropic.Claude 3 Haiku 20240307

Us Gov East 1/Anthropic.Claude 3 Haiku 20240307 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $1.50/1M output tokens.

$0.30 / 1M in 200K context

Us Gov East 1/Meta.Llama3 8b Instruct

Us Gov East 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 2,048 output tokens. Pricing: $0.3000/1M input tokens, $2.65/1M output tokens.

$0.30 / 1M in 8K context

Us Gov West 1/Amazon.Titan Text Lite

Us Gov West 1/Amazon.Titan Text Lite is available via AWS Bedrock with a 42K context window and up to 4,000 output tokens. Pricing: $0.3000/1M input tokens, $0.4000/1M output tokens.

$0.30 / 1M in 42K context

Us Gov West 1/Anthropic.Claude 3 Haiku 20240307

Us Gov West 1/Anthropic.Claude 3 Haiku 20240307 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $1.50/1M output tokens.

$0.30 / 1M in 200K context

Us Gov West 1/Meta.Llama3 8b Instruct

Us Gov West 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 2,048 output tokens. Pricing: $0.3000/1M input tokens, $2.65/1M output tokens.

$0.30 / 1M in 8K context

Us West 1/Meta.Llama3 8b Instruct

Us West 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $0.6000/1M output tokens.

$0.30 / 1M in 8K context

Us West 2/Minimax.Minimax M2.1

Us West 2/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 196K context

Us West 2/Minimax.Minimax M2.5

Us West 2/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 1M context

Cohere.Command Light Text

Cohere.Command Light Text is available via AWS Bedrock with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $0.6000/1M output tokens.

$0.30 / 1M in 4K context

Global.Amazon.Nova 2 Lite

Global.Amazon.Nova 2 Lite is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1M context

Meta.Llama3 8b Instruct

Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $0.6000/1M output tokens.

$0.30 / 1M in 8K context

Minimax.Minimax M2

Minimax.Minimax M2 is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 128K context

Minimax.Minimax M2.1

Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 196K context

Minimax.Minimax M2.5

Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 1M context

Ap Southeast 2/Minimax.Minimax M2.5

Ap Southeast 2/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3090/1M input tokens, $1.24/1M output tokens.

$0.31 / 1M in 1M context

Eu West 1/Meta.Llama3 8b Instruct

Eu West 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3200/1M input tokens, $0.6500/1M output tokens.

$0.32 / 1M in 8K context

Apac.Amazon.Nova 2 Lite

Apac.Amazon.Nova 2 Lite is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $0.3300/1M input tokens, $2.75/1M output tokens.

$0.33 / 1M in 1M context

Eu.Amazon.Nova 2 Lite

Eu.Amazon.Nova 2 Lite is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $0.3300/1M input tokens, $2.75/1M output tokens.

$0.33 / 1M in 1M context

Us.Amazon.Nova 2 Lite

Us.Amazon.Nova 2 Lite is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $0.3300/1M input tokens, $2.75/1M output tokens.

$0.33 / 1M in 1M context

Ca Central 1/Meta.Llama3 8b Instruct

Ca Central 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3500/1M input tokens, $0.6900/1M output tokens.

$0.35 / 1M in 8K context

Meta.Llama3 2 11b Instruct

Meta.Llama3 2 11b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.3500/1M input tokens, $0.3500/1M output tokens.

$0.35 / 1M in 128K context

Us.Meta.Llama3 2 11b Instruct

Us.Meta.Llama3 2 11b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.3500/1M input tokens, $0.3500/1M output tokens.

$0.35 / 1M in 128K context

Ap Northeast 1/Minimax.Minimax M2.1

Ap Northeast 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Ap Northeast 1/Minimax.Minimax M2.5

Ap Northeast 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Ap South 1/Meta.Llama3 8b Instruct

Ap South 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $0.7200/1M output tokens.

$0.36 / 1M in 8K context

Ap South 1/Minimax.Minimax M2.1

Ap South 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Ap South 1/Minimax.Minimax M2.5

Ap South 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Ap Southeast 3/Minimax.Minimax M2.1

Ap Southeast 3/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Ap Southeast 3/Minimax.Minimax M2.5

Ap Southeast 3/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Eu North 1/Minimax.Minimax M2.1

Eu North 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Eu North 1/Minimax.Minimax M2.5

Eu North 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Eu Central 1/Minimax.Minimax M2.1

Eu Central 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Eu Central 1/Minimax.Minimax M2.5

Eu Central 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Eu West 1/Minimax.Minimax M2.1

Eu West 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Eu West 1/Minimax.Minimax M2.5

Eu West 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Eu South 1/Minimax.Minimax M2.1

Eu South 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Eu South 1/Minimax.Minimax M2.5

Eu South 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Sa East 1/Minimax.Minimax M2.1

Sa East 1/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 196K context

Sa East 1/Minimax.Minimax M2.5

Sa East 1/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.3600/1M input tokens, $1.44/1M output tokens.

$0.36 / 1M in 1M context

Eu West 2/Meta.Llama3 8b Instruct

Eu West 2/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.3900/1M input tokens, $0.7800/1M output tokens.

$0.39 / 1M in 8K context

Mistral.Devstral 2 123b

Mistral.Devstral 2 123b is available via AWS Bedrock with a 256K context window and up to 8,192 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 256K context

Us East 1/Mistral.Mixtral 8x7b Instruct

Us East 1/Mistral.Mixtral 8x7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.4500/1M input tokens, $0.7000/1M output tokens.

$0.45 / 1M in 32K context

Us West 2/Mistral.Mixtral 8x7b Instruct

Us West 2/Mistral.Mixtral 8x7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.4500/1M input tokens, $0.7000/1M output tokens.

$0.45 / 1M in 32K context

Mistral.Mixtral 8x7b Instruct

Mistral.Mixtral 8x7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.4500/1M input tokens, $0.7000/1M output tokens.

$0.45 / 1M in 32K context

Eu West 2/Minimax.Minimax M2.1

Eu West 2/Minimax.Minimax M2.1 is available via AWS Bedrock with a 196K context window and up to 8,192 output tokens. Pricing: $0.4700/1M input tokens, $1.86/1M output tokens.

$0.47 / 1M in 196K context

Eu West 2/Minimax.Minimax M2.5

Eu West 2/Minimax.Minimax M2.5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.4700/1M input tokens, $1.86/1M output tokens.

$0.47 / 1M in 1M context

Ai21.Jamba Instruct

Ai21.Jamba Instruct is available via AWS Bedrock with a 70K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $0.7000/1M output tokens.

$0.50 / 1M in 70K context

Amazon.Titan Text Premier

Amazon.Titan Text Premier is available via AWS Bedrock with a 42K context window and up to 32,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 42K context

Sa East 1/Meta.Llama3 8b Instruct

Sa East 1/Meta.Llama3 8b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.01/1M output tokens.

$0.50 / 1M in 8K context

Us East 1/Qwen.Qwen3 Coder Next

Us East 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.20/1M output tokens.

$0.50 / 1M in 262K context

Us East 2/Qwen.Qwen3 Coder Next

Us East 2/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.20/1M output tokens.

$0.50 / 1M in 262K context

Us Gov East 1/Amazon.Titan Text Premier

Us Gov East 1/Amazon.Titan Text Premier is available via AWS Bedrock with a 42K context window and up to 32,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 42K context

Us Gov West 1/Amazon.Titan Text Premier

Us Gov West 1/Amazon.Titan Text Premier is available via AWS Bedrock with a 42K context window and up to 32,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 42K context

Us West 2/Qwen.Qwen3 Coder Next

Us West 2/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.20/1M output tokens.

$0.50 / 1M in 262K context

Cohere.Command R

Cohere.Command R is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Mistral.Magistral Small 2509

Mistral.Magistral Small 2509 is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Mistral.Mistral Large 3 675b Instruct

Mistral.Mistral Large 3 675b Instruct is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Qwen.Qwen3 Coder Next

Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $1.20/1M output tokens.

$0.50 / 1M in 262K context

Qwen.Qwen3 Vl 235b A22b

Qwen.Qwen3 Vl 235b A22b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.5300/1M input tokens, $2.66/1M output tokens.

$0.53 / 1M in 128K context

Deepseek.V3

Deepseek.V3 is available via AWS Bedrock with a 164K context window and up to 81,920 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.

$0.58 / 1M in 164K context

Eu West 3/Mistral.Mixtral 8x7b Instruct

Eu West 3/Mistral.Mixtral 8x7b Instruct is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $0.5900/1M input tokens, $0.9100/1M output tokens.

$0.59 / 1M in 32K context

Us.Writer.Palmyra X5

Us.Writer.Palmyra X5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $6.00/1M output tokens.

$0.60 / 1M in 1M context

Writer.Palmyra X5

Writer.Palmyra X5 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $6.00/1M output tokens.

$0.60 / 1M in 1M context

Ap Northeast 1/Qwen.Qwen3 Coder Next

Ap Northeast 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Moonshotai.Kimi K2.5

Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.03/1M output tokens.

$0.60 / 1M in 262K context

Ap South 1/Qwen.Qwen3 Coder Next

Ap South 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Ap Southeast 3/Qwen.Qwen3 Coder Next

Ap Southeast 3/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Eu Central 1/Qwen.Qwen3 Coder Next

Eu Central 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Eu West 1/Qwen.Qwen3 Coder Next

Eu West 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Eu South 1/Qwen.Qwen3 Coder Next

Eu South 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Sa East 1/Qwen.Qwen3 Coder Next

Sa East 1/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $1.44/1M output tokens.

$0.60 / 1M in 262K context

Us East 1/Moonshotai.Kimi K2 Thinking

Us East 1/Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Us East 1/Moonshotai.Kimi K2.5

Us East 1/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Us East 2/Moonshotai.Kimi K2 Thinking

Us East 2/Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Us East 2/Moonshotai.Kimi K2.5

Us East 2/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Us West 2/Moonshotai.Kimi K2 Thinking

Us West 2/Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Us West 2/Moonshotai.Kimi K2.5

Us West 2/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Moonshot.Kimi K2 Thinking

Moonshot.Kimi K2 Thinking is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 128K context

Moonshotai.Kimi K2.5

Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Zai.Glm 4.7

Zai.Glm 4.7 is available via AWS Bedrock with a 200K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 200K context

Us East 1/Deepseek.V3.2

Us East 1/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.6200/1M input tokens, $1.85/1M output tokens.

$0.62 / 1M in 164K context

Us East 2/Deepseek.V3.2

Us East 2/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.6200/1M input tokens, $1.85/1M output tokens.

$0.62 / 1M in 164K context

Us West 2/Deepseek.V3.2

Us West 2/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.6200/1M input tokens, $1.85/1M output tokens.

$0.62 / 1M in 164K context

Deepseek.V3.2

Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.6200/1M input tokens, $1.85/1M output tokens.

$0.62 / 1M in 164K context

Us.Deepseek.V3.2

Us.Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.6200/1M input tokens, $1.85/1M output tokens.

$0.62 / 1M in 164K context

Ap South 1/Moonshotai.Kimi K2 Thinking

Ap South 1/Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7100/1M input tokens, $2.94/1M output tokens.

$0.71 / 1M in 262K context

Ap Northeast 1/Moonshotai.Kimi K2.5

Ap Northeast 1/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7200/1M input tokens, $3.60/1M output tokens.

$0.72 / 1M in 262K context

Ap South 1/Moonshotai.Kimi K2.5

Ap South 1/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7200/1M input tokens, $3.60/1M output tokens.

$0.72 / 1M in 262K context

Ap Southeast 3/Moonshotai.Kimi K2.5

Ap Southeast 3/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7200/1M input tokens, $3.60/1M output tokens.

$0.72 / 1M in 262K context

Eu North 1/Moonshotai.Kimi K2.5

Eu North 1/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7200/1M input tokens, $3.60/1M output tokens.

$0.72 / 1M in 262K context

Sa East 1/Moonshotai.Kimi K2.5

Sa East 1/Moonshotai.Kimi K2.5 is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7200/1M input tokens, $3.60/1M output tokens.

$0.72 / 1M in 262K context

Meta.Llama3 3 70b Instruct

Meta.Llama3 3 70b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Us.Meta.Llama3 3 70b Instruct

Us.Meta.Llama3 3 70b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Ap Northeast 1/Moonshotai.Kimi K2 Thinking

Ap Northeast 1/Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7300/1M input tokens, $3.03/1M output tokens.

$0.73 / 1M in 262K context

Moonshotai.Kimi K2 Thinking

Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7300/1M input tokens, $3.03/1M output tokens.

$0.73 / 1M in 262K context

Sa East 1/Moonshotai.Kimi K2 Thinking

Sa East 1/Moonshotai.Kimi K2 Thinking is available via AWS Bedrock with a 262K context window and up to 262,144 output tokens. Pricing: $0.7300/1M input tokens, $3.03/1M output tokens.

$0.73 / 1M in 262K context

Ap Northeast 1/Deepseek.V3.2

Ap Northeast 1/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.7400/1M input tokens, $2.22/1M output tokens.

$0.74 / 1M in 164K context

Ap South 1/Deepseek.V3.2

Ap South 1/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.7400/1M input tokens, $2.22/1M output tokens.

$0.74 / 1M in 164K context

Ap Southeast 3/Deepseek.V3.2

Ap Southeast 3/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.7400/1M input tokens, $2.22/1M output tokens.

$0.74 / 1M in 164K context

Eu North 1/Deepseek.V3.2

Eu North 1/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.7400/1M input tokens, $2.22/1M output tokens.

$0.74 / 1M in 164K context

Sa East 1/Deepseek.V3.2

Sa East 1/Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.7400/1M input tokens, $2.22/1M output tokens.

$0.74 / 1M in 164K context

Eu.Deepseek.V3.2

Eu.Deepseek.V3.2 is available via AWS Bedrock with a 164K context window and up to 163,840 output tokens. Pricing: $0.7400/1M input tokens, $2.22/1M output tokens.

$0.74 / 1M in 164K context

Meta.Llama2 13b Chat

Meta.Llama2 13b Chat is available via AWS Bedrock with a 4K context window and up to 4,096 output tokens. Pricing: $0.7500/1M input tokens, $1.00/1M output tokens.

$0.75 / 1M in 4K context

Eu West 2/Qwen.Qwen3 Coder Next

Eu West 2/Qwen.Qwen3 Coder Next is available via AWS Bedrock with a 262K context window and up to 8,192 output tokens. Pricing: $0.7800/1M input tokens, $1.86/1M output tokens.

$0.78 / 1M in 262K context

Amazon.Nova Pro

Amazon.Nova Pro is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.8000/1M input tokens, $3.20/1M output tokens.

$0.80 / 1M in 300K context

Anthropic.Claude 3 5 Haiku 20241022

Anthropic.Claude 3 5 Haiku 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $4.00/1M output tokens.

$0.80 / 1M in 200K context

Anthropic.Claude Instant

Anthropic.Claude Instant is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $0.8000/1M input tokens, $2.40/1M output tokens.

$0.80 / 1M in 100K context

Us East 1/Anthropic.Claude Instant

Us East 1/Anthropic.Claude Instant is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $0.8000/1M input tokens, $2.40/1M output tokens.

$0.80 / 1M in 100K context

Us West 2/Anthropic.Claude Instant

Us West 2/Anthropic.Claude Instant is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $0.8000/1M input tokens, $2.40/1M output tokens.

$0.80 / 1M in 100K context

Us.Anthropic.Claude 3 5 Haiku 20241022

Us.Anthropic.Claude 3 5 Haiku 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $4.00/1M output tokens.

$0.80 / 1M in 200K context

Us.Amazon.Nova Pro

Us.Amazon.Nova Pro is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.8000/1M input tokens, $3.20/1M output tokens.

$0.80 / 1M in 300K context

Us.Anthropic.Claude 3 5 Haiku 20241022

Us.Anthropic.Claude 3 5 Haiku 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $4.00/1M output tokens.

$0.80 / 1M in 200K context

Apac.Amazon.Nova Pro

Apac.Amazon.Nova Pro is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.8400/1M input tokens, $3.36/1M output tokens.

$0.84 / 1M in 300K context

Us Gov East 1/Amazon.Nova Pro

Us Gov East 1/Amazon.Nova Pro is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.9600/1M input tokens, $3.84/1M output tokens.

$0.96 / 1M in 300K context

Us Gov West 1/Amazon.Nova Pro

Us Gov West 1/Amazon.Nova Pro is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $0.9600/1M input tokens, $3.84/1M output tokens.

$0.96 / 1M in 300K context

Meta.Llama3 1 70b Instruct

Meta.Llama3 1 70b Instruct is available via AWS Bedrock with a 128K context window and up to 2,048 output tokens. Pricing: $0.9900/1M input tokens, $0.9900/1M output tokens.

$0.99 / 1M in 128K context

Us.Meta.Llama3 1 70b Instruct

Us.Meta.Llama3 1 70b Instruct is available via AWS Bedrock with a 128K context window and up to 2,048 output tokens. Pricing: $0.9900/1M input tokens, $0.9900/1M output tokens.

$0.99 / 1M in 128K context

Anthropic.Claude Haiku 4 5 20251001

Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Anthropic.Claude Haiku 4 5

Anthropic.Claude Haiku 4 5 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Global.Anthropic.Claude Haiku 4 5 20251001

Global.Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Mistral.Mistral Small 2402

Mistral.Mistral Small 2402 is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 32K context

Zai.Glm 5

Zai.Glm 5 is available via AWS Bedrock with a 200K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.20/1M output tokens.

$1.00 / 1M in 200K context

Eu.Amazon.Nova Pro

Eu.Amazon.Nova Pro is available via AWS Bedrock with a 300K context window and up to 10,000 output tokens. Pricing: $1.05/1M input tokens, $4.20/1M output tokens.

$1.05 / 1M in 300K context

Apac.Anthropic.Claude Haiku 4 5 20251001

Apac.Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.10/1M input tokens, $5.50/1M output tokens.

$1.10 / 1M in 200K context

Eu.Anthropic.Claude Haiku 4 5 20251001

Eu.Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.10/1M input tokens, $5.50/1M output tokens.

$1.10 / 1M in 200K context

Jp.Anthropic.Claude Haiku 4 5 20251001

Jp.Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.10/1M input tokens, $5.50/1M output tokens.

$1.10 / 1M in 200K context

Us.Anthropic.Claude Haiku 4 5 20251001

Us.Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.10/1M input tokens, $5.50/1M output tokens.

$1.10 / 1M in 200K context

Au.Anthropic.Claude Haiku 4 5 20251001

Au.Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.10/1M input tokens, $5.50/1M output tokens.

$1.10 / 1M in 200K context

Us Gov East 1/Anthropic.Claude Haiku 4 5 20251001

Us Gov East 1/Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.20/1M input tokens, $6.00/1M output tokens.

$1.20 / 1M in 200K context

Us Gov West 1/Anthropic.Claude Haiku 4 5 20251001

Us Gov West 1/Anthropic.Claude Haiku 4 5 20251001 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $1.20/1M input tokens, $6.00/1M output tokens.

$1.20 / 1M in 200K context

Amazon.Titan Text Express

Amazon.Titan Text Express is available via AWS Bedrock with a 42K context window and up to 8,000 output tokens. Pricing: $1.30/1M input tokens, $1.70/1M output tokens.

$1.30 / 1M in 42K context

Us Gov East 1/Amazon.Titan Text Express

Us Gov East 1/Amazon.Titan Text Express is available via AWS Bedrock with a 42K context window and up to 8,000 output tokens. Pricing: $1.30/1M input tokens, $1.70/1M output tokens.

$1.30 / 1M in 42K context

Us Gov West 1/Amazon.Titan Text Express

Us Gov West 1/Amazon.Titan Text Express is available via AWS Bedrock with a 42K context window and up to 8,000 output tokens. Pricing: $1.30/1M input tokens, $1.70/1M output tokens.

$1.30 / 1M in 42K context

Us.Deepseek.R1

Us.Deepseek.R1 is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

$1.35 / 1M in 128K context

Cohere.Command Text

Cohere.Command Text is available via AWS Bedrock with a 4K context window and up to 4,096 output tokens. Pricing: $1.50/1M input tokens, $2.00/1M output tokens.

$1.50 / 1M in 4K context

Meta.Llama2 70b Chat

Meta.Llama2 70b Chat is available via AWS Bedrock with a 4K context window and up to 4,096 output tokens. Pricing: $1.95/1M input tokens, $2.56/1M output tokens.

$1.95 / 1M in 4K context

Ai21.Jamba 1 5 Large

Ai21.Jamba 1 5 Large is available via AWS Bedrock with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Eu.Mistral.Pixtral Large 2502

Eu.Mistral.Pixtral Large 2502 is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Meta.Llama3 2 90b Instruct

Meta.Llama3 2 90b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 128K context

Us.Meta.Llama3 2 90b Instruct

Us.Meta.Llama3 2 90b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 128K context

Us.Mistral.Pixtral Large 2502

Us.Mistral.Pixtral Large 2502 is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Amazon.Nova 2 Pro Preview 20251202

Amazon.Nova 2 Pro Preview 20251202 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $2.19/1M input tokens, $17.50/1M output tokens.

$2.19 / 1M in 1M context

Apac.Amazon.Nova 2 Pro Preview 20251202

Apac.Amazon.Nova 2 Pro Preview 20251202 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $2.19/1M input tokens, $17.50/1M output tokens.

$2.19 / 1M in 1M context

Eu.Amazon.Nova 2 Pro Preview 20251202

Eu.Amazon.Nova 2 Pro Preview 20251202 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $2.19/1M input tokens, $17.50/1M output tokens.

$2.19 / 1M in 1M context

Us.Amazon.Nova 2 Pro Preview 20251202

Us.Amazon.Nova 2 Pro Preview 20251202 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $2.19/1M input tokens, $17.50/1M output tokens.

$2.19 / 1M in 1M context

Ap Northeast 1/Anthropic.Claude Instant

Ap Northeast 1/Anthropic.Claude Instant is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $2.23/1M input tokens, $7.55/1M output tokens.

$2.23 / 1M in 100K context

Eu Central 1/Anthropic.Claude Instant

Eu Central 1/Anthropic.Claude Instant is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $2.48/1M input tokens, $8.38/1M output tokens.

$2.48 / 1M in 100K context

Us.Writer.Palmyra X4

Us.Writer.Palmyra X4 is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Writer.Palmyra X4

Writer.Palmyra X4 is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Us.Amazon.Nova Premier

Us.Amazon.Nova Premier is available via AWS Bedrock with a 1M context window and up to 10,000 output tokens. Pricing: $2.50/1M input tokens, $12.50/1M output tokens.

$2.50 / 1M in 1M context

Us East 1/Meta.Llama3 70b Instruct

Us East 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $2.65/1M input tokens, $3.50/1M output tokens.

$2.65 / 1M in 8K context

Us Gov East 1/Meta.Llama3 70b Instruct

Us Gov East 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 2,048 output tokens. Pricing: $2.65/1M input tokens, $3.50/1M output tokens.

$2.65 / 1M in 8K context

Us Gov West 1/Meta.Llama3 70b Instruct

Us Gov West 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 2,048 output tokens. Pricing: $2.65/1M input tokens, $3.50/1M output tokens.

$2.65 / 1M in 8K context

Us West 1/Meta.Llama3 70b Instruct

Us West 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $2.65/1M input tokens, $3.50/1M output tokens.

$2.65 / 1M in 8K context

Meta.Llama3 70b Instruct

Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $2.65/1M input tokens, $3.50/1M output tokens.

$2.65 / 1M in 8K context

Eu West 1/Meta.Llama3 70b Instruct

Eu West 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $2.86/1M input tokens, $3.78/1M output tokens.

$2.86 / 1M in 8K context

Anthropic.Claude 3 5 Sonnet 20240620

Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 1M context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Anthropic.Claude 3 5 Sonnet 20241022

Anthropic.Claude 3 5 Sonnet 20241022 is available via AWS Bedrock with a 1M context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Anthropic.Claude 3 7 Sonnet 20250219

Anthropic.Claude 3 7 Sonnet 20250219 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic.Claude 3 Sonnet 20240229

Anthropic.Claude 3 Sonnet 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic.Claude Sonnet 4 6

Anthropic.Claude Sonnet 4 6 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Global.Anthropic.Claude Sonnet 4 6

Global.Anthropic.Claude Sonnet 4 6 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Anthropic.Claude Sonnet 4 20250514

Anthropic.Claude Sonnet 4 20250514 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Anthropic.Claude Sonnet 4 5 20250929

Anthropic.Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Apac.Anthropic.Claude 3 5 Sonnet 20240620

Apac.Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Apac.Anthropic.Claude 3 5 Sonnet 20241022

Apac.Anthropic.Claude 3 5 Sonnet 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Apac.Anthropic.Claude 3 Sonnet 20240229

Apac.Anthropic.Claude 3 Sonnet 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Apac.Anthropic.Claude Sonnet 4 20250514

Apac.Anthropic.Claude Sonnet 4 20250514 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Invoke/Anthropic.Claude 3 5 Sonnet 20240620

Invoke/Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4 5 20250929

Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Cohere.Command R Plus

Cohere.Command R Plus is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

Eu.Anthropic.Claude 3 5 Sonnet 20240620

Eu.Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Eu.Anthropic.Claude 3 5 Sonnet 20241022

Eu.Anthropic.Claude 3 5 Sonnet 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Eu.Anthropic.Claude 3 7 Sonnet 20250219

Eu.Anthropic.Claude 3 7 Sonnet 20250219 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Eu.Anthropic.Claude 3 Sonnet 20240229

Eu.Anthropic.Claude 3 Sonnet 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Eu.Anthropic.Claude Sonnet 4 20250514

Eu.Anthropic.Claude Sonnet 4 20250514 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Global.Anthropic.Claude Sonnet 4 5 20250929

Global.Anthropic.Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Global.Anthropic.Claude Sonnet 4 20250514

Global.Anthropic.Claude Sonnet 4 20250514 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Mistral.Mistral Large 2407

Mistral.Mistral Large 2407 is available via AWS Bedrock with a 128K context window and up to 8,191 output tokens. Pricing: $3.00/1M input tokens, $9.00/1M output tokens.

$3.00 / 1M in 128K context

Us.Anthropic.Claude 3 5 Sonnet 20240620

Us.Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Us.Anthropic.Claude 3 5 Sonnet 20241022

Us.Anthropic.Claude 3 5 Sonnet 20241022 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Us.Anthropic.Claude 3 7 Sonnet 20250219

Us.Anthropic.Claude 3 7 Sonnet 20250219 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Us.Anthropic.Claude 3 Sonnet 20240229

Us.Anthropic.Claude 3 Sonnet 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Us.Anthropic.Claude Sonnet 4 20250514

Us.Anthropic.Claude Sonnet 4 20250514 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Ca Central 1/Meta.Llama3 70b Instruct

Ca Central 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $3.05/1M input tokens, $4.03/1M output tokens.

$3.05 / 1M in 8K context

Ap South 1/Meta.Llama3 70b Instruct

Ap South 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $3.18/1M input tokens, $4.20/1M output tokens.

$3.18 / 1M in 8K context

Us.Anthropic.Claude Sonnet 4 6

Us.Anthropic.Claude Sonnet 4 6 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 1M context

Eu.Anthropic.Claude Sonnet 4 6

Eu.Anthropic.Claude Sonnet 4 6 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 1M context

Au.Anthropic.Claude Sonnet 4 6

Au.Anthropic.Claude Sonnet 4 6 is available via AWS Bedrock with a 1M context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 1M context

Au.Anthropic.Claude Sonnet 4 5 20250929

Au.Anthropic.Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 200K context

Eu.Anthropic.Claude Sonnet 4 5 20250929

Eu.Anthropic.Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 200K context

Jp.Anthropic.Claude Sonnet 4 5 20250929

Jp.Anthropic.Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 200K context

Us.Anthropic.Claude Sonnet 4 5 20250929

Us.Anthropic.Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 200K context

Eu West 2/Meta.Llama3 70b Instruct

Eu West 2/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $3.45/1M input tokens, $4.55/1M output tokens.

$3.45 / 1M in 8K context

Anthropic.Claude 3 7 Sonnet 20240620

Anthropic.Claude 3 7 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.60/1M input tokens, $18.00/1M output tokens.

$3.60 / 1M in 200K context

Us Gov East 1/Anthropic.Claude 3 5 Sonnet 20240620

Us Gov East 1/Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.60/1M input tokens, $18.00/1M output tokens.

$3.60 / 1M in 200K context

Us Gov East 1/Claude Sonnet 4 5 20250929

Us Gov East 1/Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.60/1M input tokens, $18.00/1M output tokens.

$3.60 / 1M in 200K context

Us Gov West 1/Anthropic.Claude 3 7 Sonnet 20250219

Us Gov West 1/Anthropic.Claude 3 7 Sonnet 20250219 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.60/1M input tokens, $18.00/1M output tokens.

$3.60 / 1M in 200K context

Us Gov West 1/Anthropic.Claude 3 5 Sonnet 20240620

Us Gov West 1/Anthropic.Claude 3 5 Sonnet 20240620 is available via AWS Bedrock with a 200K context window and up to 8,192 output tokens. Pricing: $3.60/1M input tokens, $18.00/1M output tokens.

$3.60 / 1M in 200K context

Us Gov West 1/Claude Sonnet 4 5 20250929

Us Gov West 1/Claude Sonnet 4 5 20250929 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $3.60/1M input tokens, $18.00/1M output tokens.

$3.60 / 1M in 200K context

Sa East 1/Meta.Llama3 70b Instruct

Sa East 1/Meta.Llama3 70b Instruct is available via AWS Bedrock with a 8K context window and up to 8,192 output tokens. Pricing: $4.45/1M input tokens, $5.88/1M output tokens.

$4.45 / 1M in 8K context

Anthropic.Claude Opus 4 5 20251101

Anthropic.Claude Opus 4 5 20251101 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Anthropic.Claude Opus 4 6

Anthropic.Claude Opus 4 6 is available via AWS Bedrock with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Global.Anthropic.Claude Opus 4 6

Global.Anthropic.Claude Opus 4 6 is available via AWS Bedrock with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Global.Anthropic.Claude Opus 4 5 20251101

Global.Anthropic.Claude Opus 4 5 20251101 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Eu.Anthropic.Claude Opus 4 5 20251101

Eu.Anthropic.Claude Opus 4 5 20251101 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Meta.Llama3 1 405b Instruct

Meta.Llama3 1 405b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $5.32/1M input tokens, $16.00/1M output tokens.

$5.32 / 1M in 128K context

Us.Meta.Llama3 1 405b Instruct

Us.Meta.Llama3 1 405b Instruct is available via AWS Bedrock with a 128K context window and up to 4,096 output tokens. Pricing: $5.32/1M input tokens, $16.00/1M output tokens.

$5.32 / 1M in 128K context

Us.Anthropic.Claude Opus 4 6

Us.Anthropic.Claude Opus 4 6 is available via AWS Bedrock with a 1M context window and up to 128,000 output tokens. Pricing: $5.50/1M input tokens, $27.50/1M output tokens.

$5.50 / 1M in 1M context

Eu.Anthropic.Claude Opus 4 6

Eu.Anthropic.Claude Opus 4 6 is available via AWS Bedrock with a 1M context window and up to 128,000 output tokens. Pricing: $5.50/1M input tokens, $27.50/1M output tokens.

$5.50 / 1M in 1M context

Au.Anthropic.Claude Opus 4 6

Au.Anthropic.Claude Opus 4 6 is available via AWS Bedrock with a 1M context window and up to 128,000 output tokens. Pricing: $5.50/1M input tokens, $27.50/1M output tokens.

$5.50 / 1M in 1M context

Us.Anthropic.Claude Opus 4 5 20251101

Us.Anthropic.Claude Opus 4 5 20251101 is available via AWS Bedrock with a 200K context window and up to 64,000 output tokens. Pricing: $5.50/1M input tokens, $27.50/1M output tokens.

$5.50 / 1M in 200K context

Anthropic.Claude

Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Anthropic.Claude

Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Ap Northeast 1/Anthropic.Claude

Ap Northeast 1/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Ap Northeast 1/Anthropic.Claude

Ap Northeast 1/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Eu Central 1/Anthropic.Claude

Eu Central 1/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Eu Central 1/Anthropic.Claude

Eu Central 1/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Us East 1/Anthropic.Claude

Us East 1/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Us East 1/Anthropic.Claude

Us East 1/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Us East 1/Mistral.Mistral Large 2402

Us East 1/Mistral.Mistral Large 2402 is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 32K context

Us West 2/Anthropic.Claude

Us West 2/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Us West 2/Anthropic.Claude

Us West 2/Anthropic.Claude is available via AWS Bedrock with a 100K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 100K context

Us West 2/Mistral.Mistral Large 2402

Us West 2/Mistral.Mistral Large 2402 is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 32K context

Mistral.Mistral Large 2402

Mistral.Mistral Large 2402 is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 32K context

Eu West 3/Mistral.Mistral Large 2402

Eu West 3/Mistral.Mistral Large 2402 is available via AWS Bedrock with a 32K context window and up to 8,191 output tokens. Pricing: $10.40/1M input tokens, $31.20/1M output tokens.

$10.40 / 1M in 32K context

Ai21.J2 Mid

Ai21.J2 Mid is available via AWS Bedrock with a 8K context window and up to 8,191 output tokens. Pricing: $12.50/1M input tokens, $12.50/1M output tokens.

$12.50 / 1M in 8K context

Anthropic.Claude 3 Opus 20240229

Anthropic.Claude 3 Opus 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Anthropic.Claude Opus 4 1 20250805

Anthropic.Claude Opus 4 1 20250805 is available via AWS Bedrock with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Anthropic.Claude Opus 4 20250514

Anthropic.Claude Opus 4 20250514 is available via AWS Bedrock with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Eu.Anthropic.Claude 3 Opus 20240229

Eu.Anthropic.Claude 3 Opus 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Eu.Anthropic.Claude Opus 4 1 20250805

Eu.Anthropic.Claude Opus 4 1 20250805 is available via AWS Bedrock with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Eu.Anthropic.Claude Opus 4 20250514

Eu.Anthropic.Claude Opus 4 20250514 is available via AWS Bedrock with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Us.Anthropic.Claude 3 Opus 20240229

Us.Anthropic.Claude 3 Opus 20240229 is available via AWS Bedrock with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Us.Anthropic.Claude Opus 4 1 20250805

Us.Anthropic.Claude Opus 4 1 20250805 is available via AWS Bedrock with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Us.Anthropic.Claude Opus 4 20250514

Us.Anthropic.Claude Opus 4 20250514 is available via AWS Bedrock with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Ai21.J2 Ultra

Ai21.J2 Ultra is available via AWS Bedrock with a 8K context window and up to 8,191 output tokens. Pricing: $18.80/1M input tokens, $18.80/1M output tokens.

$18.80 / 1M in 8K context

Fireworks AI Models

View provider details →

Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union

Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.001000/1M input tokens, $0.001000/1M output tokens.

$0.001 / 1M in 4K context

Accounts/Fireworks/Models/Gpt Oss 20b

Accounts/Fireworks/Models/Gpt Oss 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

$0.050 / 1M in 131K context

Accounts/Fireworks/Models/Llama V3p1 8b Instruct

Accounts/Fireworks/Models/Llama V3p1 8b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 16K context

Accounts/Fireworks/Models/Llama V3p2 1b Instruct

Accounts/Fireworks/Models/Llama V3p2 1b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 16K context

Accounts/Fireworks/Models/Llama V3p2 3b Instruct

Accounts/Fireworks/Models/Llama V3p2 3b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 16K context

Accounts/Fireworks/Models/Codegemma 2b

Accounts/Fireworks/Models/Codegemma 2b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 8K context

Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b

Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek Coder 1b Base

Accounts/Fireworks/Models/Deepseek Coder 1b Base is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 16K context

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt

Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt

Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Flux 1 Dev

Accounts/Fireworks/Models/Flux 1 Dev is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Flux 1 Schnell

Accounts/Fireworks/Models/Flux 1 Schnell is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Gemma 2b It

Accounts/Fireworks/Models/Gemma 2b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 8K context

Accounts/Fireworks/Models/Llama Guard 3 1b

Accounts/Fireworks/Models/Llama Guard 3 1b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Llama V2 70b

Accounts/Fireworks/Models/Llama V2 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long

Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b

Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Llama V3p2 1b

Accounts/Fireworks/Models/Llama V3p2 1b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Llama V3p2 3b

Accounts/Fireworks/Models/Llama V3p2 3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Minimax M1 80k

Accounts/Fireworks/Models/Minimax M1 80k is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512

Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 256K context

Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl

Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Phi 2 3b

Accounts/Fireworks/Models/Phi 2 3b is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 2K context

Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct

Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct

Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 3b

Accounts/Fireworks/Models/Qwen2p5 Coder 3b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 33K context

Accounts/Fireworks/Models/Qwen3 0p6b

Accounts/Fireworks/Models/Qwen3 0p6b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 41K context

Accounts/Fireworks/Models/Qwen3 1p7b

Accounts/Fireworks/Models/Qwen3 1p7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072 is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960

Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960 is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 41K context

Accounts/Fireworks/Models/Stablecode 3b

Accounts/Fireworks/Models/Stablecode 3b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 4K context

Accounts/Fireworks/Models/Starcoder2 3b

Accounts/Fireworks/Models/Starcoder2 3b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 16K context

Accounts/Fireworks/Models/Gpt Oss 120b

Accounts/Fireworks/Models/Gpt Oss 120b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Accounts/Fireworks/Models/Llama4 Scout Instruct Basic

Accounts/Fireworks/Models/Llama4 Scout Instruct Basic is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 30b A3b

Accounts/Fireworks/Models/Qwen3 30b A3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct

Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking

Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 262K context

Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct

Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Chronos Hermes 13b

Accounts/Fireworks/Models/Chronos Hermes 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Code Llama 13b

Accounts/Fireworks/Models/Code Llama 13b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 13b Instruct

Accounts/Fireworks/Models/Code Llama 13b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 13b Python

Accounts/Fireworks/Models/Code Llama 13b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 7b

Accounts/Fireworks/Models/Code Llama 7b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 7b Instruct

Accounts/Fireworks/Models/Code Llama 7b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 7b Python

Accounts/Fireworks/Models/Code Llama 7b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Code Qwen 1p5 7b

Accounts/Fireworks/Models/Code Qwen 1p5 7b is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 66K context

Accounts/Fireworks/Models/Codegemma 7b

Accounts/Fireworks/Models/Codegemma 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b

Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek Coder 7b Base

Accounts/Fireworks/Models/Deepseek Coder 7b Base is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5

Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5

Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b

Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b

Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Firellava 13b

Accounts/Fireworks/Models/Firellava 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Firesearch Ocr

Accounts/Fireworks/Models/Firesearch Ocr is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Gemma 7b

Accounts/Fireworks/Models/Gemma 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Gemma 7b It

Accounts/Fireworks/Models/Gemma 7b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Gemma2 9b It

Accounts/Fireworks/Models/Gemma2 9b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b

Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Internvl3 8b

Accounts/Fireworks/Models/Internvl3 8b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Llama Guard 2 8b

Accounts/Fireworks/Models/Llama Guard 2 8b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Llama Guard 3 8b

Accounts/Fireworks/Models/Llama Guard 3 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Llama V2 13b

Accounts/Fireworks/Models/Llama V2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Llama V2 13b Chat

Accounts/Fireworks/Models/Llama V2 13b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Llama V2 7b

Accounts/Fireworks/Models/Llama V2 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Llama V2 7b Chat

Accounts/Fireworks/Models/Llama V2 7b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Llama V3 8b

Accounts/Fireworks/Models/Llama V3 8b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Llama V3 8b Instruct Hf

Accounts/Fireworks/Models/Llama V3 8b Instruct Hf is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Llamaguard 7b

Accounts/Fireworks/Models/Llamaguard 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512

Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 256K context

Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512

Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 256K context

Accounts/Fireworks/Models/Mistral 7b

Accounts/Fireworks/Models/Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Mistral 7b Instruct 4k

Accounts/Fireworks/Models/Mistral 7b Instruct 4k is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Mistral 7b Instruct V0p2

Accounts/Fireworks/Models/Mistral 7b Instruct V0p2 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Mistral 7b Instruct

Accounts/Fireworks/Models/Mistral 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Mistral 7b V0p2

Accounts/Fireworks/Models/Mistral 7b V0p2 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Mistral Nemo Base 2407

Accounts/Fireworks/Models/Mistral Nemo Base 2407 is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 128K context

Accounts/Fireworks/Models/Mistral Nemo Instruct 2407

Accounts/Fireworks/Models/Mistral Nemo Instruct 2407 is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 128K context

Accounts/Fireworks/Models/Mythomax L2 13b

Accounts/Fireworks/Models/Mythomax L2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Nous Capybara 7b V1p9

Accounts/Fireworks/Models/Nous Capybara 7b V1p9 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Nous Hermes Llama2 13b

Accounts/Fireworks/Models/Nous Hermes Llama2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Nous Hermes Llama2 7b

Accounts/Fireworks/Models/Nous Hermes Llama2 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b

Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b

Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Openchat 3p5 0106 7b

Accounts/Fireworks/Models/Openchat 3p5 0106 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Openhermes 2 Mistral 7b

Accounts/Fireworks/Models/Openhermes 2 Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b

Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Openorca 7b

Accounts/Fireworks/Models/Openorca 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct

Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct is available via Fireworks AI with a 32K context window and up to 32,064 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 32K context

Accounts/Fireworks/Models/Pythia 12b

Accounts/Fireworks/Models/Pythia 12b is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 2K context

Accounts/Fireworks/Models/Qwen V2p5 14b Instruct

Accounts/Fireworks/Models/Qwen V2p5 14b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen V2p5 7b

Accounts/Fireworks/Models/Qwen V2p5 7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Qwen2 7b Instruct

Accounts/Fireworks/Models/Qwen2 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct

Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 14b

Accounts/Fireworks/Models/Qwen2p5 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Accounts/Fireworks/Models/Qwen2p5 7b Instruct

Accounts/Fireworks/Models/Qwen2p5 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 14b

Accounts/Fireworks/Models/Qwen2p5 Coder 14b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 7b

Accounts/Fireworks/Models/Qwen2p5 Coder 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 128K context

Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 128K context

Accounts/Fireworks/Models/Qwen3 14b

Accounts/Fireworks/Models/Qwen3 14b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 41K context

Accounts/Fireworks/Models/Qwen3 4b

Accounts/Fireworks/Models/Qwen3 4b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 41K context

Accounts/Fireworks/Models/Qwen3 4b Instruct 2507

Accounts/Fireworks/Models/Qwen3 4b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 8b

Accounts/Fireworks/Models/Qwen3 8b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 41K context

Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Rolm Ocr

Accounts/Fireworks/Models/Rolm Ocr is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 128K context

Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo

Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Starcoder 16b

Accounts/Fireworks/Models/Starcoder 16b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Starcoder 7b

Accounts/Fireworks/Models/Starcoder 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Accounts/Fireworks/Models/Starcoder2 15b

Accounts/Fireworks/Models/Starcoder2 15b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Starcoder2 7b

Accounts/Fireworks/Models/Starcoder2 7b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 16K context

Accounts/Fireworks/Models/Toppy M 7b

Accounts/Fireworks/Models/Toppy M 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Yi 6b

Accounts/Fireworks/Models/Yi 6b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 4K context

Accounts/Fireworks/Models/Zephyr 7b Beta

Accounts/Fireworks/Models/Zephyr 7b Beta is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Accounts/Fireworks/Models/Glm 4p5 Air

Accounts/Fireworks/Models/Glm 4p5 Air is available via Fireworks AI with a 128K context window and up to 96,000 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 128K context

Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic

Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 235b A22b

Accounts/Fireworks/Models/Qwen3 235b A22b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507

Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507

Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking

Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.

$0.22 / 1M in 262K context

Accounts/Fireworks/Models/Minimax M2p1

Accounts/Fireworks/Models/Minimax M2p1 is available via Fireworks AI with a 205K context window and up to 204,800 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 205K context

Minimax M2p1

Minimax M2p1 is available via Fireworks AI with a 205K context window and up to 204,800 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 205K context

Accounts/Fireworks/Models/Minimax M2

Accounts/Fireworks/Models/Minimax M2 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 4K context

Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct

Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.4500/1M input tokens, $1.80/1M output tokens.

$0.45 / 1M in 262K context

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 164K context

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct

Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 164K context

Accounts/Fireworks/Models/Deepseek V2 Lite Chat

Accounts/Fireworks/Models/Deepseek V2 Lite Chat is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 164K context

Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b

Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 33K context

Accounts/Fireworks/Models/Firefunction

Accounts/Fireworks/Models/Firefunction is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 33K context

Accounts/Fireworks/Models/Gpt Oss Safeguard 20b

Accounts/Fireworks/Models/Gpt Oss Safeguard 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 131K context

Accounts/Fireworks/Models/Mixtral 8x7b

Accounts/Fireworks/Models/Mixtral 8x7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 33K context

Accounts/Fireworks/Models/Mixtral 8x7b Instruct

Accounts/Fireworks/Models/Mixtral 8x7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 33K context

Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf

Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 33K context

Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo

Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 33K context

Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507

Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 262K context

Accounts/Fireworks/Models/Deepseek R1 Basic

Accounts/Fireworks/Models/Deepseek R1 Basic is available via Fireworks AI with a 128K context window and up to 20,480 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 128K context

Accounts/Fireworks/Models/Glm 4p5

Accounts/Fireworks/Models/Glm 4p5 is available via Fireworks AI with a 128K context window and up to 96,000 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 128K context

Accounts/Fireworks/Models/Glm 4p6

Accounts/Fireworks/Models/Glm 4p6 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 203K context

Accounts/Fireworks/Models/Deepseek V3p1

Accounts/Fireworks/Models/Deepseek V3p1 is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

$0.56 / 1M in 128K context

Accounts/Fireworks/Models/Deepseek V3p1 Terminus

Accounts/Fireworks/Models/Deepseek V3p1 Terminus is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

$0.56 / 1M in 128K context

Accounts/Fireworks/Models/Deepseek V3p2

Accounts/Fireworks/Models/Deepseek V3p2 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

$0.56 / 1M in 164K context

Accounts/Fireworks/Models/Glm 4p7

Accounts/Fireworks/Models/Glm 4p7 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 203K context

Accounts/Fireworks/Models/Kimi K2 Instruct

Accounts/Fireworks/Models/Kimi K2 Instruct is available via Fireworks AI with a 131K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 131K context

Accounts/Fireworks/Models/Kimi K2 Instruct 0905

Accounts/Fireworks/Models/Kimi K2 Instruct 0905 is available via Fireworks AI with a 262K context window and up to 32,768 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Accounts/Fireworks/Models/Kimi K2 Thinking

Accounts/Fireworks/Models/Kimi K2 Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Accounts/Fireworks/Models/Kimi K2p5

Accounts/Fireworks/Models/Kimi K2p5 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Glm 4p7

Glm 4p7 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 203K context

Kimi K2p5

Kimi K2p5 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Accounts/Fireworks/Models/Deepseek

Accounts/Fireworks/Models/Deepseek is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 128K context

Accounts/Fireworks/Models/Deepseek V3 0324

Accounts/Fireworks/Models/Deepseek V3 0324 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 164K context

Accounts/Fireworks/Models/Firefunction

Accounts/Fireworks/Models/Firefunction is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 8K context

Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct

Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Qwen2 72b Instruct

Accounts/Fireworks/Models/Qwen2 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Code Llama 34b

Accounts/Fireworks/Models/Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 34b Instruct

Accounts/Fireworks/Models/Code Llama 34b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 34b Python

Accounts/Fireworks/Models/Code Llama 34b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Code Llama 70b

Accounts/Fireworks/Models/Code Llama 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Code Llama 70b Instruct

Accounts/Fireworks/Models/Code Llama 70b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Code Llama 70b Python

Accounts/Fireworks/Models/Code Llama 70b Python is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b

Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b

Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek Coder 33b Instruct

Accounts/Fireworks/Models/Deepseek Coder 33b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b

Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b

Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Devstral Small 2505

Accounts/Fireworks/Models/Devstral Small 2505 is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New

Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b

Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Fare 20b

Accounts/Fireworks/Models/Fare 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Gemma 3 27b It

Accounts/Fireworks/Models/Gemma 3 27b It is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Internvl3 38b

Accounts/Fireworks/Models/Internvl3 38b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Internvl3 78b

Accounts/Fireworks/Models/Internvl3 78b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Kat Coder

Accounts/Fireworks/Models/Kat Coder is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 262K context

Accounts/Fireworks/Models/Kat Dev 32b

Accounts/Fireworks/Models/Kat Dev 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Kat Dev 72b Exp

Accounts/Fireworks/Models/Kat Dev 72b Exp is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Llama V2 70b Chat

Accounts/Fireworks/Models/Llama V2 70b Chat is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 2K context

Accounts/Fireworks/Models/Llama V3 70b Instruct

Accounts/Fireworks/Models/Llama V3 70b Instruct is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 8K context

Accounts/Fireworks/Models/Llama V3 70b Instruct Hf

Accounts/Fireworks/Models/Llama V3 70b Instruct Hf is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 8K context

Accounts/Fireworks/Models/Llama V3p1 70b Instruct

Accounts/Fireworks/Models/Llama V3p1 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct

Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Llama V3p3 70b Instruct

Accounts/Fireworks/Models/Llama V3p3 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Llava Yi 34b

Accounts/Fireworks/Models/Llava Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501

Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b

Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Nous Hermes Llama2 70b

Accounts/Fireworks/Models/Nous Hermes Llama2 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Phind Code Llama 34b Python

Accounts/Fireworks/Models/Phind Code Llama 34b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Phind Code Llama 34b

Accounts/Fireworks/Models/Phind Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Phind Code Llama 34b

Accounts/Fireworks/Models/Phind Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 16K context

Accounts/Fireworks/Models/Qwen Qwq 32b Preview

Accounts/Fireworks/Models/Qwen Qwq 32b Preview is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen1p5 72b Chat

Accounts/Fireworks/Models/Qwen1p5 72b Chat is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct

Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 32b

Accounts/Fireworks/Models/Qwen2p5 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Qwen2p5 32b Instruct

Accounts/Fireworks/Models/Qwen2p5 32b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 72b

Accounts/Fireworks/Models/Qwen2p5 72b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Qwen2p5 72b Instruct

Accounts/Fireworks/Models/Qwen2p5 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 32b

Accounts/Fireworks/Models/Qwen2p5 Coder 32b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 33K context

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k

Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 66K context

Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct

Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 128K context

Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct

Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 128K context

Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507

Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 262K context

Accounts/Fireworks/Models/Qwen3 32b

Accounts/Fireworks/Models/Qwen3 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16

Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking

Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct

Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Qwq 32b

Accounts/Fireworks/Models/Qwq 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 131K context

Accounts/Fireworks/Models/Yi 34b

Accounts/Fireworks/Models/Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Yi 34b 200k Capybara

Accounts/Fireworks/Models/Yi 34b 200k Capybara is available via Fireworks AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 200K context

Accounts/Fireworks/Models/Yi 34b Chat

Accounts/Fireworks/Models/Yi 34b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 4K context

Accounts/Fireworks/Models/Deepseek Coder V2 Instruct

Accounts/Fireworks/Models/Deepseek Coder V2 Instruct is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 66K context

Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf

Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 66K context

Accounts/Fireworks/Models/Cogito 671b V2 P1

Accounts/Fireworks/Models/Cogito 671b V2 P1 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 164K context

Accounts/Fireworks/Models/Dbrx Instruct

Accounts/Fireworks/Models/Dbrx Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 33K context

Accounts/Fireworks/Models/Deepseek Prover

Accounts/Fireworks/Models/Deepseek Prover is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 164K context

Accounts/Fireworks/Models/Deepseek V2p5

Accounts/Fireworks/Models/Deepseek V2p5 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 33K context

Accounts/Fireworks/Models/Glm 4p5v

Accounts/Fireworks/Models/Glm 4p5v is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 131K context

Accounts/Fireworks/Models/Gpt Oss Safeguard 120b

Accounts/Fireworks/Models/Gpt Oss Safeguard 120b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 131K context

Accounts/Fireworks/Models/Mistral Large 3 Fp8

Accounts/Fireworks/Models/Mistral Large 3 Fp8 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 256K context

Accounts/Fireworks/Models/Mixtral 8x22b

Accounts/Fireworks/Models/Mixtral 8x22b is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 66K context

Accounts/Fireworks/Models/Mixtral 8x22b Instruct

Accounts/Fireworks/Models/Mixtral 8x22b Instruct is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 66K context

Accounts/Fireworks/Models/Deepseek R1

Accounts/Fireworks/Models/Deepseek R1 is available via Fireworks AI with a 128K context window and up to 20,480 output tokens. Pricing: $3.00/1M input tokens, $8.00/1M output tokens.

$3.00 / 1M in 128K context

Accounts/Fireworks/Models/Deepseek R1 0528

Accounts/Fireworks/Models/Deepseek R1 0528 is available via Fireworks AI with a 160K context window and up to 160,000 output tokens. Pricing: $3.00/1M input tokens, $8.00/1M output tokens.

$3.00 / 1M in 160K context

Accounts/Fireworks/Models/Llama V3p1 405b Instruct

Accounts/Fireworks/Models/Llama V3p1 405b Instruct is available via Fireworks AI with a 128K context window and up to 16,384 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.

$3.00 / 1M in 128K context

Accounts/Fireworks/Models/Yi Large

Accounts/Fireworks/Models/Yi Large is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.

$3.00 / 1M in 33K context

Azure OpenAI Models

View provider details →

Gpt 5 Nano

Gpt 5 Nano is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.0500/1M input tokens, $0.4000/1M output tokens.

$0.050 / 1M in 272K context

Gpt 5 Nano 2025 08 07

Gpt 5 Nano 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.0500/1M input tokens, $0.4000/1M output tokens.

$0.050 / 1M in 272K context

Eu/Gpt 5 Nano 2025 08 07

Eu/Gpt 5 Nano 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.0550/1M input tokens, $0.4400/1M output tokens.

$0.055 / 1M in 272K context

Us/Gpt 5 Nano 2025 08 07

Us/Gpt 5 Nano 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.0550/1M input tokens, $0.4400/1M output tokens.

$0.055 / 1M in 272K context

Gpt 4.1 Nano

Gpt 4.1 Nano is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gpt 4.1 Nano 2025 04 14

Gpt 4.1 Nano 2025 04 14 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Us/Gpt 4.1 Nano 2025 04 14

Us/Gpt 4.1 Nano 2025 04 14 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.1100/1M input tokens, $0.4400/1M output tokens.

$0.11 / 1M in 1.0M context

Global Standard/Gpt 4o Mini

Global Standard/Gpt 4o Mini is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Eu/Gpt 4o Mini 2024 07 18

Eu/Gpt 4o Mini 2024 07 18 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1650/1M input tokens, $0.6600/1M output tokens.

$0.17 / 1M in 128K context

Gpt 4o Mini

Gpt 4o Mini is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1650/1M input tokens, $0.6600/1M output tokens.

$0.17 / 1M in 128K context

Gpt 4o Mini 2024 07 18

Gpt 4o Mini 2024 07 18 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1650/1M input tokens, $0.6600/1M output tokens.

$0.17 / 1M in 128K context

Us/Gpt 4o Mini 2024 07 18

Us/Gpt 4o Mini 2024 07 18 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1650/1M input tokens, $0.6600/1M output tokens.

$0.17 / 1M in 128K context

Gpt 5.4 Nano

Gpt 5.4 Nano is available via Azure OpenAI with a 1.1M context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $1.25/1M output tokens.

$0.20 / 1M in 1.1M context

Gpt 5 Mini

Gpt 5 Mini is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 272K context

Gpt 5 Mini 2025 08 07

Gpt 5 Mini 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 272K context

Eu/Gpt 5 Mini 2025 08 07

Eu/Gpt 5 Mini 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2750/1M input tokens, $2.20/1M output tokens.

$0.28 / 1M in 272K context

Us/Gpt 5 Mini 2025 08 07

Us/Gpt 5 Mini 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2750/1M input tokens, $2.20/1M output tokens.

$0.28 / 1M in 272K context

Gpt 4.1 Mini

Gpt 4.1 Mini is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 1.0M context

Gpt 4.1 Mini 2025 04 14

Gpt 4.1 Mini 2025 04 14 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 1.0M context

Us/Gpt 4.1 Mini 2025 04 14

Us/Gpt 4.1 Mini 2025 04 14 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.4400/1M input tokens, $1.76/1M output tokens.

$0.44 / 1M in 1.0M context

Gpt 3.5 Turbo

Gpt 3.5 Turbo is available via Azure OpenAI with a 4K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 4K context

Gpt 3.5 Turbo 0125

Gpt 3.5 Turbo 0125 is available via Azure OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 16K context

Gpt 35 Turbo

Gpt 35 Turbo is available via Azure OpenAI with a 4K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 4K context

Gpt 35 Turbo 0125

Gpt 35 Turbo 0125 is available via Azure OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 16K context

Gpt Audio Mini 2025 10 06

Gpt Audio Mini 2025 10 06 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt 4o Mini Realtime Preview 2024 12 17

Gpt 4o Mini Realtime Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt Realtime Mini 2025 10 06

Gpt Realtime Mini 2025 10 06 is available via Azure OpenAI with a 32K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 32K context

Eu/Gpt 4o Mini Realtime Preview 2024 12 17

Eu/Gpt 4o Mini Realtime Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6600/1M input tokens, $2.64/1M output tokens.

$0.66 / 1M in 128K context

Us/Gpt 4o Mini Realtime Preview 2024 12 17

Us/Gpt 4o Mini Realtime Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6600/1M input tokens, $2.64/1M output tokens.

$0.66 / 1M in 128K context

Gpt 5.4 Mini

Gpt 5.4 Mini is available via Azure OpenAI with a 1.1M context window and up to 128,000 output tokens. Pricing: $0.7500/1M input tokens, $4.50/1M output tokens.

$0.75 / 1M in 1.1M context

Gpt 35 Turbo 1106

Gpt 35 Turbo 1106 is available via Azure OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $2.00/1M output tokens.

$1.00 / 1M in 16K context

O1 Mini 2024 09 12

O1 Mini 2024 09 12 is available via Azure OpenAI with a 128K context window and up to 65,536 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 128K context

O3 Mini

O3 Mini is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

O3 Mini 2025 01 31

O3 Mini 2025 01 31 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

O4 Mini

O4 Mini is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

O4 Mini 2025 04 16

O4 Mini 2025 04 16 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

Eu/O1 Mini 2024 09 12

Eu/O1 Mini 2024 09 12 is available via Azure OpenAI with a 128K context window and up to 65,536 output tokens. Pricing: $1.21/1M input tokens, $4.84/1M output tokens.

$1.21 / 1M in 128K context

Eu/O3 Mini 2025 01 31

Eu/O3 Mini 2025 01 31 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.21/1M input tokens, $4.84/1M output tokens.

$1.21 / 1M in 200K context

O1 Mini

O1 Mini is available via Azure OpenAI with a 128K context window and up to 65,536 output tokens. Pricing: $1.21/1M input tokens, $4.84/1M output tokens.

$1.21 / 1M in 128K context

Us/O1 Mini 2024 09 12

Us/O1 Mini 2024 09 12 is available via Azure OpenAI with a 128K context window and up to 65,536 output tokens. Pricing: $1.21/1M input tokens, $4.84/1M output tokens.

$1.21 / 1M in 128K context

Us/O3 Mini 2025 01 31

Us/O3 Mini 2025 01 31 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.21/1M input tokens, $4.84/1M output tokens.

$1.21 / 1M in 200K context

Us/O4 Mini 2025 04 16

Us/O4 Mini 2025 04 16 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.21/1M input tokens, $4.84/1M output tokens.

$1.21 / 1M in 200K context

Global/Gpt 5.1

Global/Gpt 5.1 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Global/Gpt 5.1 Chat

Global/Gpt 5.1 Chat is available via Azure OpenAI with a 128K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5.1 2025 11 13

Gpt 5.1 2025 11 13 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5.1 Chat 2025 11 13

Gpt 5.1 Chat 2025 11 13 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5

Gpt 5 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5 2025 08 07

Gpt 5 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5 Chat

Gpt 5 Chat is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5 Chat Latest

Gpt 5 Chat Latest is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5.1

Gpt 5.1 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5.1 Chat

Gpt 5.1 Chat is available via Azure OpenAI with a 128K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Eu/Gpt 5 2025 08 07

Eu/Gpt 5 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.38/1M input tokens, $11.00/1M output tokens.

$1.38 / 1M in 272K context

Us/Gpt 5 2025 08 07

Us/Gpt 5 2025 08 07 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.38/1M input tokens, $11.00/1M output tokens.

$1.38 / 1M in 272K context

Eu/Gpt 5.1

Eu/Gpt 5.1 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.38/1M input tokens, $11.00/1M output tokens.

$1.38 / 1M in 272K context

Eu/Gpt 5.1 Chat

Eu/Gpt 5.1 Chat is available via Azure OpenAI with a 128K context window and up to 128,000 output tokens. Pricing: $1.38/1M input tokens, $11.00/1M output tokens.

$1.38 / 1M in 128K context

Us/Gpt 5.1

Us/Gpt 5.1 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.38/1M input tokens, $11.00/1M output tokens.

$1.38 / 1M in 272K context

Us/Gpt 5.1 Chat

Us/Gpt 5.1 Chat is available via Azure OpenAI with a 128K context window and up to 128,000 output tokens. Pricing: $1.38/1M input tokens, $11.00/1M output tokens.

$1.38 / 1M in 128K context

Gpt 5.2

Gpt 5.2 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 272K context

Gpt 5.2 2025 12 11

Gpt 5.2 2025 12 11 is available via Azure OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 272K context

Gpt 5.2 Chat

Gpt 5.2 Chat is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 128K context

Gpt 5.2 Chat 2025 12 11

Gpt 5.2 Chat 2025 12 11 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 128K context

Gpt 5.3 Chat

Gpt 5.3 Chat is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 128K context

Gpt 4.1

Gpt 4.1 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gpt 4.1 2025 04 14

Gpt 4.1 2025 04 14 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 1.0M context

O3

O3 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 200K context

O3 2025 04 16

O3 2025 04 16 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 200K context

Us/Gpt 4.1 2025 04 14

Us/Gpt 4.1 2025 04 14 is available via Azure OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $2.20/1M input tokens, $8.80/1M output tokens.

$2.20 / 1M in 1.0M context

Us/O3 2025 04 16

Us/O3 2025 04 16 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $2.20/1M input tokens, $8.80/1M output tokens.

$2.20 / 1M in 200K context

Global Standard/Gpt 4o 2024 08 06

Global Standard/Gpt 4o 2024 08 06 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Global Standard/Gpt 4o 2024 11 20

Global Standard/Gpt 4o 2024 11 20 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Global/Gpt 4o 2024 08 06

Global/Gpt 4o 2024 08 06 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Global/Gpt 4o 2024 11 20

Global/Gpt 4o 2024 11 20 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o

Gpt 4o is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o 2024 08 06

Gpt 4o 2024 08 06 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt Audio 2025 08 28

Gpt Audio 2025 08 28 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt Audio 1.5 2026 02 23

Gpt Audio 1.5 2026 02 23 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Audio Preview 2024 12 17

Gpt 4o Audio Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Mini Audio Preview 2024 12 17

Gpt 4o Mini Audio Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 5.4

Gpt 5.4 is available via Azure OpenAI with a 1.1M context window and up to 128,000 output tokens. Pricing: $2.50/1M input tokens, $15.00/1M output tokens.

$2.50 / 1M in 1.1M context

Gpt 5.4 2026 03 05

Gpt 5.4 2026 03 05 is available via Azure OpenAI with a 1.1M context window and up to 128,000 output tokens. Pricing: $2.50/1M input tokens, $15.00/1M output tokens.

$2.50 / 1M in 1.1M context

Eu/Gpt 4o 2024 08 06

Eu/Gpt 4o 2024 08 06 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.75/1M input tokens, $11.00/1M output tokens.

$2.75 / 1M in 128K context

Eu/Gpt 4o 2024 11 20

Eu/Gpt 4o 2024 11 20 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.75/1M input tokens, $11.00/1M output tokens.

$2.75 / 1M in 128K context

Gpt 4o 2024 11 20

Gpt 4o 2024 11 20 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.75/1M input tokens, $11.00/1M output tokens.

$2.75 / 1M in 128K context

Us/Gpt 4o 2024 08 06

Us/Gpt 4o 2024 08 06 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.75/1M input tokens, $11.00/1M output tokens.

$2.75 / 1M in 128K context

Us/Gpt 4o 2024 11 20

Us/Gpt 4o 2024 11 20 is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.75/1M input tokens, $11.00/1M output tokens.

$2.75 / 1M in 128K context

Command R Plus

Command R Plus is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

Computer Use Preview

Computer Use Preview is available via Azure OpenAI with a 8K context window and up to 1,024 output tokens. Pricing: $3.00/1M input tokens, $12.00/1M output tokens.

$3.00 / 1M in 8K context

Gpt 35 Turbo 16k

Gpt 35 Turbo 16k is available via Azure OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $4.00/1M output tokens.

$3.00 / 1M in 16K context

Gpt 35 Turbo 16k 0613

Gpt 35 Turbo 16k 0613 is available via Azure OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $4.00/1M output tokens.

$3.00 / 1M in 16K context

Computer Use Preview

Computer Use Preview is available via Azure OpenAI with a 8K context window and up to 1,024 output tokens. Pricing: $3.00/1M input tokens, $12.00/1M output tokens.

$3.00 / 1M in 8K context

Gpt Realtime 2025 08 28

Gpt Realtime 2025 08 28 is available via Azure OpenAI with a 32K context window and up to 4,096 output tokens. Pricing: $4.00/1M input tokens, $16.00/1M output tokens.

$4.00 / 1M in 32K context

Gpt Realtime 1.5 2026 02 23

Gpt Realtime 1.5 2026 02 23 is available via Azure OpenAI with a 32K context window and up to 4,096 output tokens. Pricing: $4.00/1M input tokens, $16.00/1M output tokens.

$4.00 / 1M in 32K context

Gpt 4o 2024 05 13

Gpt 4o 2024 05 13 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4o Realtime Preview 2024 10 01

Gpt 4o Realtime Preview 2024 10 01 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $20.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4o Realtime Preview 2024 12 17

Gpt 4o Realtime Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $20.00/1M output tokens.

$5.00 / 1M in 128K context

Eu/Gpt 4o Realtime Preview 2024 10 01

Eu/Gpt 4o Realtime Preview 2024 10 01 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.50/1M input tokens, $22.00/1M output tokens.

$5.50 / 1M in 128K context

Eu/Gpt 4o Realtime Preview 2024 12 17

Eu/Gpt 4o Realtime Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.50/1M input tokens, $22.00/1M output tokens.

$5.50 / 1M in 128K context

Us/Gpt 4o Realtime Preview 2024 10 01

Us/Gpt 4o Realtime Preview 2024 10 01 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.50/1M input tokens, $22.00/1M output tokens.

$5.50 / 1M in 128K context

Us/Gpt 4o Realtime Preview 2024 12 17

Us/Gpt 4o Realtime Preview 2024 12 17 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.50/1M input tokens, $22.00/1M output tokens.

$5.50 / 1M in 128K context

Mistral Large 2402

Mistral Large 2402 is available via Azure OpenAI with a 32K context window and up to 32,000 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 32K context

Mistral Large Latest

Mistral Large Latest is available via Azure OpenAI with a 32K context window and up to 32,000 output tokens. Pricing: $8.00/1M input tokens, $24.00/1M output tokens.

$8.00 / 1M in 32K context

Gpt 4 0125 Preview

Gpt 4 0125 Preview is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 1106 Preview

Gpt 4 1106 Preview is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 Turbo

Gpt 4 Turbo is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 Turbo 2024 04 09

Gpt 4 Turbo 2024 04 09 is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 Turbo Vision Preview

Gpt 4 Turbo Vision Preview is available via Azure OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

O1

O1 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 200K context

O1 2024 12 17

O1 2024 12 17 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 200K context

O1 Preview

O1 Preview is available via Azure OpenAI with a 128K context window and up to 32,768 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 128K context

O1 Preview 2024 09 12

O1 Preview 2024 09 12 is available via Azure OpenAI with a 128K context window and up to 32,768 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 128K context

Eu/O1 2024 12 17

Eu/O1 2024 12 17 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $16.50/1M input tokens, $66.00/1M output tokens.

$16.50 / 1M in 200K context

Eu/O1 Preview 2024 09 12

Eu/O1 Preview 2024 09 12 is available via Azure OpenAI with a 128K context window and up to 32,768 output tokens. Pricing: $16.50/1M input tokens, $66.00/1M output tokens.

$16.50 / 1M in 128K context

Us/O1 2024 12 17

Us/O1 2024 12 17 is available via Azure OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $16.50/1M input tokens, $66.00/1M output tokens.

$16.50 / 1M in 200K context

Us/O1 Preview 2024 09 12

Us/O1 Preview 2024 09 12 is available via Azure OpenAI with a 128K context window and up to 32,768 output tokens. Pricing: $16.50/1M input tokens, $66.00/1M output tokens.

$16.50 / 1M in 128K context

Gpt 4

Gpt 4 is available via Azure OpenAI with a 8K context window and up to 4,096 output tokens. Pricing: $30.00/1M input tokens, $60.00/1M output tokens.

$30.00 / 1M in 8K context

Gpt 4 0613

Gpt 4 0613 is available via Azure OpenAI with a 8K context window and up to 4,096 output tokens. Pricing: $30.00/1M input tokens, $60.00/1M output tokens.

$30.00 / 1M in 8K context

Gpt 4 32k

Gpt 4 32k is available via Azure OpenAI with a 33K context window and up to 4,096 output tokens. Pricing: $60.00/1M input tokens, $120.00/1M output tokens.

$60.00 / 1M in 33K context

Gpt 4 32k 0613

Gpt 4 32k 0613 is available via Azure OpenAI with a 33K context window and up to 4,096 output tokens. Pricing: $60.00/1M input tokens, $120.00/1M output tokens.

$60.00 / 1M in 33K context

Gpt 4.5 Preview

Gpt 4.5 Preview is available via Azure OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $75.00/1M input tokens, $150.00/1M output tokens.

$75.00 / 1M in 128K context

Google Vertex AI Models

View provider details →

Meta/Llama 3.1 70b Instruct Maas

Meta/Llama 3.1 70b Instruct Maas is available via Google Vertex AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

Meta/Llama 3.1 8b Instruct Maas

Meta/Llama 3.1 8b Instruct Maas is available via Google Vertex AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

Meta/Llama 3.2 90b Vision Instruct Maas

Meta/Llama 3.2 90b Vision Instruct Maas is available via Google Vertex AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

Meta/Llama3 405b Instruct Maas

Meta/Llama3 405b Instruct Maas is available via Google Vertex AI with a 32K context window and up to 32,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 32K context

Meta/Llama3 70b Instruct Maas

Meta/Llama3 70b Instruct Maas is available via Google Vertex AI with a 32K context window and up to 32,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 32K context

Meta/Llama3 8b Instruct Maas

Meta/Llama3 8b Instruct Maas is available via Google Vertex AI with a 32K context window and up to 32,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 32K context

Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite is available via Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 1.0M context

Gemini 2.0 Flash Lite 001

Gemini 2.0 Flash Lite 001 is available via Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 1.0M context

Openai/Gpt Oss 20b Maas

Openai/Gpt Oss 20b Maas is available via Google Vertex AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 131K context

Gemini 2.0 Flash

Gemini 2.0 Flash is available via Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.5 Flash Lite Preview 09 2025

Gemini 2.5 Flash Lite Preview 09 2025 is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.5 Flash Lite Preview 06 17

Gemini 2.5 Flash Lite Preview 06 17 is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.0 Flash 001

Gemini 2.0 Flash 001 is available via Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 1.0M context

Mistral Nemo@Latest

Mistral Nemo@Latest is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Openai/Gpt Oss 120b Maas

Openai/Gpt Oss 120b Maas is available via Google Vertex AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Qwen/Qwen3 Next 80b A3b Instruct Maas

Qwen/Qwen3 Next 80b A3b Instruct Maas is available via Google Vertex AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $1.20/1M output tokens.

$0.15 / 1M in 262K context

Qwen/Qwen3 Next 80b A3b Thinking Maas

Qwen/Qwen3 Next 80b A3b Thinking Maas is available via Google Vertex AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $1.20/1M output tokens.

$0.15 / 1M in 262K context

Codestral 2501

Codestral 2501 is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 128K context

Codestral

Codestral is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 128K context

Codestral@Latest

Codestral@Latest is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 128K context

Jamba 1.5

Jamba 1.5 is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba 1.5 Mini

Jamba 1.5 Mini is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba 1.5 Mini

Jamba 1.5 Mini is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.2500/1M input tokens, $1.50/1M output tokens.

$0.25 / 1M in 1.0M context

Claude 3 Haiku

Claude 3 Haiku is available via Google Vertex AI with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Claude 3 Haiku

Claude 3 Haiku is available via Google Vertex AI with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.2500/1M input tokens, $1.50/1M output tokens.

$0.25 / 1M in 1.0M context

Meta/Llama 4 Scout 17b 128e Instruct Maas

Meta/Llama 4 Scout 17b 128e Instruct Maas is available via Google Vertex AI with a 10M context window and up to 10,000,000 output tokens. Pricing: $0.2500/1M input tokens, $0.7000/1M output tokens.

$0.25 / 1M in 10M context

Meta/Llama 4 Scout 17b 16e Instruct Maas

Meta/Llama 4 Scout 17b 16e Instruct Maas is available via Google Vertex AI with a 10M context window and up to 10,000,000 output tokens. Pricing: $0.2500/1M input tokens, $0.7000/1M output tokens.

$0.25 / 1M in 10M context

Qwen/Qwen3 235b A22b Instruct 2507 Maas

Qwen/Qwen3 235b A22b Instruct 2507 Maas is available via Google Vertex AI with a 262K context window and up to 16,384 output tokens. Pricing: $0.2500/1M input tokens, $1.00/1M output tokens.

$0.25 / 1M in 262K context

Gemini 2.5 Flash

Gemini 2.5 Flash is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Preview 09 2025

Gemini 2.5 Flash Preview 09 2025 is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini Robotics Er 1.5 Preview

Gemini Robotics Er 1.5 Preview is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Mistralai/Codestral 2

Mistralai/Codestral 2 is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 128K context

Codestral 2

Codestral 2 is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 128K context

Codestral 2

Codestral 2 is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 128K context

Mistralai/Codestral 2

Mistralai/Codestral 2 is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 128K context

Minimaxai/Minimax M2 Maas

Minimaxai/Minimax M2 Maas is available via Google Vertex AI with a 197K context window and up to 196,608 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 197K context

Meta/Llama 4 Maverick 17b 128e Instruct Maas

Meta/Llama 4 Maverick 17b 128e Instruct Maas is available via Google Vertex AI with a 1M context window and up to 1,000,000 output tokens. Pricing: $0.3500/1M input tokens, $1.15/1M output tokens.

$0.35 / 1M in 1M context

Meta/Llama 4 Maverick 17b 16e Instruct Maas

Meta/Llama 4 Maverick 17b 16e Instruct Maas is available via Google Vertex AI with a 1M context window and up to 1,000,000 output tokens. Pricing: $0.3500/1M input tokens, $1.15/1M output tokens.

$0.35 / 1M in 1M context

Mistral Medium 3

Mistral Medium 3 is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 128K context

Mistral Medium 3

Mistral Medium 3 is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 128K context

Mistralai/Mistral Medium 3

Mistralai/Mistral Medium 3 is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 128K context

Mistralai/Mistral Medium 3

Mistralai/Mistral Medium 3 is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 128K context

Gemini 3 Flash Preview

Gemini 3 Flash Preview is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.5000/1M input tokens, $3.00/1M output tokens.

$0.50 / 1M in 1.0M context

Gemini 3 Flash Preview

Gemini 3 Flash Preview is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.5000/1M input tokens, $3.00/1M output tokens.

$0.50 / 1M in 1.0M context

Deepseek Ai/Deepseek V3.2 Maas

Deepseek Ai/Deepseek V3.2 Maas is available via Google Vertex AI with a 164K context window and up to 32,768 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.

$0.56 / 1M in 164K context

Moonshotai/Kimi K2 Thinking Maas

Moonshotai/Kimi K2 Thinking Maas is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 256K context

Zai Org/Glm 4.7 Maas

Zai Org/Glm 4.7 Maas is available via Google Vertex AI with a 200K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 200K context

Claude 3 5 Haiku

Claude 3 5 Haiku is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Claude 3 5 Haiku

Claude 3 5 Haiku is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Claude Haiku 4 5

Claude Haiku 4 5 is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Claude Haiku 4 5

Claude Haiku 4 5 is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Zai Org/Glm 5 Maas

Zai Org/Glm 5 Maas is available via Google Vertex AI with a 200K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.20/1M output tokens.

$1.00 / 1M in 200K context

Mistral Small 2503

Mistral Small 2503 is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 128K context

Mistral Small 2503

Mistral Small 2503 is available via Google Vertex AI with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 32K context

Qwen/Qwen3 Coder 480b A35b Instruct Maas

Qwen/Qwen3 Coder 480b A35b Instruct Maas is available via Google Vertex AI with a 262K context window and up to 32,768 output tokens. Pricing: $1.00/1M input tokens, $4.00/1M output tokens.

$1.00 / 1M in 262K context

Gemini 2.5 Pro

Gemini 2.5 Pro is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Gemini 2.5 Pro Preview Tts

Gemini 2.5 Pro Preview Tts is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Gemini 2.5 Computer Use Preview 10 2025

Gemini 2.5 Computer Use Preview 10 2025 is available via Google Vertex AI with a 128K context window and up to 64,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Deepseek Ai/Deepseek V3.1 Maas

Deepseek Ai/Deepseek V3.1 Maas is available via Google Vertex AI with a 164K context window and up to 32,768 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

$1.35 / 1M in 164K context

Deepseek Ai/Deepseek R1 0528 Maas

Deepseek Ai/Deepseek R1 0528 Maas is available via Google Vertex AI with a 65K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

$1.35 / 1M in 65K context

Gemini 3 Pro Preview

Gemini 3 Pro Preview is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3.1 Pro Preview Customtools

Gemini 3.1 Pro Preview Customtools is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3 Pro Preview

Gemini 3 Pro Preview is available via Google Vertex AI with a 1.0M context window and up to 65,535 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3.1 Pro Preview Customtools

Gemini 3.1 Pro Preview Customtools is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Jamba 1.5 Large

Jamba 1.5 Large is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Jamba 1.5 Large

Jamba 1.5 Large is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Mistral Large 2411

Mistral Large 2411 is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Mistral Large

Mistral Large is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Mistral Large@2411 001

Mistral Large@2411 001 is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Mistral Large@Latest

Mistral Large@Latest is available via Google Vertex AI with a 128K context window and up to 8,191 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Claude 3 5 Sonnet

Claude 3 5 Sonnet is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude 3 5 Sonnet

Claude 3 5 Sonnet is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude 3 7 Sonnet

Claude 3 7 Sonnet is available via Google Vertex AI with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude 3 Sonnet

Claude 3 Sonnet is available via Google Vertex AI with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude 3 Sonnet

Claude 3 Sonnet is available via Google Vertex AI with a 200K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4 5

Claude Sonnet 4 5 is available via Google Vertex AI with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4 6

Claude Sonnet 4 6 is available via Google Vertex AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Claude Sonnet 4 5

Claude Sonnet 4 5 is available via Google Vertex AI with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4

Claude Sonnet 4 is available via Google Vertex AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Claude Sonnet 4

Claude Sonnet 4 is available via Google Vertex AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Mistral Nemo

Mistral Nemo is available via Google Vertex AI with a 128K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.

$3.00 / 1M in 128K context

Claude Sonnet 4 6@Default

Claude Sonnet 4 6@Default is available via Google Vertex AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Claude Opus 4 5

Claude Opus 4 5 is available via Google Vertex AI with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Claude Opus 4 5

Claude Opus 4 5 is available via Google Vertex AI with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Claude Opus 4 6

Claude Opus 4 6 is available via Google Vertex AI with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Claude Opus 4 6@Default

Claude Opus 4 6@Default is available via Google Vertex AI with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Meta/Llama 3.1 405b Instruct Maas

Meta/Llama 3.1 405b Instruct Maas is available via Google Vertex AI with a 128K context window and up to 2,048 output tokens. Pricing: $5.00/1M input tokens, $16.00/1M output tokens.

$5.00 / 1M in 128K context

Claude 3 Opus

Claude 3 Opus is available via Google Vertex AI with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude 3 Opus

Claude 3 Opus is available via Google Vertex AI with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4

Claude Opus 4 is available via Google Vertex AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4 1

Claude Opus 4 1 is available via Google Vertex AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4 1

Claude Opus 4 1 is available via Google Vertex AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4

Claude Opus 4 is available via Google Vertex AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

OpenAI Models

View provider details →

Gpt 5 Nano

Gpt 5 Nano is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.0500/1M input tokens, $0.4000/1M output tokens.

$0.050 / 1M in 272K context

Gpt 5 Nano 2025 08 07

Gpt 5 Nano 2025 08 07 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.0500/1M input tokens, $0.4000/1M output tokens.

$0.050 / 1M in 272K context

Gpt 4.1 Nano

NEW

GPT-4.1 Nano is OpenAI's fastest and cheapest model, designed for classification, autocompletion, and lightweight agentic tasks. Despite its low cost, it supports the full 1M context window.

$0.10 / 1M in 1.0M context

Gpt 4.1 Nano 2025 04 14

Gpt 4.1 Nano 2025 04 14 is available via OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gpt 4o Mini

GPT-4o Mini is OpenAI's most affordable model, replacing GPT-3.5 Turbo as the default for high-volume applications. It scores higher than GPT-4 on many benchmarks while costing 60% less than GPT-3.5 Turbo.

$0.15 / 1M in 128K context

Gpt 4o Mini 2024 07 18

Gpt 4o Mini 2024 07 18 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Gpt 4o Mini Audio Preview

Gpt 4o Mini Audio Preview is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Gpt 4o Mini Audio Preview 2024 12 17

Gpt 4o Mini Audio Preview 2024 12 17 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Gpt 4o Mini Search Preview

Gpt 4o Mini Search Preview is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Gpt 4o Mini Search Preview 2025 03 11

Gpt 4o Mini Search Preview 2025 03 11 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Ft:Gpt 4.1 Nano 2025 04 14

Ft:Gpt 4.1 Nano 2025 04 14 is available via OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.8000/1M output tokens.

$0.20 / 1M in 1.0M context

Gpt 5.4 Nano

Gpt 5.4 Nano is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $1.25/1M output tokens.

$0.20 / 1M in 272K context

Gpt 5 Mini

Gpt 5 Mini is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 272K context

Gpt 5 Mini 2025 08 07

Gpt 5 Mini 2025 08 07 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 272K context

Ft:Gpt 4o Mini 2024 07 18

Ft:Gpt 4o Mini 2024 07 18 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 128K context

Gpt 4.1 Mini

NEW

GPT-4.1 Mini balances performance and cost with a 1M token context window. It delivers strong coding and instruction-following ability at a price point suitable for high-volume production workloads.

$0.40 / 1M in 1.0M context

Gpt 4.1 Mini 2025 04 14

Gpt 4.1 Mini 2025 04 14 is available via OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 1.0M context

Gpt 3.5 Turbo

Gpt 3.5 Turbo is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 16K context

Gpt 3.5 Turbo 0125

Gpt 3.5 Turbo 0125 is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 16K context

Gpt Audio Mini

Gpt Audio Mini is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt Audio Mini 2025 10 06

Gpt Audio Mini 2025 10 06 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt Audio Mini 2025 12 15

Gpt Audio Mini 2025 12 15 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt 4o Mini Realtime Preview

Gpt 4o Mini Realtime Preview is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt 4o Mini Realtime Preview 2024 12 17

Gpt 4o Mini Realtime Preview 2024 12 17 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt Realtime Mini

Gpt Realtime Mini is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt Realtime Mini 2025 10 06

Gpt Realtime Mini 2025 10 06 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt Realtime Mini 2025 12 15

Gpt Realtime Mini 2025 12 15 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $2.40/1M output tokens.

$0.60 / 1M in 128K context

Gpt 5.4 Mini

Gpt 5.4 Mini is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $0.7500/1M input tokens, $4.50/1M output tokens.

$0.75 / 1M in 272K context

Ft:Gpt 4.1 Mini 2025 04 14

Ft:Gpt 4.1 Mini 2025 04 14 is available via OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.8000/1M input tokens, $3.20/1M output tokens.

$0.80 / 1M in 1.0M context

Gpt 3.5 Turbo 1106

Gpt 3.5 Turbo 1106 is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $2.00/1M output tokens.

$1.00 / 1M in 16K context

O3 Mini

O3 Mini is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

O3 Mini 2025 01 31

O3 Mini 2025 01 31 is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

O4 Mini

O4 Mini is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

O4 Mini 2025 04 16

O4 Mini 2025 04 16 is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

Gpt 5

Gpt 5 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5.1

Gpt 5.1 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5.1 2025 11 13

Gpt 5.1 2025 11 13 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5.1 Chat Latest

Gpt 5.1 Chat Latest is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5 2025 08 07

Gpt 5 2025 08 07 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5 Chat

Gpt 5 Chat is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5 Chat Latest

Gpt 5 Chat Latest is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gpt 5 Search Api

Gpt 5 Search Api is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5 Search Api 2025 10 14

Gpt 5 Search Api 2025 10 14 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Gpt 5.2

Gpt 5.2 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 272K context

Gpt 5.2 2025 12 11

Gpt 5.2 2025 12 11 is available via OpenAI with a 272K context window and up to 128,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 272K context

Gpt 5.2 Chat Latest

Gpt 5.2 Chat Latest is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 128K context

Gpt 5.3 Chat Latest

Gpt 5.3 Chat Latest is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 128K context

Gpt 4.1

NEW

GPT-4.1 is OpenAI's latest flagship model optimized for coding, instruction-following, and long-context tasks. With a 1M token context window and strong agentic performance, it is the recommended default for most applications.

$2.00 / 1M in 1.0M context

Gpt 4.1 2025 04 14

Gpt 4.1 2025 04 14 is available via OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 1.0M context

O3

NEW

o3 is OpenAI's most powerful reasoning model, surpassing o1 on math, coding, and science benchmarks. It supports tool use and vision, making it the top choice for the hardest analytical and code-generation tasks.

$2.00 / 1M in 200K context

O3 2025 04 16

O3 2025 04 16 is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 200K context

Gpt 4o

GPT-4o is OpenAI's flagship multimodal model, processing text, images, and audio in a single architecture. It delivers GPT-4-class intelligence at half the price and 2x the speed of GPT-4 Turbo.

$2.50 / 1M in 128K context

Gpt 4o 2024 08 06

Gpt 4o 2024 08 06 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o 2024 11 20

Gpt 4o 2024 11 20 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Audio Preview

Gpt 4o Audio Preview is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Audio Preview 2024 12 17

Gpt 4o Audio Preview 2024 12 17 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Audio Preview 2025 06 03

Gpt 4o Audio Preview 2025 06 03 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt Audio

Gpt Audio is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt Audio 1.5

Gpt Audio 1.5 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt Audio 2025 08 28

Gpt Audio 2025 08 28 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Search Preview

Gpt 4o Search Preview is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 4o Search Preview 2025 03 11

Gpt 4o Search Preview 2025 03 11 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Gpt 5.4

Gpt 5.4 is available via OpenAI with a 1.1M context window and up to 128,000 output tokens. Pricing: $2.50/1M input tokens, $15.00/1M output tokens.

$2.50 / 1M in 1.1M context

Gpt 5.4 2026 03 05

Gpt 5.4 2026 03 05 is available via OpenAI with a 1.1M context window and up to 128,000 output tokens. Pricing: $2.50/1M input tokens, $15.00/1M output tokens.

$2.50 / 1M in 1.1M context

Ft:Gpt 3.5 Turbo

Ft:Gpt 3.5 Turbo is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $6.00/1M output tokens.

$3.00 / 1M in 16K context

Ft:Gpt 3.5 Turbo 0125

Ft:Gpt 3.5 Turbo 0125 is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $6.00/1M output tokens.

$3.00 / 1M in 16K context

Ft:Gpt 3.5 Turbo 0613

Ft:Gpt 3.5 Turbo 0613 is available via OpenAI with a 4K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $6.00/1M output tokens.

$3.00 / 1M in 4K context

Ft:Gpt 3.5 Turbo 1106

Ft:Gpt 3.5 Turbo 1106 is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $6.00/1M output tokens.

$3.00 / 1M in 16K context

Ft:Gpt 4.1 2025 04 14

Ft:Gpt 4.1 2025 04 14 is available via OpenAI with a 1.0M context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $12.00/1M output tokens.

$3.00 / 1M in 1.0M context

Gpt 3.5 Turbo 16k

Gpt 3.5 Turbo 16k is available via OpenAI with a 16K context window and up to 4,096 output tokens. Pricing: $3.00/1M input tokens, $4.00/1M output tokens.

$3.00 / 1M in 16K context

Ft:Gpt 4o 2024 08 06

Ft:Gpt 4o 2024 08 06 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $3.75/1M input tokens, $15.00/1M output tokens.

$3.75 / 1M in 128K context

Ft:Gpt 4o 2024 11 20

Ft:Gpt 4o 2024 11 20 is available via OpenAI with a 128K context window and up to 16,384 output tokens. Pricing: $3.75/1M input tokens, $15.00/1M output tokens.

$3.75 / 1M in 128K context

Ft:O4 Mini 2025 04 16

Ft:O4 Mini 2025 04 16 is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $4.00/1M input tokens, $16.00/1M output tokens.

$4.00 / 1M in 200K context

Gpt Realtime

Gpt Realtime is available via OpenAI with a 32K context window and up to 4,096 output tokens. Pricing: $4.00/1M input tokens, $16.00/1M output tokens.

$4.00 / 1M in 32K context

Gpt Realtime 1.5

Gpt Realtime 1.5 is available via OpenAI with a 32K context window and up to 4,096 output tokens. Pricing: $4.00/1M input tokens, $16.00/1M output tokens.

$4.00 / 1M in 32K context

Gpt Realtime 2025 08 28

Gpt Realtime 2025 08 28 is available via OpenAI with a 32K context window and up to 4,096 output tokens. Pricing: $4.00/1M input tokens, $16.00/1M output tokens.

$4.00 / 1M in 32K context

Chatgpt 4o Latest

Chatgpt 4o Latest is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4o 2024 05 13

Gpt 4o 2024 05 13 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4o Realtime Preview

Gpt 4o Realtime Preview is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $20.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4o Realtime Preview 2024 12 17

Gpt 4o Realtime Preview 2024 12 17 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $20.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4o Realtime Preview 2025 06 03

Gpt 4o Realtime Preview 2025 06 03 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $20.00/1M output tokens.

$5.00 / 1M in 128K context

Gpt 4 0125 Preview

Gpt 4 0125 Preview is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 1106 Preview

Gpt 4 1106 Preview is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 Turbo

Gpt 4 Turbo is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 Turbo 2024 04 09

Gpt 4 Turbo 2024 04 09 is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Gpt 4 Turbo Preview

Gpt 4 Turbo Preview is available via OpenAI with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

O1

O1 is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 200K context

O1 2024 12 17

O1 2024 12 17 is available via OpenAI with a 200K context window and up to 100,000 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 200K context

Ft:Gpt 4 0613

Ft:Gpt 4 0613 is available via OpenAI with a 8K context window and up to 4,096 output tokens. Pricing: $30.00/1M input tokens, $60.00/1M output tokens.

$30.00 / 1M in 8K context

Gpt 4

Gpt 4 is available via OpenAI with a 8K context window and up to 4,096 output tokens. Pricing: $30.00/1M input tokens, $60.00/1M output tokens.

$30.00 / 1M in 8K context

Gpt 4 0314

Gpt 4 0314 is available via OpenAI with a 8K context window and up to 4,096 output tokens. Pricing: $30.00/1M input tokens, $60.00/1M output tokens.

$30.00 / 1M in 8K context

Gpt 4 0613

Gpt 4 0613 is available via OpenAI with a 8K context window and up to 4,096 output tokens. Pricing: $30.00/1M input tokens, $60.00/1M output tokens.

$30.00 / 1M in 8K context

Vercel AI Gateway Models

View provider details →

Amazon/Nova Micro

Amazon/Nova Micro is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.0350/1M input tokens, $0.1400/1M output tokens.

$0.035 / 1M in 128K context

Mistral/Ministral 3b

Mistral/Ministral 3b is available via Vercel AI Gateway with a 128K context window and up to 4,000 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.

$0.040 / 1M in 128K context

Meta/Llama 3 8b

Meta/Llama 3 8b is available via Vercel AI Gateway with a 8K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

$0.050 / 1M in 8K context

Meta/Llama 3.1 8b

Meta/Llama 3.1 8b is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

$0.050 / 1M in 131K context

Amazon/Nova Lite

Amazon/Nova Lite is available via Vercel AI Gateway with a 300K context window and up to 8,192 output tokens. Pricing: $0.0600/1M input tokens, $0.2400/1M output tokens.

$0.060 / 1M in 300K context

Mistral/Devstral Small

Mistral/Devstral Small is available via Vercel AI Gateway with a 128K context window and up to 128,000 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 128K context

Google/Gemini 2.0 Flash Lite

Google/Gemini 2.0 Flash Lite is available via Vercel AI Gateway with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 1.0M context

Alibaba/Qwen 3 14b

Alibaba/Qwen 3 14b is available via Vercel AI Gateway with a 41K context window and up to 16,384 output tokens. Pricing: $0.0800/1M input tokens, $0.2400/1M output tokens.

$0.080 / 1M in 41K context

Alibaba/Qwen 3 30b

Alibaba/Qwen 3 30b is available via Vercel AI Gateway with a 41K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 41K context

Alibaba/Qwen 3 32b

Alibaba/Qwen 3 32b is available via Vercel AI Gateway with a 41K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 41K context

Meta/Llama 3.2 1b

Meta/Llama 3.2 1b is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Meta/Llama 4 Scout

Meta/Llama 4 Scout is available via Vercel AI Gateway with a 131K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 131K context

Mistral/Ministral 8b

Mistral/Ministral 8b is available via Vercel AI Gateway with a 128K context window and up to 4,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Mistral/Mistral Small

Mistral/Mistral Small is available via Vercel AI Gateway with a 32K context window and up to 4,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 32K context

Openai/Gpt 4.1 Nano

Openai/Gpt 4.1 Nano is available via Vercel AI Gateway with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Cohere/Command R

Cohere/Command R is available via Vercel AI Gateway with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Google/Gemini 2.0 Flash

Google/Gemini 2.0 Flash is available via Vercel AI Gateway with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 1.0M context

Meta/Llama 3.2 3b

Meta/Llama 3.2 3b is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Mistral/Pixtral 12b

Mistral/Pixtral 12b is available via Vercel AI Gateway with a 128K context window and up to 4,000 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Openai/Gpt 4o Mini

Openai/Gpt 4o Mini is available via Vercel AI Gateway with a 128K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Meta/Llama 3.2 11b

Meta/Llama 3.2 11b is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.1600/1M input tokens, $0.1600/1M output tokens.

$0.16 / 1M in 128K context

Alibaba/Qwen 3 235b

Alibaba/Qwen 3 235b is available via Vercel AI Gateway with a 41K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 41K context

Google/Gemma 2 9b

Google/Gemma 2 9b is available via Vercel AI Gateway with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Meta/Llama 4 Maverick

Meta/Llama 4 Maverick is available via Vercel AI Gateway with a 131K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 131K context

Zai/Glm 4.5 Air

Zai/Glm 4.5 Air is available via Vercel AI Gateway with a 128K context window and up to 96,000 output tokens. Pricing: $0.2000/1M input tokens, $1.10/1M output tokens.

$0.20 / 1M in 128K context

Anthropic/Claude 3 Haiku

Anthropic/Claude 3 Haiku is available via Vercel AI Gateway with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Inception/Mercury Coder Small

Inception/Mercury Coder Small is available via Vercel AI Gateway with a 32K context window and up to 16,384 output tokens. Pricing: $0.2500/1M input tokens, $1.00/1M output tokens.

$0.25 / 1M in 32K context

Google/Gemini 2.5 Flash

Google/Gemini 2.5 Flash is available via Vercel AI Gateway with a 1M context window and up to 65,536 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1M context

Mistral/Codestral

Mistral/Codestral is available via Vercel AI Gateway with a 256K context window and up to 4,000 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 256K context

Xai/Grok 3 Mini

Xai/Grok 3 Mini is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $0.5000/1M output tokens.

$0.30 / 1M in 131K context

Alibaba/Qwen3 Coder

Alibaba/Qwen3 Coder is available via Vercel AI Gateway with a 262K context window and up to 66,536 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 262K context

Openai/Gpt 4.1 Mini

Openai/Gpt 4.1 Mini is available via Vercel AI Gateway with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 1.0M context

Zai/Glm 4.6

Zai/Glm 4.6 is available via Vercel AI Gateway with a 200K context window and up to 200,000 output tokens. Pricing: $0.4500/1M input tokens, $1.80/1M output tokens.

$0.45 / 1M in 200K context

Mistral/Magistral Small

Mistral/Magistral Small is available via Vercel AI Gateway with a 128K context window and up to 64,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Openai/Gpt 3.5 Turbo

Openai/Gpt 3.5 Turbo is available via Vercel AI Gateway with a 16K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 16K context

Deepseek/Deepseek R1

Deepseek/Deepseek R1 is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 128K context

Moonshotai/Kimi K2

Moonshotai/Kimi K2 is available via Vercel AI Gateway with a 131K context window and up to 16,384 output tokens. Pricing: $0.5500/1M input tokens, $2.20/1M output tokens.

$0.55 / 1M in 131K context

Meta/Llama 3 70b

Meta/Llama 3 70b is available via Vercel AI Gateway with a 8K context window and up to 8,192 output tokens. Pricing: $0.5900/1M input tokens, $0.7900/1M output tokens.

$0.59 / 1M in 8K context

Xai/Grok 3 Mini Fast

Xai/Grok 3 Mini Fast is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $4.00/1M output tokens.

$0.60 / 1M in 131K context

Zai/Glm 4.5

Zai/Glm 4.5 is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 131K context

Meta/Llama 3.1 70b

Meta/Llama 3.1 70b is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Meta/Llama 3.2 90b

Meta/Llama 3.2 90b is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Meta/Llama 3.3 70b

Meta/Llama 3.3 70b is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Deepseek/Deepseek R1 Distill Llama 70b

Deepseek/Deepseek R1 Distill Llama 70b is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $0.7500/1M input tokens, $0.9900/1M output tokens.

$0.75 / 1M in 131K context

Mistral/Mistral Saba 24b

Mistral/Mistral Saba 24b is available via Vercel AI Gateway with a 33K context window and up to 32,768 output tokens. Pricing: $0.7900/1M input tokens, $0.7900/1M output tokens.

$0.79 / 1M in 33K context

Amazon/Nova Pro

Amazon/Nova Pro is available via Vercel AI Gateway with a 300K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $3.20/1M output tokens.

$0.80 / 1M in 300K context

Anthropic/Claude 3.5 Haiku

Anthropic/Claude 3.5 Haiku is available via Vercel AI Gateway with a 200K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $4.00/1M output tokens.

$0.80 / 1M in 200K context

Morph/Morph V3 Fast

Morph/Morph V3 Fast is available via Vercel AI Gateway with a 33K context window and up to 16,384 output tokens. Pricing: $0.8000/1M input tokens, $1.20/1M output tokens.

$0.80 / 1M in 33K context

Deepseek/Deepseek

Deepseek/Deepseek is available via Vercel AI Gateway with a 128K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 128K context

Morph/Morph V3 Large

Morph/Morph V3 Large is available via Vercel AI Gateway with a 33K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $1.90/1M output tokens.

$0.90 / 1M in 33K context

Anthropic/Claude Haiku 4.5

Anthropic/Claude Haiku 4.5 is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Perplexity/Sonar

Perplexity/Sonar is available via Vercel AI Gateway with a 127K context window and up to 8,000 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 127K context

Perplexity/Sonar Reasoning

Perplexity/Sonar Reasoning is available via Vercel AI Gateway with a 127K context window and up to 8,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 127K context

Openai/O3 Mini

Openai/O3 Mini is available via Vercel AI Gateway with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

Openai/O4 Mini

Openai/O4 Mini is available via Vercel AI Gateway with a 200K context window and up to 100,000 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 200K context

Mistral/Mixtral 8x22b Instruct

Mistral/Mixtral 8x22b Instruct is available via Vercel AI Gateway with a 66K context window and up to 2,048 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.

$1.20 / 1M in 66K context

Openai/Gpt 3.5 Turbo Instruct

Openai/Gpt 3.5 Turbo Instruct is available via Vercel AI Gateway with a 8K context window and up to 4,096 output tokens. Pricing: $1.50/1M input tokens, $2.00/1M output tokens.

$1.50 / 1M in 8K context

Mistral/Magistral Medium

Mistral/Magistral Medium is available via Vercel AI Gateway with a 128K context window and up to 64,000 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 128K context

Mistral/Mistral Large

Mistral/Mistral Large is available via Vercel AI Gateway with a 32K context window and up to 4,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 32K context

Mistral/Pixtral Large

Mistral/Pixtral Large is available via Vercel AI Gateway with a 128K context window and up to 4,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Openai/Gpt 4.1

Openai/Gpt 4.1 is available via Vercel AI Gateway with a 1.0M context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 1.0M context

Openai/O3

Openai/O3 is available via Vercel AI Gateway with a 200K context window and up to 100,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 200K context

Perplexity/Sonar Reasoning Pro

Perplexity/Sonar Reasoning Pro is available via Vercel AI Gateway with a 127K context window and up to 8,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 127K context

Xai/Grok 2

Xai/Grok 2 is available via Vercel AI Gateway with a 131K context window and up to 4,000 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 131K context

Xai/Grok 2 Vision

Xai/Grok 2 Vision is available via Vercel AI Gateway with a 33K context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 33K context

Cohere/Command A

Cohere/Command A is available via Vercel AI Gateway with a 256K context window and up to 8,000 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 256K context

Cohere/Command R Plus

Cohere/Command R Plus is available via Vercel AI Gateway with a 128K context window and up to 4,096 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Google/Gemini 2.5 Pro

Google/Gemini 2.5 Pro is available via Vercel AI Gateway with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 1.0M context

Openai/Gpt 4o

Openai/Gpt 4o is available via Vercel AI Gateway with a 128K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Anthropic/Claude 3.5 Sonnet

Anthropic/Claude 3.5 Sonnet is available via Vercel AI Gateway with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude 3.7 Sonnet

Anthropic/Claude 3.7 Sonnet is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude 4 Sonnet

Anthropic/Claude 4 Sonnet is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude 3 5 Sonnet

Anthropic/Claude 3 5 Sonnet is available via Vercel AI Gateway with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude 3 5 Sonnet

Anthropic/Claude 3 5 Sonnet is available via Vercel AI Gateway with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude 3 7 Sonnet

Anthropic/Claude 3 7 Sonnet is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude Sonnet 4

Anthropic/Claude Sonnet 4 is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude Sonnet 4.5

Anthropic/Claude Sonnet 4.5 is available via Vercel AI Gateway with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Perplexity/Sonar Pro

Perplexity/Sonar Pro is available via Vercel AI Gateway with a 200K context window and up to 8,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Vercel/V0 1.0 Md

Vercel/V0 1.0 Md is available via Vercel AI Gateway with a 128K context window and up to 32,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

Vercel/V0 1.5 Md

Vercel/V0 1.5 Md is available via Vercel AI Gateway with a 128K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

Xai/Grok 3

Xai/Grok 3 is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Xai/Grok 4

Xai/Grok 4 is available via Vercel AI Gateway with a 256K context window and up to 256,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 256K context

Anthropic/Claude Opus 4.5

Anthropic/Claude Opus 4.5 is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Anthropic/Claude Opus 4.6

Anthropic/Claude Opus 4.6 is available via Vercel AI Gateway with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Xai/Grok 3 Fast

Xai/Grok 3 Fast is available via Vercel AI Gateway with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Openai/Gpt 4 Turbo

Openai/Gpt 4 Turbo is available via Vercel AI Gateway with a 128K context window and up to 4,096 output tokens. Pricing: $10.00/1M input tokens, $30.00/1M output tokens.

$10.00 / 1M in 128K context

Anthropic/Claude 3 Opus

Anthropic/Claude 3 Opus is available via Vercel AI Gateway with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Anthropic/Claude 4 Opus

Anthropic/Claude 4 Opus is available via Vercel AI Gateway with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Anthropic/Claude Opus 4

Anthropic/Claude Opus 4 is available via Vercel AI Gateway with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Anthropic/Claude Opus 4.1

Anthropic/Claude Opus 4.1 is available via Vercel AI Gateway with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Openai/O1

Openai/O1 is available via Vercel AI Gateway with a 200K context window and up to 100,000 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 200K context

Novita AI Models

View provider details →

Paddlepaddle/Paddleocr Vl

Paddlepaddle/Paddleocr Vl is available via Novita AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.0200/1M input tokens, $0.0200/1M output tokens.

$0.020 / 1M in 16K context

Meta Llama/Llama 3.1 8b Instruct

Meta Llama/Llama 3.1 8b Instruct is available via Novita AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.0200/1M input tokens, $0.0500/1M output tokens.

$0.020 / 1M in 16K context

Deepseek/Deepseek Ocr

Deepseek/Deepseek Ocr is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0300/1M output tokens.

$0.030 / 1M in 8K context

Qwen/Qwen3 4b Fp8

Qwen/Qwen3 4b Fp8 is available via Novita AI with a 128K context window and up to 20,000 output tokens. Pricing: $0.0300/1M input tokens, $0.0300/1M output tokens.

$0.030 / 1M in 128K context

Meta Llama/Llama 3.2 3b Instruct

Meta Llama/Llama 3.2 3b Instruct is available via Novita AI with a 33K context window and up to 32,000 output tokens. Pricing: $0.0300/1M input tokens, $0.0500/1M output tokens.

$0.030 / 1M in 33K context

Zai Org/Autoglm Phone 9b Multilingual

Zai Org/Autoglm Phone 9b Multilingual is available via Novita AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.0350/1M input tokens, $0.1380/1M output tokens.

$0.035 / 1M in 66K context

Qwen/Qwen3 8b Fp8

Qwen/Qwen3 8b Fp8 is available via Novita AI with a 128K context window and up to 20,000 output tokens. Pricing: $0.0350/1M input tokens, $0.1380/1M output tokens.

$0.035 / 1M in 128K context

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0400/1M input tokens, $0.1500/1M output tokens.

$0.040 / 1M in 131K context

Mistralai/Mistral Nemo

Mistralai/Mistral Nemo is available via Novita AI with a 60K context window and up to 16,000 output tokens. Pricing: $0.0400/1M input tokens, $0.1700/1M output tokens.

$0.040 / 1M in 60K context

Meta Llama/Llama 3 8b Instruct

Meta Llama/Llama 3 8b Instruct is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.

$0.040 / 1M in 8K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 131K context

Google/Gemma 3 12b It

Google/Gemma 3 12b It is available via Novita AI with a 131K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

$0.050 / 1M in 131K context

Sao10k/L3 8b Lunaris

Sao10k/L3 8b Lunaris is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0500/1M output tokens.

$0.050 / 1M in 8K context

Sao10K/L3 8B Stheno V3.2

Sao10K/L3 8B Stheno V3.2 is available via Novita AI with a 8K context window and up to 32,000 output tokens. Pricing: $0.0500/1M input tokens, $0.0500/1M output tokens.

$0.050 / 1M in 8K context

Deepseek/Deepseek R1 0528 Qwen3 8b

Deepseek/Deepseek R1 0528 Qwen3 8b is available via Novita AI with a 128K context window and up to 32,000 output tokens. Pricing: $0.0600/1M input tokens, $0.0900/1M output tokens.

$0.060 / 1M in 128K context

Qwen/Qwen3 Coder 30b A3b Instruct

Qwen/Qwen3 Coder 30b A3b Instruct is available via Novita AI with a 160K context window and up to 32,768 output tokens. Pricing: $0.0700/1M input tokens, $0.2700/1M output tokens.

$0.070 / 1M in 160K context

Baidu/Ernie 4.5 21B A3b Thinking

Baidu/Ernie 4.5 21B A3b Thinking is available via Novita AI with a 131K context window and up to 65,536 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 131K context

Baichuan/Baichuan M2 32b

Baichuan/Baichuan M2 32b is available via Novita AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.0700/1M input tokens, $0.0700/1M output tokens.

$0.070 / 1M in 131K context

Baidu/Ernie 4.5 21B A3b

Baidu/Ernie 4.5 21B A3b is available via Novita AI with a 120K context window and up to 8,000 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 120K context

Qwen/Qwen2.5 7b Instruct

Qwen/Qwen2.5 7b Instruct is available via Novita AI with a 32K context window and up to 32,000 output tokens. Pricing: $0.0700/1M input tokens, $0.0700/1M output tokens.

$0.070 / 1M in 32K context

Qwen/Qwen3 Vl 8b Instruct

Qwen/Qwen3 Vl 8b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0800/1M input tokens, $0.5000/1M output tokens.

$0.080 / 1M in 131K context

Qwen/Qwen3 235b A22b Instruct 2507

Qwen/Qwen3 235b A22b Instruct 2507 is available via Novita AI with a 131K context window and up to 16,384 output tokens. Pricing: $0.0900/1M input tokens, $0.5800/1M output tokens.

$0.090 / 1M in 131K context

Qwen/Qwen3 30b A3b Fp8

Qwen/Qwen3 30b A3b Fp8 is available via Novita AI with a 41K context window and up to 20,000 output tokens. Pricing: $0.0900/1M input tokens, $0.4500/1M output tokens.

$0.090 / 1M in 41K context

Gryphe/Mythomax L2 13b

Gryphe/Mythomax L2 13b is available via Novita AI with a 4K context window and up to 3,200 output tokens. Pricing: $0.0900/1M input tokens, $0.0900/1M output tokens.

$0.090 / 1M in 4K context

Xiaomimimo/Mimo V2 Flash

Xiaomimimo/Mimo V2 Flash is available via Novita AI with a 262K context window and up to 32,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 262K context

Qwen/Qwen3 32b Fp8

Qwen/Qwen3 32b Fp8 is available via Novita AI with a 41K context window and up to 20,000 output tokens. Pricing: $0.1000/1M input tokens, $0.4500/1M output tokens.

$0.10 / 1M in 41K context

Google/Gemma 3 27b It

Google/Gemma 3 27b It is available via Novita AI with a 98K context window and up to 16,384 output tokens. Pricing: $0.1190/1M input tokens, $0.2000/1M output tokens.

$0.12 / 1M in 98K context

Zai Org/Glm 4.5 Air

Zai Org/Glm 4.5 Air is available via Novita AI with a 131K context window and up to 98,304 output tokens. Pricing: $0.1300/1M input tokens, $0.8500/1M output tokens.

$0.13 / 1M in 131K context

Meta Llama/Llama 3.3 70b Instruct

Meta Llama/Llama 3.3 70b Instruct is available via Novita AI with a 131K context window and up to 120,000 output tokens. Pricing: $0.1350/1M input tokens, $0.4000/1M output tokens.

$0.14 / 1M in 131K context

Nousresearch/Hermes 2 Pro Llama 3 8b

Nousresearch/Hermes 2 Pro Llama 3 8b is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1400/1M input tokens, $0.1400/1M output tokens.

$0.14 / 1M in 8K context

Baidu/Ernie 4.5 Vl 28b A3b

Baidu/Ernie 4.5 Vl 28b A3b is available via Novita AI with a 30K context window and up to 8,000 output tokens. Pricing: $0.1400/1M input tokens, $0.5600/1M output tokens.

$0.14 / 1M in 30K context

Qwen/Qwen3 Next 80b A3b Instruct

Qwen/Qwen3 Next 80b A3b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

$0.15 / 1M in 131K context

Qwen/Qwen3 Next 80b A3b Thinking

Qwen/Qwen3 Next 80b A3b Thinking is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

$0.15 / 1M in 131K context

Deepseek/Deepseek R1 Distill Qwen 14b

Deepseek/Deepseek R1 Distill Qwen 14b is available via Novita AI with a 33K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 33K context

Meta Llama/Llama 4 Scout 17b 16e Instruct

Meta Llama/Llama 4 Scout 17b 16e Instruct is available via Novita AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1800/1M input tokens, $0.5900/1M output tokens.

$0.18 / 1M in 131K context

Skywork/R1v4 Lite

Skywork/R1v4 Lite is available via Novita AI with a 262K context window and up to 65,536 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 262K context

Qwen/Qwen3 235b A22b Fp8

Qwen/Qwen3 235b A22b Fp8 is available via Novita AI with a 41K context window and up to 20,000 output tokens. Pricing: $0.2000/1M input tokens, $0.8000/1M output tokens.

$0.20 / 1M in 41K context

Qwen/Qwen3 Vl 30b A3b Instruct

Qwen/Qwen3 Vl 30b A3b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.7000/1M output tokens.

$0.20 / 1M in 131K context

Qwen/Qwen3 Vl 30b A3b Thinking

Qwen/Qwen3 Vl 30b A3b Thinking is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $1.00/1M output tokens.

$0.20 / 1M in 131K context

Qwen/Qwen3 Omni 30b A3b Thinking

Qwen/Qwen3 Omni 30b A3b Thinking is available via Novita AI with a 66K context window and up to 16,384 output tokens. Pricing: $0.2500/1M input tokens, $0.9700/1M output tokens.

$0.25 / 1M in 66K context

Qwen/Qwen3 Omni 30b A3b Instruct

Qwen/Qwen3 Omni 30b A3b Instruct is available via Novita AI with a 66K context window and up to 16,384 output tokens. Pricing: $0.2500/1M input tokens, $0.9700/1M output tokens.

$0.25 / 1M in 66K context

Qwen/Qwen Mt Plus

Qwen/Qwen Mt Plus is available via Novita AI with a 16K context window and up to 8,192 output tokens. Pricing: $0.2500/1M input tokens, $0.7500/1M output tokens.

$0.25 / 1M in 16K context

Deepseek/Deepseek V3.2

Deepseek/Deepseek V3.2 is available via Novita AI with a 164K context window and up to 65,536 output tokens. Pricing: $0.2690/1M input tokens, $0.4000/1M output tokens.

$0.27 / 1M in 164K context

Deepseek/Deepseek V3.2 Exp

Deepseek/Deepseek V3.2 Exp is available via Novita AI with a 164K context window and up to 65,536 output tokens. Pricing: $0.2700/1M input tokens, $0.4100/1M output tokens.

$0.27 / 1M in 164K context

Deepseek/Deepseek V3.1 Terminus

Deepseek/Deepseek V3.1 Terminus is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2700/1M input tokens, $1.00/1M output tokens.

$0.27 / 1M in 131K context

Deepseek/Deepseek V3.1

Deepseek/Deepseek V3.1 is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2700/1M input tokens, $1.00/1M output tokens.

$0.27 / 1M in 131K context

Deepseek/Deepseek V3 0324

Deepseek/Deepseek V3 0324 is available via Novita AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.2700/1M input tokens, $1.12/1M output tokens.

$0.27 / 1M in 164K context

Meta Llama/Llama 4 Maverick 17b 128e Instruct Fp8

Meta Llama/Llama 4 Maverick 17b 128e Instruct Fp8 is available via Novita AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.2700/1M input tokens, $0.8500/1M output tokens.

$0.27 / 1M in 1.0M context

Baidu/Ernie 4.5 300b A47b Paddle

Baidu/Ernie 4.5 300b A47b Paddle is available via Novita AI with a 123K context window and up to 12,000 output tokens. Pricing: $0.2800/1M input tokens, $1.10/1M output tokens.

$0.28 / 1M in 123K context

Minimax/Minimax M2.1

Minimax/Minimax M2.1 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 205K context

Minimax/Minimax M2

Minimax/Minimax M2 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 205K context

Zai Org/Glm 4.6v

Zai Org/Glm 4.6v is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 131K context

Kwaipilot/Kat Coder Pro

Kwaipilot/Kat Coder Pro is available via Novita AI with a 256K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 256K context

Qwen/Qwen3 Vl 235b A22b Instruct

Qwen/Qwen3 Vl 235b A22b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.3000/1M input tokens, $1.50/1M output tokens.

$0.30 / 1M in 131K context

Qwen/Qwen3 Coder 480b A35b Instruct

Qwen/Qwen3 Coder 480b A35b Instruct is available via Novita AI with a 262K context window and up to 65,536 output tokens. Pricing: $0.3000/1M input tokens, $1.30/1M output tokens.

$0.30 / 1M in 262K context

Qwen/Qwen3 235b A22b Thinking 2507

Qwen/Qwen3 235b A22b Thinking 2507 is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.3000/1M input tokens, $3.00/1M output tokens.

$0.30 / 1M in 131K context

Deepseek/Deepseek R1 Distill Qwen 32b

Deepseek/Deepseek R1 Distill Qwen 32b is available via Novita AI with a 64K context window and up to 32,000 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

$0.30 / 1M in 64K context

Qwen/Qwen 2.5 72b Instruct

Qwen/Qwen 2.5 72b Instruct is available via Novita AI with a 32K context window and up to 8,192 output tokens. Pricing: $0.3800/1M input tokens, $0.4000/1M output tokens.

$0.38 / 1M in 32K context

Baidu/Ernie 4.5 Vl 28b A3b Thinking

Baidu/Ernie 4.5 Vl 28b A3b Thinking is available via Novita AI with a 131K context window and up to 65,536 output tokens. Pricing: $0.3900/1M input tokens, $0.3900/1M output tokens.

$0.39 / 1M in 131K context

Deepseek/Deepseek V3 Turbo

Deepseek/Deepseek V3 Turbo is available via Novita AI with a 64K context window and up to 16,000 output tokens. Pricing: $0.4000/1M input tokens, $1.30/1M output tokens.

$0.40 / 1M in 64K context

Baidu/Ernie 4.5 Vl 424b A47b

Baidu/Ernie 4.5 Vl 424b A47b is available via Novita AI with a 123K context window and up to 16,000 output tokens. Pricing: $0.4200/1M input tokens, $1.25/1M output tokens.

$0.42 / 1M in 123K context

Meta Llama/Llama 3 70b Instruct

Meta Llama/Llama 3 70b Instruct is available via Novita AI with a 8K context window and up to 8,000 output tokens. Pricing: $0.5100/1M input tokens, $0.7400/1M output tokens.

$0.51 / 1M in 8K context

Zai Org/Glm 4.6

Zai Org/Glm 4.6 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.5500/1M input tokens, $2.20/1M output tokens.

$0.55 / 1M in 205K context

Minimaxai/Minimax M1 80k

Minimaxai/Minimax M1 80k is available via Novita AI with a 1M context window and up to 40,000 output tokens. Pricing: $0.5500/1M input tokens, $2.20/1M output tokens.

$0.55 / 1M in 1M context

Moonshotai/Kimi K2 Instruct

Moonshotai/Kimi K2 Instruct is available via Novita AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.5700/1M input tokens, $2.30/1M output tokens.

$0.57 / 1M in 131K context

Zai Org/Glm 4.7

Zai Org/Glm 4.7 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 205K context

Moonshotai/Kimi K2 Thinking

Moonshotai/Kimi K2 Thinking is available via Novita AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Moonshotai/Kimi K2 0905

Moonshotai/Kimi K2 0905 is available via Novita AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Zai Org/Glm 4.5

Zai Org/Glm 4.5 is available via Novita AI with a 131K context window and up to 98,304 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 131K context

Zai Org/Glm 4.5v

Zai Org/Glm 4.5v is available via Novita AI with a 66K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $1.80/1M output tokens.

$0.60 / 1M in 66K context

Microsoft/Wizardlm 2 8x22b

Microsoft/Wizardlm 2 8x22b is available via Novita AI with a 66K context window and up to 8,000 output tokens. Pricing: $0.6200/1M input tokens, $0.6200/1M output tokens.

$0.62 / 1M in 66K context

Deepseek/Deepseek R1 0528

Deepseek/Deepseek R1 0528 is available via Novita AI with a 164K context window and up to 32,768 output tokens. Pricing: $0.7000/1M input tokens, $2.50/1M output tokens.

$0.70 / 1M in 164K context

Deepseek/Deepseek Prover V2 671b

Deepseek/Deepseek Prover V2 671b is available via Novita AI with a 160K context window and up to 160,000 output tokens. Pricing: $0.7000/1M input tokens, $2.50/1M output tokens.

$0.70 / 1M in 160K context

Deepseek/Deepseek R1 Turbo

Deepseek/Deepseek R1 Turbo is available via Novita AI with a 64K context window and up to 16,000 output tokens. Pricing: $0.7000/1M input tokens, $2.50/1M output tokens.

$0.70 / 1M in 64K context

Deepseek/Deepseek R1 Distill Llama 70b

Deepseek/Deepseek R1 Distill Llama 70b is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

$0.80 / 1M in 8K context

Qwen/Qwen2.5 Vl 72b Instruct

Qwen/Qwen2.5 Vl 72b Instruct is available via Novita AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

$0.80 / 1M in 33K context

Qwen/Qwen3 Vl 235b A22b Thinking

Qwen/Qwen3 Vl 235b A22b Thinking is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.9800/1M input tokens, $3.95/1M output tokens.

$0.98 / 1M in 131K context

Sao10k/L3 70b Euryale V2.1

Sao10k/L3 70b Euryale V2.1 is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $1.48/1M input tokens, $1.48/1M output tokens.

$1.48 / 1M in 8K context

Sao10k/L31 70b Euryale V2.2

Sao10k/L31 70b Euryale V2.2 is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $1.48/1M input tokens, $1.48/1M output tokens.

$1.48 / 1M in 8K context

Qwen/Qwen3 Max

Qwen/Qwen3 Max is available via Novita AI with a 262K context window and up to 65,536 output tokens. Pricing: $2.11/1M input tokens, $8.45/1M output tokens.

$2.11 / 1M in 262K context

OpenRouter Models

View provider details →

Openrouter/Auto

Openrouter/Auto is available via OpenRouter with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 2M context

Openrouter/Free

Openrouter/Free is available via OpenRouter with a 200K context window and up to 200,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 200K context

Openrouter/Bodybuilder

Openrouter/Bodybuilder is available via OpenRouter with a 128K context window and up to 128,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via OpenRouter with a 131K context window and up to 32,768 output tokens. Pricing: $0.0200/1M input tokens, $0.1000/1M output tokens.

$0.020 / 1M in 131K context

Openai/Gpt 5 Nano

Openai/Gpt 5 Nano is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $0.0500/1M input tokens, $0.4000/1M output tokens.

$0.050 / 1M in 272K context

Z Ai/Glm 4.7 Flash

Z Ai/Glm 4.7 Flash is available via OpenRouter with a 200K context window and up to 32,000 output tokens. Pricing: $0.0700/1M input tokens, $0.4000/1M output tokens.

$0.070 / 1M in 200K context

Qwen/Qwen3 235b A22b 2507

Qwen/Qwen3 235b A22b 2507 is available via OpenRouter with a 262K context window and up to 262,144 output tokens. Pricing: $0.0710/1M input tokens, $0.1000/1M output tokens.

$0.071 / 1M in 262K context

Xiaomi/Mimo V2 Flash

Xiaomi/Mimo V2 Flash is available via OpenRouter with a 262K context window and up to 16,384 output tokens. Pricing: $0.0900/1M input tokens, $0.2900/1M output tokens.

$0.090 / 1M in 262K context

Bytedance/Ui Tars 1.5 7b

Bytedance/Ui Tars 1.5 7b is available via OpenRouter with a 131K context window and up to 2,048 output tokens. Pricing: $0.1000/1M input tokens, $0.2000/1M output tokens.

$0.10 / 1M in 131K context

Google/Gemini 2.0 Flash 001

Google/Gemini 2.0 Flash 001 is available via OpenRouter with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Mistralai/Ministral 3b 2512

Mistralai/Ministral 3b 2512 is available via OpenRouter with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Openai/Gpt 4.1 Nano

Openai/Gpt 4.1 Nano is available via OpenRouter with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Qwen/Qwen3.5 Flash 02 23

Qwen/Qwen3.5 Flash 02 23 is available via OpenRouter with a 1M context window and up to 65,536 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1M context

Qwen/Qwen3 235b A22b Thinking 2507

Qwen/Qwen3 235b A22b Thinking 2507 is available via OpenRouter with a 262K context window and up to 262,144 output tokens. Pricing: $0.1100/1M input tokens, $0.6000/1M output tokens.

$0.11 / 1M in 262K context

Deepseek/Deepseek Chat

Deepseek/Deepseek Chat is available via OpenRouter with a 66K context window and up to 8,192 output tokens. Pricing: $0.1400/1M input tokens, $0.2800/1M output tokens.

$0.14 / 1M in 66K context

Deepseek/Deepseek Chat V3 0324

Deepseek/Deepseek Chat V3 0324 is available via OpenRouter with a 66K context window and up to 8,192 output tokens. Pricing: $0.1400/1M input tokens, $0.2800/1M output tokens.

$0.14 / 1M in 66K context

Mistralai/Devstral 2512

Mistralai/Devstral 2512 is available via OpenRouter with a 262K context window and up to 65,536 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 262K context

Mistralai/Ministral 8b 2512

Mistralai/Ministral 8b 2512 is available via OpenRouter with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 262K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via OpenRouter with a 131K context window and up to 32,768 output tokens. Pricing: $0.1800/1M input tokens, $0.8000/1M output tokens.

$0.18 / 1M in 131K context

Qwen/Qwen 2.5 Coder 32b Instruct

Qwen/Qwen 2.5 Coder 32b Instruct is available via OpenRouter with a 34K context window and up to 33,792 output tokens. Pricing: $0.1800/1M input tokens, $0.1800/1M output tokens.

$0.18 / 1M in 34K context

Deepseek/Deepseek Chat V3.1

Deepseek/Deepseek Chat V3.1 is available via OpenRouter with a 164K context window and up to 163,840 output tokens. Pricing: $0.2000/1M input tokens, $0.8000/1M output tokens.

$0.20 / 1M in 164K context

Deepseek/Deepseek V3.2 Exp

Deepseek/Deepseek V3.2 Exp is available via OpenRouter with a 164K context window and up to 163,840 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 164K context

Mistralai/Ministral 14b 2512

Mistralai/Ministral 14b 2512 is available via OpenRouter with a 262K context window and up to 262,144 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 262K context

Qwen/Qwen Vl Plus

Qwen/Qwen Vl Plus is available via OpenRouter with a 8K context window and up to 2,048 output tokens. Pricing: $0.2100/1M input tokens, $0.6300/1M output tokens.

$0.21 / 1M in 8K context

Qwen/Qwen3 Coder

Qwen/Qwen3 Coder is available via OpenRouter with a 262K context window and up to 262,100 output tokens. Pricing: $0.2200/1M input tokens, $0.9500/1M output tokens.

$0.22 / 1M in 262K context

Openai/Gpt 5 Mini

Openai/Gpt 5 Mini is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 272K context

Qwen/Qwen3.5 35b A3b

Qwen/Qwen3.5 35b A3b is available via OpenRouter with a 262K context window and up to 65,536 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 262K context

Minimax/Minimax M2

Minimax/Minimax M2 is available via OpenRouter with a 205K context window and up to 204,800 output tokens. Pricing: $0.2550/1M input tokens, $1.02/1M output tokens.

$0.26 / 1M in 205K context

Minimax/Minimax M2.1

Minimax/Minimax M2.1 is available via OpenRouter with a 204K context window and up to 64,000 output tokens. Pricing: $0.2700/1M input tokens, $1.20/1M output tokens.

$0.27 / 1M in 204K context

Deepseek/Deepseek V3.2

Deepseek/Deepseek V3.2 is available via OpenRouter with a 164K context window and up to 163,840 output tokens. Pricing: $0.2800/1M input tokens, $0.4000/1M output tokens.

$0.28 / 1M in 164K context

Google/Gemini 2.5 Flash

Google/Gemini 2.5 Flash is available via OpenRouter with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Qwen/Qwen3.5 27b

Qwen/Qwen3.5 27b is available via OpenRouter with a 262K context window and up to 65,536 output tokens. Pricing: $0.3000/1M input tokens, $2.40/1M output tokens.

$0.30 / 1M in 262K context

Minimax/Minimax M2.5

Minimax/Minimax M2.5 is available via OpenRouter with a 197K context window and up to 65,536 output tokens. Pricing: $0.3000/1M input tokens, $1.10/1M output tokens.

$0.30 / 1M in 197K context

Openai/Gpt 4.1 Mini

Openai/Gpt 4.1 Mini is available via OpenRouter with a 1.0M context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 1.0M context

Qwen/Qwen3.5 122b A10b

Qwen/Qwen3.5 122b A10b is available via OpenRouter with a 262K context window and up to 65,536 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 262K context

Qwen/Qwen3.5 Plus 02 15

Qwen/Qwen3.5 Plus 02 15 is available via OpenRouter with a 1M context window and up to 65,536 output tokens. Pricing: $0.4000/1M input tokens, $2.40/1M output tokens.

$0.40 / 1M in 1M context

Z Ai/Glm 4.6

Z Ai/Glm 4.6 is available via OpenRouter with a 203K context window and up to 131,000 output tokens. Pricing: $0.4000/1M input tokens, $1.75/1M output tokens.

$0.40 / 1M in 203K context

Z Ai/Glm 4.7

Z Ai/Glm 4.7 is available via OpenRouter with a 203K context window and up to 64,000 output tokens. Pricing: $0.4000/1M input tokens, $1.50/1M output tokens.

$0.40 / 1M in 203K context

Z Ai/Glm 4.6:Exacto

Z Ai/Glm 4.6:Exacto is available via OpenRouter with a 203K context window and up to 131,000 output tokens. Pricing: $0.4500/1M input tokens, $1.90/1M output tokens.

$0.45 / 1M in 203K context

Deepseek/Deepseek R1 0528

Deepseek/Deepseek R1 0528 is available via OpenRouter with a 65K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $2.15/1M output tokens.

$0.50 / 1M in 65K context

Google/Gemini 3 Flash Preview

Google/Gemini 3 Flash Preview is available via OpenRouter with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.5000/1M input tokens, $3.00/1M output tokens.

$0.50 / 1M in 1.0M context

Mistralai/Mistral Large 2512

Mistralai/Mistral Large 2512 is available via OpenRouter with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 262K context

Deepseek/Deepseek R1

Deepseek/Deepseek R1 is available via OpenRouter with a 65K context window and up to 8,192 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 65K context

Moonshotai/Kimi K2.5

Moonshotai/Kimi K2.5 is available via OpenRouter with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Qwen/Qwen3.5 397b A17b

Qwen/Qwen3.5 397b A17b is available via OpenRouter with a 262K context window and up to 65,536 output tokens. Pricing: $0.6000/1M input tokens, $3.60/1M output tokens.

$0.60 / 1M in 262K context

Z Ai/Glm 5

Z Ai/Glm 5 is available via OpenRouter with a 203K context window and up to 128,000 output tokens. Pricing: $0.8000/1M input tokens, $2.56/1M output tokens.

$0.80 / 1M in 203K context

Switchpoint/Router

Switchpoint/Router is available via OpenRouter with a 131K context window and up to 131,072 output tokens. Pricing: $0.8500/1M input tokens, $3.40/1M output tokens.

$0.85 / 1M in 131K context

Anthropic/Claude Haiku 4.5

Anthropic/Claude Haiku 4.5 is available via OpenRouter with a 200K context window and up to 200,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Qwen/Qwen3 Coder Plus

Qwen/Qwen3 Coder Plus is available via OpenRouter with a 998K context window and up to 65,536 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 998K context

Openai/O3 Mini

Openai/O3 Mini is available via OpenRouter with a 128K context window and up to 65,536 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 128K context

Openai/O3 Mini High

Openai/O3 Mini High is available via OpenRouter with a 128K context window and up to 65,536 output tokens. Pricing: $1.10/1M input tokens, $4.40/1M output tokens.

$1.10 / 1M in 128K context

Google/Gemini 2.5 Pro

Google/Gemini 2.5 Pro is available via OpenRouter with a 1.0M context window and up to 8,192 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Openai/Gpt 5 Chat

Openai/Gpt 5 Chat is available via OpenRouter with a 128K context window and up to 16,384 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Openai/Gpt 5 Codex

Openai/Gpt 5 Codex is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Openai/Gpt 5

Openai/Gpt 5 is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Openai/Gpt 5.1 Codex Max

Openai/Gpt 5.1 Codex Max is available via OpenRouter with a 400K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 400K context

Openai/Gpt 5.2 Codex

Openai/Gpt 5.2 Codex is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 272K context

Openai/Gpt 5.2

Openai/Gpt 5.2 is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 272K context

Openai/Gpt 5.2 Chat

Openai/Gpt 5.2 Chat is available via OpenRouter with a 128K context window and up to 16,384 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 128K context

Google/Gemini 3 Pro Preview

Google/Gemini 3 Pro Preview is available via OpenRouter with a 1.0M context window and up to 65,535 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Google/Gemini 3.1 Pro Preview

Google/Gemini 3.1 Pro Preview is available via OpenRouter with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Openai/Gpt 4.1

Openai/Gpt 4.1 is available via OpenRouter with a 1.0M context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 1.0M context

Openai/Gpt 4o

Openai/Gpt 4o is available via OpenRouter with a 128K context window and up to 4,096 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Anthropic/Claude 3.5 Sonnet

Anthropic/Claude 3.5 Sonnet is available via OpenRouter with a 200K context window and up to 8,192 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude 3.7 Sonnet

Anthropic/Claude 3.7 Sonnet is available via OpenRouter with a 200K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic/Claude Sonnet 4

Anthropic/Claude Sonnet 4 is available via OpenRouter with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Anthropic/Claude Sonnet 4.6

Anthropic/Claude Sonnet 4.6 is available via OpenRouter with a 1M context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Anthropic/Claude Sonnet 4.5

Anthropic/Claude Sonnet 4.5 is available via OpenRouter with a 1M context window and up to 1,000,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

X Ai/Grok 4

X Ai/Grok 4 is available via OpenRouter with a 256K context window and up to 256,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 256K context

Anthropic/Claude Opus 4.5

Anthropic/Claude Opus 4.5 is available via OpenRouter with a 200K context window and up to 32,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Anthropic/Claude Opus 4.6

Anthropic/Claude Opus 4.6 is available via OpenRouter with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Openai/Gpt 4o 2024 05 13

Openai/Gpt 4o 2024 05 13 is available via OpenRouter with a 128K context window and up to 4,096 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 128K context

Anthropic/Claude Opus 4

Anthropic/Claude Opus 4 is available via OpenRouter with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Anthropic/Claude Opus 4.1

Anthropic/Claude Opus 4.1 is available via OpenRouter with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Openai/O1

Openai/O1 is available via OpenRouter with a 200K context window and up to 100,000 output tokens. Pricing: $15.00/1M input tokens, $60.00/1M output tokens.

$15.00 / 1M in 200K context

Openai/Gpt 5.2 Pro

Openai/Gpt 5.2 Pro is available via OpenRouter with a 272K context window and up to 128,000 output tokens. Pricing: $21.00/1M input tokens, $168.00/1M output tokens.

$21.00 / 1M in 272K context

DeepInfra Models

View provider details →

Meta Llama/Llama 3.2 3B Instruct

Meta Llama/Llama 3.2 3B Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0200/1M input tokens, $0.0200/1M output tokens.

$0.020 / 1M in 131K context

Meta Llama/Meta Llama 3.1 8B Instruct Turbo

Meta Llama/Meta Llama 3.1 8B Instruct Turbo is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0200/1M input tokens, $0.0300/1M output tokens.

$0.020 / 1M in 131K context

Mistralai/Mistral Nemo Instruct 2407

Mistralai/Mistral Nemo Instruct 2407 is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0200/1M input tokens, $0.0400/1M output tokens.

$0.020 / 1M in 131K context

Meta Llama/Meta Llama 3 8B Instruct

Meta Llama/Meta Llama 3 8B Instruct is available via DeepInfra with a 8K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0600/1M output tokens.

$0.030 / 1M in 8K context

Meta Llama/Meta Llama 3.1 8B Instruct

Meta Llama/Meta Llama 3.1 8B Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0300/1M input tokens, $0.0500/1M output tokens.

$0.030 / 1M in 131K context

Qwen/Qwen2.5 7B Instruct

Qwen/Qwen2.5 7B Instruct is available via DeepInfra with a 33K context window and up to 32,768 output tokens. Pricing: $0.0400/1M input tokens, $0.1000/1M output tokens.

$0.040 / 1M in 33K context

Sao10K/L3 8B Lunaris V1 Turbo

Sao10K/L3 8B Lunaris V1 Turbo is available via DeepInfra with a 8K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.0500/1M output tokens.

$0.040 / 1M in 8K context

Google/Gemma 3 4b It

Google/Gemma 3 4b It is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0400/1M input tokens, $0.0800/1M output tokens.

$0.040 / 1M in 131K context

Nvidia/NVIDIA Nemotron Nano 9B

Nvidia/NVIDIA Nemotron Nano 9B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0400/1M input tokens, $0.1600/1M output tokens.

$0.040 / 1M in 131K context

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0400/1M input tokens, $0.1500/1M output tokens.

$0.040 / 1M in 131K context

Meta Llama/Llama 3.2 11B Vision Instruct

Meta Llama/Llama 3.2 11B Vision Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0490/1M input tokens, $0.0490/1M output tokens.

$0.049 / 1M in 131K context

Google/Gemma 3 12b It

Google/Gemma 3 12b It is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

$0.050 / 1M in 131K context

Mistralai/Mistral Small 24B Instruct 2501

Mistralai/Mistral Small 24B Instruct 2501 is available via DeepInfra with a 33K context window and up to 32,768 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

$0.050 / 1M in 33K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.4500/1M output tokens.

$0.050 / 1M in 131K context

Meta Llama/Llama Guard 3 8B

Meta Llama/Llama Guard 3 8B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0550/1M input tokens, $0.0550/1M output tokens.

$0.055 / 1M in 131K context

Qwen/Qwen3 14B

Qwen/Qwen3 14B is available via DeepInfra with a 41K context window and up to 40,960 output tokens. Pricing: $0.0600/1M input tokens, $0.2400/1M output tokens.

$0.060 / 1M in 41K context

Microsoft/Phi 4

Microsoft/Phi 4 is available via DeepInfra with a 16K context window and up to 16,384 output tokens. Pricing: $0.0700/1M input tokens, $0.1400/1M output tokens.

$0.070 / 1M in 16K context

Mistralai/Mistral Small 3.2 24B Instruct 2506

Mistralai/Mistral Small 3.2 24B Instruct 2506 is available via DeepInfra with a 128K context window and up to 128,000 output tokens. Pricing: $0.0750/1M input tokens, $0.2000/1M output tokens.

$0.075 / 1M in 128K context

Gryphe/MythoMax L2 13b

Gryphe/MythoMax L2 13b is available via DeepInfra with a 4K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.0900/1M output tokens.

$0.080 / 1M in 4K context

Qwen/Qwen3 30B A3B

Qwen/Qwen3 30B A3B is available via DeepInfra with a 41K context window and up to 40,960 output tokens. Pricing: $0.0800/1M input tokens, $0.2900/1M output tokens.

$0.080 / 1M in 41K context

Meta Llama/Llama 4 Scout 17B 16E Instruct

Meta Llama/Llama 4 Scout 17B 16E Instruct is available via DeepInfra with a 328K context window and up to 327,680 output tokens. Pricing: $0.0800/1M input tokens, $0.3000/1M output tokens.

$0.080 / 1M in 328K context

Qwen/Qwen3 235B A22B Instruct 2507

Qwen/Qwen3 235B A22B Instruct 2507 is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.0900/1M input tokens, $0.6000/1M output tokens.

$0.090 / 1M in 262K context

Google/Gemma 3 27b It

Google/Gemma 3 27b It is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.0900/1M input tokens, $0.1600/1M output tokens.

$0.090 / 1M in 131K context

Qwen/Qwen3 32B

Qwen/Qwen3 32B is available via DeepInfra with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.2800/1M output tokens.

$0.10 / 1M in 41K context

Google/Gemini 2.0 Flash 001

Google/Gemini 2.0 Flash 001 is available via DeepInfra with a 1M context window and up to 1,000,000 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1M context

Meta Llama/Meta Llama 3.1 70B Instruct Turbo

Meta Llama/Meta Llama 3.1 70B Instruct Turbo is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.2800/1M output tokens.

$0.10 / 1M in 131K context

Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 131K context

Qwen/Qwen2.5 72B Instruct

Qwen/Qwen2.5 72B Instruct is available via DeepInfra with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3900/1M output tokens.

$0.12 / 1M in 33K context

Meta Llama/Llama 3.3 70B Instruct Turbo

Meta Llama/Llama 3.3 70B Instruct Turbo is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.1300/1M input tokens, $0.3900/1M output tokens.

$0.13 / 1M in 131K context

Qwen/Qwen3 Next 80B A3B Instruct

Qwen/Qwen3 Next 80B A3B Instruct is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.1400/1M input tokens, $1.40/1M output tokens.

$0.14 / 1M in 262K context

Qwen/Qwen3 Next 80B A3B Thinking

Qwen/Qwen3 Next 80B A3B Thinking is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.1400/1M input tokens, $1.40/1M output tokens.

$0.14 / 1M in 262K context

Qwen/QwQ 32B

Qwen/QwQ 32B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.4000/1M output tokens.

$0.15 / 1M in 131K context

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 is available via DeepInfra with a 1.0M context window and up to 1,048,576 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 1.0M context

Qwen/Qwen3 235B A22B

Qwen/Qwen3 235B A22B is available via DeepInfra with a 41K context window and up to 40,960 output tokens. Pricing: $0.1800/1M input tokens, $0.5400/1M output tokens.

$0.18 / 1M in 41K context

Meta Llama/Llama Guard 4 12B

Meta Llama/Llama Guard 4 12B is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.1800/1M input tokens, $0.1800/1M output tokens.

$0.18 / 1M in 164K context

Qwen/Qwen2.5 VL 32B Instruct

Qwen/Qwen2.5 VL 32B Instruct is available via DeepInfra with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 128K context

Deepseek Ai/DeepSeek R1 Distill Llama 70B

Deepseek Ai/DeepSeek R1 Distill Llama 70B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 131K context

Meta Llama/Llama 3.3 70B Instruct

Meta Llama/Llama 3.3 70B Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.2300/1M input tokens, $0.4000/1M output tokens.

$0.23 / 1M in 131K context

Deepseek Ai/DeepSeek V3 0324

Deepseek Ai/DeepSeek V3 0324 is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.2500/1M input tokens, $0.8800/1M output tokens.

$0.25 / 1M in 164K context

Allenai/OlmOCR 7B 0725 FP8

Allenai/OlmOCR 7B 0725 FP8 is available via DeepInfra with a 16K context window and up to 16,384 output tokens. Pricing: $0.2700/1M input tokens, $1.50/1M output tokens.

$0.27 / 1M in 16K context

Deepseek Ai/DeepSeek R1 Distill Qwen 32B

Deepseek Ai/DeepSeek R1 Distill Qwen 32B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.2700/1M input tokens, $0.2700/1M output tokens.

$0.27 / 1M in 131K context

Deepseek Ai/DeepSeek V3.1

Deepseek Ai/DeepSeek V3.1 is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.2700/1M input tokens, $1.00/1M output tokens.

$0.27 / 1M in 164K context

Deepseek Ai/DeepSeek V3.1 Terminus

Deepseek Ai/DeepSeek V3.1 Terminus is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.2700/1M input tokens, $1.00/1M output tokens.

$0.27 / 1M in 164K context

Qwen/Qwen3 Coder 480B A35B Instruct Turbo

Qwen/Qwen3 Coder 480B A35B Instruct Turbo is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.2900/1M input tokens, $1.20/1M output tokens.

$0.29 / 1M in 262K context

NousResearch/Hermes 3 Llama 3.1 70B

NousResearch/Hermes 3 Llama 3.1 70B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

$0.30 / 1M in 131K context

Qwen/Qwen3 235B A22B Thinking 2507

Qwen/Qwen3 235B A22B Thinking 2507 is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.3000/1M input tokens, $2.90/1M output tokens.

$0.30 / 1M in 262K context

Google/Gemini 2.5 Flash

Google/Gemini 2.5 Flash is available via DeepInfra with a 1M context window and up to 1,000,000 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1M context

Deepseek Ai/DeepSeek V3

Deepseek Ai/DeepSeek V3 is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.3800/1M input tokens, $0.8900/1M output tokens.

$0.38 / 1M in 164K context

Qwen/Qwen3 Coder 480B A35B Instruct

Qwen/Qwen3 Coder 480B A35B Instruct is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 262K context

Meta Llama/Meta Llama 3.1 70B Instruct

Meta Llama/Meta Llama 3.1 70B Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.4000/1M input tokens, $0.4000/1M output tokens.

$0.40 / 1M in 131K context

Mistralai/Mixtral 8x7B Instruct V0.1

Mistralai/Mixtral 8x7B Instruct V0.1 is available via DeepInfra with a 33K context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $0.4000/1M output tokens.

$0.40 / 1M in 33K context

Zai Org/GLM 4.5

Zai Org/GLM 4.5 is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 131K context

Microsoft/WizardLM 2 8x22B

Microsoft/WizardLM 2 8x22B is available via DeepInfra with a 66K context window and up to 65,536 output tokens. Pricing: $0.4800/1M input tokens, $0.4800/1M output tokens.

$0.48 / 1M in 66K context

Deepseek Ai/DeepSeek R1 0528

Deepseek Ai/DeepSeek R1 0528 is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $2.15/1M output tokens.

$0.50 / 1M in 164K context

Moonshotai/Kimi K2 Instruct

Moonshotai/Kimi K2 Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.5000/1M input tokens, $2.00/1M output tokens.

$0.50 / 1M in 131K context

Moonshotai/Kimi K2 Instruct 0905

Moonshotai/Kimi K2 Instruct 0905 is available via DeepInfra with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $2.00/1M output tokens.

$0.50 / 1M in 262K context

Nvidia/Llama 3.1 Nemotron 70B Instruct

Nvidia/Llama 3.1 Nemotron 70B Instruct is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

$0.60 / 1M in 131K context

Sao10K/L3.1 70B Euryale V2.2

Sao10K/L3.1 70B Euryale V2.2 is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.6500/1M input tokens, $0.7500/1M output tokens.

$0.65 / 1M in 131K context

Sao10K/L3.3 70B Euryale V2.3

Sao10K/L3.3 70B Euryale V2.3 is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $0.6500/1M input tokens, $0.7500/1M output tokens.

$0.65 / 1M in 131K context

Deepseek Ai/DeepSeek R1

Deepseek Ai/DeepSeek R1 is available via DeepInfra with a 164K context window and up to 163,840 output tokens. Pricing: $0.7000/1M input tokens, $2.40/1M output tokens.

$0.70 / 1M in 164K context

NousResearch/Hermes 3 Llama 3.1 405B

NousResearch/Hermes 3 Llama 3.1 405B is available via DeepInfra with a 131K context window and up to 131,072 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 131K context

Deepseek Ai/DeepSeek R1 0528 Turbo

Deepseek Ai/DeepSeek R1 0528 Turbo is available via DeepInfra with a 33K context window and up to 32,768 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 33K context

Deepseek Ai/DeepSeek R1 Turbo

Deepseek Ai/DeepSeek R1 Turbo is available via DeepInfra with a 41K context window and up to 40,960 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 41K context

Google/Gemini 2.5 Pro

Google/Gemini 2.5 Pro is available via DeepInfra with a 1M context window and up to 1,000,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1M context

Anthropic/Claude 3 7 Sonnet Latest

Anthropic/Claude 3 7 Sonnet Latest is available via DeepInfra with a 200K context window and up to 200,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 200K context

Anthropic/Claude 4 Sonnet

Anthropic/Claude 4 Sonnet is available via DeepInfra with a 200K context window and up to 200,000 output tokens. Pricing: $3.30/1M input tokens, $16.50/1M output tokens.

$3.30 / 1M in 200K context

Anthropic/Claude 4 Opus

Anthropic/Claude 4 Opus is available via DeepInfra with a 200K context window and up to 200,000 output tokens. Pricing: $16.50/1M input tokens, $82.50/1M output tokens.

$16.50 / 1M in 200K context

Azure AI Models

View provider details →

Ministral 3b

Ministral 3b is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.

$0.040 / 1M in 128K context

Phi 4 Mini Instruct

Phi 4 Mini Instruct is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 131K context

Phi 4 Multimodal Instruct

Phi 4 Multimodal Instruct is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.3200/1M output tokens.

$0.080 / 1M in 131K context

Phi 4 Mini Reasoning

Phi 4 Mini Reasoning is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.3200/1M output tokens.

$0.080 / 1M in 131K context

Mistral Small 2503

Mistral Small 2503 is available via Azure AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 128K context

Phi 4

Phi 4 is available via Azure AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1250/1M input tokens, $0.5000/1M output tokens.

$0.13 / 1M in 16K context

Phi 4 Reasoning

Phi 4 Reasoning is available via Azure AI with a 33K context window and up to 4,096 output tokens. Pricing: $0.1250/1M input tokens, $0.5000/1M output tokens.

$0.13 / 1M in 33K context

Phi 3 Mini 128k Instruct

Phi 3 Mini 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

$0.13 / 1M in 128K context

Phi 3 Mini 4k Instruct

Phi 3 Mini 4k Instruct is available via Azure AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

$0.13 / 1M in 4K context

Phi 3.5 Mini Instruct

Phi 3.5 Mini Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

$0.13 / 1M in 128K context

Phi 3.5 Vision Instruct

Phi 3.5 Vision Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

$0.13 / 1M in 128K context

Gpt Oss 120b

Gpt Oss 120b is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Phi 3 Small 128k Instruct

Phi 3 Small 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Phi 3 Small 8k Instruct

Phi 3 Small 8k Instruct is available via Azure AI with a 8K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 8K context

Mistral Nemo

Mistral Nemo is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 131K context

Phi 3.5 MoE Instruct

Phi 3.5 MoE Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1600/1M input tokens, $0.6400/1M output tokens.

$0.16 / 1M in 128K context

Phi 3 Medium 128k Instruct

Phi 3 Medium 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6800/1M output tokens.

$0.17 / 1M in 128K context

Phi 3 Medium 4k Instruct

Phi 3 Medium 4k Instruct is available via Azure AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6800/1M output tokens.

$0.17 / 1M in 4K context

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout 17B 16E Instruct is available via Azure AI with a 10M context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.7800/1M output tokens.

$0.20 / 1M in 10M context

Grok 4 Fast Non Reasoning

Grok 4 Fast Non Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 131K context

Grok 4 Fast Reasoning

Grok 4 Fast Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 131K context

Grok 4 1 Fast Non Reasoning

Grok 4 1 Fast Non Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 131K context

Grok 4 1 Fast Reasoning

Grok 4 1 Fast Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 131K context

Grok Code Fast 1

Grok Code Fast 1 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $1.50/1M output tokens.

$0.20 / 1M in 131K context

Global/Grok 3 Mini

Global/Grok 3 Mini is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $1.27/1M output tokens.

$0.25 / 1M in 131K context

Grok 3 Mini

Grok 3 Mini is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $1.27/1M output tokens.

$0.25 / 1M in 131K context

Meta Llama 3.1 8B Instruct

Meta Llama 3.1 8B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.3000/1M input tokens, $0.6100/1M output tokens.

$0.30 / 1M in 128K context

Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.3700/1M input tokens, $0.3700/1M output tokens.

$0.37 / 1M in 128K context

Mistral Medium 2505

Mistral Medium 2505 is available via Azure AI with a 131K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 131K context

Jamba Instruct

Jamba Instruct is available via Azure AI with a 70K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $0.7000/1M output tokens.

$0.50 / 1M in 70K context

Mistral Large 3

Mistral Large 3 is available via Azure AI with a 256K context window and up to 8,191 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 256K context

Deepseek V3.2

Deepseek V3.2 is available via Azure AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.

$0.58 / 1M in 164K context

Deepseek V3.2 Speciale

Deepseek V3.2 Speciale is available via Azure AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.

$0.58 / 1M in 164K context

Kimi K2.5

Kimi K2.5 is available via Azure AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.7100/1M input tokens, $0.7100/1M output tokens.

$0.71 / 1M in 128K context

Claude Haiku 4 5

Claude Haiku 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Mistral Small

Mistral Small is available via Azure AI with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 32K context

Meta Llama 3 70B Instruct

Meta Llama 3 70B Instruct is available via Azure AI with a 8K context window and up to 2,048 output tokens. Pricing: $1.10/1M input tokens, $0.3700/1M output tokens.

$1.10 / 1M in 8K context

Deepseek

Deepseek is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.14/1M input tokens, $4.56/1M output tokens.

$1.14 / 1M in 128K context

Deepseek V3 0324

Deepseek V3 0324 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.14/1M input tokens, $4.56/1M output tokens.

$1.14 / 1M in 128K context

MAI DS R1

MAI DS R1 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

$1.35 / 1M in 128K context

Deepseek R1

Deepseek R1 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

$1.35 / 1M in 128K context

Llama 4 Maverick 17B 128E Instruct FP8

Llama 4 Maverick 17B 128E Instruct FP8 is available via Azure AI with a 1M context window and up to 16,384 output tokens. Pricing: $1.41/1M input tokens, $0.3500/1M output tokens.

$1.41 / 1M in 1M context

Mistral Large 2407

Mistral Large 2407 is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Mistral Large Latest

Mistral Large Latest is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Llama 3.2 90B Vision Instruct

Llama 3.2 90B Vision Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $2.04/1M input tokens, $2.04/1M output tokens.

$2.04 / 1M in 128K context

Meta Llama 3.1 70B Instruct

Meta Llama 3.1 70B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $2.68/1M input tokens, $3.54/1M output tokens.

$2.68 / 1M in 128K context

Claude Sonnet 4 5

Claude Sonnet 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4 6

Claude Sonnet 4 6 is available via Azure AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Global/Grok 3

Global/Grok 3 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Grok 3

Grok 3 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Grok 4

Grok 4 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Mistral Large

Mistral Large is available via Azure AI with a 32K context window and up to 8,191 output tokens. Pricing: $4.00/1M input tokens, $12.00/1M output tokens.

$4.00 / 1M in 32K context

Claude Opus 4 5

Claude Opus 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Claude Opus 4 6

Claude Opus 4 6 is available via Azure AI with a 200K context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Meta Llama 3.1 405B Instruct

Meta Llama 3.1 405B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $5.33/1M input tokens, $16.00/1M output tokens.

$5.33 / 1M in 128K context

Claude Opus 4 1

Claude Opus 4 1 is available via Azure AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Jais 30b Chat

Jais 30b Chat is available via Azure AI with a 8K context window and up to 8,192 output tokens. Pricing: $3200.00/1M input tokens, $9710.00/1M output tokens.

$3200.00 / 1M in 8K context

Mistral Models

View provider details →

Codestral 2405

Codestral 2405 is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 32K context

Codestral Latest

Codestral Latest is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 32K context

Mistral Small Latest

Mistral Small Latest is available via Mistral with a 131K context window and up to 131,072 output tokens. Pricing: $0.0600/1M input tokens, $0.1800/1M output tokens.

$0.060 / 1M in 131K context

Mistral Small 3 2 2506

Mistral Small 3 2 2506 is available via Mistral with a 131K context window and up to 131,072 output tokens. Pricing: $0.0600/1M input tokens, $0.1800/1M output tokens.

$0.060 / 1M in 131K context

Devstral Small 2505

Devstral Small 2505 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 128K context

Devstral Small 2507

Devstral Small 2507 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 128K context

Devstral Small Latest

Devstral Small Latest is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 256K context

Labs Devstral Small 2512

Labs Devstral Small 2512 is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 256K context

Mistral Small

Mistral Small is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 32K context

Ministral 3 3b 2512

Ministral 3 3b 2512 is available via Mistral with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Ministral 3 8b 2512

Ministral 3 8b 2512 is available via Mistral with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 262K context

Pixtral 12b 2409

Pixtral 12b 2409 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Ministral 3 14b 2512

Ministral 3 14b 2512 is available via Mistral with a 262K context window and up to 262,144 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 262K context

Codestral Mamba Latest

Codestral Mamba Latest is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

$0.25 / 1M in 256K context

Mistral Tiny

Mistral Tiny is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

$0.25 / 1M in 32K context

Open Codestral Mamba

Open Codestral Mamba is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

$0.25 / 1M in 256K context

Open Mistral 7b

Open Mistral 7b is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

$0.25 / 1M in 32K context

Codestral 2508

Codestral 2508 is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

$0.30 / 1M in 256K context

Open Mistral Nemo

Open Mistral Nemo is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

$0.30 / 1M in 128K context

Open Mistral Nemo 2407

Open Mistral Nemo 2407 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

$0.30 / 1M in 128K context

Devstral Medium 2507

Devstral Medium 2507 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 128K context

Devstral Latest

Devstral Latest is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 256K context

Devstral Medium Latest

Devstral Medium Latest is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 256K context

Devstral 2512

Devstral 2512 is available via Mistral with a 256K context window and up to 256,000 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 256K context

Mistral Medium 2505

Mistral Medium 2505 is available via Mistral with a 131K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 131K context

Mistral Medium Latest

Mistral Medium Latest is available via Mistral with a 131K context window and up to 131,072 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 131K context

Mistral Medium 3 1 2508

Mistral Medium 3 1 2508 is available via Mistral with a 131K context window and up to 131,072 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 131K context

Magistral Small 2506

Magistral Small 2506 is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 40K context

Magistral Small Latest

Magistral Small Latest is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 40K context

Magistral Small 1 2 2509

Magistral Small 1 2 2509 is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 40K context

Mistral Large Latest

Mistral Large Latest is available via Mistral with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 262K context

Mistral Large 3

Mistral Large 3 is available via Mistral with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 262K context

Mistral Large 2512

Mistral Large 2512 is available via Mistral with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 262K context

Open Mixtral 8x7b

Open Mixtral 8x7b is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $0.7000/1M input tokens, $0.7000/1M output tokens.

$0.70 / 1M in 32K context

Codestral 2405

Codestral 2405 is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 32K context

Codestral Latest

Codestral Latest is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 32K context

Magistral Medium 2506

Magistral Medium 2506 is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 40K context

Magistral Medium 2509

Magistral Medium 2509 is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 40K context

Magistral Medium 1 2 2509

Magistral Medium 1 2 2509 is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 40K context

Magistral Medium Latest

Magistral Medium Latest is available via Mistral with a 40K context window and up to 40,000 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 40K context

Mistral Large 2411

Mistral Large 2411 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Open Mixtral 8x22b

Open Mixtral 8x22b is available via Mistral with a 65K context window and up to 8,191 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 65K context

Pixtral Large 2411

Pixtral Large 2411 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Pixtral Large Latest

Pixtral Large Latest is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 128K context

Mistral Medium

Mistral Medium is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $2.70/1M input tokens, $8.10/1M output tokens.

$2.70 / 1M in 32K context

Mistral Medium 2312

Mistral Medium 2312 is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $2.70/1M input tokens, $8.10/1M output tokens.

$2.70 / 1M in 32K context

Mistral Large 2407

Mistral Large 2407 is available via Mistral with a 128K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $9.00/1M output tokens.

$3.00 / 1M in 128K context

Mistral Large 2402

Mistral Large 2402 is available via Mistral with a 32K context window and up to 8,191 output tokens. Pricing: $4.00/1M input tokens, $12.00/1M output tokens.

$4.00 / 1M in 32K context

Google Gemini Models

View provider details →

Gemini Exp 1114

Gemini Exp 1114 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 1.0M context

Gemini Exp 1206

Gemini Exp 1206 is available via Google Gemini with a 2.1M context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 2.1M context

Gemma 3 27b It

Gemma 3 27b It is available via Google Gemini with a 131K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Learnlm 1.5 Pro Experimental

Learnlm 1.5 Pro Experimental is available via Google Gemini with a 33K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Lyria 3 Clip Preview

Lyria 3 Clip Preview is available via Google Gemini with a 131K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Lyria 3 Pro Preview

Lyria 3 Pro Preview is available via Google Gemini with a 131K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 1.0M context

Gemini 2.0 Flash Lite 001

Gemini 2.0 Flash Lite 001 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 1.0M context

Gemini 2.0 Flash

NEW

Gemini 2.0 Flash is Google's fastest and most cost-effective model, designed for high-volume agentic tasks. It supports native tool use, code execution, and multimodal input at extremely low per-token pricing.

$0.10 / 1M in 1.0M context

Gemini 2.0 Flash 001

Gemini 2.0 Flash 001 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.5 Flash Lite Preview 09 2025

Gemini 2.5 Flash Lite Preview 09 2025 is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini Flash Lite Latest

Gemini Flash Lite Latest is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 2.5 Flash Lite Preview 06 17

Gemini 2.5 Flash Lite Preview 06 17 is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini Flash Lite Latest

Gemini Flash Lite Latest is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 1.0M context

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is available via Google Gemini with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.2500/1M input tokens, $1.50/1M output tokens.

$0.25 / 1M in 1.0M context

Gemini Robotics Er 1.5 Preview

Gemini Robotics Er 1.5 Preview is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash

Gemini 2.5 Flash is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Preview 09 2025

Gemini 2.5 Flash Preview 09 2025 is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini Flash Latest

Gemini Flash Latest is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Native Audio Latest

Gemini 2.5 Flash Native Audio Latest is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Native Audio Preview 09 2025

Gemini 2.5 Flash Native Audio Preview 09 2025 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Native Audio Preview 12 2025

Gemini 2.5 Flash Native Audio Preview 12 2025 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Native Audio Latest

Gemini 2.5 Flash Native Audio Latest is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Native Audio Preview 09 2025

Gemini 2.5 Flash Native Audio Preview 09 2025 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 2.5 Flash Native Audio Preview 12 2025

Gemini 2.5 Flash Native Audio Preview 12 2025 is available via Google Gemini with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini Flash Latest

Gemini Flash Latest is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini Exp 1206

Gemini Exp 1206 is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Gemini 3 Flash Preview

Gemini 3 Flash Preview is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.5000/1M input tokens, $3.00/1M output tokens.

$0.50 / 1M in 1.0M context

Gemini 3.1 Flash Live Preview

Gemini 3.1 Flash Live Preview is available via Google Gemini with a 131K context window and up to 65,536 output tokens. Pricing: $0.7500/1M input tokens, $4.50/1M output tokens.

$0.75 / 1M in 131K context

Gemini 3.1 Flash Live Preview

Gemini 3.1 Flash Live Preview is available via Google Gemini with a 131K context window and up to 65,536 output tokens. Pricing: $0.7500/1M input tokens, $4.50/1M output tokens.

$0.75 / 1M in 131K context

Gemini 2.5 Pro

Gemini 2.5 Pro is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Gemini 2.5 Computer Use Preview 10 2025

Gemini 2.5 Computer Use Preview 10 2025 is available via Google Gemini with a 128K context window and up to 64,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 128K context

Gemini 2.5 Pro Preview Tts

Gemini 2.5 Pro Preview Tts is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Gemini Pro Latest

Gemini Pro Latest is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Gemini Pro Latest

Gemini Pro Latest is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Gemini 3 Pro Preview

Gemini 3 Pro Preview is available via Google Gemini with a 1.0M context window and up to 65,535 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is available via Google Gemini with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Gemini 3.1 Pro Preview Customtools

Gemini 3.1 Pro Preview Customtools is available via Google Gemini with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Xai Models

View provider details →

Grok 4 Fast Reasoning

Grok 4 Fast Reasoning is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok 4 Fast Non Reasoning

Grok 4 Fast Non Reasoning is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok 4 1 Fast

Grok 4 1 Fast is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok 4 1 Fast Reasoning

Grok 4 1 Fast Reasoning is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok 4 1 Fast Reasoning Latest

Grok 4 1 Fast Reasoning Latest is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok 4 1 Fast Non Reasoning

Grok 4 1 Fast Non Reasoning is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok 4 1 Fast Non Reasoning Latest

Grok 4 1 Fast Non Reasoning Latest is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

$0.20 / 1M in 2M context

Grok Code Fast

Grok Code Fast is available via Xai with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $1.50/1M output tokens.

$0.20 / 1M in 256K context

Grok Code Fast 1

Grok Code Fast 1 is available via Xai with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $1.50/1M output tokens.

$0.20 / 1M in 256K context

Grok Code Fast 1 0825

Grok Code Fast 1 0825 is available via Xai with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $1.50/1M output tokens.

$0.20 / 1M in 256K context

Grok 3 Mini

Grok 3 Mini is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $0.5000/1M output tokens.

$0.30 / 1M in 131K context

Grok 3 Mini Beta

Grok 3 Mini Beta is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $0.5000/1M output tokens.

$0.30 / 1M in 131K context

Grok 3 Mini Latest

Grok 3 Mini Latest is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $0.5000/1M output tokens.

$0.30 / 1M in 131K context

Grok 3 Mini Fast

Grok 3 Mini Fast is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $4.00/1M output tokens.

$0.60 / 1M in 131K context

Grok 3 Mini Fast Beta

Grok 3 Mini Fast Beta is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $4.00/1M output tokens.

$0.60 / 1M in 131K context

Grok 3 Mini Fast Latest

Grok 3 Mini Fast Latest is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $4.00/1M output tokens.

$0.60 / 1M in 131K context

Grok 2

Grok 2 is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 131K context

Grok 2 1212

Grok 2 1212 is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 131K context

Grok 2 Latest

Grok 2 Latest is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 131K context

Grok 2 Vision

Grok 2 Vision is available via Xai with a 33K context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 33K context

Grok 2 Vision 1212

Grok 2 Vision 1212 is available via Xai with a 33K context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 33K context

Grok 2 Vision Latest

Grok 2 Vision Latest is available via Xai with a 33K context window and up to 32,768 output tokens. Pricing: $2.00/1M input tokens, $10.00/1M output tokens.

$2.00 / 1M in 33K context

Grok 4.20 Multi Agent Beta 0309

Grok 4.20 Multi Agent Beta 0309 is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 2M context

Grok 4.20 Beta 0309 Reasoning

Grok 4.20 Beta 0309 Reasoning is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 2M context

Grok 4.20 Beta 0309 Non Reasoning

Grok 4.20 Beta 0309 Non Reasoning is available via Xai with a 2M context window and up to 2,000,000 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

$2.00 / 1M in 2M context

Grok 3

Grok 3 is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Grok 3 Beta

Grok 3 Beta is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Grok 3 Latest

Grok 3 Latest is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Grok 4

Grok 4 is available via Xai with a 256K context window and up to 256,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 256K context

Grok 4 0709

Grok 4 0709 is available via Xai with a 256K context window and up to 256,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 256K context

Grok 4 Latest

Grok 4 Latest is available via Xai with a 256K context window and up to 256,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 256K context

Grok 3 Fast Beta

Grok 3 Fast Beta is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Grok 3 Fast Latest

Grok 3 Fast Latest is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Grok Beta

Grok Beta is available via Xai with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 131K context

Grok Vision Beta

Grok Vision Beta is available via Xai with a 8K context window and up to 8,192 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 8K context

Oci Models

View provider details →

Google.Gemini 2.5 Flash Lite

Google.Gemini 2.5 Flash Lite is available via Oci with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 1.0M context

Cohere.Command A Translate 08 2025

Cohere.Command A Translate 08 2025 is available via Oci with a 256K context window and up to 4,000 output tokens. Pricing: $0.0900/1M input tokens, $0.0900/1M output tokens.

$0.090 / 1M in 256K context

Cohere.Command R 08 2024

Cohere.Command R 08 2024 is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Google.Gemini 2.5 Flash

Google.Gemini 2.5 Flash is available via Oci with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 1.0M context

Xai.Grok 3 Mini

Xai.Grok 3 Mini is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $0.5000/1M output tokens.

$0.30 / 1M in 131K context

Xai.Grok 3 Mini Fast

Xai.Grok 3 Mini Fast is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $4.00/1M output tokens.

$0.60 / 1M in 131K context

Meta.Llama 3.3 70b Instruct

Meta.Llama 3.3 70b Instruct is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Meta.Llama 4 Maverick 17b 128e Instruct Fp8

Meta.Llama 4 Maverick 17b 128e Instruct Fp8 is available via Oci with a 512K context window and up to 4,000 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 512K context

Meta.Llama 4 Scout 17b 16e Instruct

Meta.Llama 4 Scout 17b 16e Instruct is available via Oci with a 192K context window and up to 4,000 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 192K context

Meta.Llama 3.1 70b Instruct

Meta.Llama 3.1 70b Instruct is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Meta.Llama 3.3 70b Instruct Fp8 Dynamic

Meta.Llama 3.3 70b Instruct Fp8 Dynamic is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $0.7200/1M input tokens, $0.7200/1M output tokens.

$0.72 / 1M in 128K context

Google.Gemini 2.5 Pro

Google.Gemini 2.5 Pro is available via Oci with a 1.0M context window and up to 65,536 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Cohere.Command Latest

Cohere.Command Latest is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $1.56/1M input tokens, $1.56/1M output tokens.

$1.56 / 1M in 128K context

Cohere.Command A 03 2025

Cohere.Command A 03 2025 is available via Oci with a 256K context window and up to 4,000 output tokens. Pricing: $1.56/1M input tokens, $1.56/1M output tokens.

$1.56 / 1M in 256K context

Cohere.Command Plus Latest

Cohere.Command Plus Latest is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $1.56/1M input tokens, $1.56/1M output tokens.

$1.56 / 1M in 128K context

Cohere.Command A Reasoning 08 2025

Cohere.Command A Reasoning 08 2025 is available via Oci with a 256K context window and up to 4,000 output tokens. Pricing: $1.56/1M input tokens, $1.56/1M output tokens.

$1.56 / 1M in 256K context

Cohere.Command A Vision 07 2025

Cohere.Command A Vision 07 2025 is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $1.56/1M input tokens, $1.56/1M output tokens.

$1.56 / 1M in 128K context

Cohere.Command R Plus 08 2024

Cohere.Command R Plus 08 2024 is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $1.56/1M input tokens, $1.56/1M output tokens.

$1.56 / 1M in 128K context

Meta.Llama 3.2 90b Vision Instruct

Meta.Llama 3.2 90b Vision Instruct is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 128K context

Meta.Llama 3.2 11b Vision Instruct

Meta.Llama 3.2 11b Vision Instruct is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 128K context

Xai.Grok 3

Xai.Grok 3 is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Xai.Grok 4

Xai.Grok 4 is available via Oci with a 128K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

Xai.Grok 4.20

Xai.Grok 4.20 is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Xai.Grok 4.20 Multi Agent

Xai.Grok 4.20 Multi Agent is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 131K context

Xai.Grok 3 Fast

Xai.Grok 3 Fast is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Xai.Grok 4 Fast

Xai.Grok 4 Fast is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Xai.Grok 4.1 Fast

Xai.Grok 4.1 Fast is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Xai.Grok Code Fast 1

Xai.Grok Code Fast 1 is available via Oci with a 131K context window and up to 131,072 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 131K context

Meta.Llama 3.1 405b Instruct

Meta.Llama 3.1 405b Instruct is available via Oci with a 128K context window and up to 4,000 output tokens. Pricing: $10.68/1M input tokens, $10.68/1M output tokens.

$10.68 / 1M in 128K context

IBM Watsonx Models

View provider details →

Ibm/Granite 4 H Small

Ibm/Granite 4 H Small is available via IBM Watsonx with a 20K context window and up to 20,480 output tokens. Pricing: $0.0600/1M input tokens, $0.2500/1M output tokens.

$0.060 / 1M in 20K context

Ibm/Granite Guardian 3 2 2b

Ibm/Granite Guardian 3 2 2b is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 8K context

Ibm/Granite Vision 3 2 2b

Ibm/Granite Vision 3 2 2b is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 8K context

Meta Llama/Llama 3 2 1b Instruct

Meta Llama/Llama 3 2 1b Instruct is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Mistralai/Mistral Small 2503

Mistralai/Mistral Small 2503 is available via IBM Watsonx with a 32K context window and up to 32,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 32K context

Mistralai/Mistral Small 3 1 24b Instruct 2503

Mistralai/Mistral Small 3 1 24b Instruct 2503 is available via IBM Watsonx with a 32K context window and up to 32,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 32K context

Meta Llama/Llama 3 2 3b Instruct

Meta Llama/Llama 3 2 3b Instruct is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 128K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 8K context

Ibm/Granite 3 8b Instruct

Ibm/Granite 3 8b Instruct is available via IBM Watsonx with a 8K context window and up to 1,024 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Ibm/Granite 3 3 8b Instruct

Ibm/Granite 3 3 8b Instruct is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Ibm/Granite Guardian 3 3 8b

Ibm/Granite Guardian 3 3 8b is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Meta Llama/Llama 3 2 11b Vision Instruct

Meta Llama/Llama 3 2 11b Vision Instruct is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.3500/1M input tokens, $0.3500/1M output tokens.

$0.35 / 1M in 128K context

Meta Llama/Llama 4 Maverick 17b

Meta Llama/Llama 4 Maverick 17b is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.3500/1M input tokens, $1.40/1M output tokens.

$0.35 / 1M in 128K context

Meta Llama/Llama Guard 3 11b Vision

Meta Llama/Llama Guard 3 11b Vision is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.3500/1M input tokens, $0.3500/1M output tokens.

$0.35 / 1M in 128K context

Mistralai/Pixtral 12b 2409

Mistralai/Pixtral 12b 2409 is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.3500/1M input tokens, $0.3500/1M output tokens.

$0.35 / 1M in 128K context

Ibm/Granite Ttm 1024 96 R2

Ibm/Granite Ttm 1024 96 R2 is available via IBM Watsonx with a 1K context window and up to 512 output tokens. Pricing: $0.3800/1M input tokens, $0.3800/1M output tokens.

$0.38 / 1M in 1K context

Ibm/Granite Ttm 1536 96 R2

Ibm/Granite Ttm 1536 96 R2 is available via IBM Watsonx with a 1K context window and up to 512 output tokens. Pricing: $0.3800/1M input tokens, $0.3800/1M output tokens.

$0.38 / 1M in 1K context

Ibm/Granite Ttm 512 96 R2

Ibm/Granite Ttm 512 96 R2 is available via IBM Watsonx with a 1K context window and up to 512 output tokens. Pricing: $0.3800/1M input tokens, $0.3800/1M output tokens.

$0.38 / 1M in 1K context

Google/Flan T5 Xl 3b

Google/Flan T5 Xl 3b is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

$0.60 / 1M in 8K context

Ibm/Granite 13b Chat

Ibm/Granite 13b Chat is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

$0.60 / 1M in 8K context

Ibm/Granite 13b Instruct

Ibm/Granite 13b Instruct is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

$0.60 / 1M in 8K context

Meta Llama/Llama 3 3 70b Instruct

Meta Llama/Llama 3 3 70b Instruct is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $0.7100/1M input tokens, $0.7100/1M output tokens.

$0.71 / 1M in 128K context

Sdaia/Allam 1 13b Instruct

Sdaia/Allam 1 13b Instruct is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $1.80/1M input tokens, $1.80/1M output tokens.

$1.80 / 1M in 8K context

Meta Llama/Llama 3 2 90b Vision Instruct

Meta Llama/Llama 3 2 90b Vision Instruct is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 128K context

Mistralai/Mistral Large

Mistralai/Mistral Large is available via IBM Watsonx with a 131K context window and up to 16,384 output tokens. Pricing: $3.00/1M input tokens, $10.00/1M output tokens.

$3.00 / 1M in 131K context

Mistralai/Mistral Medium 2505

Mistralai/Mistral Medium 2505 is available via IBM Watsonx with a 128K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $10.00/1M output tokens.

$3.00 / 1M in 128K context

Bigscience/Mt0 Xxl 13b

Bigscience/Mt0 Xxl 13b is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $500.00/1M input tokens, $2000.00/1M output tokens.

$500.00 / 1M in 8K context

Core42/Jais 13b Chat

Core42/Jais 13b Chat is available via IBM Watsonx with a 8K context window and up to 8,192 output tokens. Pricing: $500.00/1M input tokens, $2000.00/1M output tokens.

$500.00 / 1M in 8K context

Nebius Models

View provider details →

Qwen/Qwen2.5 Coder 7B

Qwen/Qwen2.5 Coder 7B is available via Nebius with a 33K context window and up to 32,768 output tokens. Pricing: $0.0100/1M input tokens, $0.0300/1M output tokens.

$0.010 / 1M in 33K context

Meta Llama/Llama Guard 3 8B

Meta Llama/Llama Guard 3 8B is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.0200/1M input tokens, $0.0600/1M output tokens.

$0.020 / 1M in 128K context

Meta Llama/Meta Llama 3.1 8B Instruct

Meta Llama/Meta Llama 3.1 8B Instruct is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.0200/1M input tokens, $0.0600/1M output tokens.

$0.020 / 1M in 128K context

Qwen/Qwen2 VL 7B Instruct

Qwen/Qwen2 VL 7B Instruct is available via Nebius with a 131K context window and up to 131,072 output tokens. Pricing: $0.0200/1M input tokens, $0.0600/1M output tokens.

$0.020 / 1M in 131K context

Mistralai/Mistral Nemo Instruct 2407

Mistralai/Mistral Nemo Instruct 2407 is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.0400/1M input tokens, $0.1200/1M output tokens.

$0.040 / 1M in 128K context

Google/Gemma 3 27b It

Google/Gemma 3 27b It is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.0600/1M input tokens, $0.2000/1M output tokens.

$0.060 / 1M in 128K context

Qwen/Qwen2.5 32B Instruct

Qwen/Qwen2.5 32B Instruct is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.0600/1M input tokens, $0.2000/1M output tokens.

$0.060 / 1M in 128K context

Qwen/Qwen3 14B

Qwen/Qwen3 14B is available via Nebius with a 33K context window and up to 32,768 output tokens. Pricing: $0.0800/1M input tokens, $0.2400/1M output tokens.

$0.080 / 1M in 33K context

Qwen/Qwen3 4B

Qwen/Qwen3 4B is available via Nebius with a 33K context window and up to 32,768 output tokens. Pricing: $0.0800/1M input tokens, $0.2400/1M output tokens.

$0.080 / 1M in 33K context

Nvidia/Llama 3.3 Nemotron Super 49B

Nvidia/Llama 3.3 Nemotron Super 49B is available via Nebius with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.4000/1M output tokens.

$0.10 / 1M in 131K context

Qwen/Qwen3 32B

Qwen/Qwen3 32B is available via Nebius with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 33K context

Qwen/Qwen3 30B A3B

Qwen/Qwen3 30B A3B is available via Nebius with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

$0.10 / 1M in 33K context

Meta Llama/Llama 3.3 70B Instruct

Meta Llama/Llama 3.3 70B Instruct is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.1300/1M input tokens, $0.4000/1M output tokens.

$0.13 / 1M in 128K context

Meta Llama/Meta Llama 3.1 70B Instruct

Meta Llama/Meta Llama 3.1 70B Instruct is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.1300/1M input tokens, $0.4000/1M output tokens.

$0.13 / 1M in 128K context

Qwen/Qwen2.5 72B Instruct

Qwen/Qwen2.5 72B Instruct is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.1300/1M input tokens, $0.4000/1M output tokens.

$0.13 / 1M in 128K context

Qwen/Qwen2.5 VL 72B Instruct

Qwen/Qwen2.5 VL 72B Instruct is available via Nebius with a 131K context window and up to 131,072 output tokens. Pricing: $0.1300/1M input tokens, $0.4000/1M output tokens.

$0.13 / 1M in 131K context

Qwen/Qwen2 VL 72B Instruct

Qwen/Qwen2 VL 72B Instruct is available via Nebius with a 131K context window and up to 131,072 output tokens. Pricing: $0.1300/1M input tokens, $0.4000/1M output tokens.

$0.13 / 1M in 131K context

Qwen/QwQ 32B

Qwen/QwQ 32B is available via Nebius with a 33K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $0.4500/1M output tokens.

$0.15 / 1M in 33K context

Qwen/Qwen3 235B A22B

Qwen/Qwen3 235B A22B is available via Nebius with a 262K context window and up to 262,144 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 262K context

Deepseek Ai/DeepSeek R1 Distill Llama 70B

Deepseek Ai/DeepSeek R1 Distill Llama 70B is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $0.7500/1M output tokens.

$0.25 / 1M in 128K context

Deepseek Ai/DeepSeek V3

Deepseek Ai/DeepSeek V3 is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Deepseek Ai/DeepSeek V3 0324

Deepseek Ai/DeepSeek V3 0324 is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Nvidia/Llama 3.1 Nemotron Ultra 253B

Nvidia/Llama 3.1 Nemotron Ultra 253B is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $1.80/1M output tokens.

$0.60 / 1M in 128K context

Deepseek Ai/DeepSeek R1

Deepseek Ai/DeepSeek R1 is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $0.8000/1M input tokens, $2.40/1M output tokens.

$0.80 / 1M in 128K context

Deepseek Ai/DeepSeek R1 0528

Deepseek Ai/DeepSeek R1 0528 is available via Nebius with a 164K context window and up to 164,000 output tokens. Pricing: $0.8000/1M input tokens, $2.40/1M output tokens.

$0.80 / 1M in 164K context

Meta Llama/Meta Llama 3.1 405B Instruct

Meta Llama/Meta Llama 3.1 405B Instruct is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 128K context

NousResearch/Hermes 3 Llama 3.1 405B

NousResearch/Hermes 3 Llama 3.1 405B is available via Nebius with a 128K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 128K context

Databricks Models

View provider details →

Databricks Gpt 5 Nano

Databricks Gpt 5 Nano is available via Databricks with a 272K context window and up to 128,000 output tokens. Pricing: $0.0500/1M input tokens, $0.4000/1M output tokens.

$0.050 / 1M in 272K context

Databricks Gpt Oss 20b

Databricks Gpt Oss 20b is available via Databricks with a 131K context window and up to 131,072 output tokens. Pricing: $0.0700/1M input tokens, $0.3000/1M output tokens.

$0.070 / 1M in 131K context

Databricks Gemma 3 12b

Databricks Gemma 3 12b is available via Databricks with a 128K context window and up to 32,000 output tokens. Pricing: $0.1500/1M input tokens, $0.5000/1M output tokens.

$0.15 / 1M in 128K context

Databricks Gpt Oss 120b

Databricks Gpt Oss 120b is available via Databricks with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Databricks Meta Llama 3 1 8b Instruct

Databricks Meta Llama 3 1 8b Instruct is available via Databricks with a 200K context window and up to 128,000 output tokens. Pricing: $0.1500/1M input tokens, $0.4500/1M output tokens.

$0.15 / 1M in 200K context

Databricks Gpt 5 Mini

Databricks Gpt 5 Mini is available via Databricks with a 272K context window and up to 128,000 output tokens. Pricing: $0.2500/1M input tokens, $2.00/1M output tokens.

$0.25 / 1M in 272K context

Databricks Gemini 2 5 Flash

Databricks Gemini 2 5 Flash is available via Databricks with a 1.0M context window and up to 65,535 output tokens. Pricing: $0.3000/1M input tokens, $2.50/1M output tokens.

$0.30 / 1M in 1.0M context

Databricks Llama 2 70b Chat

Databricks Llama 2 70b Chat is available via Databricks with a 4K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 4K context

Databricks Llama 4 Maverick

Databricks Llama 4 Maverick is available via Databricks with a 128K context window and up to 128,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Databricks Meta Llama 3 3 70b Instruct

Databricks Meta Llama 3 3 70b Instruct is available via Databricks with a 128K context window and up to 128,000 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

$0.50 / 1M in 128K context

Databricks Mixtral 8x7b Instruct

Databricks Mixtral 8x7b Instruct is available via Databricks with a 4K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $1.00/1M output tokens.

$0.50 / 1M in 4K context

Databricks Mpt 7b Instruct

Databricks Mpt 7b Instruct is available via Databricks with a 8K context window and up to 8,192 output tokens. Pricing: $0.5000/1M input tokens, $0.000000/1M output tokens.

$0.50 / 1M in 8K context

Databricks Claude Haiku 4 5

Databricks Claude Haiku 4 5 is available via Databricks with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Databricks Meta Llama 3 70b Instruct

Databricks Meta Llama 3 70b Instruct is available via Databricks with a 128K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 128K context

Databricks Mpt 30b Instruct

Databricks Mpt 30b Instruct is available via Databricks with a 8K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 8K context

Databricks Gemini 2 5 Pro

Databricks Gemini 2 5 Pro is available via Databricks with a 1.0M context window and up to 65,536 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 1.0M context

Databricks Gpt 5

Databricks Gpt 5 is available via Databricks with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Databricks Gpt 5 1

Databricks Gpt 5 1 is available via Databricks with a 272K context window and up to 128,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 272K context

Databricks Claude 3 7 Sonnet

Databricks Claude 3 7 Sonnet is available via Databricks with a 200K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Databricks Claude Sonnet 4

Databricks Claude Sonnet 4 is available via Databricks with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Databricks Claude Sonnet 4 1

Databricks Claude Sonnet 4 1 is available via Databricks with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Databricks Claude Sonnet 4 5

Databricks Claude Sonnet 4 5 is available via Databricks with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Databricks Claude Opus 4 5

Databricks Claude Opus 4 5 is available via Databricks with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Databricks Meta Llama 3 1 405b Instruct

Databricks Meta Llama 3 1 405b Instruct is available via Databricks with a 128K context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $15.00/1M output tokens.

$5.00 / 1M in 128K context

Databricks Claude Opus 4

Databricks Claude Opus 4 is available via Databricks with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Databricks Claude Opus 4 1

Databricks Claude Opus 4 1 is available via Databricks with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Moonshot Models

View provider details →

Kimi Latest 8k

Kimi Latest 8k is available via Moonshot with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $2.00/1M output tokens.

$0.20 / 1M in 8K context

Moonshot V1 8k

Moonshot V1 8k is available via Moonshot with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $2.00/1M output tokens.

$0.20 / 1M in 8K context

Moonshot V1 8k 0430

Moonshot V1 8k 0430 is available via Moonshot with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $2.00/1M output tokens.

$0.20 / 1M in 8K context

Moonshot V1 8k Vision Preview

Moonshot V1 8k Vision Preview is available via Moonshot with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $2.00/1M output tokens.

$0.20 / 1M in 8K context

Kimi K2 0711 Preview

Kimi K2 0711 Preview is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 131K context

Kimi K2 0905 Preview

Kimi K2 0905 Preview is available via Moonshot with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Kimi K2.5

Kimi K2.5 is available via Moonshot with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

$0.60 / 1M in 262K context

Kimi Thinking Preview

Kimi Thinking Preview is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 131K context

Kimi K2 Thinking

Kimi K2 Thinking is available via Moonshot with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 262K context

Kimi Latest 32k

Kimi Latest 32k is available via Moonshot with a 33K context window and up to 32,768 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 33K context

Moonshot V1 32k

Moonshot V1 32k is available via Moonshot with a 33K context window and up to 32,768 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 33K context

Moonshot V1 32k 0430

Moonshot V1 32k 0430 is available via Moonshot with a 33K context window and up to 32,768 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 33K context

Moonshot V1 32k Vision Preview

Moonshot V1 32k Vision Preview is available via Moonshot with a 33K context window and up to 32,768 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 33K context

Kimi K2 Turbo Preview

Kimi K2 Turbo Preview is available via Moonshot with a 262K context window and up to 262,144 output tokens. Pricing: $1.15/1M input tokens, $8.00/1M output tokens.

$1.15 / 1M in 262K context

Kimi K2 Thinking Turbo

Kimi K2 Thinking Turbo is available via Moonshot with a 262K context window and up to 262,144 output tokens. Pricing: $1.15/1M input tokens, $8.00/1M output tokens.

$1.15 / 1M in 262K context

Kimi Latest

Kimi Latest is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 131K context

Kimi Latest 128k

Kimi Latest 128k is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 131K context

Moonshot V1 128k

Moonshot V1 128k is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 131K context

Moonshot V1 128k 0430

Moonshot V1 128k 0430 is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 131K context

Moonshot V1 128k Vision Preview

Moonshot V1 128k Vision Preview is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 131K context

Moonshot V1 Auto

Moonshot V1 Auto is available via Moonshot with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $5.00/1M output tokens.

$2.00 / 1M in 131K context

Ollama Models

View provider details →

Codegeex4

Codegeex4 is available via Ollama with a 33K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Deepseek Coder V2 Instruct

Deepseek Coder V2 Instruct is available via Ollama with a 33K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Deepseek Coder V2 Lite Instruct

Deepseek Coder V2 Lite Instruct is available via Ollama with a 33K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Deepseek V3.1:671b Cloud

Deepseek V3.1:671b Cloud is available via Ollama with a 164K context window and up to 163,840 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 164K context

Gpt Oss:120b Cloud

Gpt Oss:120b Cloud is available via Ollama with a 131K context window and up to 131,072 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Gpt Oss:20b Cloud

Gpt Oss:20b Cloud is available via Ollama with a 131K context window and up to 131,072 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Internlm2 5 20b Chat

Internlm2 5 20b Chat is available via Ollama with a 33K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Llama2

Llama2 is available via Ollama with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

Llama2:13b

Llama2:13b is available via Ollama with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

Llama2:70b

Llama2:70b is available via Ollama with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

Llama2:7b

Llama2:7b is available via Ollama with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

Llama3

Llama3 is available via Ollama with a 8K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Llama3.1

Llama3.1 is available via Ollama with a 8K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Llama3:70b

Llama3:70b is available via Ollama with a 8K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Llama3:8b

Llama3:8b is available via Ollama with a 8K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Mistral 7B Instruct V0.1

Mistral 7B Instruct V0.1 is available via Ollama with a 8K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Mistral 7B Instruct V0.2

Mistral 7B Instruct V0.2 is available via Ollama with a 33K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Mistral Large Instruct 2407

Mistral Large Instruct 2407 is available via Ollama with a 66K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 66K context

Mixtral 8x22B Instruct V0.1

Mixtral 8x22B Instruct V0.1 is available via Ollama with a 66K context window and up to 65,536 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 66K context

Mixtral 8x7B Instruct V0.1

Mixtral 8x7B Instruct V0.1 is available via Ollama with a 33K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Qwen3 Coder:480b Cloud

Qwen3 Coder:480b Cloud is available via Ollama with a 262K context window and up to 262,144 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 262K context

Lambda Ai Models

View provider details →

Llama3.2 11b Vision Instruct

Llama3.2 11b Vision Instruct is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0150/1M input tokens, $0.0250/1M output tokens.

$0.015 / 1M in 131K context

Llama3.2 3b Instruct

Llama3.2 3b Instruct is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0150/1M input tokens, $0.0250/1M output tokens.

$0.015 / 1M in 131K context

Hermes3 8b

Hermes3 8b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0250/1M input tokens, $0.0400/1M output tokens.

$0.025 / 1M in 131K context

Lfm 7b

Lfm 7b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0250/1M input tokens, $0.0400/1M output tokens.

$0.025 / 1M in 131K context

Llama3.1 8b Instruct

Llama3.1 8b Instruct is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0250/1M input tokens, $0.0400/1M output tokens.

$0.025 / 1M in 131K context

Llama 4 Maverick 17b 128e Instruct Fp8

Llama 4 Maverick 17b 128e Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

$0.050 / 1M in 131K context

Llama 4 Scout 17b 16e Instruct

Llama 4 Scout 17b 16e Instruct is available via Lambda Ai with a 16K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

$0.050 / 1M in 16K context

Qwen25 Coder 32b Instruct

Qwen25 Coder 32b Instruct is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

$0.050 / 1M in 131K context

Qwen3 32b Fp8

Qwen3 32b Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

$0.050 / 1M in 131K context

Lfm 40b

Lfm 40b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.2000/1M output tokens.

$0.10 / 1M in 131K context

Hermes3 70b

Hermes3 70b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Llama3.1 70b Instruct Fp8

Llama3.1 70b Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Llama3.1 Nemotron 70b Instruct Fp8

Llama3.1 Nemotron 70b Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Llama3.3 70b Instruct Fp8

Llama3.3 70b Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Deepseek Llama3.3 70b

Deepseek Llama3.3 70b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 131K context

Deepseek R1 0528

Deepseek R1 0528 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 131K context

Deepseek V3 0324

Deepseek V3 0324 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 131K context

Deepseek R1 671b

Deepseek R1 671b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

$0.80 / 1M in 131K context

Hermes3 405b

Hermes3 405b is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

$0.80 / 1M in 131K context

Llama3.1 405b Instruct Fp8

Llama3.1 405b Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

$0.80 / 1M in 131K context

Perplexity Models

View provider details →

Pplx 70b Online

Pplx 70b Online is available via Perplexity with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $2.80/1M output tokens.

$0.000 / 1M in 4K context

Pplx 7b Online

Pplx 7b Online is available via Perplexity with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.2800/1M output tokens.

$0.000 / 1M in 4K context

Sonar Medium Online

Sonar Medium Online is available via Perplexity with a 12K context window and up to 12,000 output tokens. Pricing: $0.000000/1M input tokens, $1.80/1M output tokens.

$0.000 / 1M in 12K context

Sonar Small Online

Sonar Small Online is available via Perplexity with a 12K context window and up to 12,000 output tokens. Pricing: $0.000000/1M input tokens, $0.2800/1M output tokens.

$0.000 / 1M in 12K context

Mistral 7b Instruct

Mistral 7b Instruct is available via Perplexity with a 4K context window and up to 4,096 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 4K context

Mixtral 8x7b Instruct

Mixtral 8x7b Instruct is available via Perplexity with a 4K context window and up to 4,096 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 4K context

Pplx 7b Chat

Pplx 7b Chat is available via Perplexity with a 8K context window and up to 8,192 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 8K context

Sonar Small Chat

Sonar Small Chat is available via Perplexity with a 16K context window and up to 16,384 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

$0.070 / 1M in 16K context

Llama 3.1 8b Instruct

Llama 3.1 8b Instruct is available via Perplexity with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Codellama 34b Instruct

Codellama 34b Instruct is available via Perplexity with a 16K context window and up to 16,384 output tokens. Pricing: $0.3500/1M input tokens, $1.40/1M output tokens.

$0.35 / 1M in 16K context

Sonar Medium Chat

Sonar Medium Chat is available via Perplexity with a 16K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $1.80/1M output tokens.

$0.60 / 1M in 16K context

Codellama 70b Instruct

Codellama 70b Instruct is available via Perplexity with a 16K context window and up to 16,384 output tokens. Pricing: $0.7000/1M input tokens, $2.80/1M output tokens.

$0.70 / 1M in 16K context

Llama 2 70b Chat

Llama 2 70b Chat is available via Perplexity with a 4K context window and up to 4,096 output tokens. Pricing: $0.7000/1M input tokens, $2.80/1M output tokens.

$0.70 / 1M in 4K context

Pplx 70b Chat

Pplx 70b Chat is available via Perplexity with a 4K context window and up to 4,096 output tokens. Pricing: $0.7000/1M input tokens, $2.80/1M output tokens.

$0.70 / 1M in 4K context

Llama 3.1 70b Instruct

Llama 3.1 70b Instruct is available via Perplexity with a 131K context window and up to 131,072 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 131K context

Sonar

Sonar is available via Perplexity with a 128K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 128K context

Sonar Reasoning

Sonar Reasoning is available via Perplexity with a 128K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 128K context

Sonar Deep Research

Sonar Deep Research is available via Perplexity with a 128K context window and up to 128,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 128K context

Sonar Reasoning Pro

Sonar Reasoning Pro is available via Perplexity with a 128K context window and up to 128,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 128K context

Sonar Pro

Sonar Pro is available via Perplexity with a 200K context window and up to 8,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Anthropic Models

View provider details →

Claude 3 Haiku

Claude 3 Haiku is available via Anthropic with a 200K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $1.25/1M output tokens.

$0.25 / 1M in 200K context

Claude Haiku 4 5

Claude Haiku 4 5 is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Claude Haiku 4 5

Claude Haiku 4 5 is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

$1.00 / 1M in 200K context

Claude 3 7 Sonnet

Claude 3 7 Sonnet is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude 4 Sonnet

Claude 4 Sonnet is available via Anthropic with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Claude Sonnet 4 5

Claude Sonnet 4 5 is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4 5

Claude Sonnet 4 5 is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 200K context

Claude Sonnet 4 6

Claude Sonnet 4 6 is available via Anthropic with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 1M context

Claude Sonnet 4

NEW

Claude 4 Sonnet is Anthropic's latest and most capable model, excelling at coding, analysis, and complex instruction-following. It features extended context and improved tool use compared to previous Claude generations.

$3.00 / 1M in 1M context

Claude Opus 4 5

Claude Opus 4 5 is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Claude Opus 4 5

Claude Opus 4 5 is available via Anthropic with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 200K context

Claude Opus 4 6

Claude Opus 4 6 is available via Anthropic with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Claude Opus 4 6

Claude Opus 4 6 is available via Anthropic with a 1M context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 1M context

Claude 3 Opus

Claude 3 Opus is available via Anthropic with a 200K context window and up to 4,096 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude 4 Opus

Claude 4 Opus is available via Anthropic with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4 1

Claude Opus 4 1 is available via Anthropic with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4 1

Claude Opus 4 1 is available via Anthropic with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Claude Opus 4

Claude Opus 4 is available via Anthropic with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 200K context

Dashscope Models

View provider details →

Qwen Turbo

Qwen Turbo is available via Dashscope with a 129K context window and up to 16,384 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

$0.050 / 1M in 129K context

Qwen Turbo 2024 11 01

Qwen Turbo 2024 11 01 is available via Dashscope with a 1M context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

$0.050 / 1M in 1M context

Qwen Turbo 2025 04 28

Qwen Turbo 2025 04 28 is available via Dashscope with a 1M context window and up to 16,384 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

$0.050 / 1M in 1M context

Qwen Turbo Latest

Qwen Turbo Latest is available via Dashscope with a 1M context window and up to 16,384 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

$0.050 / 1M in 1M context

Qwen3 Next 80b A3b Instruct

Qwen3 Next 80b A3b Instruct is available via Dashscope with a 262K context window and up to 65,536 output tokens. Pricing: $0.1500/1M input tokens, $1.20/1M output tokens.

$0.15 / 1M in 262K context

Qwen3 Next 80b A3b Thinking

Qwen3 Next 80b A3b Thinking is available via Dashscope with a 262K context window and up to 65,536 output tokens. Pricing: $0.1500/1M input tokens, $1.20/1M output tokens.

$0.15 / 1M in 262K context

Qwen3 Vl 32b Instruct

Qwen3 Vl 32b Instruct is available via Dashscope with a 131K context window and up to 32,768 output tokens. Pricing: $0.1600/1M input tokens, $0.6400/1M output tokens.

$0.16 / 1M in 131K context

Qwen3 Vl 32b Thinking

Qwen3 Vl 32b Thinking is available via Dashscope with a 131K context window and up to 32,768 output tokens. Pricing: $0.1600/1M input tokens, $2.87/1M output tokens.

$0.16 / 1M in 131K context

Qwen Coder

Qwen Coder is available via Dashscope with a 1M context window and up to 16,384 output tokens. Pricing: $0.3000/1M input tokens, $1.50/1M output tokens.

$0.30 / 1M in 1M context

Qwen Plus

Qwen Plus is available via Dashscope with a 129K context window and up to 16,384 output tokens. Pricing: $0.4000/1M input tokens, $1.20/1M output tokens.

$0.40 / 1M in 129K context

Qwen Plus 2025 01 25

Qwen Plus 2025 01 25 is available via Dashscope with a 129K context window and up to 8,192 output tokens. Pricing: $0.4000/1M input tokens, $1.20/1M output tokens.

$0.40 / 1M in 129K context

Qwen Plus 2025 04 28

Qwen Plus 2025 04 28 is available via Dashscope with a 129K context window and up to 16,384 output tokens. Pricing: $0.4000/1M input tokens, $1.20/1M output tokens.

$0.40 / 1M in 129K context

Qwen Plus 2025 07 14

Qwen Plus 2025 07 14 is available via Dashscope with a 129K context window and up to 16,384 output tokens. Pricing: $0.4000/1M input tokens, $1.20/1M output tokens.

$0.40 / 1M in 129K context

Qwen3 Vl 235b A22b Instruct

Qwen3 Vl 235b A22b Instruct is available via Dashscope with a 131K context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $1.60/1M output tokens.

$0.40 / 1M in 131K context

Qwen3 Vl 235b A22b Thinking

Qwen3 Vl 235b A22b Thinking is available via Dashscope with a 131K context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $4.00/1M output tokens.

$0.40 / 1M in 131K context

Qwq Plus

Qwq Plus is available via Dashscope with a 98K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $2.40/1M output tokens.

$0.80 / 1M in 98K context

Qwen Max

Qwen Max is available via Dashscope with a 31K context window and up to 8,192 output tokens. Pricing: $1.60/1M input tokens, $6.40/1M output tokens.

$1.60 / 1M in 31K context

Gmi Models

View provider details →

Openai/Gpt 4o Mini

Openai/Gpt 4o Mini is available via Gmi with a 131K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Deepseek Ai/DeepSeek V3.2

Deepseek Ai/DeepSeek V3.2 is available via Gmi with a 164K context window and up to 16,384 output tokens. Pricing: $0.2800/1M input tokens, $0.4000/1M output tokens.

$0.28 / 1M in 164K context

Deepseek Ai/DeepSeek V3 0324

Deepseek Ai/DeepSeek V3 0324 is available via Gmi with a 164K context window and up to 16,384 output tokens. Pricing: $0.2800/1M input tokens, $0.8800/1M output tokens.

$0.28 / 1M in 164K context

MiniMaxAI/MiniMax M2.1

MiniMaxAI/MiniMax M2.1 is available via Gmi with a 197K context window and up to 16,384 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 197K context

Qwen/Qwen3 VL 235B A22B Instruct FP8

Qwen/Qwen3 VL 235B A22B Instruct FP8 is available via Gmi with a 262K context window and up to 16,384 output tokens. Pricing: $0.3000/1M input tokens, $1.40/1M output tokens.

$0.30 / 1M in 262K context

Zai Org/GLM 4.7 FP8

Zai Org/GLM 4.7 FP8 is available via Gmi with a 203K context window and up to 16,384 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

$0.40 / 1M in 203K context

Google/Gemini 3 Flash Preview

Google/Gemini 3 Flash Preview is available via Gmi with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.5000/1M input tokens, $3.00/1M output tokens.

$0.50 / 1M in 1.0M context

Moonshotai/Kimi K2 Thinking

Moonshotai/Kimi K2 Thinking is available via Gmi with a 262K context window and up to 16,384 output tokens. Pricing: $0.8000/1M input tokens, $1.20/1M output tokens.

$0.80 / 1M in 262K context

Openai/Gpt 5.1

Openai/Gpt 5.1 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 410K context

Openai/Gpt 5

Openai/Gpt 5 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $1.25/1M input tokens, $10.00/1M output tokens.

$1.25 / 1M in 410K context

Openai/Gpt 5.2

Openai/Gpt 5.2 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $1.75/1M input tokens, $14.00/1M output tokens.

$1.75 / 1M in 410K context

Google/Gemini 3 Pro Preview

Google/Gemini 3 Pro Preview is available via Gmi with a 1.0M context window and up to 65,536 output tokens. Pricing: $2.00/1M input tokens, $12.00/1M output tokens.

$2.00 / 1M in 1.0M context

Openai/Gpt 4o

Openai/Gpt 4o is available via Gmi with a 131K context window and up to 16,384 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 131K context

Anthropic/Claude Sonnet 4.5

Anthropic/Claude Sonnet 4.5 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 410K context

Anthropic/Claude Sonnet 4

Anthropic/Claude Sonnet 4 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 410K context

Anthropic/Claude Opus 4.5

Anthropic/Claude Opus 4.5 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

$5.00 / 1M in 410K context

Anthropic/Claude Opus 4

Anthropic/Claude Opus 4 is available via Gmi with a 410K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 410K context

Together AI Models

View provider details →

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

$0.050 / 1M in 128K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Qwen/Qwen3 Next 80B A3B Instruct

Qwen/Qwen3 Next 80B A3B Instruct is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

$0.15 / 1M in 262K context

Qwen/Qwen3 Next 80B A3B Thinking

Qwen/Qwen3 Next 80B A3B Thinking is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

$0.15 / 1M in 262K context

Qwen/Qwen3 235B A22B Instruct 2507 Tput

Qwen/Qwen3 235B A22B Instruct 2507 Tput is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $6.00/1M output tokens.

$0.20 / 1M in 262K context

Qwen/Qwen3 235B A22B Fp8 Tput

Qwen/Qwen3 235B A22B Fp8 Tput is available via Together AI with a 40K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 40K context

Zai Org/GLM 4.5 Air FP8

Zai Org/GLM 4.5 Air FP8 is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $1.10/1M output tokens.

$0.20 / 1M in 128K context

Zai Org/GLM 4.7

Zai Org/GLM 4.7 is available via Together AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.4500/1M input tokens, $2.00/1M output tokens.

$0.45 / 1M in 200K context

Moonshotai/Kimi K2.5

Moonshotai/Kimi K2.5 is available via Together AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.5000/1M input tokens, $2.80/1M output tokens.

$0.50 / 1M in 256K context

Deepseek Ai/DeepSeek R1 0528 Tput

Deepseek Ai/DeepSeek R1 0528 Tput is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 128K context

Zai Org/GLM 4.6

Zai Org/GLM 4.6 is available via Together AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 200K context

Qwen/Qwen3.5 397B A17B

Qwen/Qwen3.5 397B A17B is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $3.60/1M output tokens.

$0.60 / 1M in 262K context

Qwen/Qwen3 235B A22B Thinking 2507

Qwen/Qwen3 235B A22B Thinking 2507 is available via Together AI with a 256K context window and up to 4,096 output tokens. Pricing: $0.6500/1M input tokens, $3.00/1M output tokens.

$0.65 / 1M in 256K context

Moonshotai/Kimi K2 Instruct 0905

Moonshotai/Kimi K2 Instruct 0905 is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 262K context

Deepseek Ai/DeepSeek V3

Deepseek Ai/DeepSeek V3 is available via Together AI with a 66K context window and up to 8,192 output tokens. Pricing: $1.25/1M input tokens, $1.25/1M output tokens.

$1.25 / 1M in 66K context

Qwen/Qwen3 Coder 480B A35B Instruct FP8

Qwen/Qwen3 Coder 480B A35B Instruct FP8 is available via Together AI with a 256K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 256K context

Deepseek Ai/DeepSeek R1

Deepseek Ai/DeepSeek R1 is available via Together AI with a 128K context window and up to 20,480 output tokens. Pricing: $3.00/1M input tokens, $7.00/1M output tokens.

$3.00 / 1M in 128K context

Hyperbolic Models

View provider details →

NousResearch/Hermes 3 Llama 3.1 70B

NousResearch/Hermes 3 Llama 3.1 70B is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 33K context

Qwen/Qwen2.5 72B Instruct

Qwen/Qwen2.5 72B Instruct is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Qwen/Qwen2.5 Coder 32B Instruct

Qwen/Qwen2.5 Coder 32B Instruct is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 33K context

Meta Llama/Llama 3.2 3B Instruct

Meta Llama/Llama 3.2 3B Instruct is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 33K context

Meta Llama/Llama 3.3 70B Instruct

Meta Llama/Llama 3.3 70B Instruct is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Meta Llama/Meta Llama 3 70B Instruct

Meta Llama/Meta Llama 3 70B Instruct is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 131K context

Meta Llama/Meta Llama 3.1 405B Instruct

Meta Llama/Meta Llama 3.1 405B Instruct is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 33K context

Meta Llama/Meta Llama 3.1 70B Instruct

Meta Llama/Meta Llama 3.1 70B Instruct is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 33K context

Meta Llama/Meta Llama 3.1 8B Instruct

Meta Llama/Meta Llama 3.1 8B Instruct is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

$0.12 / 1M in 33K context

Qwen/QwQ 32B

Qwen/QwQ 32B is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 131K context

Deepseek Ai/DeepSeek V3

Deepseek Ai/DeepSeek V3 is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 33K context

Deepseek Ai/DeepSeek R1 0528

Deepseek Ai/DeepSeek R1 0528 is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

$0.25 / 1M in 131K context

Deepseek Ai/DeepSeek R1

Deepseek Ai/DeepSeek R1 is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $0.4000/1M output tokens.

$0.40 / 1M in 33K context

Deepseek Ai/DeepSeek V3 0324

Deepseek Ai/DeepSeek V3 0324 is available via Hyperbolic with a 33K context window and up to 32,768 output tokens. Pricing: $0.4000/1M input tokens, $0.4000/1M output tokens.

$0.40 / 1M in 33K context

Qwen/Qwen3 235B A22B

Qwen/Qwen3 235B A22B is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 131K context

Moonshotai/Kimi K2 Instruct

Moonshotai/Kimi K2 Instruct is available via Hyperbolic with a 131K context window and up to 131,072 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

$2.00 / 1M in 131K context

Replicate Models

View provider details →

Meta/Llama 2 7b

Meta/Llama 2 7b is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 4K context

Meta/Llama 2 7b Chat

Meta/Llama 2 7b Chat is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 4K context

Meta/Llama 3 8b

Meta/Llama 3 8b is available via Replicate with a 8K context window and up to 8,086 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 8K context

Meta/Llama 3 8b Instruct

Meta/Llama 3 8b Instruct is available via Replicate with a 8K context window and up to 8,086 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 8K context

Mistralai/Mistral 7b Instruct V0.2

Mistralai/Mistral 7b Instruct V0.2 is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 4K context

Mistralai/Mistral 7b V0.1

Mistralai/Mistral 7b V0.1 is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

$0.050 / 1M in 4K context

Meta/Llama 2 13b

Meta/Llama 2 13b is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.5000/1M output tokens.

$0.10 / 1M in 4K context

Meta/Llama 2 13b Chat

Meta/Llama 2 13b Chat is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.5000/1M output tokens.

$0.10 / 1M in 4K context

Mistralai/Mixtral 8x7b Instruct V0.1

Mistralai/Mixtral 8x7b Instruct V0.1 is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $1.00/1M output tokens.

$0.30 / 1M in 4K context

Meta/Llama 2 70b

Meta/Llama 2 70b is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.6500/1M input tokens, $2.75/1M output tokens.

$0.65 / 1M in 4K context

Meta/Llama 2 70b Chat

Meta/Llama 2 70b Chat is available via Replicate with a 4K context window and up to 4,096 output tokens. Pricing: $0.6500/1M input tokens, $2.75/1M output tokens.

$0.65 / 1M in 4K context

Meta/Llama 3 70b

Meta/Llama 3 70b is available via Replicate with a 8K context window and up to 8,192 output tokens. Pricing: $0.6500/1M input tokens, $2.75/1M output tokens.

$0.65 / 1M in 8K context

Meta/Llama 3 70b Instruct

Meta/Llama 3 70b Instruct is available via Replicate with a 8K context window and up to 8,192 output tokens. Pricing: $0.6500/1M input tokens, $2.75/1M output tokens.

$0.65 / 1M in 8K context

Deepseek Ai/Deepseek V3.1

Deepseek Ai/Deepseek V3.1 is available via Replicate with a 164K context window and up to 163,840 output tokens. Pricing: $0.6720/1M input tokens, $2.02/1M output tokens.

$0.67 / 1M in 164K context

Deepseek Ai/Deepseek

Deepseek Ai/Deepseek is available via Replicate with a 66K context window and up to 8,192 output tokens. Pricing: $1.45/1M input tokens, $1.45/1M output tokens.

$1.45 / 1M in 66K context

Deepseek Ai/Deepseek R1

Deepseek Ai/Deepseek R1 is available via Replicate with a 66K context window and up to 8,192 output tokens. Pricing: $3.75/1M input tokens, $10.00/1M output tokens.

$3.75 / 1M in 66K context

SambaNova Models

View provider details →

Meta Llama 3.2 1B Instruct

Meta Llama 3.2 1B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.0400/1M input tokens, $0.0800/1M output tokens.

$0.040 / 1M in 16K context

Meta Llama 3.2 3B Instruct

Meta Llama 3.2 3B Instruct is available via SambaNova with a 4K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.1600/1M output tokens.

$0.080 / 1M in 4K context

Meta Llama 3.1 8B Instruct

Meta Llama 3.1 8B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.2000/1M output tokens.

$0.10 / 1M in 16K context

Meta Llama Guard 3 8B

Meta Llama Guard 3 8B is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

$0.30 / 1M in 16K context

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout 17B 16E Instruct is available via SambaNova with a 8K context window and up to 8,192 output tokens. Pricing: $0.4000/1M input tokens, $0.7000/1M output tokens.

$0.40 / 1M in 8K context

Qwen3 32B

Qwen3 32B is available via SambaNova with a 8K context window and up to 8,192 output tokens. Pricing: $0.4000/1M input tokens, $0.8000/1M output tokens.

$0.40 / 1M in 8K context

QwQ 32B

QwQ 32B is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $0.5000/1M input tokens, $1.00/1M output tokens.

$0.50 / 1M in 16K context

Qwen2 Audio 7B Instruct

Qwen2 Audio 7B Instruct is available via SambaNova with a 4K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $100.00/1M output tokens.

$0.50 / 1M in 4K context

Meta Llama 3.3 70B Instruct

Meta Llama 3.3 70B Instruct is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $1.20/1M output tokens.

$0.60 / 1M in 131K context

Llama 4 Maverick 17B 128E Instruct

Llama 4 Maverick 17B 128E Instruct is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $0.6300/1M input tokens, $1.80/1M output tokens.

$0.63 / 1M in 131K context

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $0.7000/1M input tokens, $1.40/1M output tokens.

$0.70 / 1M in 131K context

DeepSeek V3 0324

DeepSeek V3 0324 is available via SambaNova with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $4.50/1M output tokens.

$3.00 / 1M in 33K context

DeepSeek V3.1

DeepSeek V3.1 is available via SambaNova with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $4.50/1M output tokens.

$3.00 / 1M in 33K context

Gpt Oss 120b

Gpt Oss 120b is available via SambaNova with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $4.50/1M output tokens.

$3.00 / 1M in 131K context

DeepSeek R1

DeepSeek R1 is available via SambaNova with a 33K context window and up to 32,768 output tokens. Pricing: $5.00/1M input tokens, $7.00/1M output tokens.

$5.00 / 1M in 33K context

Meta Llama 3.1 405B Instruct

Meta Llama 3.1 405B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $5.00/1M input tokens, $10.00/1M output tokens.

$5.00 / 1M in 16K context

Ovhcloud Models

View provider details →

Gpt Oss 20b

Gpt Oss 20b is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.0400/1M input tokens, $0.1500/1M output tokens.

$0.040 / 1M in 131K context

Qwen3 32B

Qwen3 32B is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.0800/1M input tokens, $0.2300/1M output tokens.

$0.080 / 1M in 32K context

Gpt Oss 120b

Gpt Oss 120b is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.0800/1M input tokens, $0.4000/1M output tokens.

$0.080 / 1M in 131K context

Mistral Small 3.2 24B Instruct 2506

Mistral Small 3.2 24B Instruct 2506 is available via Ovhcloud with a 128K context window and up to 128,000 output tokens. Pricing: $0.0900/1M input tokens, $0.2800/1M output tokens.

$0.090 / 1M in 128K context

Llama 3.1 8B Instruct

Llama 3.1 8B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 131K context

Mistral 7B Instruct V0.3

Mistral 7B Instruct V0.3 is available via Ovhcloud with a 127K context window and up to 127,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 127K context

Mistral Nemo Instruct 2407

Mistral Nemo Instruct 2407 is available via Ovhcloud with a 118K context window and up to 118,000 output tokens. Pricing: $0.1300/1M input tokens, $0.1300/1M output tokens.

$0.13 / 1M in 118K context

Mamba Codestral 7B V0.1

Mamba Codestral 7B V0.1 is available via Ovhcloud with a 256K context window and up to 256,000 output tokens. Pricing: $0.1900/1M input tokens, $0.1900/1M output tokens.

$0.19 / 1M in 256K context

Llava V1.6 Mistral 7b Hf

Llava V1.6 Mistral 7b Hf is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.2900/1M input tokens, $0.2900/1M output tokens.

$0.29 / 1M in 32K context

Mixtral 8x7B Instruct V0.1

Mixtral 8x7B Instruct V0.1 is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.6300/1M input tokens, $0.6300/1M output tokens.

$0.63 / 1M in 32K context

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

$0.67 / 1M in 131K context

Meta Llama 3 1 70B Instruct

Meta Llama 3 1 70B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

$0.67 / 1M in 131K context

Meta Llama 3 3 70B Instruct

Meta Llama 3 3 70B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

$0.67 / 1M in 131K context

Qwen2.5 Coder 32B Instruct

Qwen2.5 Coder 32B Instruct is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.8700/1M input tokens, $0.8700/1M output tokens.

$0.87 / 1M in 32K context

Qwen2.5 VL 72B Instruct

Qwen2.5 VL 72B Instruct is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.9100/1M input tokens, $0.9100/1M output tokens.

$0.91 / 1M in 32K context

Llamagate Models

View provider details →

Llama 3.1 8b

Llama 3.1 8b is available via Llamagate with a 131K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0500/1M output tokens.

$0.030 / 1M in 131K context

Gemma3 4b

Gemma3 4b is available via Llamagate with a 128K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0800/1M output tokens.

$0.030 / 1M in 128K context

Llama 3.2 3b

Llama 3.2 3b is available via Llamagate with a 131K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.0800/1M output tokens.

$0.040 / 1M in 131K context

Qwen3 8b

Qwen3 8b is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.1400/1M output tokens.

$0.040 / 1M in 33K context

Qwen2.5 Coder 7b

Qwen2.5 Coder 7b is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.0600/1M input tokens, $0.1200/1M output tokens.

$0.060 / 1M in 33K context

Deepseek Coder 6.7b

Deepseek Coder 6.7b is available via Llamagate with a 16K context window and up to 4,096 output tokens. Pricing: $0.0600/1M input tokens, $0.1200/1M output tokens.

$0.060 / 1M in 16K context

Codellama 7b

Codellama 7b is available via Llamagate with a 16K context window and up to 4,096 output tokens. Pricing: $0.0600/1M input tokens, $0.1200/1M output tokens.

$0.060 / 1M in 16K context

Dolphin3 8b

Dolphin3 8b is available via Llamagate with a 128K context window and up to 8,192 output tokens. Pricing: $0.0800/1M input tokens, $0.1500/1M output tokens.

$0.080 / 1M in 128K context

Deepseek R1 7b Qwen

Deepseek R1 7b Qwen is available via Llamagate with a 131K context window and up to 16,384 output tokens. Pricing: $0.0800/1M input tokens, $0.1500/1M output tokens.

$0.080 / 1M in 131K context

Openthinker 7b

Openthinker 7b is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.0800/1M input tokens, $0.1500/1M output tokens.

$0.080 / 1M in 33K context

Mistral 7b V0.3

Mistral 7b V0.3 is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1500/1M output tokens.

$0.10 / 1M in 33K context

Deepseek R1 8b

Deepseek R1 8b is available via Llamagate with a 66K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.2000/1M output tokens.

$0.10 / 1M in 66K context

Llava 7b

Llava 7b is available via Llamagate with a 4K context window and up to 2,048 output tokens. Pricing: $0.1000/1M input tokens, $0.2000/1M output tokens.

$0.10 / 1M in 4K context

Qwen3 Vl 8b

Qwen3 Vl 8b is available via Llamagate with a 33K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.5500/1M output tokens.

$0.15 / 1M in 33K context

Wandb Models

View provider details →

Moonshotai/Kimi K2 Instruct

Moonshotai/Kimi K2 Instruct is available via Wandb with a 128K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

$0.60 / 1M in 128K context

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via Wandb with a 131K context window and up to 131,072 output tokens. Pricing: $5000.00/1M input tokens, $20000.00/1M output tokens.

$5000.00 / 1M in 131K context

Microsoft/Phi 4 Mini Instruct

Microsoft/Phi 4 Mini Instruct is available via Wandb with a 128K context window and up to 128,000 output tokens. Pricing: $8000.00/1M input tokens, $35000.00/1M output tokens.

$8000.00 / 1M in 128K context

Qwen/Qwen3 235B A22B Instruct 2507

Qwen/Qwen3 235B A22B Instruct 2507 is available via Wandb with a 262K context window and up to 262,144 output tokens. Pricing: $10000.00/1M input tokens, $10000.00/1M output tokens.

$10000.00 / 1M in 262K context

Qwen/Qwen3 235B A22B Thinking 2507

Qwen/Qwen3 235B A22B Thinking 2507 is available via Wandb with a 262K context window and up to 262,144 output tokens. Pricing: $10000.00/1M input tokens, $10000.00/1M output tokens.

$10000.00 / 1M in 262K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via Wandb with a 131K context window and up to 131,072 output tokens. Pricing: $15000.00/1M input tokens, $60000.00/1M output tokens.

$15000.00 / 1M in 131K context

Meta Llama/Llama 4 Scout 17B 16E Instruct

Meta Llama/Llama 4 Scout 17B 16E Instruct is available via Wandb with a 64K context window and up to 64,000 output tokens. Pricing: $17000.00/1M input tokens, $66000.00/1M output tokens.

$17000.00 / 1M in 64K context

Meta Llama/Llama 3.1 8B Instruct

Meta Llama/Llama 3.1 8B Instruct is available via Wandb with a 128K context window and up to 128,000 output tokens. Pricing: $22000.00/1M input tokens, $22000.00/1M output tokens.

$22000.00 / 1M in 128K context

Zai Org/GLM 4.5

Zai Org/GLM 4.5 is available via Wandb with a 131K context window and up to 131,072 output tokens. Pricing: $55000.00/1M input tokens, $200000.00/1M output tokens.

$55000.00 / 1M in 131K context

Deepseek Ai/DeepSeek V3.1

Deepseek Ai/DeepSeek V3.1 is available via Wandb with a 128K context window and up to 128,000 output tokens. Pricing: $55000.00/1M input tokens, $165000.00/1M output tokens.

$55000.00 / 1M in 128K context

Meta Llama/Llama 3.3 70B Instruct

Meta Llama/Llama 3.3 70B Instruct is available via Wandb with a 128K context window and up to 128,000 output tokens. Pricing: $71000.00/1M input tokens, $71000.00/1M output tokens.

$71000.00 / 1M in 128K context

Qwen/Qwen3 Coder 480B A35B Instruct

Qwen/Qwen3 Coder 480B A35B Instruct is available via Wandb with a 262K context window and up to 262,144 output tokens. Pricing: $100000.00/1M input tokens, $150000.00/1M output tokens.

$100000.00 / 1M in 262K context

Deepseek Ai/DeepSeek V3 0324

Deepseek Ai/DeepSeek V3 0324 is available via Wandb with a 161K context window and up to 161,000 output tokens. Pricing: $114000.00/1M input tokens, $275000.00/1M output tokens.

$114000.00 / 1M in 161K context

Deepseek Ai/DeepSeek R1 0528

Deepseek Ai/DeepSeek R1 0528 is available via Wandb with a 161K context window and up to 161,000 output tokens. Pricing: $135000.00/1M input tokens, $540000.00/1M output tokens.

$135000.00 / 1M in 161K context

Anyscale Models

View provider details →

HuggingFaceH4/Zephyr 7b Beta

HuggingFaceH4/Zephyr 7b Beta is available via Anyscale with a 16K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 16K context

Google/Gemma 7b It

Google/Gemma 7b It is available via Anyscale with a 8K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 8K context

Meta Llama/Llama 2 7b Chat Hf

Meta Llama/Llama 2 7b Chat Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 4K context

Meta Llama/Meta Llama 3 8B Instruct

Meta Llama/Meta Llama 3 8B Instruct is available via Anyscale with a 8K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 8K context

Mistralai/Mistral 7B Instruct V0.1

Mistralai/Mistral 7B Instruct V0.1 is available via Anyscale with a 16K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 16K context

Mistralai/Mixtral 8x7B Instruct V0.1

Mistralai/Mixtral 8x7B Instruct V0.1 is available via Anyscale with a 16K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

$0.15 / 1M in 16K context

Meta Llama/Llama 2 13b Chat Hf

Meta Llama/Llama 2 13b Chat Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

$0.25 / 1M in 4K context

Mistralai/Mixtral 8x22B Instruct V0.1

Mistralai/Mixtral 8x22B Instruct V0.1 is available via Anyscale with a 66K context window and up to 65,536 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

$0.90 / 1M in 66K context

Codellama/CodeLlama 34b Instruct Hf

Codellama/CodeLlama 34b Instruct Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 4K context

Codellama/CodeLlama 70b Instruct Hf

Codellama/CodeLlama 70b Instruct Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 4K context

Meta Llama/Llama 2 70b Chat Hf

Meta Llama/Llama 2 70b Chat Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 4K context

Meta Llama/Meta Llama 3 70B Instruct

Meta Llama/Meta Llama 3 70B Instruct is available via Anyscale with a 8K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

$1.00 / 1M in 8K context

Groq Models

View provider details →

Llama 3.1 8b Instant

Llama 3.1 8b Instant is available via Groq with a 128K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

$0.050 / 1M in 128K context

Gemma 7b It

Gemma 7b It is available via Groq with a 8K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

$0.050 / 1M in 8K context

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via Groq with a 131K context window and up to 32,768 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 131K context

Openai/Gpt Oss Safeguard 20b

Openai/Gpt Oss Safeguard 20b is available via Groq with a 131K context window and up to 65,536 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 131K context

Meta Llama/Llama 4 Scout 17b 16e Instruct

Meta Llama/Llama 4 Scout 17b 16e Instruct is available via Groq with a 131K context window and up to 8,192 output tokens. Pricing: $0.1100/1M input tokens, $0.3400/1M output tokens.

$0.11 / 1M in 131K context

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via Groq with a 131K context window and up to 32,766 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Meta Llama/Llama Guard 4 12b

Meta Llama/Llama Guard 4 12b is available via Groq with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.

$0.20 / 1M in 8K context

Meta Llama/Llama 4 Maverick 17b 128e Instruct

Meta Llama/Llama 4 Maverick 17b 128e Instruct is available via Groq with a 131K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

$0.20 / 1M in 131K context

Qwen/Qwen3 32b

Qwen/Qwen3 32b is available via Groq with a 131K context window and up to 131,000 output tokens. Pricing: $0.2900/1M input tokens, $0.5900/1M output tokens.

$0.29 / 1M in 131K context

Llama 3.3 70b Versatile

Llama 3.3 70b Versatile is available via Groq with a 128K context window and up to 32,768 output tokens. Pricing: $0.5900/1M input tokens, $0.7900/1M output tokens.

$0.59 / 1M in 128K context

Moonshotai/Kimi K2 Instruct 0905

Moonshotai/Kimi K2 Instruct 0905 is available via Groq with a 262K context window and up to 16,384 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

$1.00 / 1M in 262K context

Zai Models

View provider details →

Glm 4.5 Flash

Glm 4.5 Flash is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

Glm 4 32b 0414 128k

Glm 4 32b 0414 128k is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Glm 4.5 Air

Glm 4.5 Air is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $0.2000/1M input tokens, $1.10/1M output tokens.

$0.20 / 1M in 128K context

Glm 4.7

Glm 4.7 is available via Zai with a 200K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 200K context

Glm 4.6

Glm 4.6 is available via Zai with a 200K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 200K context

Glm 4.5

Glm 4.5 is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

$0.60 / 1M in 128K context

Glm 4.5v

Glm 4.5v is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $0.6000/1M input tokens, $1.80/1M output tokens.

$0.60 / 1M in 128K context

Glm 5

Glm 5 is available via Zai with a 200K context window and up to 128,000 output tokens. Pricing: $1.00/1M input tokens, $3.20/1M output tokens.

$1.00 / 1M in 200K context

Glm 4.5 Airx

Glm 4.5 Airx is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $1.10/1M input tokens, $4.50/1M output tokens.

$1.10 / 1M in 128K context

Glm 5 Code

Glm 5 Code is available via Zai with a 200K context window and up to 128,000 output tokens. Pricing: $1.20/1M input tokens, $5.00/1M output tokens.

$1.20 / 1M in 200K context

Glm 4.5 X

Glm 4.5 X is available via Zai with a 128K context window and up to 32,000 output tokens. Pricing: $2.20/1M input tokens, $8.90/1M output tokens.

$2.20 / 1M in 128K context

AI21 Models

View provider details →

Jamba 1.5

Jamba 1.5 is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba 1.5 Mini

Jamba 1.5 Mini is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba 1.5 Mini

Jamba 1.5 Mini is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba Mini 1.6

Jamba Mini 1.6 is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba Mini 1.7

Jamba Mini 1.7 is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.4000/1M output tokens.

$0.20 / 1M in 256K context

Jamba 1.5 Large

Jamba 1.5 Large is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Jamba 1.5 Large

Jamba 1.5 Large is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Jamba Large 1.6

Jamba Large 1.6 is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Jamba Large 1.7

Jamba Large 1.7 is available via AI21 with a 256K context window and up to 256,000 output tokens. Pricing: $2.00/1M input tokens, $8.00/1M output tokens.

$2.00 / 1M in 256K context

Publicai Models

View provider details →

Swiss Ai/Apertus 8b Instruct

Swiss Ai/Apertus 8b Instruct is available via Publicai with a 8K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Swiss Ai/Apertus 70b Instruct

Swiss Ai/Apertus 70b Instruct is available via Publicai with a 8K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Aisingapore/Gemma SEA LION V4 27B IT

Aisingapore/Gemma SEA LION V4 27B IT is available via Publicai with a 8K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

BSC LT/Salamandra 7b Instruct Tools 16k

BSC LT/Salamandra 7b Instruct Tools 16k is available via Publicai with a 16K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 16K context

BSC LT/ALIA 40b Instruct Q8 0

BSC LT/ALIA 40b Instruct Q8 0 is available via Publicai with a 8K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Allenai/Olmo 3 7B Instruct

Allenai/Olmo 3 7B Instruct is available via Publicai with a 33K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Aisingapore/Qwen SEA LION V4 32B IT

Aisingapore/Qwen SEA LION V4 32B IT is available via Publicai with a 33K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Allenai/Olmo 3 7B Think

Allenai/Olmo 3 7B Think is available via Publicai with a 33K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

Allenai/Olmo 3 32B Think

Allenai/Olmo 3 32B Think is available via Publicai with a 33K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 33K context

DeepSeek Models

View provider details →

Deepseek Coder

Deepseek Coder is available via DeepSeek with a 128K context window and up to 4,096 output tokens. Pricing: $0.1400/1M input tokens, $0.2800/1M output tokens.

$0.14 / 1M in 128K context

Deepseek

Deepseek is available via DeepSeek with a 66K context window and up to 8,192 output tokens. Pricing: $0.2700/1M input tokens, $1.10/1M output tokens.

$0.27 / 1M in 66K context

Deepseek Chat

NEW

DeepSeek V3 is a 671B Mixture-of-Experts model that matches GPT-4o performance at a dramatically lower price. Its efficient architecture activates only 37B parameters per token, enabling fast inference at low cost.

$0.28 / 1M in 131K context

Deepseek Reasoner

NEW

DeepSeek R1 is a reasoning model that uses chain-of-thought to solve complex math and coding problems. It achieves performance comparable to OpenAI o1 on major benchmarks at roughly 95% less cost.

$0.28 / 1M in 131K context

Deepseek Chat

Deepseek Chat is available via DeepSeek with a 131K context window and up to 8,192 output tokens. Pricing: $0.2800/1M input tokens, $0.4200/1M output tokens.

$0.28 / 1M in 131K context

Deepseek Reasoner

Deepseek Reasoner is available via DeepSeek with a 131K context window and up to 65,536 output tokens. Pricing: $0.2800/1M input tokens, $0.4200/1M output tokens.

$0.28 / 1M in 131K context

Deepseek V3.2

Deepseek V3.2 is available via DeepSeek with a 164K context window and up to 163,840 output tokens. Pricing: $0.2800/1M input tokens, $0.4000/1M output tokens.

$0.28 / 1M in 164K context

Deepseek R1

Deepseek R1 is available via DeepSeek with a 66K context window and up to 8,192 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

$0.55 / 1M in 66K context

Cerebras Models

View provider details →

Llama3.1 8b

Llama3.1 8b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 128K context

Gpt Oss 120b

Gpt Oss 120b is available via Cerebras with a 131K context window and up to 32,768 output tokens. Pricing: $0.3500/1M input tokens, $0.7500/1M output tokens.

$0.35 / 1M in 131K context

Qwen 3 32b

Qwen 3 32b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.4000/1M input tokens, $0.8000/1M output tokens.

$0.40 / 1M in 128K context

Llama3.1 70b

Llama3.1 70b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

$0.60 / 1M in 128K context

Llama 3.3 70b

Llama 3.3 70b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.8500/1M input tokens, $1.20/1M output tokens.

$0.85 / 1M in 128K context

Zai Glm 4.6

Zai Glm 4.6 is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $2.25/1M input tokens, $2.75/1M output tokens.

$2.25 / 1M in 128K context

Zai Glm 4.7

Zai Glm 4.7 is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $2.25/1M input tokens, $2.75/1M output tokens.

$2.25 / 1M in 128K context

Cohere Models

View provider details →

Command R

Command R is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Command R 08 2024

Command R 08 2024 is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 128K context

Command R7b 12 2024

Command R7b 12 2024 is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.0375/1M output tokens.

$0.15 / 1M in 128K context

Command Light

Command Light is available via Cohere with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $0.6000/1M output tokens.

$0.30 / 1M in 4K context

Command A 03 2025

Command A 03 2025 is available via Cohere with a 256K context window and up to 8,000 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 256K context

Command R Plus

Command R Plus is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Command R Plus 08 2024

Command R Plus 08 2024 is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

$2.50 / 1M in 128K context

Lemonade Models

View provider details →

Qwen3 Coder 30B A3B Instruct GGUF

Qwen3 Coder 30B A3B Instruct GGUF is available via Lemonade with a 262K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 262K context

Gpt Oss 20b Mxfp4 GGUF

Gpt Oss 20b Mxfp4 GGUF is available via Lemonade with a 131K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Gpt Oss 120b Mxfp GGUF

Gpt Oss 120b Mxfp GGUF is available via Lemonade with a 131K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 131K context

Gemma 3 4b It GGUF

Gemma 3 4b It GGUF is available via Lemonade with a 128K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

Qwen3 4B Instruct 2507 GGUF

Qwen3 4B Instruct 2507 GGUF is available via Lemonade with a 262K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 262K context

Minimax Models

View provider details →

MiniMax M2.1

MiniMax M2.1 is available via Minimax with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 1M context

MiniMax M2.1 Lightning

MiniMax M2.1 Lightning is available via Minimax with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.40/1M output tokens.

$0.30 / 1M in 1M context

MiniMax M2.5

MiniMax M2.5 is available via Minimax with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 1M context

MiniMax M2.5 Lightning

MiniMax M2.5 Lightning is available via Minimax with a 1M context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $2.40/1M output tokens.

$0.30 / 1M in 1M context

MiniMax M2

MiniMax M2 is available via Minimax with a 200K context window and up to 8,192 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

$0.30 / 1M in 200K context

Amazon Nova Models

View provider details →

Nova Micro

Nova Micro is available via Amazon Nova with a 128K context window and up to 10,000 output tokens. Pricing: $0.0350/1M input tokens, $0.1400/1M output tokens.

$0.035 / 1M in 128K context

Nova Lite

Nova Lite is available via Amazon Nova with a 300K context window and up to 10,000 output tokens. Pricing: $0.0600/1M input tokens, $0.2400/1M output tokens.

$0.060 / 1M in 300K context

Nova Pro

Nova Pro is available via Amazon Nova with a 300K context window and up to 10,000 output tokens. Pricing: $0.8000/1M input tokens, $3.20/1M output tokens.

$0.80 / 1M in 300K context

Nova Premier

Nova Premier is available via Amazon Nova with a 1M context window and up to 10,000 output tokens. Pricing: $2.50/1M input tokens, $12.50/1M output tokens.

$2.50 / 1M in 1M context

Bedrock Mantle Models

View provider details →

Openai.Gpt Oss 20b

Openai.Gpt Oss 20b is available via Bedrock Mantle with a 131K context window and up to 32,768 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 131K context

Openai.Gpt Oss Safeguard 20b

Openai.Gpt Oss Safeguard 20b is available via Bedrock Mantle with a 131K context window and up to 65,536 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

$0.075 / 1M in 131K context

Openai.Gpt Oss 120b

Openai.Gpt Oss 120b is available via Bedrock Mantle with a 131K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Openai.Gpt Oss Safeguard 120b

Openai.Gpt Oss Safeguard 120b is available via Bedrock Mantle with a 131K context window and up to 65,536 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

$0.15 / 1M in 131K context

Cloudflare Models

View provider details →

@Cf/Meta/Llama 2 7b Chat Fp16

@Cf/Meta/Llama 2 7b Chat Fp16 is available via Cloudflare with a 3K context window and up to 3,072 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.

$1.92 / 1M in 3K context

@Cf/Meta/Llama 2 7b Chat Int8

@Cf/Meta/Llama 2 7b Chat Int8 is available via Cloudflare with a 2K context window and up to 2,048 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.

$1.92 / 1M in 2K context

@Cf/Mistral/Mistral 7b Instruct V0.1

@Cf/Mistral/Mistral 7b Instruct V0.1 is available via Cloudflare with a 8K context window and up to 8,192 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.

$1.92 / 1M in 8K context

@Hf/Thebloke/Codellama 7b Instruct Awq

@Hf/Thebloke/Codellama 7b Instruct Awq is available via Cloudflare with a 4K context window and up to 4,096 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.

$1.92 / 1M in 4K context

Gigachat Models

View provider details →

GigaChat 2 Lite

GigaChat 2 Lite is available via Gigachat with a 128K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

GigaChat 2 Max

GigaChat 2 Max is available via Gigachat with a 128K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

GigaChat 2 Pro

GigaChat 2 Pro is available via Gigachat with a 128K context window and up to 8,192 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 128K context

AWS SageMaker Models

View provider details →

Meta Textgeneration Llama 2 13b F

Meta Textgeneration Llama 2 13b F is available via AWS SageMaker with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

Meta Textgeneration Llama 2 70b B F

Meta Textgeneration Llama 2 70b B F is available via AWS SageMaker with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

Meta Textgeneration Llama 2 7b F

Meta Textgeneration Llama 2 7b F is available via AWS SageMaker with a 4K context window and up to 4,096 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 4K context

V0 Models

View provider details →

V0 1.0 Md

V0 1.0 Md is available via V0 with a 128K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

V0 1.5 Md

V0 1.5 Md is available via V0 with a 128K context window and up to 128,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

$3.00 / 1M in 128K context

V0 1.5 Lg

V0 1.5 Lg is available via V0 with a 512K context window and up to 512,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

$15.00 / 1M in 512K context

Volcengine Models

View provider details →

Deepseek V3 2 251201

Deepseek V3 2 251201 is available via Volcengine with a 98K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 98K context

Glm 4 7 251222

Glm 4 7 251222 is available via Volcengine with a 205K context window and up to 131,072 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 205K context

Kimi K2 Thinking 251104

Kimi K2 Thinking 251104 is available via Volcengine with a 229K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 229K context

FriendliAI Models

View provider details →

Meta Llama 3.1 8b Instruct

Meta Llama 3.1 8b Instruct is available via FriendliAI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

$0.10 / 1M in 8K context

Meta Llama 3.1 70b Instruct

Meta Llama 3.1 70b Instruct is available via FriendliAI with a 8K context window and up to 8,192 output tokens. Pricing: $0.6000/1M input tokens, $0.6000/1M output tokens.

$0.60 / 1M in 8K context

Morph Models

View provider details →

Morph V3 Fast

Morph V3 Fast is available via Morph with a 16K context window and up to 16,000 output tokens. Pricing: $0.8000/1M input tokens, $1.20/1M output tokens.

$0.80 / 1M in 16K context

Morph V3 Large

Morph V3 Large is available via Morph with a 16K context window and up to 16,000 output tokens. Pricing: $0.9000/1M input tokens, $1.90/1M output tokens.

$0.90 / 1M in 16K context

Palm Models

View provider details →

Chat Bison

Chat Bison is available via Palm with a 8K context window and up to 4,096 output tokens. Pricing: $0.1250/1M input tokens, $0.1250/1M output tokens.

$0.13 / 1M in 8K context

Chat Bison 001

Chat Bison 001 is available via Palm with a 8K context window and up to 4,096 output tokens. Pricing: $0.1250/1M input tokens, $0.1250/1M output tokens.

$0.13 / 1M in 8K context

NLP Cloud Models

View provider details →

Chatdolphin

Chatdolphin is available via NLP Cloud with a 16K context window and up to 16,384 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.

$0.50 / 1M in 16K context

Sarvam Models

View provider details →

Sarvam M

Sarvam M is available via Sarvam with a 8K context window and up to 32,000 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.

$0.000 / 1M in 8K context

Calculate token costs for any model

Use our free tools to count tokens, compare pricing, and estimate API costs.

Token Counter Pricing Calculator Compare Models