Skip to content

Novita AI Models

Novita AI provides 80 AI models accessible via API.

Visit Novita AI →

80

Models Available

$0.020

Cheapest Input / 1M

1.0M

Largest Context

What is Novita AI?

Novita AI is an AI model provider offering 80 large language models for developers. Their cheapest model starts at $0.020 per 1M input tokens, and their largest context window reaches 1.0M. Novita AI provides 80 AI models accessible via API.

Novita AI Strengths

All Novita AI Models

Model Input $/1M Output $/1M Context Max Output Released
Paddlepaddle/Paddleocr Vl $0.020 $0.020 16K 16,384
Meta Llama/Llama 3.1 8b Instruct $0.020 $0.050 16K 16,384
Deepseek/Deepseek Ocr $0.030 $0.030 8K 8,192
Qwen/Qwen3 4b Fp8 $0.030 $0.030 128K 20,000
Meta Llama/Llama 3.2 3b Instruct $0.030 $0.050 33K 32,000
Zai Org/Autoglm Phone 9b Multilingual $0.035 $0.14 66K 65,536
Qwen/Qwen3 8b Fp8 $0.035 $0.14 128K 20,000
Openai/Gpt Oss 20b $0.040 $0.15 131K 32,768
Mistralai/Mistral Nemo $0.040 $0.17 60K 16,000
Meta Llama/Llama 3 8b Instruct $0.040 $0.040 8K 8,192
Openai/Gpt Oss 120b $0.050 $0.25 131K 32,768
Google/Gemma 3 12b It $0.050 $0.10 131K 8,192
Sao10k/L3 8b Lunaris $0.050 $0.050 8K 8,192
Sao10K/L3 8B Stheno V3.2 $0.050 $0.050 8K 32,000
Deepseek/Deepseek R1 0528 Qwen3 8b $0.060 $0.090 128K 32,000
Qwen/Qwen3 Coder 30b A3b Instruct $0.070 $0.27 160K 32,768
Baidu/Ernie 4.5 21B A3b Thinking $0.070 $0.28 131K 65,536
Baichuan/Baichuan M2 32b $0.070 $0.070 131K 131,072
Baidu/Ernie 4.5 21B A3b $0.070 $0.28 120K 8,000
Qwen/Qwen2.5 7b Instruct $0.070 $0.070 32K 32,000
Qwen/Qwen3 Vl 8b Instruct $0.080 $0.50 131K 32,768
Qwen/Qwen3 235b A22b Instruct 2507 $0.090 $0.58 131K 16,384
Qwen/Qwen3 30b A3b Fp8 $0.090 $0.45 41K 20,000
Gryphe/Mythomax L2 13b $0.090 $0.090 4K 3,200
Xiaomimimo/Mimo V2 Flash $0.10 $0.30 262K 32,000
Qwen/Qwen3 32b Fp8 $0.10 $0.45 41K 20,000
Google/Gemma 3 27b It $0.12 $0.20 98K 16,384
Zai Org/Glm 4.5 Air $0.13 $0.85 131K 98,304
Meta Llama/Llama 3.3 70b Instruct $0.14 $0.40 131K 120,000
Nousresearch/Hermes 2 Pro Llama 3 8b $0.14 $0.14 8K 8,192
Baidu/Ernie 4.5 Vl 28b A3b $0.14 $0.56 30K 8,000
Qwen/Qwen3 Next 80b A3b Instruct $0.15 $1.50 131K 32,768
Qwen/Qwen3 Next 80b A3b Thinking $0.15 $1.50 131K 32,768
Deepseek/Deepseek R1 Distill Qwen 14b $0.15 $0.15 33K 16,384
Meta Llama/Llama 4 Scout 17b 16e Instruct $0.18 $0.59 131K 131,072
Skywork/R1v4 Lite $0.20 $0.60 262K 65,536
Qwen/Qwen3 235b A22b Fp8 $0.20 $0.80 41K 20,000
Qwen/Qwen3 Vl 30b A3b Instruct $0.20 $0.70 131K 32,768
Qwen/Qwen3 Vl 30b A3b Thinking $0.20 $1.00 131K 32,768
Qwen/Qwen3 Omni 30b A3b Thinking $0.25 $0.97 66K 16,384
Qwen/Qwen3 Omni 30b A3b Instruct $0.25 $0.97 66K 16,384
Qwen/Qwen Mt Plus $0.25 $0.75 16K 8,192
Deepseek/Deepseek V3.2 $0.27 $0.40 164K 65,536
Deepseek/Deepseek V3.2 Exp $0.27 $0.41 164K 65,536
Deepseek/Deepseek V3.1 Terminus $0.27 $1.00 131K 32,768
Deepseek/Deepseek V3.1 $0.27 $1.00 131K 32,768
Deepseek/Deepseek V3 0324 $0.27 $1.12 164K 163,840
Meta Llama/Llama 4 Maverick 17b 128e Instruct Fp8 $0.27 $0.85 1.0M 8,192
Baidu/Ernie 4.5 300b A47b Paddle $0.28 $1.10 123K 12,000
Minimax/Minimax M2.1 $0.30 $1.20 205K 131,072
Minimax/Minimax M2 $0.30 $1.20 205K 131,072
Zai Org/Glm 4.6v $0.30 $0.90 131K 32,768
Kwaipilot/Kat Coder Pro $0.30 $1.20 256K 128,000
Qwen/Qwen3 Vl 235b A22b Instruct $0.30 $1.50 131K 32,768
Qwen/Qwen3 Coder 480b A35b Instruct $0.30 $1.30 262K 65,536
Qwen/Qwen3 235b A22b Thinking 2507 $0.30 $3.00 131K 32,768
Deepseek/Deepseek R1 Distill Qwen 32b $0.30 $0.30 64K 32,000
Qwen/Qwen 2.5 72b Instruct $0.38 $0.40 32K 8,192
Baidu/Ernie 4.5 Vl 28b A3b Thinking $0.39 $0.39 131K 65,536
Deepseek/Deepseek V3 Turbo $0.40 $1.30 64K 16,000
Baidu/Ernie 4.5 Vl 424b A47b $0.42 $1.25 123K 16,000
Meta Llama/Llama 3 70b Instruct $0.51 $0.74 8K 8,000
Zai Org/Glm 4.6 $0.55 $2.20 205K 131,072
Minimaxai/Minimax M1 80k $0.55 $2.20 1M 40,000
Moonshotai/Kimi K2 Instruct $0.57 $2.30 131K 131,072
Zai Org/Glm 4.7 $0.60 $2.20 205K 131,072
Moonshotai/Kimi K2 Thinking $0.60 $2.50 262K 262,144
Moonshotai/Kimi K2 0905 $0.60 $2.50 262K 262,144
Zai Org/Glm 4.5 $0.60 $2.20 131K 98,304
Zai Org/Glm 4.5v $0.60 $1.80 66K 16,384
Microsoft/Wizardlm 2 8x22b $0.62 $0.62 66K 8,000
Deepseek/Deepseek R1 0528 $0.70 $2.50 164K 32,768
Deepseek/Deepseek Prover V2 671b $0.70 $2.50 160K 160,000
Deepseek/Deepseek R1 Turbo $0.70 $2.50 64K 16,000
Deepseek/Deepseek R1 Distill Llama 70b $0.80 $0.80 8K 8,192
Qwen/Qwen2.5 Vl 72b Instruct $0.80 $0.80 33K 32,768
Qwen/Qwen3 Vl 235b A22b Thinking $0.98 $3.95 131K 32,768
Sao10k/L3 70b Euryale V2.1 $1.48 $1.48 8K 8,192
Sao10k/L31 70b Euryale V2.2 $1.48 $1.48 8K 8,192
Qwen/Qwen3 Max $2.11 $8.45 262K 65,536

Model Details

Paddlepaddle/Paddleocr Vl

Paddlepaddle/Paddleocr Vl is available via Novita AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.0200/1M input tokens, $0.0200/1M output tokens.

Input: $0.020/1M Output: $0.020/1M Context: 16K
text vision

Meta Llama/Llama 3.1 8b Instruct

Meta Llama/Llama 3.1 8b Instruct is available via Novita AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.0200/1M input tokens, $0.0500/1M output tokens.

Input: $0.020/1M Output: $0.050/1M Context: 16K
text

Deepseek/Deepseek Ocr

Deepseek/Deepseek Ocr is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0300/1M output tokens.

Input: $0.030/1M Output: $0.030/1M Context: 8K
text vision json mode

Qwen/Qwen3 4b Fp8

Qwen/Qwen3 4b Fp8 is available via Novita AI with a 128K context window and up to 20,000 output tokens. Pricing: $0.0300/1M input tokens, $0.0300/1M output tokens.

Input: $0.030/1M Output: $0.030/1M Context: 128K
text reasoning

Meta Llama/Llama 3.2 3b Instruct

Meta Llama/Llama 3.2 3b Instruct is available via Novita AI with a 33K context window and up to 32,000 output tokens. Pricing: $0.0300/1M input tokens, $0.0500/1M output tokens.

Input: $0.030/1M Output: $0.050/1M Context: 33K
text function calling

Zai Org/Autoglm Phone 9b Multilingual

Zai Org/Autoglm Phone 9b Multilingual is available via Novita AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.0350/1M input tokens, $0.1380/1M output tokens.

Input: $0.035/1M Output: $0.14/1M Context: 66K
text vision

Qwen/Qwen3 8b Fp8

Qwen/Qwen3 8b Fp8 is available via Novita AI with a 128K context window and up to 20,000 output tokens. Pricing: $0.0350/1M input tokens, $0.1380/1M output tokens.

Input: $0.035/1M Output: $0.14/1M Context: 128K
text reasoning

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0400/1M input tokens, $0.1500/1M output tokens.

Input: $0.040/1M Output: $0.15/1M Context: 131K
text vision reasoning json mode

Mistralai/Mistral Nemo

Mistralai/Mistral Nemo is available via Novita AI with a 60K context window and up to 16,000 output tokens. Pricing: $0.0400/1M input tokens, $0.1700/1M output tokens.

Input: $0.040/1M Output: $0.17/1M Context: 60K
text json mode

Meta Llama/Llama 3 8b Instruct

Meta Llama/Llama 3 8b Instruct is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.

Input: $0.040/1M Output: $0.040/1M Context: 8K
text

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0500/1M input tokens, $0.2500/1M output tokens.

Input: $0.050/1M Output: $0.25/1M Context: 131K
text vision function calling reasoning json mode

Google/Gemma 3 12b It

Google/Gemma 3 12b It is available via Novita AI with a 131K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.1000/1M output tokens.

Input: $0.050/1M Output: $0.10/1M Context: 131K
text vision json mode

Sao10k/L3 8b Lunaris

Sao10k/L3 8b Lunaris is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0500/1M output tokens.

Input: $0.050/1M Output: $0.050/1M Context: 8K
text json mode

Sao10K/L3 8B Stheno V3.2

Sao10K/L3 8B Stheno V3.2 is available via Novita AI with a 8K context window and up to 32,000 output tokens. Pricing: $0.0500/1M input tokens, $0.0500/1M output tokens.

Input: $0.050/1M Output: $0.050/1M Context: 8K
text function calling

Deepseek/Deepseek R1 0528 Qwen3 8b

Deepseek/Deepseek R1 0528 Qwen3 8b is available via Novita AI with a 128K context window and up to 32,000 output tokens. Pricing: $0.0600/1M input tokens, $0.0900/1M output tokens.

Input: $0.060/1M Output: $0.090/1M Context: 128K
text reasoning

Qwen/Qwen3 Coder 30b A3b Instruct

Qwen/Qwen3 Coder 30b A3b Instruct is available via Novita AI with a 160K context window and up to 32,768 output tokens. Pricing: $0.0700/1M input tokens, $0.2700/1M output tokens.

Input: $0.070/1M Output: $0.27/1M Context: 160K
text function calling json mode

Baidu/Ernie 4.5 21B A3b Thinking

Baidu/Ernie 4.5 21B A3b Thinking is available via Novita AI with a 131K context window and up to 65,536 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

Input: $0.070/1M Output: $0.28/1M Context: 131K
text reasoning

Baichuan/Baichuan M2 32b

Baichuan/Baichuan M2 32b is available via Novita AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.0700/1M input tokens, $0.0700/1M output tokens.

Input: $0.070/1M Output: $0.070/1M Context: 131K
text

Baidu/Ernie 4.5 21B A3b

Baidu/Ernie 4.5 21B A3b is available via Novita AI with a 120K context window and up to 8,000 output tokens. Pricing: $0.0700/1M input tokens, $0.2800/1M output tokens.

Input: $0.070/1M Output: $0.28/1M Context: 120K
text function calling

Qwen/Qwen2.5 7b Instruct

Qwen/Qwen2.5 7b Instruct is available via Novita AI with a 32K context window and up to 32,000 output tokens. Pricing: $0.0700/1M input tokens, $0.0700/1M output tokens.

Input: $0.070/1M Output: $0.070/1M Context: 32K
text function calling json mode

Qwen/Qwen3 Vl 8b Instruct

Qwen/Qwen3 Vl 8b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.0800/1M input tokens, $0.5000/1M output tokens.

Input: $0.080/1M Output: $0.50/1M Context: 131K
text vision function calling json mode

Qwen/Qwen3 235b A22b Instruct 2507

Qwen/Qwen3 235b A22b Instruct 2507 is available via Novita AI with a 131K context window and up to 16,384 output tokens. Pricing: $0.0900/1M input tokens, $0.5800/1M output tokens.

Input: $0.090/1M Output: $0.58/1M Context: 131K
text function calling json mode

Qwen/Qwen3 30b A3b Fp8

Qwen/Qwen3 30b A3b Fp8 is available via Novita AI with a 41K context window and up to 20,000 output tokens. Pricing: $0.0900/1M input tokens, $0.4500/1M output tokens.

Input: $0.090/1M Output: $0.45/1M Context: 41K
text reasoning

Gryphe/Mythomax L2 13b

Gryphe/Mythomax L2 13b is available via Novita AI with a 4K context window and up to 3,200 output tokens. Pricing: $0.0900/1M input tokens, $0.0900/1M output tokens.

Input: $0.090/1M Output: $0.090/1M Context: 4K
text

Xiaomimimo/Mimo V2 Flash

Xiaomimimo/Mimo V2 Flash is available via Novita AI with a 262K context window and up to 32,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

Input: $0.10/1M Output: $0.30/1M Context: 262K
text function calling reasoning json mode

Qwen/Qwen3 32b Fp8

Qwen/Qwen3 32b Fp8 is available via Novita AI with a 41K context window and up to 20,000 output tokens. Pricing: $0.1000/1M input tokens, $0.4500/1M output tokens.

Input: $0.10/1M Output: $0.45/1M Context: 41K
text reasoning

Google/Gemma 3 27b It

Google/Gemma 3 27b It is available via Novita AI with a 98K context window and up to 16,384 output tokens. Pricing: $0.1190/1M input tokens, $0.2000/1M output tokens.

Input: $0.12/1M Output: $0.20/1M Context: 98K
text vision

Zai Org/Glm 4.5 Air

Zai Org/Glm 4.5 Air is available via Novita AI with a 131K context window and up to 98,304 output tokens. Pricing: $0.1300/1M input tokens, $0.8500/1M output tokens.

Input: $0.13/1M Output: $0.85/1M Context: 131K
text function calling reasoning

Meta Llama/Llama 3.3 70b Instruct

Meta Llama/Llama 3.3 70b Instruct is available via Novita AI with a 131K context window and up to 120,000 output tokens. Pricing: $0.1350/1M input tokens, $0.4000/1M output tokens.

Input: $0.14/1M Output: $0.40/1M Context: 131K
text function calling

Nousresearch/Hermes 2 Pro Llama 3 8b

Nousresearch/Hermes 2 Pro Llama 3 8b is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1400/1M input tokens, $0.1400/1M output tokens.

Input: $0.14/1M Output: $0.14/1M Context: 8K
text json mode

Baidu/Ernie 4.5 Vl 28b A3b

Baidu/Ernie 4.5 Vl 28b A3b is available via Novita AI with a 30K context window and up to 8,000 output tokens. Pricing: $0.1400/1M input tokens, $0.5600/1M output tokens.

Input: $0.14/1M Output: $0.56/1M Context: 30K
text vision function calling reasoning

Qwen/Qwen3 Next 80b A3b Instruct

Qwen/Qwen3 Next 80b A3b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

Input: $0.15/1M Output: $1.50/1M Context: 131K
text function calling json mode

Qwen/Qwen3 Next 80b A3b Thinking

Qwen/Qwen3 Next 80b A3b Thinking is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

Input: $0.15/1M Output: $1.50/1M Context: 131K
text function calling reasoning json mode

Deepseek/Deepseek R1 Distill Qwen 14b

Deepseek/Deepseek R1 Distill Qwen 14b is available via Novita AI with a 33K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 33K
text reasoning json mode

Meta Llama/Llama 4 Scout 17b 16e Instruct

Meta Llama/Llama 4 Scout 17b 16e Instruct is available via Novita AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1800/1M input tokens, $0.5900/1M output tokens.

Input: $0.18/1M Output: $0.59/1M Context: 131K
text vision

Skywork/R1v4 Lite

Skywork/R1v4 Lite is available via Novita AI with a 262K context window and up to 65,536 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

Input: $0.20/1M Output: $0.60/1M Context: 262K
text vision json mode

Qwen/Qwen3 235b A22b Fp8

Qwen/Qwen3 235b A22b Fp8 is available via Novita AI with a 41K context window and up to 20,000 output tokens. Pricing: $0.2000/1M input tokens, $0.8000/1M output tokens.

Input: $0.20/1M Output: $0.80/1M Context: 41K
text reasoning

Qwen/Qwen3 Vl 30b A3b Instruct

Qwen/Qwen3 Vl 30b A3b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.7000/1M output tokens.

Input: $0.20/1M Output: $0.70/1M Context: 131K
text vision function calling json mode

Qwen/Qwen3 Vl 30b A3b Thinking

Qwen/Qwen3 Vl 30b A3b Thinking is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $1.00/1M output tokens.

Input: $0.20/1M Output: $1.00/1M Context: 131K
text vision function calling json mode

Qwen/Qwen3 Omni 30b A3b Thinking

Qwen/Qwen3 Omni 30b A3b Thinking is available via Novita AI with a 66K context window and up to 16,384 output tokens. Pricing: $0.2500/1M input tokens, $0.9700/1M output tokens.

Input: $0.25/1M Output: $0.97/1M Context: 66K
text vision function calling reasoning audio json mode

Qwen/Qwen3 Omni 30b A3b Instruct

Qwen/Qwen3 Omni 30b A3b Instruct is available via Novita AI with a 66K context window and up to 16,384 output tokens. Pricing: $0.2500/1M input tokens, $0.9700/1M output tokens.

Input: $0.25/1M Output: $0.97/1M Context: 66K
text vision function calling audio json mode

Qwen/Qwen Mt Plus

Qwen/Qwen Mt Plus is available via Novita AI with a 16K context window and up to 8,192 output tokens. Pricing: $0.2500/1M input tokens, $0.7500/1M output tokens.

Input: $0.25/1M Output: $0.75/1M Context: 16K
text

Deepseek/Deepseek V3.2

Deepseek/Deepseek V3.2 is available via Novita AI with a 164K context window and up to 65,536 output tokens. Pricing: $0.2690/1M input tokens, $0.4000/1M output tokens.

Input: $0.27/1M Output: $0.40/1M Context: 164K
text function calling reasoning json mode

Deepseek/Deepseek V3.2 Exp

Deepseek/Deepseek V3.2 Exp is available via Novita AI with a 164K context window and up to 65,536 output tokens. Pricing: $0.2700/1M input tokens, $0.4100/1M output tokens.

Input: $0.27/1M Output: $0.41/1M Context: 164K
text function calling reasoning json mode

Deepseek/Deepseek V3.1 Terminus

Deepseek/Deepseek V3.1 Terminus is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2700/1M input tokens, $1.00/1M output tokens.

Input: $0.27/1M Output: $1.00/1M Context: 131K
text function calling reasoning json mode

Deepseek/Deepseek V3.1

Deepseek/Deepseek V3.1 is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.2700/1M input tokens, $1.00/1M output tokens.

Input: $0.27/1M Output: $1.00/1M Context: 131K
text function calling reasoning json mode

Deepseek/Deepseek V3 0324

Deepseek/Deepseek V3 0324 is available via Novita AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.2700/1M input tokens, $1.12/1M output tokens.

Input: $0.27/1M Output: $1.12/1M Context: 164K
text function calling json mode

Meta Llama/Llama 4 Maverick 17b 128e Instruct Fp8

Meta Llama/Llama 4 Maverick 17b 128e Instruct Fp8 is available via Novita AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.2700/1M input tokens, $0.8500/1M output tokens.

Input: $0.27/1M Output: $0.85/1M Context: 1.0M
text vision

Baidu/Ernie 4.5 300b A47b Paddle

Baidu/Ernie 4.5 300b A47b Paddle is available via Novita AI with a 123K context window and up to 12,000 output tokens. Pricing: $0.2800/1M input tokens, $1.10/1M output tokens.

Input: $0.28/1M Output: $1.10/1M Context: 123K
text json mode

Minimax/Minimax M2.1

Minimax/Minimax M2.1 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

Input: $0.30/1M Output: $1.20/1M Context: 205K
text function calling json mode

Minimax/Minimax M2

Minimax/Minimax M2 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

Input: $0.30/1M Output: $1.20/1M Context: 205K
text function calling reasoning

Zai Org/Glm 4.6v

Zai Org/Glm 4.6v is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.3000/1M input tokens, $0.9000/1M output tokens.

Input: $0.30/1M Output: $0.90/1M Context: 131K
text vision function calling reasoning json mode

Kwaipilot/Kat Coder Pro

Kwaipilot/Kat Coder Pro is available via Novita AI with a 256K context window and up to 128,000 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.

Input: $0.30/1M Output: $1.20/1M Context: 256K
text function calling json mode

Qwen/Qwen3 Vl 235b A22b Instruct

Qwen/Qwen3 Vl 235b A22b Instruct is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.3000/1M input tokens, $1.50/1M output tokens.

Input: $0.30/1M Output: $1.50/1M Context: 131K
text vision function calling json mode

Qwen/Qwen3 Coder 480b A35b Instruct

Qwen/Qwen3 Coder 480b A35b Instruct is available via Novita AI with a 262K context window and up to 65,536 output tokens. Pricing: $0.3000/1M input tokens, $1.30/1M output tokens.

Input: $0.30/1M Output: $1.30/1M Context: 262K
text function calling json mode

Qwen/Qwen3 235b A22b Thinking 2507

Qwen/Qwen3 235b A22b Thinking 2507 is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.3000/1M input tokens, $3.00/1M output tokens.

Input: $0.30/1M Output: $3.00/1M Context: 131K
text function calling reasoning

Deepseek/Deepseek R1 Distill Qwen 32b

Deepseek/Deepseek R1 Distill Qwen 32b is available via Novita AI with a 64K context window and up to 32,000 output tokens. Pricing: $0.3000/1M input tokens, $0.3000/1M output tokens.

Input: $0.30/1M Output: $0.30/1M Context: 64K
text reasoning json mode

Qwen/Qwen 2.5 72b Instruct

Qwen/Qwen 2.5 72b Instruct is available via Novita AI with a 32K context window and up to 8,192 output tokens. Pricing: $0.3800/1M input tokens, $0.4000/1M output tokens.

Input: $0.38/1M Output: $0.40/1M Context: 32K
text function calling json mode

Baidu/Ernie 4.5 Vl 28b A3b Thinking

Baidu/Ernie 4.5 Vl 28b A3b Thinking is available via Novita AI with a 131K context window and up to 65,536 output tokens. Pricing: $0.3900/1M input tokens, $0.3900/1M output tokens.

Input: $0.39/1M Output: $0.39/1M Context: 131K
text vision function calling reasoning json mode

Deepseek/Deepseek V3 Turbo

Deepseek/Deepseek V3 Turbo is available via Novita AI with a 64K context window and up to 16,000 output tokens. Pricing: $0.4000/1M input tokens, $1.30/1M output tokens.

Input: $0.40/1M Output: $1.30/1M Context: 64K
text function calling

Baidu/Ernie 4.5 Vl 424b A47b

Baidu/Ernie 4.5 Vl 424b A47b is available via Novita AI with a 123K context window and up to 16,000 output tokens. Pricing: $0.4200/1M input tokens, $1.25/1M output tokens.

Input: $0.42/1M Output: $1.25/1M Context: 123K
text vision reasoning

Meta Llama/Llama 3 70b Instruct

Meta Llama/Llama 3 70b Instruct is available via Novita AI with a 8K context window and up to 8,000 output tokens. Pricing: $0.5100/1M input tokens, $0.7400/1M output tokens.

Input: $0.51/1M Output: $0.74/1M Context: 8K
text json mode

Zai Org/Glm 4.6

Zai Org/Glm 4.6 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.5500/1M input tokens, $2.20/1M output tokens.

Input: $0.55/1M Output: $2.20/1M Context: 205K
text function calling reasoning json mode

Minimaxai/Minimax M1 80k

Minimaxai/Minimax M1 80k is available via Novita AI with a 1M context window and up to 40,000 output tokens. Pricing: $0.5500/1M input tokens, $2.20/1M output tokens.

Input: $0.55/1M Output: $2.20/1M Context: 1M
text function calling reasoning

Moonshotai/Kimi K2 Instruct

Moonshotai/Kimi K2 Instruct is available via Novita AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.5700/1M input tokens, $2.30/1M output tokens.

Input: $0.57/1M Output: $2.30/1M Context: 131K
text function calling json mode

Zai Org/Glm 4.7

Zai Org/Glm 4.7 is available via Novita AI with a 205K context window and up to 131,072 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

Input: $0.60/1M Output: $2.20/1M Context: 205K
text function calling reasoning json mode

Moonshotai/Kimi K2 Thinking

Moonshotai/Kimi K2 Thinking is available via Novita AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

Input: $0.60/1M Output: $2.50/1M Context: 262K
text function calling reasoning json mode

Moonshotai/Kimi K2 0905

Moonshotai/Kimi K2 0905 is available via Novita AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.

Input: $0.60/1M Output: $2.50/1M Context: 262K
text function calling json mode

Zai Org/Glm 4.5

Zai Org/Glm 4.5 is available via Novita AI with a 131K context window and up to 98,304 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

Input: $0.60/1M Output: $2.20/1M Context: 131K
text function calling reasoning

Zai Org/Glm 4.5v

Zai Org/Glm 4.5v is available via Novita AI with a 66K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $1.80/1M output tokens.

Input: $0.60/1M Output: $1.80/1M Context: 66K
text vision function calling reasoning json mode

Microsoft/Wizardlm 2 8x22b

Microsoft/Wizardlm 2 8x22b is available via Novita AI with a 66K context window and up to 8,000 output tokens. Pricing: $0.6200/1M input tokens, $0.6200/1M output tokens.

Input: $0.62/1M Output: $0.62/1M Context: 66K
text

Deepseek/Deepseek R1 0528

Deepseek/Deepseek R1 0528 is available via Novita AI with a 164K context window and up to 32,768 output tokens. Pricing: $0.7000/1M input tokens, $2.50/1M output tokens.

Input: $0.70/1M Output: $2.50/1M Context: 164K
text function calling reasoning json mode

Deepseek/Deepseek Prover V2 671b

Deepseek/Deepseek Prover V2 671b is available via Novita AI with a 160K context window and up to 160,000 output tokens. Pricing: $0.7000/1M input tokens, $2.50/1M output tokens.

Input: $0.70/1M Output: $2.50/1M Context: 160K
text

Deepseek/Deepseek R1 Turbo

Deepseek/Deepseek R1 Turbo is available via Novita AI with a 64K context window and up to 16,000 output tokens. Pricing: $0.7000/1M input tokens, $2.50/1M output tokens.

Input: $0.70/1M Output: $2.50/1M Context: 64K
text function calling reasoning

Deepseek/Deepseek R1 Distill Llama 70b

Deepseek/Deepseek R1 Distill Llama 70b is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

Input: $0.80/1M Output: $0.80/1M Context: 8K
text reasoning json mode

Qwen/Qwen2.5 Vl 72b Instruct

Qwen/Qwen2.5 Vl 72b Instruct is available via Novita AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.8000/1M input tokens, $0.8000/1M output tokens.

Input: $0.80/1M Output: $0.80/1M Context: 33K
text vision

Qwen/Qwen3 Vl 235b A22b Thinking

Qwen/Qwen3 Vl 235b A22b Thinking is available via Novita AI with a 131K context window and up to 32,768 output tokens. Pricing: $0.9800/1M input tokens, $3.95/1M output tokens.

Input: $0.98/1M Output: $3.95/1M Context: 131K
text vision reasoning

Sao10k/L3 70b Euryale V2.1

Sao10k/L3 70b Euryale V2.1 is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $1.48/1M input tokens, $1.48/1M output tokens.

Input: $1.48/1M Output: $1.48/1M Context: 8K
text function calling

Sao10k/L31 70b Euryale V2.2

Sao10k/L31 70b Euryale V2.2 is available via Novita AI with a 8K context window and up to 8,192 output tokens. Pricing: $1.48/1M input tokens, $1.48/1M output tokens.

Input: $1.48/1M Output: $1.48/1M Context: 8K
text function calling

Qwen/Qwen3 Max

Qwen/Qwen3 Max is available via Novita AI with a 262K context window and up to 65,536 output tokens. Pricing: $2.11/1M input tokens, $8.45/1M output tokens.

Input: $2.11/1M Output: $8.45/1M Context: 262K
text function calling json mode

Compare Novita AI model pricing

Use our pricing calculator to find the cheapest Novita AI model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →