Skip to content

Together AI Models

Together AI provides 17 AI models accessible via API.

Visit Together AI →

17

Models Available

$0.050

Cheapest Input / 1M

262K

Largest Context

What is Together AI?

Together AI is an AI model provider offering 17 large language models for developers. Their cheapest model starts at $0.050 per 1M input tokens, and their largest context window reaches 262K. Together AI provides 17 AI models accessible via API.

Together AI Strengths

All Together AI Models

Model Input $/1M Output $/1M Context Max Output Released
Openai/Gpt Oss 20b $0.050 $0.20 128K 4,096
Openai/Gpt Oss 120b $0.15 $0.60 128K 4,096
Qwen/Qwen3 Next 80B A3B Instruct $0.15 $1.50 262K 4,096
Qwen/Qwen3 Next 80B A3B Thinking $0.15 $1.50 262K 4,096
Qwen/Qwen3 235B A22B Instruct 2507 Tput $0.20 $6.00 262K 4,096
Qwen/Qwen3 235B A22B Fp8 Tput $0.20 $0.60 40K 4,096
Zai Org/GLM 4.5 Air FP8 $0.20 $1.10 128K 4,096
Zai Org/GLM 4.7 $0.45 $2.00 200K 200,000
Moonshotai/Kimi K2.5 $0.50 $2.80 256K 256,000
Deepseek Ai/DeepSeek R1 0528 Tput $0.55 $2.19 128K 4,096
Zai Org/GLM 4.6 $0.60 $2.20 200K 200,000
Qwen/Qwen3.5 397B A17B $0.60 $3.60 262K 4,096
Qwen/Qwen3 235B A22B Thinking 2507 $0.65 $3.00 256K 4,096
Moonshotai/Kimi K2 Instruct 0905 $1.00 $3.00 262K 4,096
Deepseek Ai/DeepSeek V3 $1.25 $1.25 66K 8,192
Qwen/Qwen3 Coder 480B A35B Instruct FP8 $2.00 $2.00 256K 4,096
Deepseek Ai/DeepSeek R1 $3.00 $7.00 128K 20,480

Model Details

Openai/Gpt Oss 20b

Openai/Gpt Oss 20b is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.

Input: $0.050/1M Output: $0.20/1M Context: 128K
text function calling json mode

Openai/Gpt Oss 120b

Openai/Gpt Oss 120b is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 128K
text function calling json mode

Qwen/Qwen3 Next 80B A3B Instruct

Qwen/Qwen3 Next 80B A3B Instruct is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

Input: $0.15/1M Output: $1.50/1M Context: 262K
text function calling json mode

Qwen/Qwen3 Next 80B A3B Thinking

Qwen/Qwen3 Next 80B A3B Thinking is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $1.50/1M output tokens.

Input: $0.15/1M Output: $1.50/1M Context: 262K
text function calling json mode

Qwen/Qwen3 235B A22B Instruct 2507 Tput

Qwen/Qwen3 235B A22B Instruct 2507 Tput is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $6.00/1M output tokens.

Input: $0.20/1M Output: $6.00/1M Context: 262K
text function calling json mode

Qwen/Qwen3 235B A22B Fp8 Tput

Qwen/Qwen3 235B A22B Fp8 Tput is available via Together AI with a 40K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

Input: $0.20/1M Output: $0.60/1M Context: 40K
text

Zai Org/GLM 4.5 Air FP8

Zai Org/GLM 4.5 Air FP8 is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $1.10/1M output tokens.

Input: $0.20/1M Output: $1.10/1M Context: 128K
text function calling json mode

Zai Org/GLM 4.7

Zai Org/GLM 4.7 is available via Together AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.4500/1M input tokens, $2.00/1M output tokens.

Input: $0.45/1M Output: $2.00/1M Context: 200K
text function calling reasoning

Moonshotai/Kimi K2.5

Moonshotai/Kimi K2.5 is available via Together AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.5000/1M input tokens, $2.80/1M output tokens.

Input: $0.50/1M Output: $2.80/1M Context: 256K
text vision function calling reasoning

Deepseek Ai/DeepSeek R1 0528 Tput

Deepseek Ai/DeepSeek R1 0528 Tput is available via Together AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.

Input: $0.55/1M Output: $2.19/1M Context: 128K
text function calling json mode

Zai Org/GLM 4.6

Zai Org/GLM 4.6 is available via Together AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.

Input: $0.60/1M Output: $2.20/1M Context: 200K
text function calling reasoning

Qwen/Qwen3.5 397B A17B

Qwen/Qwen3.5 397B A17B is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $0.6000/1M input tokens, $3.60/1M output tokens.

Input: $0.60/1M Output: $3.60/1M Context: 262K
text function calling json mode

Qwen/Qwen3 235B A22B Thinking 2507

Qwen/Qwen3 235B A22B Thinking 2507 is available via Together AI with a 256K context window and up to 4,096 output tokens. Pricing: $0.6500/1M input tokens, $3.00/1M output tokens.

Input: $0.65/1M Output: $3.00/1M Context: 256K
text function calling json mode

Moonshotai/Kimi K2 Instruct 0905

Moonshotai/Kimi K2 Instruct 0905 is available via Together AI with a 262K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

Input: $1.00/1M Output: $3.00/1M Context: 262K
text function calling

Deepseek Ai/DeepSeek V3

Deepseek Ai/DeepSeek V3 is available via Together AI with a 66K context window and up to 8,192 output tokens. Pricing: $1.25/1M input tokens, $1.25/1M output tokens.

Input: $1.25/1M Output: $1.25/1M Context: 66K
text function calling json mode

Qwen/Qwen3 Coder 480B A35B Instruct FP8

Qwen/Qwen3 Coder 480B A35B Instruct FP8 is available via Together AI with a 256K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $2.00/1M output tokens.

Input: $2.00/1M Output: $2.00/1M Context: 256K
text function calling json mode

Deepseek Ai/DeepSeek R1

Deepseek Ai/DeepSeek R1 is available via Together AI with a 128K context window and up to 20,480 output tokens. Pricing: $3.00/1M input tokens, $7.00/1M output tokens.

Input: $3.00/1M Output: $7.00/1M Context: 128K
text function calling json mode

Compare Together AI model pricing

Use our pricing calculator to find the cheapest Together AI model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →