Skip to content

LLM Providers

Compare 53 AI model providers by pricing, model selection, and capabilities. Click any provider to see their full model lineup.

AWS Bedrock

AWS Bedrock provides access to foundation models from multiple providers including Anthropic, Meta, Mistral, and Amazon's own Nova models. It offers enterprise-grade security and seamless AWS integration.

Multi-provider access Enterprise-grade SLAs AWS ecosystem integration

297

Models

$0.035

From / 1M

1M

Max Context

Fireworks AI

Fireworks AI provides 244 AI models accessible via API.

244

Models

$0.001

From / 1M

262K

Max Context

Azure OpenAI

Azure OpenAI Service provides access to OpenAI models with enterprise features including private networking, content filtering, and regional deployment. It is the primary choice for enterprises already on Microsoft Azure.

Enterprise security & compliance Private networking Regional deployment

120

Models

$0.050

From / 1M

1.1M

Max Context

Google Vertex AI

Google Vertex AI provides 98 AI models accessible via API.

98

Models

$0.000

From / 1M

10M

Max Context

OpenAI

OpenAI is the creator of GPT-4o, o3, and the GPT-4.1 series. They pioneered large-scale commercial LLMs and remain the most widely adopted API provider for AI-powered applications.

Largest developer ecosystem Best function-calling support Widest third-party integration

92

Models

$0.050

From / 1M

1.1M

Max Context

Vercel AI Gateway

Vercel AI Gateway provides 91 AI models accessible via API.

91

Models

$0.035

From / 1M

1.0M

Max Context

Novita AI

Novita AI provides 80 AI models accessible via API.

80

Models

$0.020

From / 1M

1.0M

Max Context

OpenRouter

OpenRouter provides 76 AI models accessible via API.

76

Models

$0.000

From / 1M

2M

Max Context

DeepInfra

DeepInfra provides 67 AI models accessible via API.

67

Models

$0.020

From / 1M

1.0M

Max Context

Azure AI

Azure AI provides 58 AI models accessible via API.

58

Models

$0.040

From / 1M

10M

Max Context

Mistral

Mistral AI is a Paris-based lab building efficient open and commercial LLMs. Mistral Large competes with frontier models, while Mistral Small and Nemo offer strong cost-performance ratios for production use.

European data sovereignty Efficient model architectures Strong code generation (Codestral)

48

Models

$0.000

From / 1M

262K

Max Context

Google Gemini

Google DeepMind develops the Gemini model family, offering industry-leading context windows up to 1 million tokens. Gemini 2.5 Pro delivers frontier performance while Gemini 2.0 Flash provides cost-effective speed.

Largest context windows (1M+ tokens) Native multimodal support Competitive pricing

39

Models

$0.000

From / 1M

2.1M

Max Context

Xai

Xai provides 35 AI models accessible via API.

35

Models

$0.20

From / 1M

2M

Max Context

Oci

Oci provides 29 AI models accessible via API.

29

Models

$0.075

From / 1M

1.0M

Max Context

IBM Watsonx

IBM Watsonx provides 28 AI models accessible via API.

28

Models

$0.060

From / 1M

131K

Max Context

Nebius

Nebius provides 27 AI models accessible via API.

27

Models

$0.010

From / 1M

262K

Max Context

Databricks

Databricks provides 26 AI models accessible via API.

26

Models

$0.050

From / 1M

1.0M

Max Context

Moonshot

Moonshot provides 21 AI models accessible via API.

21

Models

$0.20

From / 1M

262K

Max Context

Ollama

Ollama provides 21 AI models accessible via API.

21

Models

$0.000

From / 1M

262K

Max Context

Lambda Ai

Lambda Ai provides 20 AI models accessible via API.

20

Models

$0.015

From / 1M

131K

Max Context

Perplexity

Perplexity provides 20 AI models accessible via API.

20

Models

$0.000

From / 1M

200K

Max Context

Anthropic

Anthropic builds the Claude model family, known for strong instruction-following, safety alignment, and extended context windows. Claude 4 Sonnet and Claude 3.5 Haiku are popular choices for production workloads.

Best instruction-following 200K+ context on all models Strong safety alignment

18

Models

$0.25

From / 1M

1M

Max Context

Dashscope

Dashscope provides 17 AI models accessible via API.

17

Models

$0.050

From / 1M

1M

Max Context

Gmi

Gmi provides 17 AI models accessible via API.

17

Models

$0.15

From / 1M

1.0M

Max Context

Together AI

Together AI provides 17 AI models accessible via API.

17

Models

$0.050

From / 1M

262K

Max Context

Hyperbolic

Hyperbolic provides 16 AI models accessible via API.

16

Models

$0.12

From / 1M

131K

Max Context

Replicate

Replicate provides 16 AI models accessible via API.

16

Models

$0.050

From / 1M

164K

Max Context

SambaNova

SambaNova provides 16 AI models accessible via API.

16

Models

$0.040

From / 1M

131K

Max Context

Ovhcloud

Ovhcloud provides 15 AI models accessible via API.

15

Models

$0.040

From / 1M

256K

Max Context

Llamagate

Llamagate provides 14 AI models accessible via API.

14

Models

$0.030

From / 1M

131K

Max Context

Wandb

Wandb provides 14 AI models accessible via API.

14

Models

$0.60

From / 1M

262K

Max Context

Anyscale

Anyscale provides 12 AI models accessible via API.

12

Models

$0.15

From / 1M

66K

Max Context

Groq

Groq provides ultra-fast inference using custom LPU hardware, delivering the fastest token generation speeds available. They host popular open-weight models like Llama and Mixtral with industry-leading latency.

Fastest inference speeds Custom LPU hardware Competitive open-model pricing

11

Models

$0.050

From / 1M

262K

Max Context

Zai

Zai provides 11 AI models accessible via API.

11

Models

$0.000

From / 1M

200K

Max Context

AI21

AI21 provides 9 AI models accessible via API.

9

Models

$0.20

From / 1M

256K

Max Context

Publicai

Publicai provides 9 AI models accessible via API.

9

Models

$0.000

From / 1M

33K

Max Context

DeepSeek

DeepSeek is a Chinese AI lab that gained attention with DeepSeek V3 and R1, offering frontier-level performance at dramatically lower prices. Their Mixture-of-Experts architecture delivers strong reasoning at a fraction of competitor costs.

Lowest pricing among frontier models Strong reasoning (R1) Mixture-of-Experts efficiency

8

Models

$0.14

From / 1M

164K

Max Context

Cerebras

Cerebras provides 7 AI models accessible via API.

7

Models

$0.10

From / 1M

131K

Max Context

Cohere

Cohere focuses on enterprise AI with retrieval-augmented generation (RAG) and search as first-class features. Command R+ is optimized for business workflows that combine generation with structured data retrieval.

Best-in-class RAG support Enterprise-focused features Strong multilingual embeddings

7

Models

$0.15

From / 1M

256K

Max Context

Lemonade

Lemonade provides 5 AI models accessible via API.

5

Models

$0.000

From / 1M

262K

Max Context

Minimax

Minimax provides 5 AI models accessible via API.

5

Models

$0.30

From / 1M

1M

Max Context

Amazon Nova

Amazon Nova provides 4 AI models accessible via API.

4

Models

$0.035

From / 1M

1M

Max Context

Bedrock Mantle

Bedrock Mantle provides 4 AI models accessible via API.

4

Models

$0.075

From / 1M

131K

Max Context

Cloudflare

Cloudflare provides 4 AI models accessible via API.

4

Models

$1.92

From / 1M

8K

Max Context

Gigachat

Gigachat provides 3 AI models accessible via API.

3

Models

$0.000

From / 1M

128K

Max Context

AWS SageMaker

AWS SageMaker provides 3 AI models accessible via API.

3

Models

$0.000

From / 1M

4K

Max Context

V0

V0 provides 3 AI models accessible via API.

3

Models

$3.00

From / 1M

512K

Max Context

Volcengine

Volcengine provides 3 AI models accessible via API.

3

Models

$0.000

From / 1M

229K

Max Context

FriendliAI

FriendliAI provides 2 AI models accessible via API.

2

Models

$0.10

From / 1M

8K

Max Context

Morph

Morph provides 2 AI models accessible via API.

2

Models

$0.80

From / 1M

16K

Max Context

Palm

Palm provides 2 AI models accessible via API.

2

Models

$0.13

From / 1M

8K

Max Context

NLP Cloud

NLP Cloud provides 1 AI models accessible via API.

1

Models

$0.50

From / 1M

16K

Max Context

Sarvam

Sarvam provides 1 AI models accessible via API.

1

Models

$0.000

From / 1M

8K

Max Context

Provider Comparison at a Glance

Provider Models Cheapest $/1M Max Context
AWS Bedrock 297 $0.035 1M
Fireworks AI 244 $0.001 262K
Azure OpenAI 120 $0.050 1.1M
Google Vertex AI 98 $0.000 10M
OpenAI 92 $0.050 1.1M
Vercel AI Gateway 91 $0.035 1.0M
Novita AI 80 $0.020 1.0M
OpenRouter 76 $0.000 2M
DeepInfra 67 $0.020 1.0M
Azure AI 58 $0.040 10M
Mistral 48 $0.000 262K
Google Gemini 39 $0.000 2.1M
Xai 35 $0.20 2M
Oci 29 $0.075 1.0M
IBM Watsonx 28 $0.060 131K
Nebius 27 $0.010 262K
Databricks 26 $0.050 1.0M
Moonshot 21 $0.20 262K
Ollama 21 $0.000 262K
Lambda Ai 20 $0.015 131K
Perplexity 20 $0.000 200K
Anthropic 18 $0.25 1M
Dashscope 17 $0.050 1M
Gmi 17 $0.15 1.0M
Together AI 17 $0.050 262K
Hyperbolic 16 $0.12 131K
Replicate 16 $0.050 164K
SambaNova 16 $0.040 131K
Ovhcloud 15 $0.040 256K
Llamagate 14 $0.030 131K
Wandb 14 $0.60 262K
Anyscale 12 $0.15 66K
Groq 11 $0.050 262K
Zai 11 $0.000 200K
AI21 9 $0.20 256K
Publicai 9 $0.000 33K
DeepSeek 8 $0.14 164K
Cerebras 7 $0.10 131K
Cohere 7 $0.15 256K
Lemonade 5 $0.000 262K
Minimax 5 $0.30 1M
Amazon Nova 4 $0.035 1M
Bedrock Mantle 4 $0.075 131K
Cloudflare 4 $1.92 8K
Gigachat 3 $0.000 128K
AWS SageMaker 3 $0.000 4K
V0 3 $3.00 512K
Volcengine 3 $0.000 229K
FriendliAI 2 $0.10 8K
Morph 2 $0.80 16K
Palm 2 $0.13 8K
NLP Cloud 1 $0.50 16K
Sarvam 1 $0.000 8K

Find the cheapest model for your workload

Use our pricing calculator to compare API costs across all providers and models.

Pricing Calculator Browse All Models