4
Models Available
$1.92
Cheapest Input / 1M
8K
Largest Context
What is Cloudflare?
Cloudflare is an AI model provider offering 4 large language models for developers. Their cheapest model starts at $1.92 per 1M input tokens, and their largest context window reaches 8K. Cloudflare provides 4 AI models accessible via API.
Cloudflare Strengths
All Cloudflare Models
| Model | Input $/1M | Output $/1M | Context | Max Output | Released |
|---|---|---|---|---|---|
| @Cf/Meta/Llama 2 7b Chat Fp16 | $1.92 | $1.92 | 3K | 3,072 | — |
| @Cf/Meta/Llama 2 7b Chat Int8 | $1.92 | $1.92 | 2K | 2,048 | — |
| @Cf/Mistral/Mistral 7b Instruct V0.1 | $1.92 | $1.92 | 8K | 8,192 | — |
| @Hf/Thebloke/Codellama 7b Instruct Awq | $1.92 | $1.92 | 4K | 4,096 | — |
Model Details
@Cf/Meta/Llama 2 7b Chat Fp16
@Cf/Meta/Llama 2 7b Chat Fp16 is available via Cloudflare with a 3K context window and up to 3,072 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.
@Cf/Meta/Llama 2 7b Chat Int8
@Cf/Meta/Llama 2 7b Chat Int8 is available via Cloudflare with a 2K context window and up to 2,048 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.
@Cf/Mistral/Mistral 7b Instruct V0.1
@Cf/Mistral/Mistral 7b Instruct V0.1 is available via Cloudflare with a 8K context window and up to 8,192 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.
@Hf/Thebloke/Codellama 7b Instruct Awq
@Hf/Thebloke/Codellama 7b Instruct Awq is available via Cloudflare with a 4K context window and up to 4,096 output tokens. Pricing: $1.92/1M input tokens, $1.92/1M output tokens.
Compare Cloudflare model pricing
Use our pricing calculator to find the cheapest Cloudflare model for your workload.