Moonshotai/Kimi K2 Thinking Maas
Moonshotai/Kimi K2 Thinking Maas is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.
Moonshotai/Kimi K2 Thinking Maas Pricing & Specifications
What is Moonshotai/Kimi K2 Thinking Maas?
Moonshotai/Kimi K2 Thinking Maas is a large language model by Google Vertex AI with a 256K context window and up to 256,000 output tokens. It costs $0.60 per 1M input tokens and $2.50 per 1M output tokens. Moonshotai/Kimi K2 Thinking Maas is available via Google Vertex AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.
Capabilities
text function calling web search
Moonshotai/Kimi K2 Thinking Maas Cost Examples
Short prompt (500 tokens)
$0.000300
Medium prompt (2K tokens)
$0.00120
Long output (4K tokens)
$0.01000
Count tokens for Moonshotai/Kimi K2 Thinking Maas
Paste your prompt to see exact token counts and API cost estimates.
Open Token CounterSimilar Models to Moonshotai/Kimi K2 Thinking Maas
Frequently Asked Questions
How much does Moonshotai/Kimi K2 Thinking Maas cost per token? +
Moonshotai/Kimi K2 Thinking Maas costs $0.60 per 1M input tokens and $2.50 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.001850.
What is the context window for Moonshotai/Kimi K2 Thinking Maas? +
Moonshotai/Kimi K2 Thinking Maas supports a context window of 256,000 tokens (256K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Moonshotai/Kimi K2 Thinking Maas? +
Moonshotai/Kimi K2 Thinking Maas can generate up to 256,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Moonshotai/Kimi K2 Thinking Maas good for coding tasks? +
Yes, Moonshotai/Kimi K2 Thinking Maas supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.