Skip to content
AWS Bedrock

Nvidia.Nemotron Nano 12b

Nvidia.Nemotron Nano 12b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

Nvidia.Nemotron Nano 12b Pricing & Specifications

Input Price$0.20 per 1M tokens
Output Price$0.60 per 1M tokens
Context Window128,000 tokens (128K)
Max Output8,192 tokens
ProviderAWS Bedrock

What is Nvidia.Nemotron Nano 12b?

Nvidia.Nemotron Nano 12b is a large language model by AWS Bedrock with a 128K context window and up to 8,192 output tokens. It costs $0.20 per 1M input tokens and $0.60 per 1M output tokens. Nvidia.Nemotron Nano 12b is available via AWS Bedrock with a 128K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.6000/1M output tokens.

Capabilities

text vision

Nvidia.Nemotron Nano 12b Cost Examples

Short prompt (500 tokens)

$0.000100

Medium prompt (2K tokens)

$0.00040

Long output (4K tokens)

$0.00240

Count tokens for Nvidia.Nemotron Nano 12b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Nvidia.Nemotron Nano 12b

AWS Bedrock

Ai21.Jamba 1 5 Mini

$0.20/1M in 256K ctx

AWS Bedrock

Eu West 3/Mistral.Mistral 7b Instruct

$0.20/1M in 32K ctx

AWS Bedrock

Mistral.Ministral 3 14b Instruct

$0.20/1M in 128K ctx

AWS Bedrock

Eu.Meta.Llama3 2 3b Instruct

$0.19/1M in 128K ctx

Frequently Asked Questions

How much does Nvidia.Nemotron Nano 12b cost per token? +
Nvidia.Nemotron Nano 12b costs $0.20 per 1M input tokens and $0.60 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000500.
What is the context window for Nvidia.Nemotron Nano 12b? +
Nvidia.Nemotron Nano 12b supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Nvidia.Nemotron Nano 12b? +
Nvidia.Nemotron Nano 12b can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Nvidia.Nemotron Nano 12b good for coding tasks? +
Nvidia.Nemotron Nano 12b can handle basic coding tasks, but there are models specifically optimized for code generation that may perform better on complex programming problems.
Token Counter | Pricing Calculator | Model Comparison | All AWS Bedrock Models