NVIDIA: Llama 3.1 Nemotron 70B Instruct API Pricing
by nvidia · 131K context window ·
$1.200
Input /1M tokens
$1.200
Output /1M tokens
131K
Context window
Monthly Cost Examples
Assuming 50% input / 50% output token split
| Usage | Monthly cost |
|---|---|
| 100K tokens/month | $0.12 |
| 1M tokens/month | $1.20 |
| 10M tokens/month | $12.00 |
| 100M tokens/month | $120.00 |
Compare with other models
NVIDIA: Llama 3.1 Nemotron 70B Instruct vs OpenAI: GPT-4o →NVIDIA: Llama 3.1 Nemotron 70B Instruct vs OpenAI: GPT-4o-mini →NVIDIA: Llama 3.1 Nemotron 70B Instruct vs OpenAI: o1 →Automate your model selection
StormRouter sends each request to the cheapest model that can handle it.
Only use NVIDIA: Llama 3.1 Nemotron 70B Instruct when your quality requirements demand it.