NVIDIA: Llama 3.1 Nemotron 70B Instruct API Pricing

by nvidia · 131K context window ·

$1.200

Input /1M tokens

$1.200

Output /1M tokens

131K

Context window

Monthly Cost Examples

Assuming 50% input / 50% output token split

Usage	Monthly cost
100K tokens/month	$0.12
1M tokens/month	$1.20
10M tokens/month	$12.00
100M tokens/month	$120.00

Compare with other models

NVIDIA: Llama 3.1 Nemotron 70B Instruct vs OpenAI: GPT-4o →NVIDIA: Llama 3.1 Nemotron 70B Instruct vs OpenAI: GPT-4o-mini →NVIDIA: Llama 3.1 Nemotron 70B Instruct vs OpenAI: o1 →

Automate your model selection

StormRouter sends each request to the cheapest model that can handle it.
Only use NVIDIA: Llama 3.1 Nemotron 70B Instruct when your quality requirements demand it.

Try StormRouter free →

NVIDIA: Llama 3.1 Nemotron 70B Instruct API Pricing

Monthly Cost Examples

Compare with other models

Automate your model selection

Similar models