Meta: Llama 3.1 70B Instruct API Pricing
by meta-llama · 131K context window ·
Explore the pricing details for Meta's Llama 3.1 70B Instruct API, a leading LLM for machine learning applications. Understand the cost structure for input and output tokens: $0.40000 per 1 million input tokens and $0.40000 per 1 million output tokens, totaling $0.8000 per million tokens. With a context window of 131,072 tokens, this API offers substantial processing capability. Powered by meta-llama, it's designed for efficiency and performance. For example, processing 10 million tokens daily would cost approximately $24 per month, making it a competitive choice for ML engineers comparing LLM API options based on both cost and context window size.
Monthly Cost Examples
Assuming 50% input / 50% output token split
| Usage | Monthly cost |
|---|---|
| 100K tokens/month | $0.04 |
| 1M tokens/month | $0.40 |
| 10M tokens/month | $4.00 |
| 100M tokens/month | $40.00 |
Compare with other models
Meta: Llama 3.1 70B Instruct vs OpenAI: GPT-4o →Meta: Llama 3.1 70B Instruct vs OpenAI: GPT-4o-mini →Meta: Llama 3.1 70B Instruct vs OpenAI: o1 →Automate your model selection
StormRouter sends each request to the cheapest model that can handle it.
Only use Meta: Llama 3.1 70B Instruct when your quality requirements demand it.