LLM API cost estimate, monthly token math

Calculate GPT, Claude, Gemini, and other model API costs with monthly requests, input/output tokens, cache hit ratio, batch discounts, and context limits.

Cost estimateLLM Table

Scenario

Model341 models

Monthly requestsInput tokens / requestOutput tokens / requestCache hit ratio: 0%

This model has no cached-input price on file. The ratio is ignored and standard input pricing applies.

Batch discount: 0%

Estimated batch discount on input + cached input only when the provider supports batch/async APIs.

Tune volumes, cache, and batch discount—totals update instantly with the same teal field language as the home quick estimate.

Total monthly cost

$250.00

OpenRouter

Results

Input cost

$100.00

Cached input

$0.00

Output cost

$150.00

Per request

$0.002500

Per 1K requests

$2.50

Per 100K requests

$250.00

Per 1M requests

$2,500.00

Monthly tokens

130,000,000

Cheaper alternatives

inclusionAI: Ling-2.6-flash

OpenRouter

$1.90

Mistral: Mistral Nemo

OpenRouter

$2.90

Meta: Llama 3.1 8B Instruct

OpenRouter

$3.50

IBM: Granite 4.0 Micro

OpenRouter

$5.06

Meta: Llama 3 8B Instruct

OpenRouter

$5.20

Free routes (limits and quality may differ)

Google: Gemma 4 26B A4B (free)Free route

OpenRouter

$0.00

Google: Gemma 4 31B (free)Free route

OpenRouter

$0.00

Meta: Llama 3.2 3B Instruct (free)Free route

OpenRouter

$0.00

This is an estimate, not a real-world result.

All prices, model data, token counts, and calculator outputs are example estimates only. Accuracy, completeness, timeliness, or fitness is not guaranteed; LLMRateRadar accepts no responsibility for decisions made from these results.

FAQ

Cost calculation notes

How is LLM API cost calculated?

The calculator multiplies monthly input and output token volume by each model's per-1M-token prices. Cache hit ratio and batch discount are applied only when you enter them.

Are missing prices treated as free?

No. If a model is missing input or output pricing, LLMRateRadar marks that scenario as unavailable instead of treating unknown prices as zero.

Why do direct API and OpenRouter prices differ?

The same underlying model can have different prices by route. LLMRateRadar keeps direct provider records and aggregator route records separate.

Should I verify prices before production use?

Yes. Prices can change quickly, so production decisions should always be checked against the official provider pricing page.