How is LLM API cost calculated?
The calculator multiplies monthly input and output token volume by each model's per-1M-token prices. Cache hit ratio and batch discount are applied only when you enter them.
Calculate GPT, Claude, Gemini, and other model API costs with monthly requests, input/output tokens, cache hit ratio, batch discounts, and context limits.
Tune volumes, cache, and batch discount—totals update instantly with the same teal field language as the home quick estimate.
This is an estimate, not a real-world result.
All prices, model data, token counts, and calculator outputs are example estimates only. Accuracy, completeness, timeliness, or fitness is not guaranteed; LLMRateRadar accepts no responsibility for decisions made from these results.
The calculator multiplies monthly input and output token volume by each model's per-1M-token prices. Cache hit ratio and batch discount are applied only when you enter them.
No. If a model is missing input or output pricing, LLMRateRadar marks that scenario as unavailable instead of treating unknown prices as zero.
The same underlying model can have different prices by route. LLMRateRadar keeps direct provider records and aggregator route records separate.
Yes. Prices can change quickly, so production decisions should always be checked against the official provider pricing page.