LLM API Cost Calculator
Estimate what an AI feature will cost before you ship it. Pick a model, set your input/output tokens per request and your daily volume, and get the per-request, monthly and yearly bill. Prices are editable indicative defaults — verify them on each provider's pricing page. Everything runs in your browser; nothing is sent to our servers.
Indicative price (USD / 1M tokens) — editable. Verify on the provider's pricing page.
60.0M input → $150 · 15.0M output → $150 per month
Tip: output tokens usually dominate the bill — they cost several times more than input on most models.
How LLM API pricing works
Most LLM APIs bill per token, split into input (your prompt + context) and output (the model's response), priced separately per million tokens. Output tokens almost always cost several times more than input, so verbose responses dominate the bill. Your monthly cost is roughly (input tokens × input price + output tokens × output price) × requests. Context size, retries, and system prompts all add input tokens, so trimming context and capping output length are the biggest levers for cutting cost. This calculator makes those trade-offs visible before you commit to a model.
FAQ
- How do I estimate LLM API costs?
- Multiply your input tokens per request by the model's input price (per million tokens), add output tokens times the output price, then multiply by your number of requests. This calculator does it for you and shows per-request, monthly, and yearly totals — adjust the editable prices to match the provider's current rates.
- Why are output tokens more expensive than input?
- Generating tokens is more compute-intensive than reading them, so most providers price output several times higher than input (often 3–5×). That means long, verbose responses drive your bill more than large prompts — capping max output length is one of the most effective cost controls.
- Are the prices in this calculator accurate?
- They are indicative defaults as of June 2026 and are fully editable. API pricing changes frequently, so always confirm the current input and output price on the provider's official pricing page and update the fields accordingly.
- How can I reduce my LLM API bill?
- Trim context and system prompts (fewer input tokens), cap output length, use a smaller/cheaper model where quality allows, cache repeated context, and batch where possible. For privacy-sensitive or high-volume workloads, running a local LLM can remove per-token cost entirely.
This tool runs entirely in your browser; no data is transmitted or stored. Prices are indicative, editable defaults (as of June 2026) — always confirm current pricing with the provider. The link to Proton is an affiliate link, disclosed and marked rel="sponsored".