Question 1

How are token costs calculated?

Accepted Answer

Cost = (tokens × price per million) ÷ 1,000,000. Input and output tokens are priced separately because output generation is more compute-intensive. Cached input tokens (where supported) are priced at a discount — typically 50–90% off the standard input rate.

Question 2

What is prompt caching and how does it affect cost?

Accepted Answer

Prompt caching lets you reuse a previously processed prefix (system prompt, documents, few-shot examples) across multiple requests. The provider stores the KV cache and charges a reduced rate for cache hits. Anthropic charges ~10% of the input rate for cached tokens; OpenAI charges ~50%. If your system prompt is large and reused across many calls, caching can cut costs dramatically.

Question 3

How current are the prices?

Accepted Answer

Prices are hardcoded with a last-updated date shown on the page. LLM pricing changes frequently — always verify against the provider's official pricing page before committing to a budget. The tool is designed for quick estimates, not billing-accurate quotes.

Question 4

Why does the cheapest model not always win?

Accepted Answer

Cost is only one dimension. Cheaper models may require more tokens to produce the same quality output (more retries, longer prompts for few-shot examples), which can erase the per-token savings. The comparison table shows raw token cost; factor in quality and retry rate for a true total-cost-of-ownership comparison.

Question 5

How do I estimate tokens without running the model?

Accepted Answer

Use the LLM Token Counter tool on this site to get an approximate count for your prompt text. For exact counts, use the provider's tokenizer: tiktoken for OpenAI, the Anthropic SDK's token counting endpoint, or sentencepiece for Gemini/Llama.

Cheapest models for this workload	Total
Gemini 2.0 Flash-LiteGoogle	$0.1500
Gemini 2.0 FlashGoogle	$0.2000
GPT-4o miniOpenAI	$0.3000
DeepSeek V3DeepSeek	$0.5450
Llama 3.3 70BMeta	$0.7875
GPT-4.1 miniOpenAI	$0.8000
o3-miniOpenAI	$2.20
Claude Haiku 4Anthropic	$2.25

LLM Cost Calculator

About LLM Cost Calculator

What this tool does

Pipeline

Frequently asked