AI Infrastructure Pricing Intelligence

Know the real cost of AI

Token pricing, API costs, model comparisons, and ROI analysis — everything you need to make informed decisions before you ship.

40+
Models tracked
6
Calculator tools
$0
Cost to use
Calculations

Pick your calculator

Six precision tools for every stage of your AI cost analysis — from quick token estimates to full ROI breakdowns.

Current model pricing

Loading…

Prices per 1M tokens. Updated regularly. Click any row to open the token calculator with that model pre-selected.

Model Input ($/1M) Output ($/1M) Context Speed

Why this exists

AI API pricing is opaque, inconsistent, and changes frequently. We built costofaicalc.com to give developers and teams a single, clear tool for understanding and forecasting AI infrastructure costs.

No paywalls. No accounts. Just open the calculator you need and get answers in seconds. Prices are sourced directly from provider documentation and updated when changes are announced.

Missing a model?

If a model or pricing update is missing, let us know and we'll add it.

Token Cost Calculator

Estimate the exact API cost for a single call or average request across any major language model.

Prompt, system message, context, documents
Generated response length
~4 chars per token estimated
Multiply for batch estimation
Total cost
$0.0000
per request
Input cost $0.0000
Output cost $0.0000
Input price
Output price
Total tokens
Cost per 1M tokens
1,000 requests
1M requests
Daily (100 req/day)
Monthly (3k req/day)
Tip: Output tokens are typically 2–5× more expensive than input tokens. Caching repeated prompts can save 80–90% on input costs with providers that support prompt caching.

Monthly Spend Estimator

Forecast your monthly API bill from real usage patterns. Set volumes, average lengths, and see a full cost breakdown.

Include system prompt in this count
Cache hits typically cost 10% of normal input price
Month 1 cost
$0.00
Month 6 cost (with growth)
$0.00
Month 12 cost
$0.00
Monthly requests
Monthly input tokens
Monthly output tokens
Cache savings
Cost per request
At scale: Costs compound quickly with growth. Consider negotiating volume discounts at $10k+/month spend. Switching to a smaller model can cut costs 80–95%.

ROI Calculator

Calculate the true return on your AI investment. Compare labor costs saved against API fees to find net benefit and payback period.

Include salary, benefits, overhead
Use the Monthly Spend Estimator to calculate this
One-time: dev time, tooling, testing
Monthly net savings
$0
Payback period
Labor saved (monthly)
API cost (monthly)
Time saved per task
Total hours saved/mo
Efficiency gain
Annual benefit
Configure inputs to see your ROI analysis.

Model Comparison

Side-by-side cost comparison for any two models at your exact workload.

Model A
per request
Model B
per request
Monthly cost (A)
Monthly cost (B)
Monthly difference
Annual difference
Cheaper by
Select two models to compare.

Context Window Cost

Understand the true cost of large context windows, document ingestion, and long conversations.

~750 words ≈ 1,000 tokens
Cost per conversation
$0.0000
Total context at end
Avg input per turn
Total input tokens
Total output tokens
Daily cost
Monthly cost
Context grows fast. In a 10-turn conversation, turn 10 includes ALL previous messages in the input. Long system prompts and documents are repeated on every turn — a major hidden cost.

Batch Processing Cost

Compare real-time API calls vs batch processing. Batch APIs (like Anthropic's) offer 50% discounts for async workloads.

Anthropic Batch API: 50%. OpenAI Batch API: 50%.
Real-time cost
Batch cost
You save
Total input tokens
Total output tokens
Real-time per request
Batch per request
Savings %
Best for: Bulk data enrichment, document processing, large-scale inference, overnight jobs. Batch APIs accept the same payloads but run async — results within 24h. Not suitable for real-time user interactions.