What is the cheapest Claude model?

claude-haiku-4-5 is the cheapest Claude model at $0.80/$4 per million input/output tokens — about 73% cheaper than Sonnet. It supports the full 200K context window and all API features (tool use, vision, streaming, caching). Use it for classification, summarization, and any task where Sonnet-level quality is not required.

Claude API Pricing 2026 — Complete Developer Guide

Complete Claude API pricing guide for 2026: all model prices, prompt caching savings, batch API discount, context window costs, and how to estimate your bill. With Python examples.

Claude API pricing is straightforward once you understand the three levers: model tier, prompt caching, and the Batch API. Here's everything you need to estimate and minimize your bill.

2026 Claude API pricing table

Prompt caching prices

Batch API: 50% discount for async workloads

Cost estimation in Python

Prompt caching example — production chatbot

Cost comparison: models by task type

Cost optimization checklist

Frequently asked questions

Model	Input ($/M tokens)	Output ($/M tokens)	Context	Best for
claude-haiku-4-5	$0.80	$4	200K	High-volume classification, short responses
claude-sonnet-4-6	$3	$15	200K	General production workloads
claude-opus-4-7	$15	$75	200K	Complex reasoning, highest quality

Model	Cache write ($/M)	Cache read ($/M)	Savings vs normal input
claude-haiku-4-5	$1.00	$0.08	90% on reads
claude-sonnet-4-6	$3.75	$0.30	90% on reads
claude-opus-4-7	$18.75	$1.50	90% on reads

Model	Batch input ($/M)	Batch output ($/M)
claude-haiku-4-5	$0.40	$2
claude-sonnet-4-6	$1.50	$7.50
claude-opus-4-7	$7.50	$37.50

Task	Typical tokens	Haiku cost	Sonnet cost	Opus cost
Classify a tweet (in/out)	100 / 10	$0.000088	$0.00045	$0.00225
Summarize article (in/out)	800 / 200	$0.0015	$0.0054	$0.027
Code review 500 lines	3K / 1K	$0.0064	$0.024	$0.12
Analyze 50-page PDF	40K / 2K	$0.04	$0.15	$0.75
Analyze 150K-token codebase	150K / 5K	$0.14	$0.525	$2.625

How much does the Claude API cost in 2026?

As of 2026: claude-haiku-4-5 is $0.80/$4 per million input/output tokens. claude-sonnet-4-6 is $3/$15. claude-opus-4-7 is $15/$75. With prompt caching, cached input tokens cost 90% less. The Batch API cuts all prices 50% for async workloads. Always verify current rates on the Anthropic pricing page as prices change.

Does Anthropic have a free tier for the Claude API?

Anthropic does not advertise a permanent free tier for the production API. New accounts may get a small credit to test the API. For budget-conscious development, use claude-haiku-4-5 (cheapest model) and the Batch API (50% discount). Claude.ai (the web interface) has a free plan but it does not give API access.

What is prompt caching and how much does it save?

Prompt caching stores frequently-used context blocks (system prompts, documents, examples) server-side. Subsequent requests that hit the cache pay $0.30/$3.75 per million tokens (vs $3/$15 for Sonnet) — a 90% reduction on input and 75% on cache write. Essential for production chatbots, RAG pipelines, and any app that reuses a long system prompt.

How do I estimate my Claude API bill?

Multiply your expected input tokens by the input price and output tokens by the output price. 1 token ≈ 0.75 words. A 1,000-word user message + 500-word system prompt ≈ 2,000 input tokens; a 500-word response ≈ 667 output tokens. With claude-sonnet-4-6: 2,000 input × $0.000003 = $0.006 + 667 output × $0.000015 = $0.01 per call. Use the Claude Cost Calculator for detailed estimates.

Is there a Claude API trial or sandbox?

New Anthropic accounts receive API credits to test. The API itself has no separate sandbox — you use the production endpoint with test keys. Unlike OpenAI, Anthropic does not have a 'playground' tier separate from the API; use the Claude.ai web interface for interactive testing, then move to the API for integration.

Claude API Pricing 2026