Claude API Text Summarization Python (2026 Working Examples)

Working Python code to summarize text, documents, and URLs with the Claude API in 2026. Covers bullet-point summaries, extractive key points, long-document chunking, and structured JSON output.

Claude produces high-quality summaries out of the box — no fine-tuning required. The key is a precise system prompt that specifies length, format, and perspective. Below are working patterns for the most common summarization use cases.

Minimal summarization

Bullet-point key points

Structured JSON summary

Long-document map-reduce summarization

Webpage summarization (with BeautifulSoup)

Batch summarization (50% cost, async)

Model comparison for summarization

Frequently asked questions

Model	Cost per 1K tokens (in/out)	Context	Best for
claude-haiku-4-5-20251001	$0.00025 / $0.00125	200K	High-volume bulk summarization
claude-sonnet-4-6	$0.003 / $0.015	200K	Nuanced documents, structured output
claude-opus-4-7	$0.015 / $0.075	200K	Complex research papers, board reports

What is the best Claude model for summarization?

For most summarization tasks, `claude-haiku-4-5-20251001` gives the best cost/quality trade-off — it is 5-10× cheaper than Sonnet and produces excellent summaries for documents under 50K tokens. Use `claude-sonnet-4-6` for nuanced long documents where quality is critical.

How long of a document can Claude summarize?

Claude Sonnet 4.6 and Opus 4.7 support 200K token context windows — roughly 150,000 words or a full novel. For longer documents, use a map-reduce chunking approach: summarize each chunk independently, then ask Claude to synthesize the chunk summaries.

How do I get a structured summary (bullet points, JSON)?

Specify the output format in your system prompt: 'Return a JSON object with keys: summary (2-3 sentences), key_points (list of 5 strings), sentiment (positive|neutral|negative).' Claude follows structured output instructions reliably, especially with `claude-sonnet-4-6`.

Can Claude summarize a URL or webpage?

Claude cannot fetch URLs directly — you must extract the text first. Use `requests` + `BeautifulSoup` to scrape the page text, then pass it to Claude. For PDFs, use `pypdf` or the Claude Files API.

How do I summarize many documents in bulk?

Use the Batch API (`client.beta.messages.batches.create`) for bulk summarization. It processes up to 10,000 requests asynchronously at 50% cost reduction compared to real-time API calls. Ideal for nightly document pipelines.

Claude API Text Summarization (Python)