How much does Claude 3.5 Sonnet cost vs Claude Sonnet 4.6?

Claude Sonnet 4.6 costs $3.00 per million input tokens and $15.00 per million output tokens, which has remained the stable price point for Anthropic's highly efficient mid-tier workhorse.

How does prompt caching work on the Claude API?

Anthropic Claude supports prompt caching on all models. By adding a cache breakpoint to your system prompt or messages list, subsequent calls that hit the cache pay only 10% of the normal input token cost (a 90% discount).

Guide·June 2026

Anthropic Claude API Pricing Guide: Opus 4.8, Sonnet, & Haiku

Complete guide to Claude API pricing in 2026. Review pricing for Claude Opus 4.8, Claude Sonnet 4.6, Claude Haiku 4.5, and prompt caching discounts. Sourced from official docs.

Reviewed June 6, 2026 · Figures verified against official sources, June 6, 2026

Anthropic's Claude lineup is renowned for its high quality in coding, reasoning, and writing. The pricing structure is segmented cleanly across three model size families. Below is the live pricing table for the Anthropic Claude API family, sourced dynamically from our index.

Model ID	Input / 1M	Output / 1M	Cached Input / 1M	Max Tokens
Loading Claude models...

Unlocking Massive Savings with Prompt Caching

Anthropic was a pioneer in offering **Prompt Caching** on public APIs. If you have long, static system instructions, documents, or context history, you can tag them to be cached. This yields significant savings:

90% Off Input Cost: Cached input reads cost only 10% of the standard input rate (e.g. $0.50/1M for Opus 4.8 instead of $5.00/1M).
Fast Retrieval: Caching drastically reduces first-token latency (time-to-first-token) for long documents, which is ideal for real-time applications.
Simple API Integration: Simply add `"cache_control": {"type": "ephemeral"}` to block headers in your messages list.

For cross-provider comparison, read the global AI Model Pricing Comparison, or calculate custom usage via the Cost Calculator.

Frequently asked

Does Anthropic support Batch pricing?

Yes. Anthropic supports Message Batches via `/v1/messages/batches`. This provides a flat **50% discount** on input and output tokens for asynchronous jobs completing within 24 hours, matching OpenAI's batch discount.

Anthropic Claude API Pricing Guide: Opus 4.8, Sonnet, & Haiku

Unlocking Massive Savings with Prompt Caching

Frequently asked

OpenAI API Pricing Guide.

DeepSeek vs OpenAI Pricing.