Anthropic Claude API Pricing Guide: Opus 4.8, Sonnet, & Haiku

Complete guide to Claude API pricing in 2026. Review pricing for Claude Opus 4.8, Claude Sonnet 4.6, Claude Haiku 4.5, and prompt caching discounts. Sourced from official docs.

· Figures verified against official sources, June 6, 2026

Anthropic's Claude lineup is renowned for its high quality in coding, reasoning, and writing. The pricing structure is segmented cleanly across three model size families. Below is the live pricing table for the Anthropic Claude API family, sourced dynamically from our index.

Model ID Input / 1M Output / 1M Cached Input / 1M Max Tokens
Loading Claude models...

Unlocking Massive Savings with Prompt Caching

Anthropic was a pioneer in offering **Prompt Caching** on public APIs. If you have long, static system instructions, documents, or context history, you can tag them to be cached. This yields significant savings:

  • 90% Off Input Cost: Cached input reads cost only 10% of the standard input rate (e.g. $0.50/1M for Opus 4.8 instead of $5.00/1M).
  • Fast Retrieval: Caching drastically reduces first-token latency (time-to-first-token) for long documents, which is ideal for real-time applications.
  • Simple API Integration: Simply add `"cache_control": {"type": "ephemeral"}` to block headers in your messages list.

For cross-provider comparison, read the global AI Model Pricing Comparison, or calculate custom usage via the Cost Calculator.

Frequently asked

Does Anthropic support Batch pricing?

Yes. Anthropic supports Message Batches via `/v1/messages/batches`. This provides a flat **50% discount** on input and output tokens for asynchronous jobs completing within 24 hours, matching OpenAI's batch discount.