Release Timeline · Updated June 1, 2026

AI model release timeline

Every major frontier and open-weight model release, newest first. Dates, pricing, and the key thing that changed. Updated same-day when new models ship.

Verified against official announcements No paid placements — every entry from official source

2026

May 28, 2026
Claude Opus 4.8 Frontier
Anthropic · Review →
$5/$25 per 1M tokens standard; $10/$50 Fast Mode. SWE-Bench Pro 69.2% (up from 64.3% on 4.7). Major alignment and honesty improvements. API: claude-opus-4-8.
May 19, 2026 (GA)
Gemini 3.5 Flash Frontier
Google · Review →
$1.50/$9.00 per 1M; free API tier. 1M token context, 64K output. 289 tok/s throughput — one of the fastest frontier models. Announced at Google I/O 2026.
April 2026
Claude Opus 4.7 Frontier
Anthropic · Review →
$5/$25 per 1M. SWE-Bench Pro 64.3%. Now superseded by 4.8 but still fully supported.
April 24, 2026
GPT-5.5 Frontier
OpenAI · Review →
$5/$30 per 1M standard; $2.50/$15 batch. 1.05M context, 128K output. Built for agentic coding and computer use. Successor to GPT-5.
April 24, 2026
DeepSeek V4-Pro & V4-Flash Open (MIT)
DeepSeek · Review →
V4-Pro: $0.435/$0.87 per 1M; cache hit $0.003625. V4-Flash: $0.14/$0.28 — cheapest commercial API. 1M context, 384K output. MIT license. Self-hosted or API.
April 22, 2026
Qwen 3.6 Open (Apache 2.0)
Alibaba · Review →
Apache 2.0. Dense 27B and MoE 35B-A3B variants. 262K native context (~1M extended). Strong multilingual — 201 languages. API: ~$0.30/$0.90 per 1M cloud.
April 20, 2026
Kimi K2.6 Open (Modified MIT)
Moonshot AI · Review →
$0.95/$4.00 per 1M; cached $0.16. 256K context. 1T total / 32B active MoE. Good multilingual performance.
April 2026
Grok 4.3 Frontier
xAI · Review →
$1.25/$2.50 per 1M; cached $0.20. 1M context. Real-time data via X. Very cheap output tokens for agentic loops.
April 2026
Mistral Medium 3.5 Open (Modified MIT)
Mistral
$1.50/$7.50 per 1M. Public preview. 256K context. Multimodal + reasoning. Self-host on 4 GPUs.
February 2026
Gemini 3.1 Pro Frontier
Google · Review →
$2/$12 per 1M (above 200K: $4/$18). 1M context, 64K output. Deepest reasoning in the Gemini family. Replaced Gemini 3 Pro.

2025

December 2, 2025
Mistral Large 3 Open (Apache 2.0)
Mistral · Review →
$0.50/$1.50 per 1M. Apache 2.0 — true open commercial license. 675B total / 41B active MoE. 256K context. API id: mistral-large-2512.
November 2025
Claude Haiku 4.5 Small
Anthropic · Review →
$1/$5 per 1M. 200K context. 145 tok/s. High-volume routing and classification tier.
September 2025
Claude Sonnet 4.6 Mid
Anthropic · Review →
$3/$15 per 1M. 200K context. 95 tok/s. The production daily-driver between Haiku and Opus.
August 7, 2025
GPT-5 Frontier
OpenAI · Review →
$1.25/$10 per 1M. 400K context, 128K output. The everyday OpenAI production pick at a rational price point.
April 5, 2025
Llama 4 Scout & Maverick Open (Community)
Meta · Review →
Free under Meta's Community License. Scout: 10M token context, 17B/109B; Maverick: 1M context, 17B/400B, 128 experts. Self-hosted at no API cost.
January 2025
Phi-4 Small
Microsoft
MIT license. 14B parameter dense model. 16K context. Built for edge/local inference. Punches above its weight on MMLU and math.
→ Deprecation tracker — which models are ending → Changelog — recent data updates → Full model rankings with benchr Rating → Recent releases — editorial coverage
benchr dispatch

New-model coverage the day it ships. This timeline stays current only because of that discipline.