What is the newest AI model in 2026?

As of June 2026, Claude Opus 4.8 is the most recently released frontier model (May 28, 2026), followed by Gemini 3.5 Flash (May 19, 2026 GA at Google I/O). See the timeline on this page for the full chronological list.

Which AI models were released in 2026?

In 2026 (through June): Claude Opus 4.8 (May 28), Gemini 3.5 Flash (May 19), Claude Opus 4.7 (April), GPT-5.5 (April 24), DeepSeek V4-Pro and Flash (April 24), Kimi K2.6 (April 20), Qwen 3.6 (April 22), Grok 4.3 (April), Gemini 3.1 Pro (February), and Mistral Medium 3.5 (April).

Release Timeline · Updated June 1, 2026

AI model release timeline

Every major frontier and open-weight model release, newest first. Dates, pricing, and the key thing that changed. Updated same-day when new models ship.

Verified against official announcements No paid placements — every entry from official source

2026

May 28, 2026

Claude Opus 4.8 Frontier

Anthropic · Review →

$5/$25 per 1M tokens standard; $10/$50 Fast Mode. SWE-Bench Pro 69.2% (up from 64.3% on 4.7). Major alignment and honesty improvements. API: claude-opus-4-8.

May 19, 2026 (GA)

Gemini 3.5 Flash Frontier

Google · Review →

$1.50/$9.00 per 1M; free API tier. 1M token context, 64K output. 289 tok/s throughput — one of the fastest frontier models. Announced at Google I/O 2026.

April 2026

Claude Opus 4.7 Frontier

Anthropic · Review →

$5/$25 per 1M. SWE-Bench Pro 64.3%. Now superseded by 4.8 but still fully supported.

April 24, 2026

GPT-5.5 Frontier

OpenAI · Review →

$5/$30 per 1M standard; $2.50/$15 batch. 1.05M context, 128K output. Built for agentic coding and computer use. Successor to GPT-5.

April 24, 2026

DeepSeek V4-Pro & V4-Flash Open (MIT)

DeepSeek · Review →

V4-Pro: $0.435/$0.87 per 1M; cache hit $0.003625. V4-Flash: $0.14/$0.28 — cheapest commercial API. 1M context, 384K output. MIT license. Self-hosted or API.

April 22, 2026

Qwen 3.6 Open (Apache 2.0)

Alibaba · Review →

Apache 2.0. Dense 27B and MoE 35B-A3B variants. 262K native context (~1M extended). Strong multilingual — 201 languages. API: ~$0.30/$0.90 per 1M cloud.

April 20, 2026

Kimi K2.6 Open (Modified MIT)

Moonshot AI · Review →

$0.95/$4.00 per 1M; cached $0.16. 256K context. 1T total / 32B active MoE. Good multilingual performance.

April 2026

Grok 4.3 Frontier

xAI · Review →

$1.25/$2.50 per 1M; cached $0.20. 1M context. Real-time data via X. Very cheap output tokens for agentic loops.

April 2026

Mistral Medium 3.5 Open (Modified MIT)

Mistral

$1.50/$7.50 per 1M. Public preview. 256K context. Multimodal + reasoning. Self-host on 4 GPUs.

February 2026

Gemini 3.1 Pro Frontier

Google · Review →

$2/$12 per 1M (above 200K: $4/$18). 1M context, 64K output. Deepest reasoning in the Gemini family. Replaced Gemini 3 Pro.

2025

December 2, 2025

Mistral Large 3 Open (Apache 2.0)

Mistral · Review →

$0.50/$1.50 per 1M. Apache 2.0 — true open commercial license. 675B total / 41B active MoE. 256K context. API id: mistral-large-2512.

November 2025

Claude Haiku 4.5 Small

Anthropic · Review →

$1/$5 per 1M. 200K context. 145 tok/s. High-volume routing and classification tier.

September 2025

Claude Sonnet 4.6 Mid

Anthropic · Review →

$3/$15 per 1M. 200K context. 95 tok/s. The production daily-driver between Haiku and Opus.

August 7, 2025

GPT-5 Frontier

OpenAI · Review →

$1.25/$10 per 1M. 400K context, 128K output. The everyday OpenAI production pick at a rational price point.

April 5, 2025

Llama 4 Scout & Maverick Open (Community)

Meta · Review →

Free under Meta's Community License. Scout: 10M token context, 17B/109B; Maverick: 1M context, 17B/400B, 128 experts. Self-hosted at no API cost.

January 2025

Phi-4 Small

Microsoft

MIT license. 14B parameter dense model. 16K context. Built for edge/local inference. Punches above its weight on MMLU and math.

→ Deprecation tracker — which models are ending → Changelog — recent data updates → Full model rankings with benchr Rating → Recent releases — editorial coverage