Pricing data
Per-token pricing for closed-source models comes directly from each provider's official pricing page: Anthropic, OpenAI, Google, Mistral, DeepSeek. Pricing for open-weight models hosted on third-party inference providers references the inference provider's own published rates when relevant. Where pricing changes, the article is updated and a changelog entry is added.
Benchmark scores
Benchmark numbers are sourced from the benchmark maintainers' published leaderboards. SWE-bench Verified scores come from swebench.com. LMSYS Arena scores come from lmarena.ai. ARC-AGI scores come from arcprize.org. When a provider publishes a model's score on a benchmark before it appears on the official leaderboard, the provider's published figure is used with attribution.
Capability ratings
Where this site assigns capability ratings (coding, reasoning, writing, vision, long context, multilingual) on a 0–100 scale, the ratings are synthesized from the model's documented benchmark performance on relevant evaluations, capability claims in the model's release notes, and observed behavior in published comparisons. They are not personal test scores from an original evaluation. They are a synthesized reference figure.
Recommendations and use-case fit
Recommendations about which model fits which workload are based on the documented capabilities and pricing structure of each model, not on original benchmark testing performed by this publication. Where personal experience is referenced in an article, it is described as such, and the testing setup is shown.
Update cadence
Pricing tables are checked against provider documentation when articles are revised. Model release and deprecation events are added to articles within several days of the announcement. The schedule for systematic re-verification of all model data is “before major article revisions” — there is no fixed weekly or monthly cycle.
Corrections and disputes
If you find a number, date, or attribution that does not match the primary source, send a note to corrections@benchr.org. Material corrections are noted on the corrections page and in the article changelog.