Benchmark profile

United States of America Mathematical Olympiad 2026 (USAMO 2026)

The premier US mathematical olympiad competition, featuring proof-based problems that require deep mathematical insight and rigorous argumentation at the highest competition level.

Data verified July 23, 2026

Top models on USAMO 2026 — July 23, 2026

As of July 23, 2026, Claude Mythos 5 leads the USAMO 2026 leaderboard with 97.6% , followed by Claude Opus 4.8 (96.7%) and MiniMax M3 (85.7%).

1Closed

Claude Mythos 5

Anthropic

claude-mythos-5

97.6%

Overall 83.93Context 1M+

2Closed

Claude Opus 4.8

Anthropic

claude-opus-4-8

96.7%

Overall 78.34Context 1M

3Open

MiniMax M3

MiniMax

minimax-m3

85.7%

Overall 69.75Context 1M

3 modelsMath10% of category scoreCurrentUpdated July 23, 2026

Leaderboard (3 models)

Score

Claude Mythos 5Anthropic · Closed

97.6%

Claude Opus 4.8Anthropic · Closed

96.7%

MiniMax M3MiniMax · Open weight

85.7%

According to BenchLM.ai, Claude Mythos 5 leads the USAMO 2026 benchmark with a score of 97.6%, followed by Claude Opus 4.8 (96.7%) and MiniMax M3 (85.7%). The scores show moderate spread, with meaningful differences between the top tier and mid-tier models.

3 models have been evaluated on USAMO 2026. The benchmark falls in the Math category. This category carries a 5% weight in BenchLM.ai's overall scoring system. Within that category, USAMO 2026 contributes 10% of the category score, so strong performance here directly affects a model's overall ranking.

About USAMO 2026

Year

2026

Tasks

6 proof-based problems

Format

Mathematical proof construction

Difficulty

International olympiad level

USAMO represents the highest tier of US math competitions, serving as the selection exam for the International Mathematical Olympiad team. Problems require full proofs rather than just numerical answers. Mythos Preview scored 97.6%, GPT-5.4 scored 95.2%, Gemini 3.1 Pro scored 74.4%.

United States of America Mathematical Olympiad

BenchLM freshness & provenance

Version

USAMO 2026 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

Current

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does USAMO 2026 measure?

The premier US mathematical olympiad competition, featuring proof-based problems that require deep mathematical insight and rigorous argumentation at the highest competition level.

Which model scores highest on USAMO 2026?

Claude Mythos 5 by Anthropic currently leads with a score of 97.6% on USAMO 2026.

How many models are evaluated on USAMO 2026?

3 AI models have been evaluated on USAMO 2026 on BenchLM.

Compare Top Models on USAMO 2026

Claude Mythos 5 vs Claude Opus 4.8 Claude Opus 4.8 vs MiniMax M3

Last updated: July 23, 2026 · BenchLM version USAMO 2026 2026

Choose a model with this week’s evidence

Join 2,000+ readers for ranking moves, pricing changes, and the claims that still need proof.

One email each week. Unsubscribe anytime.