Which Benchmarks will OpenAI show results from GPT-5 on, when it is announced?
15
1.1kṀ3182
2026
94%
SimpleQA
87%
SWE-Bench
83%
GPQA
71%
HumanEval
67%
ARC-AGI-2
48%
MMLU
38%
MATH
35%
Big-Bench-Hard
31%
MGSM
19%
DROP
8%
GSM8K

Some flexibility on variations of specific benchmarks. eg SWE-Bench-Hard would resolve SWE-Bench YES.

  • Update 2025-05-11 (PST) (AI summary of creator comment): The benchmarks must be those that GPT-5 is benchmarked against by OpenAI.

Must be on roughly the same day / during / around the time of the announcement. If there are several announcements over multiple days, all those times are acceptable for the purpose of this market.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy