
What will be the highest score achieved on SWE-Bench Verified in 2025?
25
Ṁ1kṀ4.8kresolved Jan 3
1H
6H
1D
1W
1M
ALL
100%91%
70-85 inclusive
1.0%
<70
8%
>85
https://openai.com/index/introducing-swe-bench-verified/
https://www.swebench.com/
Highest performance reported before 2026. Any run on https://www.swebench.com/ counts. Large AI company reported numbers count whether or not they're listed on swebench.com Other claimed scores will generally not be counted unless verified by a third party.
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ666 | |
| 2 | Ṁ265 | |
| 3 | Ṁ114 | |
| 4 | Ṁ102 | |
| 5 | Ṁ53 |
Sort by:
@JacobPfau Does Introducing Codex resolve <70 NO? Very annoyingly they don't give a number, but in the plot codex-1 pass@1 is clearly above 70%.
@SanghyeonSeo Don't see an option to resolve individual options, IIRC there are two types of multiple choice questions
People are also trading
Related questions
Best SWE-Bench Pro public score by June 30, 2026
What will be the highest score on the SWE-bench pro private set before 2027?
68.0
Top SWE-Bench Pro score by Jan 1, 2027?
78.3
What will be the best GSOBench score by Dec 31, 2026?
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
10/29/27
When will SWE-bench be solved?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
AI resolves at least X% on SWE-bench without any assistance, by 2028?
Will Claude Sonnet 5 exceed 85% on SWE-bench verified?
16% chance
Will Anthropic’s next Sonnet model exceed 83% on SWE-bench verified?
50% chance
