
What will be the highest score achieved on SWE-Bench Verified in 2025?
15
1kṀ12692026
1H
6H
1D
1W
1M
ALL
8%
<70
31%
70-85 inclusive
61%
>85
https://openai.com/index/introducing-swe-bench-verified/
https://www.swebench.com/
Highest performance reported before 2026. Any run on https://www.swebench.com/ counts. Large AI company reported numbers count whether or not they're listed on swebench.com Other claimed scores will generally not be counted unless verified by a third party.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
Sort by:
@JacobPfau Does Introducing Codex resolve <70 NO? Very annoyingly they don't give a number, but in the plot codex-1 pass@1 is clearly above 70%.
@SanghyeonSeo Don't see an option to resolve individual options, IIRC there are two types of multiple choice questions
People are also trading
Related questions
What will be the best performance on SWE-bench Verified by December 31st 2025?
Top SWE-Bench Verified score in 2025?
85.0
Top Multi-SWE-bench score in 2025?
47.5
Will SotA on PaperBench (Code-Dev) surpass 75% in 2025?
40% chance
When will SWE-bench be solved?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
AI resolves at least X% on SWE-bench without any assistance, by 2028?
What will be the best score (5/5 reliability) on ZeroBench by December 31st 2025?
What will be the best score on Cybench by December 31st 2025?
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance