AI resolves at least X% on SWE-bench WITH assistance, by 2028? | Manifold

AI resolves at least X% on SWE-bench WITH assistance, by 2028?

21

1.9kṀ6325

2027

91%

X = 40

92%

X = 50

89%

X = 60

92%

X = 65

89%

X = 70

79%

X = 75

72%

X = 80

77%

X = 85

Currently the SOTA has 4.80% resolves "with assistance":

For the unassisted leaderboard, please refer to the following market:

Leaderboard live:

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

In what year will AI achieve a score of 85% or higher on the SimpleBench benchmark?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?

In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?

What will be the highest score achieved on SWE-Bench Verified in 2025?

What will be the best performance on SWE-bench Verified by December 31st 2025?

Top Multi-SWE-bench score in 2025?

Related questions

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

In what year will AI achieve a score of 85% or higher on the SimpleBench benchmark?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?

In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?

What will be the highest score achieved on SWE-Bench Verified in 2025?

What will be the best performance on SWE-bench Verified by December 31st 2025?

Top Multi-SWE-bench score in 2025?

© Manifold Markets, Inc.•Terms•Privacy