Top Multi-SWE-bench score in 2025?

24

Ṁ10kṀ45k

resolved Jan 30

Resolved

20 - 39%

1H

6H

1D

1W

1M

ALL

100%87%

20 - 39%

0.9%

0 - 19%

7%

40 - 59%

3%

60 - 79%

1.8%

80 - 100%

SWE-bench is a great AI benchmark, but it is Python-only. Multi-SWE-bench is the same thing with multiple programming languages: C, C++, Java, JavaScript, TypeScript, Go, Rust.

Claude 3.7 Sonnet based agent achieved a score of 19% in 2025-03-29, which is currently the best score. The score will be rounded. ("Rounding half up" to be exact, see Rounding.)

The resolution will be primarily from the official leaderboard, but other announcements from reputable organizations will be considered.

See also /SG/top-swebench-verified-score-in-2025

Market context

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ4,452
2		Ṁ1,266
3		Ṁ863
4		Ṁ450
5		Ṁ319

People are also trading

Best SWE-Bench Pro public score by June 30, 2026

Top SWE-Bench Pro score by Jan 1, 2027?

What will be the highest score on the SWE-bench pro private set before 2027?

When will SWE-bench be solved?

What will be the best GSOBench score by Dec 31, 2026?

In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?

AI resolves at least X% on SWE-bench WITH assistance, by 2028?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will Claude Sonnet 5 exceed 85% on SWE-bench verified?

Which LLM Maker will hold the top Safety Score for Spiral-Bench on https://eqbench.com/spiral-bench.html on Jan1, 2027?

Sort by:

@SanghyeonSeo this can either N/A or resolve 20-39% right? (No updates, but MopenHands + Gemini-2.5-Pro is listed at 21.62)

@draaglom cc @mods

It stopped being measured

Have you tried gemini 2.5 pro experimental on it yet?

@ian The leaderboard on the website shows something with Gemini 2.5 Pro at 21.62%:

https://multi-swe-bench.github.io/#/

(Not sure what Mopenhands is...)

@TimothyJohnson5c16 thanks!

People are also trading

Best SWE-Bench Pro public score by June 30, 2026

Top SWE-Bench Pro score by Jan 1, 2027?

What will be the highest score on the SWE-bench pro private set before 2027?

When will SWE-bench be solved?

What will be the best GSOBench score by Dec 31, 2026?

In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?

AI resolves at least X% on SWE-bench WITH assistance, by 2028?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will Claude Sonnet 5 exceed 85% on SWE-bench verified?

Which LLM Maker will hold the top Safety Score for Spiral-Bench on https://eqbench.com/spiral-bench.html on Jan1, 2027?

Related questions

Best SWE-Bench Pro public score by June 30, 2026

Top SWE-Bench Pro score by Jan 1, 2027?

What will be the highest score on the SWE-bench pro private set before 2027?

When will SWE-bench be solved?

What will be the best GSOBench score by Dec 31, 2026?

In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?

AI resolves at least X% on SWE-bench WITH assistance, by 2028?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will Claude Sonnet 5 exceed 85% on SWE-bench verified?

Which LLM Maker will hold the top Safety Score for Spiral-Bench on https://eqbench.com/spiral-bench.html on Jan1, 2027?

© Manifold Markets, Inc.•Terms•Privacy