By what factor will the cost for SotA SWE-agents drop from 2024 to 2025?
Plus
9
Ṁ871Jul 2
5%
<2x
8%
<10x
9%
<50x
21%
<250x
57%
>=250x
Algorithmic progress can be measured by reduction in cost to achieve equivalent performance. SWE-bench-lite is a popular benchmark for measuring scaffolded-LLM SWE capabilities.
By what factor will the cost of SWE-bench-lite SoTA drop between mid 2024-2025? Mid-2024 SotA is 43% costing $2,700 (per the devs), so this question will resolve Yes on the answer which most tightly bounds the reduction in cost to achieve 43% on July 1, 2025.
E.g. if in June 2025, 43% on SWE-lite costs $500 then that'd be a 5.4x reduction and the question would resolve (2) "<10x".
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Is Sam Altman right that we will see AI agents materially change the output of companies in 2025?
28% chance
Will OpenAI be in the lead in the AGI race end of 2026?
52% chance
Will we reach "weak AGI" by the end of 2025?
27% chance
Will AI resolve P vs NP by 2050?
44% chance
Will some U.S. software engineers be negatively affected financially due to AI by end of 2025?
65% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
65% chance
Will AI be Recursively Self Improving by mid 2026?
25% chance
How much will AI advances impact EA research effectiveness, by 2030?
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?
72% chance
AI resolves at least X% on SWE-bench WITH assistance, by 2028?