Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
120
แน1kแน35kresolved Feb 3
Resolved
YES1H
6H
1D
1W
1M
ALL
50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

See also:
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
๐ Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | แน1,425 | |
| 2 | แน881 | |
| 3 | แน680 | |
| 4 | แน631 | |
| 5 | แน310 |
People are also trading
Sort by:
boughtแน250YES