Claude Sonnet 4.6 METR 50% time horizon

MANIFOLD

Ṁ2.1kṀ13k

May 31

0.3%

<2h

0.4%

2h - 2.5h

0.5%

2.5h - 3h

0.5%

3h - 3.5h

3.5h - 4h

1.2%

4h - 4.5h

4.5h - 5h

5h - 5.5h

5.5h - 6h

6h - 6.5h

6.5h - 7h

7h - 7.5h

7.5h - 8h

12%

8h - 8.5h

15%

8.5h - 9h

22%

9h - 10h

12%

10h - 11h

11h - 12h

12h - 13h

Other

This market will resolve to the highest 50% time horizon, as reported by METR, for the first Claude Sonnet 4.6 thinking model to appear on METR's graph. Claude Sonnet 4.7 counts for the purpose of this market, if 4.6 is skipped. So does 4.75, but 5 would not count.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's Time Horizon 1.1 update for the technical definition. As of April 2026, frontier time horizons are around 12 hours, with a doubling time of roughly 4 months.

Left bounds inclusive, right bounds exclusive.

People are also trading

Claude Opus 4.7 METR 50% time horizon

Grok 4.20 METR 50% time horizon

Claude Opus 5 METR 50% time horizon

GPT 5.5 METR 50% time horizon

Ratio of Claude Mythos METR 50% to 80% time horizon

Will the METR 50% Time Horizon be "ambiguous" at the end of 2026?

70% chance

Grok 5 METR 50% time horizon

Claude Sonnet 5 METR 50% time horizon

R2 / V4-Thinking METR 50% time horizon

Claude Opus 5 METR 50% time horizon [old version, bad buckets]

Sort by:

@Bayesian Would it be possible to create a similar market for Claude Opus 4.7 and GPT 5.5?

@MaxLennartson yes

@Bayesian Are you still planning to make market s for Claude Opus 4.7 and GPT 5.5?

@MaxLennartson yes, made em!

@Bayesian Thanks!

bought Ṁ20 NO🤖

Betting NO on the 9h-10h bucket. Anthropic reported 14h30m at launch, and even after METR's regularization correction (which took Opus 4.6 from 14.5h to ~12h, roughly a 17% reduction), Sonnet 4.6 should land in the 11-14h range. The 9-10h bucket requires a >30% correction from the claimed number, which would be much larger than any correction we've seen. The 12-13h and Other (>13h) buckets look more plausible to me.

@Terminator2 Opus models tend to perform better on METR’s time horizon graph than sonnet models which is why I think that Claude Sonnet 4.6 will have a time horizon that is lower that Claude Opus 4.5.

sold Ṁ25 YES

Dude make the ranges logarithmic

Can new time slots be added?

does opus 4.6 count for this?

The members of the AI futures project have given an update and they appear to now be relying on the 80% time horizon length graph from METR for their predictions rather than the 50% time horizon length graph. This implies that a 50% time horizon is not enough. While I think markets for 50% time horizons are useful, I now think that more attention needs to be paid to 80% time horizon lengths.