MANIFOLD
What will the top score on Humanity's Last Exam be when it is released?
12
Ṁ1kṀ5k
resolved Jan 23
Resolved
NO
>10%
Resolved
NO
>20%
Resolved
NO
>30%
Resolved
NO
>40%
Resolved
NO
>50%

https://www.safe.ai/blog/humanitys-last-exam

Humanity's Last Exam is a new benchmark developed by CAIS and Scale AI to measure AI's proximity to expert-level capabilities. The goal is to create the world's most difficult AI test by gathering complex questions from experts across fields. The project invites experts to submit challenging questions, with accepted contributors offered co-authorship on the resulting paper and prizes from a $500,000 pool.

Currently, to qualify for the benchmark, all models must fail the submitted question.

Market context
Get
Ṁ1,000
to start trading!

🏅 Top traders

#TraderTotal profit
1Ṁ724
2Ṁ200
3Ṁ106
4Ṁ33
5Ṁ27
Sort by:
bought Ṁ100 NO

https://agi.safe.ai/ top accuracy is 9.1 or 9.4 percent (o1 or DeepSeek R1).

© Manifold Markets, Inc.TermsPrivacy