What will be true of DeepSeek's r2 model?
16
1.3kṀ1318
Mar 17
86%
Some version of it will score better on GPQA than o1-preview (73% pass@1)
62%
Its base model will be DeepSeek-V3
48%
It will score >= 78% on LiveBench when first added (R1 = 71.5% on March 12)
45%
DeepSeek will report it scores >= 65% on SWE-Bench Verified
45%
It will have a variant with more parameters than DeepSeek-r1 (not necessarily active)
45%
Manifold thinks it is better than expected in a poll asking if it is better or worse than expected
42%
It will support a longer context length than DeepSeek-r1
10%
Reaches Highest Arena Score on Chatbot Arena

everything N/As if a model called R2 created by DeepSeek is not released in 2025

Get
Ṁ1,000
to start trading!
Sort by:
bought Ṁ40 NO

Seeing rumors about a March 17 release

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules