By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?
33
1kṀ11k
Dec 31
10%
chance
17

If perplexity on Common Crawl is not available for models, I will use other benchmarks as a surrogate. This will inherently be a judgement process. If a model has not been announced by EOY 2025 and no benchmarks have been posted publicly, it will not be counted for the purpose of this market.

"Based on transformers" for the purpose of this question will be anything with multi-headed self-attention that feeds into an MLP.

  • Update 2025-04-10 (PST) (AI summary of creator comment): Clarification on what constitutes 'based on transformers':

    • deepseek-style MLA with MoE is considered as based on transformers.

    • All current models, except for SSMs and LSTMs, are assumed to fall under the category of based on transformers.

    • The status of RWKV remains open for discussion.

Get
Ṁ1,000
to start trading!
Sort by:
bought Ṁ5,000 NO

deepseek-style MLA with MoE counts as "based on transformers" in my mind. So do all the current models that I'm aware of outside of SSMs and LSTMs. I'm willing to be convinced either way on RWKV.

predictedNO

A MoE of transformer model still counts as "based on transformers" right?

predictedNO

Just to be clear here, if we think that it will be based on transformers, then we should vote No?

predictedNO

@jonsimon correct

@jonsimon wish they didn't not write the title with not so many double negatives

predictedNO

@ConnorMcCormick oh yeah that's definitely confusing people. We'll, better for us who do understand it :)

and thanks for the exit gigacasting! $1 -> $68 in five minutes

predictedNO

there are few situations where you want to buy a new market from 50% to 99.2%, you're just giving free money away, use limit oreders pls

lmao i beat michael's bot in buying shares

predictedNO

giving me a solid +2892.5% profit for m$1

i'mt not entirely sure what NO means on this market though

@jacksonpolack The API only refreshes the data every 15 seconds, so if you're quick on the draw, it's totally doable.

predictedNO

i'mt not entirely sure what NO means on this market though

Lol, me neither.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules