MANIFOLD
Which of these 10 things will GPT-5 do? (@flowersslop)
29
Ṁ1kṀ3.4k
resolved Nov 3
Resolved
NO
10. It must not make any dumb, viral mistakes in week one. No 9.9–9.11, no “how many Rs in strawberry” type errors. Nothing memeably stupid.
Resolved
NO
6. Better or overhauled UI. Not just current ChatGPT with GPT-5 slapped in. @flowersslop wants it to feel new.
Resolved
NO
8. Normal people who don’t care about AI must feel that GPT-5 is a lot better. @flowersslop wants to see demand explode.
Resolved
N/A
1. @flowersslop must prefer its output for any prompt at least 80% of the time vs GPT-4o. No exceptions.
Resolved
N/A
2. It must be able to create an acceptable Doodle Jump clone, end to end, no bugs or flaws, and it must look genuinely beautiful — at least 3 out of 5 tries.
Resolved
N/A
3. Image generation v2 must look at least as good as Midjourney v7, while being at least as intelligent as 4o imagegen. +1 Bonus if it’s uncensored.
Resolved
N/A
4. Avm must be way better. Sesame level minimum.
Resolved
N/A
5. Some kind of better personalization that isn’t just sloppy memory or static custom instructions. Something that actually feels personal and cool, like midjourney customization.
Resolved
N/A
7. Needs a new gimmick that’s cool or fun or useful. Something fresh. Like avatars, proactive texting, or anything novel that no other LLM provider is doing yet.
Resolved
N/A
9. All features that make GPT-5 unique or cool must be available instantly via ChatGPT Plus. Only exception tolerated is a GPT-5 Pro

See @Flowers ’s tweet:

If 0 to 3 of these are met, it’s terrible and I’ll say so publicly. AI was a bubble. AGI timelines get pushed back 3+ years.

If 4 to 6 things are met, it’s a slight disappointment, but im still happy, they are still on track.

If 7 or 8 things are met, then im really happy. It actually deserves to be called GPT-5. Solid work.

If 9 or 10 things are met, I’m really, really excited. This means AGI could actually be close. OpenAI proves it’s still number one without a doubt.

Resolves to Flowers’ judgement of which of these happened vs didnt happen when GPT-5 comes out. If I am not able to get access to Flowers’ judgement on these, they may resolve N/A or I may resolve them anyway if they are particularly unambiguous.

Market context
Get
Ṁ1,000
to start trading!

🏅 Top traders

#TraderTotal profit
1Ṁ58
2Ṁ39
3Ṁ33
4Ṁ14
5Ṁ10
Sort by:

I’ll resolve these if Flowers wants to comment on how the more subjective options are to resolve but until then they’re N/A

bought Ṁ5 YES

I'm not sure how doodle jump clone is supposed to be beautiful. Is the model expected to generate both code and assets? Does it have to do it in a single prompt?

bought Ṁ40 NO

@ProjectVictory mb, I'll let flowers be the judge of that. if that's not acceptable feel free not to bet but I don't want to force my own interpretation of it on them

@Bayesian I won't be betting on that one since they don't set a compute budget or talk about turns. If they're expecting it zero shot from a prompt, it seems really implausible in part because recent OpenAI models have stopped and asked for clearer instructions before proceeding at a much higher frequency. If it is allowed to saturate the context then it seems very possible

© Manifold Markets, Inc.TermsPrivacy