What will happen during the fourth run of Claude Plays Pokemon?
59
4.6kṀ19k
Jun 22
98.8%
Claude obtains 1 gym badge by step 20000
90%
Claude's current team has at least 3 Pokémon by step 30000.
72%
Claude catches Clefairy
68%
Claude enters Rock Tunnel, surpassing its progress in any previous run
67%
Another model defeats the Champion before Claude (in a run started after Claude 4 was released)
65%
Tumbles is late to pay back a loan
64%
Claude obtains a Bicycle
62%
Claude picks Dome Fossil (again)
62%
Claude obtains 3 gym badges by step 50000
61%
Any member of Claude's team learns Dig
60%
Claude's starter is lower level than another party member by step 100000.
58%
Claude adds 18 or more Pokemon to his Pokedex (surpassing his completion from the previous run)
57%
Claude evolves SPIKE into Nidoking
57%
Claude enters Mt. Moon by step 6000.
54%
Claude reaches Vermilion City by step 30000
50%
Claude reaches Cerulean City by step 20000
49%
Claude obtains HM05 Flash
48%
Claude catches Spearow
45%
Claude buys a Magikarp
43%
Claude obtains HM01 Cut by step 39000

https://www.twitch.tv/claudeplayspokemon

Claude Plays Pokemon is a Twitch stream where the AI chatbot Claude attempts to beat Pokemon Red. Once the game is reset, all remaining answers resolve NO, even if the stream continues with a new game.

I am N/Aing anything that is annoying to resolve. If I have to pore over multiple days of twitch VODs to figure out which way an answer resolves, I am not going to bother.

Changes to the harness between 3.7's runs and this one: https://docs.google.com/document/d/e/2PACX-1vRIsu2pLI21W4KjfYbN13or8E-8cvJYw570wGMEp4UQU63ZhEh9FPGgj2ark8Yk7Vyrtt9MWq3jnn4h/pub


Some relevant milestones from the second run:

  • Reached Pewter City between steps 5000-5500

  • Escaped Mt. Moon between steps 20000-25000

  • Reached Vermilion City between steps 30500-32000

  • Obtained HM01 Cut between steps 55000-60000

  • Defeated Surge around step 61000

  • Obtained HM05 Flash around step 100000? (Unsure)

Get
Ṁ1,000
to start trading!
Sort by:

This version might just be worse than the previous one imo

bought Ṁ100 YES

@Balasar claude went the right way to escape the forest immediately after you posted this. please keep posting

@SaviorofPlant Claude will definitely not beeline through Mt. Moon, find the Silph Scope, and develop a mental model of the Safari Zone good enough to beat it. Also, it definitely won’t beat the champion.

opened a Ṁ100 NO at 62% order

I like the new personality

opened a Ṁ250 YES at 10% order

Put a moderately sized YES order at 10% if anyone wants to call me at these odds

@Balasar I think Safari Zone is impossible without dev help

@SaviorofPlant I agree the odds are closer to 3% or 4% but it's a trifling sum and so much more fun to root for progress

@Balasar I assume Gemini's run where it received multiple dev hints doesn't count?

@SaviorofPlant Ah no, its the only model I was referring to with that, so it does count.

bought Ṁ40 NO

@Balasar I can N/A then, or edit it to add "in a run started after Claude 4 released"

@SaviorofPlant sure, just edit it, they basically started at the same time

I am retroactively changing this market to be about Claude's fourth run (after the bugfix reset) instead of the third run. This is against the letter of the description, but it would be mildly annoying to recreate the market and the new version would probably get less traders.

If you are unhappy with this decision, feel free to managram me the mana that you would have lost had everything resolved NO. (I'd offer refunds but I don't think anyone placed big bets expecting a reset before I closed the market, lmk if wrong)

Briefly closing in case dev restarts the run

bought Ṁ150 YES

@SaviorofPlant If the run is restarted early, will existing bets still resolve accordingly?

@UnspecifiedPerson The clear implication of the description is that everything resolves NO and I have to recreate the market. Given we're at such an early stage, I might just change the title of the market to "fourth run" and reopen this market, even though that's clearly not what the description says to do

bought Ṁ20 NO

@SaviorofPlant I would strongly prefer that you follow what's written in the description

@Robincvgr hopefully it's just a stream restart. If I recreate the market, the new version will probably get less traders

We can do an unofficial poll. Like this comment if you think I should just reopen this market with the title changed. Like Robin's comment if you think I should resolve everything NO and make a new market

@SaviorofPlant Ah fuck claude was just reset

@SaviorofPlant as someone who stands to benefit if everything resolves NO, i think it is just stupid to consider this a new run lol

@ointment like cmon it was reset like 100 steps in due to a bug, no one was betting based on the assumption that that would count. just reopen imo

@ointment Ehhh, I personally actually did bet 1M No on the Rock Tunnel option mostly based on the possibility that it would reset very early. (I stand to lose >150M overall if everything's resolved No, but I still generally prefer markets resolve as close to the plain interpretation of the description as possible.)

@traders Voice your opinions now, seems like we are evenly split here

@jim has spoken

© Manifold Markets, Inc.TermsPrivacy