Will RL work for LLMs "spill over" to the rest of RL by 2026?
9
Ṁ1kṀ1.4kresolved Jan 6
Resolved
NO1H
6H
1D
1W
1M
ALL
RL is important for training LLMs and it seems likely that there will be significantly more investment in RL by the major LLM groups this year. Will any of the advances they make be:
Published (any publication that allows the research to be used elsewhere counts, this does not have to be a paper)
A significant advance for the rest of RL
For example, a new version of PPO that is close to SOTA for agents in Atari environments would resolve this YES.
What counts as a "significant advance" is mostly subject to my inscrutable whims, but is aimed more at cool research than important result. Think "very exciting to see at a conference" rather than "revolutionizes the field".
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ454 | |
| 2 | Ṁ252 | |
| 3 | Ṁ5 | |
| 4 | Ṁ0 |
People are also trading
Related questions
Will LLMs become a ubiquitous part of everyday life by June 2026?
90% chance
Will there by a major breakthrough in LLM continual learning before 2027?
45% chance
Will an LLM improve its own ability along some important metric well beyond the best trained LLMs before 2026?
14% chance
Will LLMs Daydream by EOY 2026?
17% chance
Will there be any major breakthrough in LLM continual learning before 2029?
87% chance
Will a frontier-level diffusion LLM exist by 2028?
30% chance
Will there be any major breakthrough in LLM continual learning before 2028?
75% chance
Will there be any major breakthrough in LLM continual learning before 2030?
89% chance
Will the highest-scoring LLM on Dec 31, 2026 show <10% improvement over 2025's best average benchmark performance?
72% chance
Will the most advanced LLM stop being from a US-based company any time before 2030?
34% chance