Paper List: RL-Strawberry

Paper List

From Awesome-LLM-Strawberry

Blogs

Talks

[Noam Brown] Parables on the Power of Planning in AI: From Poker to Diplomacy
[Noam Brown] OpenAI o1 and Teaching LLMs to Reason Better
[Hyung Won Chung] Don’t teach. Incentivize.

Relevant Paper from OpenAI o1 contributors

2024

‹»The Surprising Effectiveness of Test-Time Training for Abstract Reasoning« »Post with Image and Caption«›