I asked: As we scale up training and inference compute for reasoning models, will they show: A) Strong general logical reasoning skills that work across most logically-bound tasks, B) Some generalization to other logic tasks, but perhaps requiring domain-specific retraining, or C) Minimal generalization?

Finbarr Timbers
Ex-DeepMind
Artfintel
DeepMind
Artfintel
DeepMind

Gwern Branwen
Independent Researcher
Gwern
Gwern

Jacob Buckman
Founder of Manifest AI
Manifest AI
Google Brain
Manifest AI
Google Brain

Karthik Narasimhan
Co-author of GPT paper
Sierra
OpenAI
Sierra
OpenAI

Near
Independent
Independent
Independent

Nathan Lambert
Post-Training Lead at Allen AI
Allen AI
Allen AI

Pengfei Liu
Shanghai Jiao Tong University
SJTU
SJTU

Ross Taylor
Led reasoning at Meta AI
Meta AI
Meta AI

Shannon Sands
Nous Research
Nous Research
Nous Research

Steve Newman
Co-founder of Google Docs
AI Soup
Google Docs
AI Soup
Google Docs

Tamay Besiroglu
Co-founder of Epoch AI
Epoch AI
Epoch AI

Teortaxes
Independent
Independent
Independent

Xeophon
Independent
Independent
Independent

Chris Barber
Creator of this
Independent
Independent
Follow-up Questions
Tamay suggested a good follow up would be to "elicit ideas for experiments that they would expect to turn out one way conditional on "Weak transfer" and another if "Strong transfer" is correct" – let me know if you have ideas.
Questions
- What follow up questions should I ask?
- What's the even more important question to ask?
To answer, tag or DM me at @chrisbarber or email me at [email protected]
Thank you to Amir Haghighat, Arun Rao, Ash Bhat, Avery Lamp, Charlie Songhurst, Connor Mann, Daniel Kang, Dhruv Singh, Eric Jang, Ethan Beal-Brown, Finbarr Timbers, Flo Crivello, Griffin Choe, Gwern, Herrick Fang, Jacob Buckman, James Betker, Jay Hack, Josh Singer, Julian Michael, Katja Grace, Karthik Narasimhan, Logan Graham, Matt Figdore, Mike Choi, Nathan Lambert, Nicholas Carlini, Nitish Kulkarni, Pengfei Liu, Rick Barber, Robert Nishihara, Robert Wachen, Rohit Krishnan, Ron Bhattacharyay, Ross Taylor, Shannon Sands, Spencer Greenberg, Steve Newman, Tamay Besiroglu, Teknium, Teortaxes, Tim Shi, Tim Wee, Tyler Cowen, and Xeophon.