Reinforcement Learning from Human Feedback

89 points by onurkanbkrc 8 hours ago · 6 comments · 1 min read

Reader

dang 3 hours ago

Related. Others?

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

klelatti 8 hours ago

Web version with links, etc:

dang 3 hours ago

Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.

Settings