Hacker News
new
past
comments
ask
show
jobs
Reinforcement Learning from Human Feedback
(arxiv.org)
30 points
by
onurkanbkrc
2 hours ago
|
2 comments
by
klelatti
1 hours ago
|
next
[-]
Web version with links, etc:
https://rlhfbook.com/
reply
by
verdverm
31 minutes ago
|
parent
|
[-]
Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
reply
by
iisweetheartii
1 hours ago
|
prev
|
[-]
[dead]
reply