Reinforcement Learning from Human Feedback

upvote

Reinforcement Learning from Human Feedback

(arxiv.org)

30 points

by onurkanbkrc2 hours ago |

upvote

by klelatti1 hours ago|

[-]

Web version with links, etc:

https://rlhfbook.com/

reply

upvote

by verdverm31 minutes ago|

[-]

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

reply

upvote

by iisweetheartii1 hours ago|

[-]

[dead]

reply