upvote
You could say he's also learning from human feedback
reply