Hacker News
new
past
comments
ask
show
jobs
points
by
Tomte
3 hours ago
|
comments
by
tekne
2 hours ago
|
[-]
The raw pretrained models make the errors, I believe -- we then reinforcement-learn them out.
reply
by
Tomte
29 minutes ago
|
parent
|
[-]
That‘s interesting! Do you have a paper or blog post or so at hand that shows examples of raw and RL‘ed output?
reply