Hacker News
new
past
comments
ask
show
jobs
points
by
jaccola
10 hours ago
|
comments
by
muzani
9 hours ago
|
next
[-]
It's why it starts with "You're absolutely right!" It's not to flatter the user. It's a cheap way to guide the response in a space where it's utilizing the correction.
reply
by
mike_hearn
5 hours ago
|
prev
|
next
[-]
People have researched pause tokens for this exact reason.
reply
by
staminade
9 hours ago
|
prev
|
next
[-]
What do you think chain of thought reasoning is doing exactly?
reply
by
lijok
10 hours ago
|
prev
|
[-]
You’re conflating training and inference
reply