Hacker News
new
past
comments
ask
show
jobs
points
by
moffkalast
2 days ago
|
comments
by
Der_Einzige
2 days ago
|
[-]
We got an oral at ICLR for calling out how shit samplers like top_p and top_k are. Use min_p!
reply
by
moffkalast
2 days ago
|
parent
|
[-]
True yep, I wish more people benchmarked models with more representative sampler settings and then took the average of 5 or 10 responses.
reply