Hacker News
new
past
comments
ask
show
jobs
points
by
visarga
7 hours ago
|
comments
by
nosuchthing
1 hours ago
|
[-]
LLMs can't access the training data that's less than the statistically most common token, so they use a random jitter.
With that randomness comes statistically irrelevant results.
reply