Hacker News
new
past
comments
ask
show
jobs
points
by
willXare
21 hours ago
|
comments
by
lloyd-christmas
20 hours ago
|
[-]
I thought the same thing when I started using locals, but the reality is that - for a given context depth - the token generation speed doesn't change whether it's 128 or 8000, it just lengthens the benchmark run time.
reply