Hacker News
new
past
comments
ask
show
jobs
points
by
simonw
2 hours ago
|
comments
by
rahimnathwani
1 hours ago
|
[-]
Looking forward to next time, hoping you mention speculative decoding and MTP :)
It would support your point about the performance of 20GB local models.
reply