Hacker News
new
past
comments
ask
show
jobs
points
by
shay_ker
19 hours ago
|
comments
by
zargon
18 hours ago
|
[-]
They're using the term speculative decoding but doing MTP. It's the same thing as Nemotron, but Google removed the MTP heads from the original safetensora release. (They were not removed from the LiteRM format.)
reply