upvote
That paper is cited in the 'introduction' and 'background' sections. This paper is improving by removing some bottlenecks.
reply
Seems like they focus on improving the drafter and the verification policy so speculation keeps producing net speedups rather than wasted verification work at deepseek scale.
reply