Hacker News
new
past
comments
ask
show
jobs
points
by
ako
8 hours ago
|
comments
by
byzantinegene
3 hours ago
|
[-]
we're already doing that, it's called distillation and how models like deepseek are trained.
reply