Hacker News
new
past
comments
ask
show
jobs
points
by
danw1979
9 hours ago
|
comments
by
macleginn
3 hours ago
|
[-]
They are poor at generalising from a small number of examples; this is why the real generalisation power is achieved in pre-training.
reply