Hacker News
new
past
comments
ask
show
jobs
points
by
hadlock
9 hours ago
|
comments
by
zozbot234
8 hours ago
|
[-]
10 minutes a day or 15 minutes a day is what the inference workload is like on fairly small models. Once you start streaming in weights from SSD, things slow down quite a bit and become quite power hungry.
reply