Hacker News
new
past
comments
ask
show
jobs
points
by
janalsncm
233 days ago
|
comments
by
ollin
233 days ago
|
[-]
Mostly 1xA10 (though I switched to 1xGH200 briefly at the end, lambda has a sale going). The network used in the post is very tiny, but I had to train a really long time w/ large batch to get somewhat-stable results.
reply