Hacker News
new
past
comments
ask
show
jobs
points
by
ramgine
19 hours ago
|
comments
by
segmondy
17 hours ago
|
next
[-]
You can run it today with that 12gb vram 3060, but I would suggest getting 2 3090s. Use cmoe option. This will keep the attention/route tensors on the GPU and offload the rest to system memory. Try it now and see the performance.
reply
by
rnewme
17 hours ago
|
prev
|
[-]
Should work yes.
reply