Hacker News
new
past
comments
ask
show
jobs
points
by
Jabrov
21 hours ago
|
comments
by
zozbot234
16 hours ago
|
[-]
Tensor parallelism is not useful on consumer platforms with slow interconnects, unless compute is really low and you prioritize decreasing latency over throughput. pipeline parallelism (and potentially expert parallelism) are more workable.
reply