Hacker News
new
past
comments
ask
show
jobs
points
by
petu
3 hours ago
|
comments
by
redman25
2 hours ago
|
[-]
Exactly, compare MoE with MoE and dense with dense otherwise it's apples and oranges.
reply
by
swalsh
3 minutes ago
|
parent
|
[-]
Its coding to coding. I could care less how the model is architected, i only care how it performs in a real world scenario.
reply