Hacker News
new
past
comments
ask
show
jobs
points
by
gyrovagueGeist
16 hours ago
|
comments
by
ac29
16 hours ago
|
[-]
This 35B-A3B model is 4-5x cheaper than Haiku though, suggesting it would still be cheaper to outsource inference to the cloud vs running locally in your example
reply