undefined

points

[-]

The Mac is very feeble compared to the big iron that the providers run so will be much lower performance. Also many companies would prefer engineers work on the domain problems instead of working on novel LLMs.

by gizajob1 hours ago|

parent|

[-]

I meant “roll your own” LLM for use not build new ones.

by throwaw122 hours ago|

prev|

[-]

because local models which can run well using 128gb ram are still not SOTA, yes Qwen is amazing, but nor Qwen 27B neither 35B can outperform Opus 4.6, so why increase rework for your engineers even more, if you can pay slightly more and always use SOTA, until others figure out best practices for running local SOTA's

by nektro1 hours ago|

prev|

[-]

sota models cannot remotely fit in 128gb