Local models just don't seem that useful for me for these particular tasks yet - the most recent versions of Codex and Claude Opus are the first time I've found them to be particularly useful in a "real engineering" context that isn't just vibe coding.
Google's TurboQuant might help address this, but it also might just widen the gap even further.
I am far on the skeptic edge when it comes to the generative AI side of ML tools though, so do take my opinion with that weight.