I have this theory that the bloat will follow to the full extent possible. OpenClaw has this, the OpenEye or whatever that comes on another day, with better models, will have 3 million lines of code. All of the possibilities that you mention will not come to fruition the way you'd like to, because speed is preferred over building better things, and to hell with maintainability.
Eventually these things will become a ton of black boxes, and the only option will be to write them from scratch with another next gen LLM. Lots of costly busywork, and it will all take time.