Broadly speaking, we see people experiment with this architecture a lot often with a great deal of success. A few other approaches would be an agent orchestrator architecture with an intent recognition agent which routes to different sub-agents.
Obviously there are endless cases possible in production and best approach is to build your evals using that data.
Architecturally focusing on Episodic memory with feedback system.
This training is retrieved next time when something similar happens
https://github.com/rush86999/atom/blob/main/docs/EPISODIC_ME...