This is the best way forward long term. We won't have frontier performance, but at least the models will be aligned with us instead of refusing us or sabotaging us.
I've also debated having a frontier model for planning only, and then feeding plan to smaller offline models.