You end-up spending at least 5x the amount of tokens for maybe prediction machine to find a discontinuity?
I would say a way better approach is 1.123x to generate code + tests + passing analysis tools + human review + 1x "simplify as much as possible", than letting the snake its own tail without boundaries.