Like a good programming language, a good harness offers a better affordance for getting stuff done.
Even if we put correctness aside, tooling that saves time and tokens is going to be very valuable.
It's completely understandable that prompting in better/more efficient means would produce different results.