The ability to assess the user's goal and tune the inference accordingly is there. So, what methods are there to tell if a provider has their thumb on the scale?
Plodding along a longer path to the same goal would burn more tokens without necessarily a decrease in solution quality. Maybe a few more docs.