undefined

points

[-]

The idea is that smarter models might use fewer turns to accomplish the same task - reducing the overall token usage

Though, from my limited testing, the new model is far more token hungry overall

[-]

Well you‘ll need the same prompt for input tokens?

[-]

Only the first one. Ideally now there is no second prompt.

[-]

Are you aware that every tool call produces output which also counts as input to the LLM?

[-]

deleted

[-]

That's valid, but it's also worth knowing it's only one part of the puzzle. The submission title doesn't say "input".