undefined

points

[-]

> This is interesting to me because reducing context & token usage is in the user's best interest but not in the financial interest of AI vendors.

AI vendors still need to compete with each other both in terms of token cost and competency. An agent that is costly and less effective by wasting tokens is less competitive.

by Jgrubb8 hours ago|

prev|

[-]

The tokens are still being burnt, they're just doing so in a parallel dimension from the users main context window.

by ajmurmann6 hours ago|

parent|

[-]

It's true that the initial tool response still has the same amount of tokens but it doesn't keep dragged along in the longer-lived top context.

by ViewTrick10028 hours ago|

parent|

prev|

[-]

The real benefit is being able to use a cheaper, but good enough, model with a specific system prompt dedicated to that task.