upvote
I'm talking about per-request model remember? With extensive prompt you realistically can have one request every 10 minutes because the agent will be busy for at least 10 minutes executing it. They aren't rate limiting that.
reply
Each tool call requires a new request no?

The harness receives a response, has to parse out the tool call, execute it and then start a new request with the tool call result.

reply
>Each tool call requires a new request no?

Nope, not unless you are doing steering.

Each new prompt = new request, but tool calls don't count.

reply