upvote
yes it is 213 tok/s single stream (so per user)
reply
that 213 wasn't achieved when saturated though. was probably more like 30 tps per stream when doing 2.6k tps.
reply
So per subagent*.
reply
*per stream, I guess is more accurate than either?
reply