Claude Code is definitely token based, its been discussed extensively on Hacker News and the related Github threads. A large context cache miss can take half your usage easily in just one request... "max" just means more reasoning tokens. I've also run out of usage during a single request in CoWork. Its definitely token based.
reply