points
This may come as a shock, but there are LLMs not authored by anthropic and when we do measurements we may want them to be comparable across providers