upvote
You misunderstand the "test" here to mean programming, rather than test against the model's capabilities.
reply