https://en.wikipedia.org/wiki/Generative_adversarial_network
https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
In class you'd probably want a rule saying at least one LLM should be able to figure out the answer, but in a head-to-head I'm not sure how to solve it.
On the other hand then maybe a good strategy would be to write questions that the LLM just happen to have in a nich dataset in its training ”what did user5455 say to user6835?”
Nevermind my idea.