If on a website, rank the results; present the 'how I worked it out' info for the best spotters (and you could interview them). Keep the answers secret for a few weeks, then reveal them in a way that the game is still playable.
It's repeatable, every few months you could interview new experts (or the old ones again), get new models.
Kinda like the critical thinking version of images of a pelican on a bike.
I'm also interested in the broader impact of using LLMs in place of web search for general Q&A when we want 'to know things'. It's pretty clear the way LLMs are being used for knowledge acquisition now is often less accurate while 'feeling' more certain. Even if we set aside explicit hallucinations, I suspect it's still less accurate.
It's not particularly helpful; you could easily have done the 5 minutes of work.
I went back in their comment history before LLMs existed and found comments where they claim to be a doctor and sound like they know what they are taking about. I’m not a doctor but my wife and many of our friends are, so I know what they sound like.
But as far as trust goes, Hacker News has historically been a fairly high trust community. LLMs have the potential to change this dynamic, but I don’t think encouraging people to assume that every post is an LLM is helpful. I don’t think a community with that level of distrust is possible, and at that point we should just all walk away.