Hacker News
new
past
comments
ask
show
jobs
points
by
simianwords
19 hours ago
|
comments
by
camgunz
9 hours ago
|
next
[-]
Hallucination benchmarks accept "I don't know", which Haiku did at least a little. Here are other benchmarks corroborating:
https://suprmind.ai/hub/ai-hallucination-rates-and-benchmark...
reply
by
rattray
14 hours ago
|
prev
|
next
[-]
I've been very curious about that too. I wonder if it's actually much better at admitting when it doesn't know something, because it thinks it's a "dumber model". But I haven't played with this at all myself.
reply
by
jwpapi
19 hours ago
|
prev
|
[-]
The hallucination benchmark is hallucinating
reply