undefined

upvote

points

by dubcanada19 hours ago |

upvote

by Jensson11 hours ago|

[-]

> While hallucination is probably closer to 100% depending on the question.

But the benchmark didn't ask those questions, and it seems grok is very well at saying it doesn't know the answer otherwise.

reply

upvote

by elAhmo18 hours ago|

[-]

No one serious uses grok.

reply

upvote

by ajdegol18 hours ago|

[-]

@grok is this true?

reply

upvote

by for_i_in_range2 hours ago|

[-]

This comment deserves more love

reply

upvote

by NamlchakKhandro10 hours ago|

[-]

no

reply

upvote

by RALaBarge16 hours ago|

[-]

YMMV but Grok 4.1 Fast can usually find via static analysis a few things that other models dont seem to catch with the same prompt

reply

upvote

by d0gsg0w00f12 hours ago|

[-]

Why not? Honest question.

reply

upvote

by MagicMoonlight5 hours ago|

[-]

It makes sense. Grok is taught to answer the question, regardless of how explicit or extreme it is. These other models are taught to suppress any wrongthink. That's going to make it hard to answer things correctly. If you've been told to answer something incorrectly because it's wrong, then you'll have to make up an answer.

reply