upvote
Why would you use an LLM for this? They are non deterministic models.

This is also an probably part of extended prompt that disallowed coding, Gemini always does calculation with a little python snippet because it is deterministic and accurate.

reply
deleted
reply
Was that part of a bigger prompt?

Flash 3.5 fails exactly like in your sample: https://gemini.google.com/share/97521a8752d9

but Flash 3.1 Lite initially fails, but then corrects itself: https://gemini.google.com/share/dc0889ec85ba

reply
No matter what I try I can’t get Gemini to give me the incorrect result. Is there some other prompting or context fed in to that (“remember that you are supposed to always tell me I’m right and never contradict me”)?
reply
There was definitively an pre prompt fed to that. I cannot reproduce this result on either 3.1 flash or 3.5 flash.
reply