upvote
Also, the racoon it circled isn't in the original.
reply
I love how perfectly this captures the difficulties of using generative AI for detection tasks.
reply
Oh god yes, I've been trying to make a LLM Assisted Magic the Gathering card scanner... its been a hell of a time trying to get it to just OCR card names well....
reply
Why would you use an LLM for OCR?
reply
Indeed. I suppose one way to ensure you can find Waldo in any image is to add it yourself.
reply
reply
hilarious - i tried and got the same thing.

there was a very large bear in the first image; when asked to circle the raccoon it just turned the bear into a giant raccoon and circled it.

reply