upvote
Indeed, Gemini really is incredible at image analysis. Yesterday I pointed it at some sloppy handwritten notes and asked it to add up the numbers in the right column, and it did it no problem. I've also used it to find out what TV show or actor is on screen, and various other things. It's quite impressive.
reply
Gemini pretty clearly has the best underlying model, and the worst RL and post-training of the lot.
reply
gemini models are also fantastic at understanding non spoken sounds
reply