upvote
Just because it's called Gemini doesn't mean that it's somehow automatically as comparable with the frontier of small models as well, does it?
reply
All Gemini models sit around the frontier, especially if you go to smaller sizes. Google is actually more invested into efficiency than size unlike some of the other big providers.
reply
Do you have any benchmark details on the on-device Gemini models? I haven't found a lot of public information on these.
reply
Sources for your claim that the model being downloaded to Android/Chrome is Gemini instead of Gemma. Other than downloading the bin file myself and analyzing it lol.
reply
How about Google itself?

https://developer.chrome.com/docs/ai/prompt-api

>With the Prompt API, you can send natural language requests to Gemini Nano in the browser.

reply
Thanks. Looks like the current Gemini Nano is actually a separate model with the Gemma 3n architecture that has been distilled from Gemini 2.5 Flash[1].

Also, the next version of Gemini Nano will be based directly on Gemma 4 (so not distilled, not Gemini at all except for the name)[2].

So no, it's not a frontier model. Those don't run on your phone or in your browser.

[1]: https://developer.android.com/blog/posts/ml-kit-s-prompt-api...

[2]: https://android-developers.googleblog.com/2026/04/AI-Core-De...

reply
Oh, now I see your problem. You confused the pareto frontier with the pure scale frontier. They are very much not the same.

Also, distillation is how most of these smaller models are made from the biggest models. That process largely defines the frontier along most of the curve.

reply
[delayed]
reply
deleted
reply