Hacker News
new
past
comments
ask
show
jobs
points
by
TZubiri
10 hours ago
|
comments
by
sorenjan
9 hours ago
|
[-]
Google recently released their paper "Image Generators are Generalist Vision Learners" about exactly this. They fine tuned Nano Banana pro into what they call Vision Banana which can do segmentation etc.
https://arxiv.org/abs/2604.20329
reply