Hacker News
new
past
comments
ask
show
jobs
points
by
WarmWash
15 hours ago
|
comments
by
gruez
14 hours ago
|
[-]
I thought all the recent models are "multimodal"? Is the image part just sticking an image recognizer in front of the text model?
reply