Hacker News
new
past
comments
ask
show
jobs
points
by
not_that_d
12 hours ago
|
comments
by
stingraycharles
7 hours ago
|
[-]
You do realize that today’s multi-modal LLMs are actually able to understand images? They’re tokenized very much like language is.
reply