upvote
They are not, in fact, intelligent.
reply
Because image models at the basic level are just text tokens in, image tokens out. You'd need an agentic process on top to come up with a strategy, review output, try again, and so on.

I believe Nano Banana and gpt-image-2 have a little of this going on, but it's like asking a model to one-shot some code vs having an agentic harness with tools do it. Even the most basic agent can produce better code than ChatGPT can.

reply
Because the LLM is more or less hardcoded to just pass "create image" style prompts to a separate model, possibly with some embellishment.
reply
Nobody asked it to!
reply
If it’s asked to generate an image, it should to everything in its powers to make the image good.
reply
> it should do everything in its powers

That's a scary thought.

Hey Claude, why haven't you finished yet? ... Because the human I'm holding hostage hasn't finished the drawing yet.

reply
LLMs have no concept of what makes the output "good". Or to put it another way, if the LLM generates an image with jumbled numbers it's because that was the most likely output, hence it was a "good" image according to its weights.
reply
You don’t know what you don’t know
reply
Part of the problem is that it isn't the LLM making the image directly itself, it's the LLM repeatedly prompting edits for a separate edit diffusion model. The Gemini reasoning summary shows part of this. The style of some of the images makes it also clear that it uses an Imagen 4 derived diffusion model underneath.
reply
[flagged]
reply
Every decent human artist knows to draw a sketch before painting something.
reply
Of course many, even most, painters do sketch what they intend to paint, likely that's the predominant technique.

But it's not universally true, particularly among artists working in the last 100 years or so. Certainly Jackson Pollock (whether one regards his work as good or not) didn't sketch out how he was going to distribute paint onto canvas. Another example is Morris Luis (and other "stain painters") who didn't sketch out how he applied paint to canvas.

You're comment is largely correct, just pointing out that more than a few "decent artists" didn't (or don't) work that way.

reply
Humans even have the creativity to come up with sketching.

Models don't have intelligence, even less so creative thinking.

reply
Exactly, that’s my point.
reply