upvote
Agreed! A lot of people were also using ZiT as a refiner downstream to help with some of the more problematic visual aspects of the original Qwen-Image.

I'm really looking forward to running the unified model through its paces.

reply
Something I am skeptical about Z-Image is that it uses Gemma which is imo a weak LLM.

If I were to guess, I would say that Z-Image’s life is shorter than it initially appeared. Even as a refiner which are just workarounds for model issues.

reply
Note that Qwen Image 1.0 (2512) wasted ~8B weights on timestep embedding. Both Z-Image / FLUX.2 series corrected that.
reply