It's not perfect though - I've personally found CC's VL to be worse than others such as Gemini but its nice to have it completely self contained.
This project desperately needs a "What does this do differently?" section because automated LLM browser screenshot diffing has been a thing for a while now.
So... Bypassing the whole "sees what it actually looks like in the browser. It can’t tell if the layout is broken" parent commentator is talking about? Seems worse, not better.
All the power to you if you build a product out of this, I don't wanna be that guy that says that dropbox is dead because you can just setup ftp. But with Codex/Claude Code, I was able to achieve this very result just from prompting.