I wonder if a nice middle ground would be:
- recording the playwright behind the scenes and storing
- trying that as a “happy path” first attempt to see if it passes
- if it doesn’t pass, rebuilding it with the AI and vision models
Best of both worlds. The playwright is more of a cache than a test