upvote
That sounds weird. Why does it need a video feed? The computer can already generate an accessibility tree, same as Playwright uses it for webpages.
reply
So that it can utilize gui and interfaces designed for humans. Think of video editing program for example.
reply
Yes. GUIs expose an accessibility tree.
reply
Not all of them do, and not all of the ones that do expose enough to be useful to the AI.
reply
I feel like a legion of blind computer users could attest to how bad accessibility is online. If you added AI Agents to the users of accessibility features you might even see a purposeful regression in the space.
reply
> controlling a computer via a video feed of the display, and controlling it with the mouse and keyboard.

I guess that's one way to get around robots.txt. Claim that you would respect it but since the bot is not technically a crawler it doesn't apply. It's also an easier sell to not identify the bot in the user agent string because, hey, it's not a script, it's using the computer like a human would!

reply
oh hell no haha maybe with THEIR login hahaha
reply