Hacker News
new
past
comments
ask
show
jobs
points
by
withinrafael
14 hours ago
|
comments
by
theturtletalks
12 hours ago
|
[-]
I’d also checkout midscene, you can set the model and UI-TARS works but you can also use qwen vision models and it works.
reply