upvote
Um kind of but not really, it’s a mix of UX and actual measurements of what tasks it can do. Also UX is virtually the same thing: scaled quantitative surveys and preference metrics. It’s again, just benchmarking, and it’s done carefully and with best practices.
reply
Imagine unironically starting your comment with "Um" in 2026.
reply
As opposed to your incredibly useful contribution to this thread, thanks!
reply
You don't have to imagine!
reply