Um kind of but not really, it’s a mix of UX and actual measurements of what tasks it can do. Also UX is virtually the same thing: scaled quantitative surveys and preference metrics. It’s again, just benchmarking, and it’s done carefully and with best practices.
replyImagine unironically starting your comment with "Um" in 2026.
replyAs opposed to your incredibly useful contribution to this thread, thanks!
replyYou don't have to imagine!
reply