Claude 4 Opus: https://youtu.be/J7omabtqnBM?t=193
Qwen 3.6 35B A3B: https://youtu.be/gVU-DQeqkI0?t=215
Qwen 3.6 produced far more working functionality than Claude 4 Opus did.
Obviously, just one test of a single one-shot prompt of a silly toy OS, but yeah, this particular test shows Qwen 3.6 running locally dramatically outperforming Claude 4 Opus, which was a frontier model a year ago.