The machine itself is basically useless for any type of realtime inference, no matter what the marketing page states, but I still use it for prototyping LLM integrations and running comparisons across MoE models.
If only the alternatives to framework desktop wouldn't be so poorly built, I might swap it out for a local machine which has more ram but comparable performance for stuff like gpt-oss-20b (around 70tok/s)