upvote
Yes. This is known. They added neural accelerators, aka Tensor core equivalent, in the GPU. This will make prompt processing competitive vs similar class GPUs.
reply
It would probably be worth finding a more friendly way to market this, but it's a reasonable / accurate way to say it.

The prompt processing sped up.

Not the output generation.

M4 was notoriously slow at this compared to DGX etc.

reply