upvote
They make custom chips with a model's weights and parameters "hard-coded" which allows for much, much faster inference.
reply