upvote
No because the base model from which the distilled or quantized models are derived is larger.
reply