upvote
it enables models larger than was previously possible.
reply
No because the base model from which the distilled or quantized models are derived is larger.
reply