upvote
> Quality of answers from quantized models is noticeable worse than using the full model.

This is the very reason I've heard I shouldn't use Alibaba!

reply