upvote
I would very much like not to have to download 22 GB for some inference capability that is way worse than API calls both in terms of quality and speed.

I would rather pay money than seeing this thing running in my browser that only prints 5 tps on high-end consumer hardware.

reply