You don't have to run a model from VRAM, or even from a sizeable amount of RAM. These choices only ever make sense when serving the model at scale, to hundreds of simultaneous users or more.
512GB unified memory macs are available, with the ram upgrade costing a few grand.