If you're targeting end user devices then a more reasonable target is 20GB VRAM since there are quite a lot of gpu/ram/APU combinations in that range. (orders of magnitude more than 128GB).
[1]: https://www.jeffgeerling.com/blog/2025/increasing-vram-alloc...