However I think it's fair to say the cost is roughly linear in the number of users other than that.
There may be some aspects which are not quite linear when you see multiple users submitting similar queries... But I don't think this would be significant.
As for LLM, there is probably some cost constant added once it can fit on a single GPU, but should probably be almost linear.