The evidence that per-token inference _is not_ subsidized is... a quote or two from Dario and Sam Altman
as far as we know there's no evidence that they can produce any profits at all
The open-weight models will have a steady race to the bottom on inference costs just by dint of competition between providers. They aren’t at the frontier yet, but they are rapidly eating the flash market.