You can use the fact that LLMs predict P(next token | existing tokens) to losslessly and efficiently compress arbitrary token sequences. This idea is closely related to arithmetic coding.
Many things about the process are similar, so there's some analogy, but it just isn't the same.