upvote
This is incorrect, prompt processing is compute bound.
reply
LLMs are bound by both and depends on the hardware which factor is higher.
reply
This is only true for some parts of the time cost function.
reply