upvote
You can run these on a turing machine. At what point is it not worth it? At some point the energy to generate each token matters. We often seen token per second. I think a missing metric is tokens per kilowatt. That is what really matters.
reply
It’ll work but yield a token per minute. With ancient servers the throughput is the limiting aspect not mem size
reply