Hacker News
new
past
comments
ask
show
jobs
points
by
sbinnee
11 hours ago
|
comments
by
girvo
10 hours ago
|
next
[-]
Output is what the compute is used for above all else; costs more hardware time basically than prompt processing (input) which is a lot faster
reply
by
tokenmaxxinej
9 hours ago
|
prev
|
[-]
input tokens are processed at 10-50 times the speed of output tokens since you can process then in batches and not one at a time like output tokens
reply