Hacker News
new
past
comments
ask
show
jobs
points
by
cold_harbor
2 hours ago
|
comments
by
zozbot234
2 hours ago
|
next
[-]
That's also a game changer for local inference. It unlocks long contexts, batched inference and storing the KV cache to disk on ordinary consumer platforms.
reply
by
vitorsr
22 minutes ago
|
prev
|
next
[-]
Yes. The discount was most likely a "post-market trial" of how efficient the caching works for the new generation models.
reply
by
hmaddipatla
1 hours ago
|
prev
|
[-]
[dead]
reply