Hacker News
new
past
comments
ask
show
jobs
points
by
ssijak
10 hours ago
|
comments
by
x_may
10 hours ago
|
[-]
KV cache compression, so how much memory the model needs to use for extending its context. Does not affect the weight size.
reply