undefined

new past comments ask show jobs

upvote

points

by ssijak10 hours ago |

upvote

by x_may10 hours ago|

[-]

KV cache compression, so how much memory the model needs to use for extending its context. Does not affect the weight size.

reply