Super HN
New
Show
Nvidia releases 8B model with learned 8x KV cache compression
(huggingface.co)
4 points by alecco 1 hour ago