Super HN
New
Show
NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models
(arxiv.org)
6 points by chrsw 7 hours ago