Super HN - Super Hacker News

NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models (arxiv.org) 6 points by chrsw 7 hours ago