Microsoft BitNet 1.58bit LLM 2B4T released
huggingface.coAlong with the ParetoQ paper from Meta (https://arxiv.org/abs/2502.02631), the concept of low-bit LLMs seems to be gaining traction. Has anyone experimented with this in production? I'm aware of a few pre-transformer era companies focused on applying this to CNNs