Settings

Theme

Show HN: HIGGS – new sota data-free LLM quantization

huggingface.co

3 points by om8 8 months ago · 0 comments · 1 min read

Reader

My colleagues and I wrote a paper and integrated it into transformers.

It has more of both accuracy and speed than NF4

We have compressed hf models for everyone to try: https://huggingface.co/collections/ISTA-DASLab/higgs-675308e...

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection