Settings

Theme

TurboQuant: Redefining AI efficiency with extreme compression

research.google

15 points by davidbarker a month ago · 1 comment

Reader

Reubend a month ago

This looks great, but I'm wondering how effective this would be for full model weights rather than just the KV cache. Their paper only gives results for the KV cache use case, which strikes me as strange since the algos are claimed to be near optimal.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection