Settings

Theme

Addition is All You Need for Energy-efficient Language Models

huggingface.co

7 points by datapalo 2 years ago · 2 comments

Reader

akrymski 2 years ago

Fantastic result, on par with another similar effort: https://arxiv.org/pdf/2406.02528

It seems to me that we've stumbled upon this method of GPU-heavy matrix-multiplications in deep neural nets, and have only scratched the surface of alternative methods that are actually optimized for current CPU architectures such as Tsetlin Machines, Hyperdimensional Vectors, etc.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection