Galton Board Softmax O(N2) Replacement
github.comFor some time I have been trying to replace the very costly attention pass in LLMs.
Here is my current attempt at fixing things.
This is applicable beyond LLMs, but that is certainly an important use case.
Description, Ready to use Code and Interactive Educational materials inside.