A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly github.com 27 points by monax 2 months ago · 0 comments Reader PiP Save No comments yet.