monax Karma 563 Created 6 years ago Recent Submissions 1. ▲ A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly (github.com) 27 points · 1 month ago · 0 comments All submissions on HN · View profile on HN