pidtom Karma 5 Created 29 days ago Recent Submissions 1. ▲ Skipping 90% of KV dequant work speeds up LLM decode by 22% (github.com) 1 point · 29 days ago · 0 comments All submissions on HN · View profile on HN