xaskasdf Karma 165 Created 1 day ago Recent Submissions 1. ▲ Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU (github.com) 367 points · 1 day ago · 93 comments All submissions on HN · View profile on HN