1-bit inference of 0.8m param gpt running inside 8192 bytes of sram https://t.co/nu4UUkKFwz

1 min read Original article ↗

Don't miss what's happening

People on X are the first to know.