PowerInfer-2: Fast Large Language Model Inference on a Smartphone
powerinfer.aiArxiv Paper: https://arxiv.org/abs/2406.06282
Previous submission (paper link submitted): https://news.ycombinator.com/item?id=40646450
Arxiv Paper: https://arxiv.org/abs/2406.06282
Previous submission (paper link submitted): https://news.ycombinator.com/item?id=40646450