Kimi introduces Attention Residuals: 1.25x compute performance at <2% overhead arxiv.org 9 points by nekofneko 3 months ago · 0 comments Reader PiP Save No comments yet.