Why do LLMs attend to the first token? arxiv.org 2 points by adhi01 a year ago · 1 comment Reader PiP Save maytc a year ago Curious if the authors had a chance to look at the Softpick paper? https://arxiv.org/abs/2504.20966