Why do LLMs attend to the first token? arxiv.org 2 points by adhi01 7 months ago · 1 comment Reader PiP Save maytc 7 months ago Curious if the authors had a chance to look at the Softpick paper? https://arxiv.org/abs/2504.20966