Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult simonwillison.net 1 points by gingersnap a month ago · 1 comment Reader PiP Save ChrisArchitect a month ago More discussion: https://news.ycombinator.com/item?id=46037637