Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult simonwillison.net 1 points by gingersnap 2 months ago · 1 comment Reader PiP Save ChrisArchitect 2 months ago More discussion: https://news.ycombinator.com/item?id=46037637