Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult simonwillison.net 1 points by gingersnap 5 months ago · 1 comment Reader PiP Save ChrisArchitect 5 months ago More discussion: https://news.ycombinator.com/item?id=46037637