Open-world evaluations for measuring frontier AI capabilities [pdf] cruxevals.com 2 points by randomwalker 2 months ago · 0 comments Reader PiP Save No comments yet.