Show HN: Verdict – model evals on your own data, not someone else's benchmark github.com 2 points by agunapal a month ago · 0 comments Reader PiP Save No comments yet.