Pulze AI Evals github.com 1 points by fbnbr a year ago · 1 comment Reader PiP Save fbnbrOP a year ago Benchmark AI models on standard datasets like FinanceBench and MMLU.