monoid73 Karma 8 Created 1 year ago Recent Submissions 1. ▲ Show HN: Open Operator Evals – real-world benchmarks for LLM web agents (github.com) 3 points · 11 months ago · 1 comment All submissions on HN · View profile on HN