Show HN: Agentic Arena – 52 tasks implemented by Opus 4.5, Gemini 3, and GPT-5.1
arena.logic.incHow does one vote? The name of the model that made the game should be hidden.
Is there a leaderboard?
We put this together mostly just to do side-by-side comparisons, though you make a good point. It'd be fun to blind-vote on your favorite impl.