Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering arxiv.org 1 points by popey 5 days ago · 1 comment Reader PiP Save pqtr2 5 days ago Couldn't agree more. Coding benchmarks are just a score. Benchmark the harness.