FiberBundle Karma 956 Created 7 years ago Recent Submissions 1. ▲ SlopCodeBench: Benchmarking How Coding Agents Degrade over Long-Horizon Tasks (arxiv.org) 2 points · 3 months ago · 0 comments All submissions on HN · View profile on HN