FiberBundle Karma 955 Created 7 years ago Recent Submissions 1. ▲ SlopCodeBench: Benchmarking How Coding Agents Degrade over Long-Horizon Tasks (arxiv.org) 2 points · 1 month ago · 0 comments All submissions on HN · View profile on HN