tedsanders
- Karma
- 4,583
- Created
- 13 years ago
About
http://www.tedsanders.com/aboutRecent Submissions
- 1. ▲ Why SWE-bench Verified no longer measures frontier coding capabilities (openai.com)
- 2. ▲ METR estimates that GPT-5.2 has a 50%-time-horizon of around 6.6 hrs (twitter.com)