ag8
- Karma
- 1,893
- Created
- 6 years ago
About
runrl.comRecent Submissions
- 1. ▲ Claude Is Broken in Armenian (twitter.com)
- 2. ▲ Po.ta.to (po.ta.to)
- 3. ▲ Scaling pretraining affects RL sample efficiency (runrl.com)
- 4. ▲ Systematically generating tests that would have caught Anthropic's top‑K bug (theorem.dev)
- 5. ▲ Sampling at Negative Temperature (cavendishlabs.org)
- 6. ▲ Tinker (2b4fdb18.connectionism.pages.dev)
- 7. ▲ Training Qwen to answer briefly yet intelligently using feedback control (runrl.com)
- 8. ▲ Launch HN: RunRL (YC X25) – Reinforcement learning as a service (runrl.com)
- 9. ▲ Generating the Funniest Joke with RL (runrl.com)
- 10. ▲ Gravity Chess (gravity-chess.andrew.gr)