ag8
- Karma
- 1,997
- Created
- 6 years ago
About
runrl.comRecent Submissions
- 1. ▲ Gourmand Syndrome (en.wikipedia.org)
- 2. ▲ guys why does armenian completely break Claude (twitter.com)
- 3. ▲ Sampling at negative temperature (cavendishlabs.org)
- 4. ▲ Perfectly Replicating Coca Cola [video] (youtube.com)
- 5. ▲ Po.ta.to (po.ta.to)
- 6. ▲ Scaling pretraining affects RL sample efficiency (runrl.com)
- 7. ▲ Systematically generating tests that would have caught Anthropic's top‑K bug (theorem.dev)
- 8. ▲ Tinker (2b4fdb18.connectionism.pages.dev)
- 9. ▲ Training Qwen to answer briefly yet intelligently using feedback control (runrl.com)
- 10. ▲ Launch HN: RunRL (YC X25) – Reinforcement learning as a service (runrl.com)