Settings

Theme

ag8

Karma
1,893
Created
6 years ago

About

runrl.com

Recent Submissions

  1. 1. Claude Is Broken in Armenian (twitter.com)
  2. 2. Po.ta.to (po.ta.to)
  3. 3. Scaling pretraining affects RL sample efficiency (runrl.com)
  4. 4. Systematically generating tests that would have caught Anthropic's top‑K bug (theorem.dev)
  5. 5. Sampling at Negative Temperature (cavendishlabs.org)
  6. 6. Tinker (2b4fdb18.connectionism.pages.dev)
  7. 7. Training Qwen to answer briefly yet intelligently using feedback control (runrl.com)
  8. 8. Launch HN: RunRL (YC X25) – Reinforcement learning as a service (runrl.com)
  9. 9. Generating the Funniest Joke with RL (runrl.com)
  10. 10. Gravity Chess (gravity-chess.andrew.gr)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection