Settings

Theme

ag8

Karma
1,997
Created
6 years ago

About

runrl.com

Recent Submissions

  1. 1. Gourmand Syndrome (en.wikipedia.org)
  2. 2. guys why does armenian completely break Claude (twitter.com)
  3. 3. Sampling at negative temperature (cavendishlabs.org)
  4. 4. Perfectly Replicating Coca Cola [video] (youtube.com)
  5. 5. Po.ta.to (po.ta.to)
  6. 6. Scaling pretraining affects RL sample efficiency (runrl.com)
  7. 7. Systematically generating tests that would have caught Anthropic's top‑K bug (theorem.dev)
  8. 8. Tinker (2b4fdb18.connectionism.pages.dev)
  9. 9. Training Qwen to answer briefly yet intelligently using feedback control (runrl.com)
  10. 10. Launch HN: RunRL (YC X25) – Reinforcement learning as a service (runrl.com)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection