Settings

Theme

starzmustdie

Karma
3
Created
4 years ago

Recent Submissions

  1. 1. A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE) (github.com)
  2. 2. Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning (github.com)
  3. 3. Show HN: Word Game Bench – evaluating language models on word puzzles (wordgamebench.github.io)
  4. 4. Show HN: Answers to Chip Huyen's ML Interview Questions (github.com)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection