starzmustdie
- Karma
- 7
- Created
- 4 years ago
Recent Submissions
- 1. ▲ Show HN: #1 On This Day (onthisday-theta.vercel.app)
- 2. ▲ A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE) (github.com)
- 3. ▲ Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning (github.com)
- 4. ▲ Show HN: Word Game Bench – evaluating language models on word puzzles (wordgamebench.github.io)
- 5. ▲ Show HN: Answers to Chip Huyen's ML Interview Questions (github.com)