starzmustdie

Karma: 7
Created: 4 years ago

Recent Submissions

1. ▲ Show HN: #1 On This Day (onthisday-theta.vercel.app) 18 points · 2 months ago · 1 comment
2. ▲ A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE) (github.com) 1 point · 5 months ago · 0 comments
3. ▲ Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning (github.com) 1 point · 1 year ago · 0 comments
4. ▲ Show HN: Word Game Bench – evaluating language models on word puzzles (wordgamebench.github.io) 1 point · 1 year ago · 0 comments
5. ▲ Show HN: Answers to Chip Huyen's ML Interview Questions (github.com) 3 points · 2 years ago · 0 comments

All submissions on HN · View profile on HN