kcorbitt
- Karma
- 6,252
- Created
- 12 years ago
About
Hi, my name is Kyle Corbitt.Currently I'm working on openpipe.ai. Previously worked at YC and Google.
personal site: corbt.com email: kyle@ above. I respond to emails.
Recent Submissions
- 1. ▲ Show HN: RULER – Easily apply RL to any agent (openpipe.ai)
- 2. ▲ Everything I know about reward hacking (openpipe.ai)
- 3. ▲ Show HN: ART – a new open-source RL framework for training agents (github.com)
- 4. ▲ ART·E: how we built an email research agent that beats o3 (openpipe.ai)
- 5. ▲ Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue” (openpipe.ai)
- 6. ▲ Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results (openpipe.ai)