t55
- Karma
- 892
- Created
- 2 years ago
About
ML researcherRecent Submissions
- 1. ▲ Target Policy Optimization (arxiv.org)
- 2. ▲ Show HN: Kilroy – Knowledge base for teams using Claude Code (github.com)
- 3. ▲ Procedural Reasoning Datasets (github.com)
- 4. ▲ In Defence of Gary Marcus (reubenadams.substack.com)
- 5. ▲ Reasoning Gym – Procedural RL reasoning datasets (github.com)
- 6. ▲ ChatGPT Agent [video] (youtube.com)
- 7. ▲ ReasoningGym: Reasoning Environments for RL with Verifiable Rewards (arxiv.org)
- 8. ▲ Show HN: Rehearsal.so, Duolingo for Public Speaking (rehearsal.so)
- 9. ▲ End-to-End Vision Tokenizer Tuning (arxiv.org)
- 10. ▲ YC Interview Mock Practice (rehearsal.so)