t55
- Karma
- 897
- Created
- 2 years ago
About
ML researcherRecent Submissions
- 1. ▲ Procedural Reasoning Datasets (github.com)
- 2. ▲ In Defence of Gary Marcus (reubenadams.substack.com)
- 3. ▲ Reasoning Gym – Procedural RL reasoning datasets (github.com)
- 4. ▲ ChatGPT Agent [video] (youtube.com)
- 5. ▲ ReasoningGym: Reasoning Environments for RL with Verifiable Rewards (arxiv.org)
- 6. ▲ Show HN: Rehearsal.so, Duolingo for Public Speaking (rehearsal.so)
- 7. ▲ End-to-End Vision Tokenizer Tuning (arxiv.org)
- 8. ▲ YC Interview Mock Practice (rehearsal.so)
- 9. ▲ D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning (dllm-reasoning.github.io)
- 10. ▲ Are LLMs more than autocomplete? AI Debate (rehearsal.so)