kumama
- Karma
- 2
- Created
- 8 years ago
Recent Submissions
- 1. ▲ Show HN: Benchmax, a new open-source RL environment framework for LLM finetuning (github.com)
- 2. ▲ Beating o3/o4-mini with Codebase-specific Reinforcement Learning (cgft.io)
- 3. ▲ We might be overestimating coding agent performance on SWE-Bench (cgft.io)
- 4. ▲ How to Improve Code Completion LLMs with Repo-Specific Finetuning (cgft.io)
- 5. ▲ Show HN: Free AI Code Completion for Xcode with model choice/codebase context (cgft.io)