kumama
- Karma
- 3
- Created
- 9 years ago
Recent Submissions
- 1. ▲ What is reinforcement learning finetuning (youtube.com)
- 2. ▲ RAG to riches: synthetic data for training RAG agents (cgft.io)
- 3. ▲ rag not lag: rl for fast agentic retrieval (cgft.io)
- 4. ▲ Show HN: Benchmax, a new open-source RL environment framework for LLM finetuning (github.com)
- 5. ▲ Beating o3/o4-mini with Codebase-specific Reinforcement Learning (cgft.io)
- 6. ▲ We might be overestimating coding agent performance on SWE-Bench (cgft.io)
- 7. ▲ How to Improve Code Completion LLMs with Repo-Specific Finetuning (cgft.io)
- 8. ▲ Show HN: Free AI Code Completion for Xcode with model choice/codebase context (cgft.io)