kashifr
- Karma
- 749
- Created
- 15 years ago
About
https://github.com/kashif https://twitter.com/krasulRecent Submissions
- 1. ▲ Distilling 100B+ Models 40x Faster with TRL (huggingface.co)
- 2. ▲ Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries (huggingface.co)
- 3. ▲ Transformers V5 is out! (github.com)
- 4. ▲ The Smol Training Playbook: The Secrets to Building World-Class LLMs (huggingface.co)
- 5. ▲ Unlocking On-Policy Distillation for Any Model Family (huggingface.co)
- 6. ▲ Transformers 4.55 New OpenAI GPT OSS (github.com)
- 7. ▲ Smollm3: Smol, multilingual, long-context reasoner LLM (huggingface.co)
- 8. ▲ Epic vs. Apple (twitter.com)
- 9. ▲ AIMO (AI Math Olympiad) progress prize winning solution (huggingface.co)
- 10. ▲ MaPO: A reference-free alignment technique for diffusion models (mapo-t2i.github.io)