desideratum
- Karma
- 102
- Created
- 6 years ago
Recent Submissions
- 1. ▲ Finetuning GPT-OSS with Axolotl (github.com)
- 2. ▲ Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training (huggingface.co)
- 3. ▲ Training LLMs with GRPO and Interpreter Feedback Using WebAssembly (huggingface.co)
- 4. ▲ Training Large Language Models with Interpreter Feedback Using WebAssembly (huggingface.co)
- 5. ▲ DeepSeek-V3-0324 (huggingface.co)
- 6. ▲ Training Process Reward Models in Axolotl (axolotlai.substack.com)
- 7. ▲ Torchtune – a native PyTorch library for fine-tuning LLMs (github.com)
- 8. ▲ (Deep Learning Based) Opportunistic Screening to Improve Statin Rates (ahajournals.org)
- 9. ▲ The theory of Proximal Policy Optimisation implementations (salmanmohammadi.github.io)
- 10. ▲ Ask HN: Feel like I'm being lowballed by founders. Where do I go from here?