xcodevn
- Karma
- 313
- Created
- 10 years ago
Recent Submissions
- 1. ▲ Implementing DeepSeek R1's GRPO algorithm from scratch (github.com)
- 2. ▲ The LLM pre-training data wall (substack.com)
- 3. ▲ PodcastLM: An open-source AI podcast creator (github.com)
- 4. ▲ Scaling up self-attention inference (neuralblog.github.io)
- 5. ▲ Scaling up self-attention inference (neuralblog.github.io)
- 6. ▲ Letter from Professors Bengio, Hinton, Lessig, & Russell (safesecureai.org)
- 7. ▲ Logit Prisms: Decomposing Transformer Outputs for Mechanistic Interpretability (neuralblog.github.io)
- 8. ▲ Exploring MLP neurons inside Llama3 model (neuralblog.github.io)