veryluckyxyz
- Karma
- 548
- Created
- 11 years ago
Recent Submissions
- 1. ▲ Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph (huggingface.co)
- 2. ▲ Hidden drivers of HRM's performance on ARC-AGI (arcprize.org)
- 3. ▲ Set Block Decoding Is a Language Model Inference Accelerator (arxiv.org)
- 4. ▲ Deep Think with Confidence (jiaweizzhao.github.io)
- 5. ▲ A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler (arxiv.org)
- 6. ▲ Easily Understand Rdma Technology (naddod.com)
- 7. ▲ Model Merging in Pre-Training of Large Language Models (arxiv.org)
- 8. ▲ Understanding Perception and Reasoning Through Model Merging (arxiv.org)
- 9. ▲ Building and better understanding vision-language models (2024) (huggingface.co)
- 10. ▲ HF smolagents computer-agent demo (huggingface.co)