matt_d
- Karma
- 19,844
- Created
- 12 years ago
Recent Submissions
- 1. ▲ SSV: Sparse Speculative Verification for Efficient LLM Inference (arxiv.org)
- 2. ▲ Characterizing Real-World Bugs in Tile Programs for Automated Bug Detection (arxiv.org)
- 3. ▲ Characterization of machine learning compilers for LLM inference on NVIDIA GPUs (link.springer.com)
- 4. ▲ Chip design from the bottom up – Reiner Pope [video] (youtube.com)
- 5. ▲ LT2: Linear-Time Looped Transformers (charlesdddd.github.io)
- 6. ▲ Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel (arxiv.org)
- 7. ▲ PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Apps (arxiv.org)
- 8. ▲ CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs (arxiv.org)
- 9. ▲ [RFC] Open Access to Standards Documents – LLVM Project (discourse.llvm.org)
- 10. ▲ Curly braces: An evolution of UNIX and C (thalia.dev)