galeos
- Karma
- 817
- Created
- 11 years ago
Recent Submissions
- 1. ▲ 1.58bit LLM Optimised Tensor Core (github.com)
- 2. ▲ BitNet 1.58bit GPU Inference Kernel (github.com)
- 3. ▲ Microsoft beat H200 Deepseek inference with MI300 (techcommunity.microsoft.com)
- 4. ▲ Modular's CUDA alternative is ready (eetimes.com)
- 5. ▲ BitNet b1.58 2B4T Technical Report (arxiv.org)
- 6. ▲ Microsoft BitNet 1.58bit LLM 2B4T released (huggingface.co)
- 7. ▲ Mi300 Huggingface (huggingface.co)
- 8. ▲ Bitnet.cpp: Efficient Inference for 1.58bit LLMs (arxiv.org)
- 9. ▲ Matryoshka Quantization (arxiv.org)
- 10. ▲ 1-Bit AI Infrastructure (arxiv.org)