ruihangl
- Karma
- 16
- Created
- 2 years ago
Recent Submissions
- 1. ▲ XGrammar: Efficient, Flexible and Portable Structured Generation for LLM (github.com)
- 2. ▲ High-Throughput Low-Latency LLM Serving with MLCEngine (blog.mlc.ai)
- 3. ▲ Universal LLM Deployment Engine with ML Compilation (blog.mlc.ai)
- 4. ▲ Run Llama2-70B in Web Browser with WebGPU Acceleration (webllm.mlc.ai)