zhwu

Karma: 31
Created: 3 years ago

Recent Submissions

1. ▲ A collection of reproducible LLM inference engine benchmarks: SGLang vs. vLLM (github.com) 1 point · 9 months ago · 0 comments
2. ▲ Efficient GPU Resource Management for ML Workloads Using SkyPilot, Kueue on GKE (github.com) 2 points · 1 year ago · 0 comments
3. ▲ New Recipe: Serving Llama-2 with VLLM's OpenAI-Compatible API Server (github.com) 1 point · 2 years ago · 0 comments
4. ▲ Train Your Own Vicuna on Llama-2 (github.com) 3 points · 2 years ago · 0 comments
5. ▲ Guide on fine-tuning your own Vicuna on Llama-2 (twitter.com) 9 points · 2 years ago · 0 comments
6. ▲ Serving LLM 24x Faster on the Cloud with VLLM and SkyPilot (blog.skypilot.co) 12 points · 2 years ago · 1 comment
7. ▲ Biologists are moving to the clouds with SkyPilot from UC Berkeley (twitter.com) 5 points · 2 years ago · 0 comments

All submissions on HN · View profile on HN