m4r1k
- Karma
- 444
- Created
- 3 years ago
Recent Submissions
- 1. ▲ 1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM (medium.com)
- 2. ▲ Scaling Inference to Billions of Users and AI Agents (medium.com)
- 3. ▲ He Had Dangerous Delusions. ChatGPT Admitted It Made Them Worse (wsj.com)