charles_irl
- Karma
- 359
- Created
- 1 year ago
About
Building useful technology out of large neural networks. https://modal.comRecent Submissions
- 1. ▲ Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint (modal.com)
- 2. ▲ How to Achieve Serverless GPUs (modal.com)
- 3. ▲ Three types of LLM workloads and how to serve them (modal.com)
- 4. ▲ Host overhead is killing your inference efficiency (modal.com)
- 5. ▲ Quantized Float Exposed (quant.exposed)
- 6. ▲ Against SQL (2021) (scattered-thoughts.net)
- 7. ▲ Length-extension attacks are still a thing (00f.net)
- 8. ▲ The future of Python web services looks GIL-free (blog.baro.dev)
- 9. ▲ Lexical differential highlighting instead of syntax highlighting (wordsandbuttons.online)
- 10. ▲ CReact – JSX for the Cloud (github.com)