Settings

Theme

NicoConstant

Karma
40
Created
2 months ago

Recent Submissions

  1. 1. Real-time LLM Inference on Standard GPUs: 3k tokens/s per request (blog.kog.ai)
  2. 2. Kog AI – Building a Real-Time Inference Stack on AMD Instinct GPUs [video] (youtube.com)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection