Show HN: Self hosting a modern LLM stack
github.comHave you tried running really large models with this? And how are you managing allocating resources ?
Have you tried running really large models with this? And how are you managing allocating resources ?