Settings

Theme

Ask HN: If I cancel Codex today whats the next best local inference agent?

8 points by Bulbasaur2015 11 hours ago · 3 comments · 1 min read


better place to ask over /r/LocalLLaMA

bigyabai 11 hours ago

For local inference? It entirely depends on what your hardware is.

JojoFatsani 4 hours ago

Check llmfit

verdverm 10 hours ago

OpenCode + vllm, model will depend on your hardware, but OpenCode also has a killer $10/m plan with quotas for some top tier open weight models.

I'm using qwen3.6 on a DGX spark, llama-cpp has prompt cache bugs for qwen/gemma models (among more being reported). Using my OpenCode-go sub when I want a bigger / more capable model

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection