Opus 4.5 vs GPT 5.2 vs Gemini 3 Pro for Elixir development

egeersoz · December 18, 2025, 6:31am

GPT is really bad with Elixir in my experience. I regularly run experiments where I ask multiple models the same question (about design or troubleshooting a bug) and GPT is consistently bottom tier. It’s also slow as hell. Not sure why people like it as a coding assistant.

I used to use it for product management to build domain expertise but Gemini 3 is better at that now.

vkryukov · December 18, 2025, 9:11am

Interesting. I do concur that it is slow as hell; that’s why I often reach to Claude Code when I need to code something that I believe will be relatively straightforward, and doesn’t need any “depth”.

However, in my experience of using codex daily, it produces, albeit slowly, a much higher quality of code. You can hopefully see it for yourself in the design documents each model produces.

With my workflow, it doesn’t matter that much that codex is slow: I’m usually running 2-3 sessions in parallel, and/or reviewing the code that one of the agents produce to either approve and commit it or ask a clarifying questions or steer the model in a different direction. And 2-3 is about the number of different topics I can effectively work on at the same time. So longer work time doesn’t result in a real slowdown in practice, most of the time.

vkryukov · December 18, 2025, 9:18am

You are technically right, but I personally don’t find such descriptions of how LLM work useful. My own mental model of it is of an “AI person”, with some reasoning abilities, encyclopedic knowledge, and occasional tendency to hallucinations.

You might say that I anthropomorphize a tensor, but so what? The mental model above is more useful in predicting what LLMs will produce and how best to use them to generate useful code than the alternative you propose, at least to me.