Settings

Theme

Realworld benchmark between Codex 5.3 and Opus 4.6

swe-agi.com

4 points by hongbo_zhang a month ago · 3 comments

Reader

alontorres a month ago

I do feel like the latest codex 5.2 and 5.3 have been really excellent in coding and have been giving opus a good fight. I still prefer Opus 4.6 as my daily driver but specifically for coding tasks I think codex 5.3 is the best, especially when considering value for money.

  • hongbo_zhangOP a month ago

    Another thing I like about codex 5.3 is that its CLI support queueing the message directly without using third party plugins. And it can run weeks without any issues, the CC used to have memory issues and stackoverflows.

hongbo_zhangOP a month ago

This is the benchmark between the latest models on a new programming language to avoid overfitting. Latest models are quite good over generalization to new languages, they can write tens of thousands of lines of code in one prompt that just works.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection