Settings

Theme

AI Stupid Level: Independent monitoring fluctuations in AI model performance

aistupidlevel.info

3 points by SubiculumCode 20 days ago · 1 comment

Reader

SubiculumCodeOP 20 days ago

I just stumbled on this site which monitors the stability of performance of various models like ChatGPT codex 4.3, etc. Some models seem to fluctuate in performance, probably by dynamic reallocations of compute budgets, etc. Fairly interesting stuff, and gives credence to the idea that the same model performs differently on different days, and some models e.g. Chat GPT Codex 5.2 are more consistent than newer models e.g. Chat GPT 5.4

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection