Settings

Theme

GPT-5.2-high LMArena scores released, OpenAI falls from #6 to #13

lmarena.ai

13 points by reed1234 a month ago · 3 comments

Reader

_nub3 a month ago

Cookie Banner conflicts with cloudflare anti bot stuff

Site is unusable.

reed1234OP a month ago

While GPT-5.2 scores well on benchmarks, human preference is important for OpenAI’s consumer focused products.

  • aeonfox a month ago

    Arena Overview section is heavily biased towards languages. grok-4.1-thinking is worse than claude-opus-4-5-20251101-thinking-32k on every non-language metric by a large margin but somehow ranks higher overall, maybe because opus is way worse Spanish and Korean?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection