Settings

Theme

curioussquirrel

Karma
243
Created
9 months ago

About

Multilingual LLM evals

Recent Submissions

  1. 1. Have we made a unicorn? Continuous SVG-pelican style benchmark (havewemadeaunicorn.com)
  2. 2. How well do LLMs work outside English? We tested 8 models in 8 languages [pdf] (info.rws.com)
  3. 3. Claude Opus 4.7 API removes sampling parameters (platform.claude.com)
  4. 4. psmux: Terminal multiplexer for Windows – tmux alternative (github.com)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection