curioussquirrel
- Karma
- 243
- Created
- 9 months ago
About
Multilingual LLM evalsRecent Submissions
- 1. ▲ Have we made a unicorn? Continuous SVG-pelican style benchmark (havewemadeaunicorn.com)
- 2. ▲ How well do LLMs work outside English? We tested 8 models in 8 languages [pdf] (info.rws.com)
- 3. ▲ Claude Opus 4.7 API removes sampling parameters (platform.claude.com)
- 4. ▲ psmux: Terminal multiplexer for Windows – tmux alternative (github.com)