stared
- Karma
- 12,686
- Created
- 13 years ago
About
A curious being, doctor of sorcery. Posts, projects & resume at: https://p.migdal.pl/Now: benchmarking AI at: https://quesma.com/blog/
Previously: co-founder & CTO at https://quantumflytrap.com/
Recent Submissions
- 1. ▲ Dogs, Walks, and Mindfulness (2014) (trudymorgancole.wordpress.com)
- 2. ▲ Compare harnesses not models: Blitzy vs. GPT-5.4 on SWE-Bench Pro (quesma.com)
- 3. ▲ Finding Widespread Cheating on Popular Agent Benchmarks (debugml.github.io)
- 4. ▲ Compare harnesses not models: Blitzy vs. GPT-5.4 on SWE-Bench Pro (quesma.com)
- 5. ▲ Śmigus-Dyngus (Wet Monday) – A Celebration Today in Poland and Ukraine (en.wikipedia.org)
- 6. ▲ Emotion Concepts and Their Function in a Large Language Model (transformer-circuits.pub)
- 7. ▲ A map that glows with the vocabulary of water (waterdata.usgs.gov)
- 8. ▲ Buddha-Dhamma for Inquiring Minds (suanmokkh.org)