firasd
- Karma
- 3,972
- Created
- 12 years ago
About
https://www.linkedin.com/in/firasdhttp://twitter.com/firasd
firasd at gmail
Recent Submissions
- 1. ▲ Strangerbench: A benchmark for AI forecasting after training cut-off dates (github.com)
- 2. ▲ Sam Altman's blind spot on AI model power (vibesbench.substack.com)
- 3. ▲ Dullness and Disbelief: The 2026 AI Regression (vibesbench.substack.com)
- 4. ▲ Claude output silently rewritten by Anthropic (github.com)
- 5. ▲ Anthropic silently rewriting Claude punctuation output in API (github.com)
- 6. ▲ AI sycophancy panic (github.com)
- 7. ▲ Vibesbench: Prompts to track conversational regression in AI models (github.com)
- 8. ▲ Stockfish shows Morphy and Fischer didn't sacrifice their queens (github.com)
- 9. ▲ Vibesbench: A Multi-Turn Conversational AI Benchmark (github.com)