zone411
- Karma
- 4,257
- Created
- 15 years ago
About
https://twitter.com/LechMazur10 LLM benchmarks: https://github.com/lechmazur/
https://www.linkedin.com/in/lech-mazur-69b70493/
Advameg (City-data.com) founder and CEO. AI startup founder.
Author: AI melody songwriting assistant https://melodies.ai
Author: Accurate COVID-19 county-by-county neural net case prediction model based on most data.
Recent Submissions
- 1. ▲ LLM Position Bias Benchmark: Swapped-Order Pairwise Judging (github.com)
- 2. ▲ Show HN: Buyout Game Benchmark: Multi-Agent Bargaining, Transfers, and Takeovers (github.com)
- 3. ▲ LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models (github.com)
- 4. ▲ Show HN: LLM Debate Benchmark (github.com)
- 5. ▲ Show HN: LLM Sycophancy Benchmark: Opposite-Narrator Contradictions (github.com)