dial481
- Karma
- 4
- Created
- 2 months ago
Recent Submissions
- 1. ▲ Show HN: Proposal for a real long-term AI memory benchmark (penfieldlabs.substack.com)
- 2. ▲ Milla Jovovich's MemPalace Claims 100% on LoCoMo. Its Benchmarks.md Disagrees (penfieldlabs.substack.com)
- 3. ▲ LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers (github.com)