Settings

Theme

dial481

Karma
4
Created
2 months ago

Recent Submissions

  1. 1. Show HN: Proposal for a real long-term AI memory benchmark (penfieldlabs.substack.com)
  2. 2. Milla Jovovich's MemPalace Claims 100% on LoCoMo. Its Benchmarks.md Disagrees (penfieldlabs.substack.com)
  3. 3. LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers (github.com)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection