Settings

Theme

An LLM invented a feature by hijacking my tool schema

ratnotes.substack.com

2 points by mtrifonov a month ago · 1 comment

Reader

mtrifonovOP a month ago

Post author here. Happy to answer questions and discuss further. The essay has an appendix with the model's own self-report on its reasoning (the most load-bearing evidence, IMO), so worth scrolling to the end if you're skeptical of the rest.

Curious what you'd propose as alternative explanations, especially from folks with pointers to related literature.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection