Adding verification steps to AI agents made them worse. I tested it 29 times

3 points by chrisdudek a month ago · 6 comments

Reader

Bjartr a month ago

> What works... External mechanical checks

This is what I expected verification to mean going in.

For some reason verification in this article means "try to convince the AI even harder than before"

chrisdudekOP a month ago

I first tried to make the agent follow concrete paths, but no tricks worked. The reason I wanted to achieve it this way is that I wanted to create universal "rules" that would be followed without additional CLI or hooks. It didn't work out as expected. I needed it for my project, and currently I started to enforce more things with CLI, which is directly accessible by an agent, so it's now "guided" more forcefully.

Settings