Settings

Theme

AI hallucinates. How do you keep it from fucking up automations?

4 points by Gioppix 3 months ago · 5 comments · 1 min read


Every time I build simple automations LLMs find a way to screw up something. At the end of the day I still have to manually review critical actions (emails, sms, invoices...). Why bother automating then? How do you manage it?

storystarling 3 months ago

I found the only way to make this work reliably is to treat the LLM as a fallible component inside a state machine rather than the controller. I've been using LangGraph to enforce structured outputs and run validation checks before any side effects happen. If the output doesn't match the schema or business logic it just retries or halts. It seems like a lot of boilerplate initially but it is necessary if you want to trust the system with actual invoices.

downboots 3 months ago

https://en.wikipedia.org/wiki/Bernoulli_trial

nik282000 3 months ago

If you have to manually validate everything then what did you save by using an LLM? DIY and know it will work the first time.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection