Settings

Theme

Evaluating AI agents: Real-world lessons from building agentic systems at Amazon

aws.amazon.com

3 points by bpedro 2 months ago · 1 comment

Reader

lumpilumpi 2 months ago

I get the justification but I found it hard to understand how the actual evaluation at each step is carried out. For example, is there any calibration to some human gold standard involved or is the AI evaluating the AI without calibration/oversight?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection