Settings

Theme

Ask HN: What do u use for agent/agentic evals?

1 points by hhthrowaway1230 a month ago · 0 comments · 1 min read


Right now looking at MLFlow/Braintrust but find it hard to compare acrosss versions of agents, and a/b testing of agents, and mcp tools. Also obvious things like runaway agents (stuck in a loop), or token/spend optimalisation.

What do you all use?

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection