Settings

Theme

Show HN: Meta-agent: self-improving agent harnesses from live traces

github.com

14 points by essamsleiman 2 months ago · 0 comments · 1 min read

Reader

We built meta-agent: an open-source library that automatically and continuously improves agent harnesses from production traces.

Point it at an existing agent, a stream of unlabeled production traces, and a small labeled holdout set.

An LLM judge scores unlabeled production traces as they stream.

A proposer reads failed traces and writes one targeted harness update at a time, such as changes to prompts, hooks, tools, or subagents. The update is kept only if it improves holdout accuracy.

On tau-bench v3 airline, meta-agent improved holdout accuracy from 67% to 87%.

We open-sourced meta-agent. It currently supports Claude Agent SDK, with more frameworks coming soon.

Try it here: https://github.com/canvas-org/meta-agent

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection