Show HN: CATArena – Evaluating LLM agents via dynamic enviroment interactions github.com 3 points by jinqueeny a month ago · 0 comments Reader PiP Save No comments yet.