WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning arxiv.org 23 points by theredsix a year ago · 1 comment Reader PiP Save HellsMaddy a year ago Repo seems to be here: https://github.com/THUDM/WebRL