Open-Source LLM Helps Safeguard Text Generation Prompts and Responses
corp.roblox.comRoblox’s LLM is currently outperforming popular LLM guardrail models on standard benchmarks. Roblox open-sourced both the LLM weights and the RoGuard-Eval benchmarking dataset.