GitHub - arthur-ai/arthur-engine: Make AI work for Everyone - Monitoring and governing for your AI/ML

3 min read Original article ↗

The Arthur Engine provides a complete service for developing, monitoring, and governing your AI/ML workloads using popular open-source technologies and frameworks. It is a tool designed for:

  • Enforcing guardrails in your LLM Applications and Generative AI Workflows
    • Configurable metrics for real-time detection of PII or Sensitive Data leakage, Hallucination, Prompt Injection attempts, Toxic language, and other quality metrics
  • Building, Observing and Governing Agentic Applications
    • Collect and analyze OpenInference traces from any agentic workflow or LLM application
    • Run continuous evaluations on live traces to catch regressions and quality issues automatically
    • Manage, version, and iterate on prompts across your applications
    • Run experiments to compare prompt variants and measure their impact on quality metrics
    • Evaluate and monitor Retrieval-Augmented Generation (RAG) pipelines end-to-end
  • Evaluating and Benchmarking Machine Learning models (requires the Arthur Platform)
    • Support for a wide range of evaluation metrics (e.g., drift, accuracy, precision, recall, F1, and AUC)
    • Tools for comparing models, exploring feature importance, and identifying areas for optimization
    • For LLMs/GenAI applications, measure and monitor response relevance, hallucination rates, token counts, latency, and more
  • Extensibility to fit into your application's architecture
    • Native support for custom metrics and extensible API

Quickstart

Claude Code users

Paste this prompt directly into Claude Code — no installation needed:

Fetch https://raw.githubusercontent.com/arthur-ai/arthur-engine/refs/heads/main/integrations/claude-code-skills/arthur-onboard/SKILL.md, save it to ~/.claude/skills/arthur-onboard/SKILL.md (create the directory if it doesn't exist), read the saved file, and follow its instructions.

Everyone else

  1. Run the engine installer with the below command:

Mac

bash <(curl -sSL https://get-genai-engine.arthur.ai/mac)

Windows

iex (iwr -Uri "https://get-genai-engine.arthur.ai/win" -UseBasicParsing).Content
  1. Instrument your agents for evaluations and LLM guardrailing by referencing the examples:

https://github.com/arthur-ai/arthur-engine/tree/dev/genai-engine/examples

Arthur Engine

Arthur Platform Free Version

To unlock the full capabilities of the Arthur Platform, sign up and get started for free.

  • Custom dashboards
  • Alerts and notifications
  • Configurable webhook that can trigger any workflow
  • Agent discovery
  • Governance

Arthur Platform

Arthur Platform Enterprise Version

The enterprise version of the Arthur Platform provides better performance, additional features, and capabilities, including custom enterprise-ready guardrails + metrics, which can maximize the potential of AI for your organization.

Key features:

  • State-of-the-art proprietary evaluation models trained by Arthur's world-class machine learning engineering team
  • Air-gapped deployment of the Arthur Engine (no dependency to Hugging Face Hub)
  • Optional on-premises deployment of the entire Arthur Platform
  • Support from the world-class engineering teams at Arthur

To learn more about the enterprise version of the Arthur Platform, reach out!

Contributing

  • Join the Arthur community on Discord to get help and share your feedback.
  • To make a request for a bug fix or a new feature, please file a GitHub issue.
  • To make code contributions, please review the contributing guidelines.
  • Thank you!