GitHub - Not-Diamond/awesome-ai-model-routing: A curated list of awesome approaches to AI model routing

5 min read Original article ↗

Skip to content

Navigation Menu

awesome-ai-model-routing

A curated list of awesome solutions and research in AI model routing. Other awesome lists can be found in the awesome-awesomeness list.

Intelligent AI model routing

Open-source tools and software solutions for AI model routing (ordered alphabetically). * indicates that an entry is open source.

  • Martian: Dynamically routes requests to the best LLM in real-time.
  • Neutrino AI: Intelligently route queries to the best-suited LLM for the prompt.
  • Not Diamond: An AI model router that automatically determines which LLM is best-suited to respond to any query.
  • notdiamond-0001*: Automatically determines whether to send queries to GPT-3.5 or GPT-4.
  • Pulze AI KNN router*: A minimal server for generating a ranked list of targets, for a query, based on its k-nearest semantic neighbors. Written in Go.
  • OpenRouter: Prompts will be sent to Llama 3 70B Instruct, Claude 3.5 Sonnet (self-moderated) or GPT-4o.
  • Requesty: Configurable smart routing algorithms to optimize your performance, cost or latency.
  • RoRF: SOTA open-source pairwise router based on a random forest architecture.
  • RouteLLM*: RouteLLM is a framework for serving and evaluating LLM routers.
  • Semantic Router*: Route inputs to different models using semantic embeddings.
  • Unify: Improve quality, cost and speed by routing to the perfect model and provider for each individual prompt.

AI model routing papers

Research papers on AI model routing (ordered by recency)

Contributing

Your contributions are always welcome!