Show HN: Neutrino – a router that dynamically routes queries to the best LLM
neutrinoapp.comHey HN! I’m Ricardo from Neutrino AI (https://www.neutrinoapp.com). I’m excited to show you all our model router, which lets you intelligently route queries to the best-suited LLM for the prompt.
Problem:
- We want to solve the problem of balancing cost and accuracy between models like GPT-3.5 and 4, and also using the best models for specific tasks, like Claude for safety, creative writing, fine tuned models for domain-specific tasks, etc.
Key Features:
- Maximize response quality while optimizing for costs and latency
- Concurrently generate and compare responses across different closed and open-source models
- Automatically sample and evaluate responses, improving routing performance over time
You can use it with the OpenAI SDK or with LangChain by just changing the api base and api key to point to Neutrino and the model name to your own router ID
Would welcome any and all feedback!
No comments yet.