Selection Guide
Choosing the right reranker
For Maximum Accuracy
Choose top-performing models like
or
Voyage Rerank 2.5. These models deliver the highest accuracy scores and are ideal for production applications where answer quality is paramount.
Best for:
- • Customer-facing chatbots
- • High-stakes decision support
- • Complex technical documentation
For Self-Hosting
Open-source models like
and bge-reranker-v2-m3 offer excellent performance with full control over deployment. These models can be hosted on your infrastructure, ensuring data privacy and cost control.
Best for:
- • Data privacy requirements
- • High-volume applications
- • Custom fine-tuning needs
For Low Latency
and
offer the fastest response times at around 595-603ms average latency, making them ideal when response time is critical for your use case.
Best for:
- • Real-time chat applications
- • Mobile applications
- • High-concurrency scenarios
For Multilingual Support
and
excel at cross-lingual reranking, handling queries and documents in multiple languages. Check individual model pages for specific language support details.
Best for:
- • International applications
- • Multilingual documentation
- • Cross-language search