Settings

Theme

Hybrid search vs. Token pooling benchmarking

blog.colivara.com

3 points by jonathan-adly a year ago · 1 comment

Reader

jonathan-adlyOP a year ago

Hi HN!

We benchmarked two ways to improve the latency in RAG workflows with a multi-vector setup. Hybrid search using Postgres native capabilities and a relatively new method of token pooling. Token pooling unlocked up to 70% faster latency with <1% performance cost.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection