Rank Fusion for improved code context in RAG

tabby.tabbyml.com

3 points by wsxiaoys 2 years ago · 1 comment

Reader

wsxiaoysOP 2 years ago

Fun fact: We've implemented binary embedding search [1] without the need for a specialized vector database. Instead, dimensional tokens like 'embedding_0_0', 'embedding_1_0' are created and being built into the tantivy index [2].

We're satisfied with the quality and performance this approach yields, while still keep Tabby embed everything into a single binary.

[1] My binary vector search is better than your FP32 vectors: https://blog.pgvecto.rs/my-binary-vector-search-is-better-th...

[2] Tantivy: https://github.com/quickwit-oss/tantivy

Settings

Rank Fusion for improved code context in RAG

Keyboard Shortcuts