ColiVara

2 min read Original article ↗

ColiVara is a State of the Art Retrieval API - with a delightful developer experience.

Make your RAG application
10x Smarter

ColiVara has state of the art retrieval performance on both text and visual documents. Whether it's complex financial reports,technical diagrams, or data-rich tables, our advanced vision models see and understand your documents just like a human would. Say goodbye to broken layouts, missed context, and OCR limitations.

colivara.com

pip install colivara_py

# Import and initialize ColiVara
from colivara_py import ColiVara

rag_client = ColiVara()

# Upload a document to the default collection
document = rag_client.upsert_document(
  name="sample_document",
  url="https://example.com/sample.pdf",
  metadata={"author": "John Doe"}
)
results = rag_client.search(query="machine learning")
print(results) # top 3 pages with the most relevant information

About ColiVara

Colivara is a state of the art retrieval API that allows you to store, search, and retrieve documents based on their visual embeddings.

Documents are visually rich structures that convey information through text, as well as tables, figures, page layouts, and charts. While legacy document retrieval systems exhibit good performance on query-to-text matching, they struggle to pass visual cues efficiently to large language models, hindering their performance on practical document retrieval applications such as Retrieval Augmented Generation.

It is a web-first implementation of the ColPali paper using ColQwen2 as the LLM model. It works exactly like RAG from the end-user standpoint - but using vision models instead of chunking and text-processing for documents.

ColiVara Use Cases

Logo 01

State of the Art Retrieval. Delightful developer experience

Planet

Plans that match your needs

No matter how many documents you have - our pricing is simple, transparent and adapts to the size of your usage.

Create your next project with ColiVara