GitHub - Poll-The-People/awesome-rag: awesome-rag: a collection of awesome thing related to Retrieval-Augmented Generation

17 min read Original article ↗

Awesome Retrieval‑Augmented Generation (RAG)

Awesome

CustomGPT.ai

Proudly sponsored by CustomGPT.aiJoin the Slack community

CustomGPT.ai, no-code platform for building enterprise-grade RAG applications. Citation-backed answers, no hallucinations. With SOC-2 Type II security, GDPR compliance, and support for over 1400 document formats and 92 languages.

Retrieval‑Augmented Generation (RAG) equips language models with fresh, domain‑specific knowledge by fetching external context at inference time. This list is a one‑stop catalogue of every major RAG‑related resource—tools, papers, benchmarks, tutorials, and more.

Only very short descriptions are provided when essential for clarity. PRs welcome!

Subreddit subscribers

Table of Contents

Open Source Tools

  • CustomGPT.ai - Open-source SDK for building custom RAG applications with enterprise-grade features
  • TrustGraph - Open-source enterprise-grade complete AI solution stack for data sovereignty
  • RAGFlow - Open-source RAG engine based on deep document understanding
  • R2R (RAG to Riches) - Advanced AI retrieval system with production-ready features
  • FastRAG - Research framework for efficient retrieval augmented generation
  • FlashRAG - Python toolkit for RAG research with 36+ datasets and 17+ algorithms
  • Verba - Open-source RAG application out of the box
  • Kotaemon - Clean, customizable RAG UI for document-based Q&A
  • Cognita - Open-source RAG framework for modular applications
  • GraphRAG - Microsoft's approach to RAG using knowledge graphs
  • Nano-GraphRAG - Compact GraphRAG solution with core capabilities
  • LangChain — Python/JS agents & chains
  • LangChain4j — JVM
  • LlamaIndex — Data loaders & indices
  • Haystack — Modular pipelines
  • Semantic Kernel — .NET & Python
  • DSPy — Declarative pipelines
  • Guidance — Prompt DSL
  • Flowise — No‑code builder
  • reag — Reasoning Augmented Generation
  • Danswer — Internal Q&A search
  • Neum — Creation and synchronization of vector embeddings at large scale
  • GPTCache — Embedding‑aware cache
  • Mastra The TypeScript AI agent framework. Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama
  • Letta (MemGPT) — Stateful apps
  • Swiftide - Fast, streaming indexing, query, and agentic LLM applications in Rust
  • LangGraph — Agentic DAGs
  • Ragna — RAG orchestration framework
  • SimplyRetrieve - Lightweight chat AI platform featuring custom knowledge.

Embedding Models & Libraries

Proprietary Tools

Vendor Examples

Other Tools

  • LangFuse: Open-source tool for tracking LLM metrics, observability, and prompt management.
  • Ragas: Framework that helps evaluate RAG pipelines.
  • LangSmith: A platform for building production-grade LLM applications, allows you to closely monitor and evaluate your application.
  • Hugging Face Evaluate: Tool for computing metrics like BLEU and ROUGE to assess text quality.
  • Weights & Biases: Tracks experiments, logs metrics, and visualizes performance.

Vector DBs & Search Engines

Pick a vector db - GUIDE

  • Weaviate - Open-source vector database with GraphQL interface
  • Qdrant - High-performance vector similarity search engine
  • Milvus - Open-source vector database for scalable similarity search
  • Chroma - Open-source embedding database for LLM applications
  • Pinecone - The vector database
  • Elasticsearch (vector) - distributed search and analytics engine
  • OpenSearch - Open source distributed and RESTful search engine
  • Vespa - AI + Data, online
  • PGVector - PostgreSQL extension for vector similarity search
  • Redis Stack Search - Searching and querying Redis data using the Redis Query Engine
  • ClickHouse Vectors
  • Oracle AI Vector Search
  • TiDB Vector - semantic similarity searches across various data types
  • ScaNN - ScaNN (Scalable Nearest Neighbors) is a method for efficient vector similarity search at scale
  • Lantern.dev - open-source Postgres vector database
  • Azure Cosmos DB: Globally distributed, multi-model database service with integrated vector search.
  • Couchbase: A distributed NoSQL cloud database.
  • LlamaIndex: Employs a straightforward in-memory vector store for rapid experimentation.
  • Neo4j: Graph database management system.
  • Redis Stack: An in-memory data structure store used as a database, cache, and message broker.
  • SurrealDB: A scalable multi-model database optimized for time-series data.

Research Papers and Surveys

More - RAG Research Papers Collection - Curated list from ICML, ICLR, ACL

RAG Survey 2022

RAG Survey 2023

RAG Survey 2024

RAG Approaches and Architectures

Frameworks

  • LangChain - Framework for building LLM applications with chaining capabilities
  • LlamaIndex - Framework for connecting custom data sources to LLMs
  • Haystack - End-to-end framework for building production-ready LLM applications
  • DSPy - Framework for programming language models with automatic optimization
  • Dify - Open-source LLM app development platform with RAG pipeline
  • Semantic Kernel - Microsoft's SDK for developing Generative AI applications
  • Flowise - Drag & drop UI to build customized LLM flows
  • Cognita: Open-source RAG framework for building modular and production ready applications.
  • Verba: Open-source application for RAG out of the box.
  • Mastra: Typescript framework for building AI applications.
  • Letta: Open source framework for building stateful LLM applications.
  • Swiftide: Rust framework for building modular, streaming LLM applications.
  • CocoIndex: ETL framework to index data for AI, such as RAG; with realtime incremental updates.

RAG Techniques and Methodologies

Multimodal RAG

Graph-based RAG

Retrieval Methods

Dense Retrieval

Sparse Retrieval

Hybrid Search

More here: All RAG Reranking (GitHub)

Other Techniques

Prompting Strategies

Chunking & Pre‑processing

Comparison Guides

Embeddings Models

Instruction Tuning & Optimization

Finetuning and Training

Response Quality, and Hallucination

Security and Privacy Considerations

Evaluation Metrics and Benchmarks

  • RAGAS (Retrieval-Augmented Generation Assessment) - Reference-free evaluation framework with component-level metrics
  • TruLens - Comprehensive evaluation and tracking for LLM applications
  • DeepEval - Open-source evaluation framework for LLMs
  • Arize Phoenix - Open-source observability platform
  • RAGBench - 100k examples across 5 industry domains
  • BeIR - Benchmark for zero-shot evaluation of information retrieval
  • MTEB - Massive Text Embedding Benchmark
  • ARES - Automated Evaluation of RAG Systems
  • RGB Benchmark - implementation for Benchmarking Large Language Models in Retrieval-Augmented Generation
  • LlamaIndex RAG eval - Evaluation and benchmarking are crucial in developing LLM applications

Blogs

RAG Benchmark 2023

RAG Benchmark 2024

Advantages and Disadvantages

Performance, Cost & Observability

Cost Calculators

RAG Fine-tuning

Knowledge‑Graph / Structured RAG

Libraries and SDKs

Key Concepts

Educational Content

Courses and Tutorials

Blogs and Articles

Newsletters & Forums

  • ragaboutit - A blog and newsletter focused specifically on RAG news, tutorials, and insights, making it a dedicated resource for staying up-to-date.
  • r/LangChain
  • r/rag - Reddit communities for practical discussions, troubleshooting, and sharing projects. These are valuable for seeing what challenges other developers are facing in real-time.

Talks and Conferences

Influential Researchers and Influencers

Latest Trends 2024-2025

  • RAG-as-a-Service market at $1.2B (2024)
  • Projected 49.1% CAGR through 2030
  • On-device RAG for privacy

Community Resources

Reddit

Discord

GitHub Communities

Existing Collections

Contributing

Contributions are welcome! Please read the contribution guidelines before submitting a pull request.

License

This collection is licensed under MIT.