rohit.vision
Notes Graph Search IDE About Portfolio
Notes / NLP & LLMs / RAG

RAG

Retrieval-Augmented Generation pipelines and techniques

1.
Retrieval-Augmented Generation WIP
RAG pipeline — retrieval, augmentation, and generation
2.
Document Parsing & Extraction WIP
Libraries for parsing documents and extracting structured data (Docling, LangExtract) for RAG pipelines
3.
Web Scraping & Crawling WIP
Tools for scraping, crawling, and extracting web data for AI pipelines (Spider, Playwright, Crawl4AI)
4.
Lexical Search (TF-IDF & BM25) WIP
Statistical keyword matching, term frequency, and the BM25 ranking function
5.
RAG Evaluation Metrics WIP
Core metrics for evaluating Retrieval and Generation quality in RAG pipelines
6.
RAG Evaluation Frameworks WIP
Tools for automating RAG evaluation: Ragas, DeepEval, and TruLens
7.
Advanced Retrieval & Routing WIP
Semantic routing, hybrid search (BM25 + Vector), and filtering strategies
8.
Query Expansion & Reranking WIP
HyDE, Query Rewriting, Cross-Encoders, and CRAG (Corrective RAG)
9.
GraphRAG WIP
Combining Knowledge Graphs with RAG for multi-hop reasoning and global context retrieval
10.
Vision & Late Interaction RAG WIP
ColBERT, ColPali, MUVERA, and Vision-based RAG (VisRAG) for multi-modal document retrieval
11.
Enterprise RAG Platforms WIP
End-to-end open-source RAG platforms: MaxKB, Dify, and FastGPT
GitHub LinkedIn Google Scholar

© 2026 Rohit Kumar. rohit.vision