RAGFlow
Open-source RAG engine built on deep document understanding — handles tables, figures, and complex layouts in PDFs that chunk-based RAG misses. Includes a visual DAG pipeline editor for building extraction and retrieval workflows. One of the fastest-growing RAG projects of 2025 with 40K+ GitHub stars.
LlamaIndex
Framework specialized in data ingestion, indexing, and retrieval for LLM applications. The go-to for complex RAG pipelines.