GraphRAG

A RAG variant that retrieves from a knowledge graph rather than a flat vector store, enabling multi-hop reasoning across connected entities.

What Is GraphRAG?

GraphRAG (Graph Retrieval-Augmented Generation) is an advanced variant of RAG that replaces or supplements the flat vector store with a knowledge graph as the retrieval backend. Instead of finding the most similar text chunks by embedding distance, GraphRAG traverses a graph of entities and relationships to gather structured, interconnected context before passing it to the LLM.

Microsoft Research popularized the term with their 2024 paper, but the core concept — using graph structure to improve retrieval quality — had been explored in enterprise knowledge management for years prior.

Why Standard RAG Falls Short

Standard RAG works well when the answer lives inside a single document chunk. It struggles when:

The answer requires combining facts from multiple sources.
Questions involve relationships between entities ("Which suppliers serve both Customer A and Customer B?").
Context requires traversing chains of reasoning across several steps.

These are exactly the scenarios where a graph structure shines.

How GraphRAG Works

A typical GraphRAG pipeline involves several stages:

Ingestion: Documents are parsed and entities are extracted, forming nodes in the graph.
Relationship extraction: Co-occurring or logically related entities are connected with labeled edges.
Community detection (optional): Graph clustering algorithms identify tightly connected entity communities and generate summaries.
Query-time retrieval: At inference, the user query is mapped to relevant entities, and the graph is traversed to collect a subgraph of related facts.
Context assembly: The retrieved subgraph (nodes, edges, summaries) is serialized into text and injected into the LLM prompt.
Generation: The LLM produces an answer grounded in the retrieved graph context.

Multi-Hop Reasoning

The defining capability of GraphRAG is multi-hop retrieval — following a chain of edges to answer a question that no single node can answer alone.

Example: "Who is the CEO of the company that acquired the startup founded by the author of this paper?"

Hop 1: Author → founded → Startup
Hop 2: Startup → acquired by → Company
Hop 3: Company → CEO → Person

Each hop is a simple edge traversal. The combined path yields the answer.

Trade-offs

Strength: Superior for relational, cross-document, and reasoning-heavy queries.
Weakness: Requires significant upfront investment in entity extraction, graph construction, and schema design.
Cost: Graph construction is computationally expensive compared to simple chunking and embedding.
Latency: Multi-hop traversal adds query-time complexity.