Retriever

System retrieval published

Also known as: Retrieval Component, Search Component

Definition

A component that takes a query and returns relevant documents or passages from a corpus. The retriever is the "search" part of RAG—it determines what context the LLM sees. Retrievers can use various strategies: sparse (keyword/BM25), dense (vector similarity), or hybrid (combining both).

What this is NOT

Not the same as a vector database (the DB stores vectors; the retriever queries it)
Not the embedding model (the model creates vectors; the retriever uses them)
Not the full RAG pipeline (retriever is one component)

Alternative Interpretations

Different communities use this term differently:

llm-practitioners

An abstraction over document retrieval, typically implemented as a class or function that accepts a query string and returns ranked documents. Frameworks like LangChain and LlamaIndex provide retriever interfaces.

Sources: LangChain BaseRetriever documentation, LlamaIndex retriever documentation

information-retrieval

The first stage of a retrieval pipeline that efficiently narrows a large corpus to a candidate set, which may then be reranked by a more expensive model.

Sources: Information retrieval literature, Two-stage retrieval patterns

Examples

LangChain VectorStoreRetriever querying Pinecone
A hybrid retriever combining Elasticsearch BM25 with vector search
A self-query retriever that parses 'papers from 2023' into a date filter
An ensemble retriever merging results from multiple indexes

Counterexamples

Things that might seem like Retriever but are not:

The vector database itself (that's storage, not the retrieval interface)
The embedding model (that creates vectors, doesn't retrieve)
The LLM that generates the final response

Relations

requires retrieval-augmented-generation (Retriever is the R in RAG)
overlapsWith vector-search (Vector search is one retrieval strategy)
overlapsWith hybrid-search (Hybrid retrievers combine strategies)
produces context-window (Retrieved documents become context)

Implementations

Tools and frameworks that implement this concept:

Arize AI secondary
Elasticsearch secondary
Haystack primary
Jina AI secondary
LlamaIndex primary
Voyage AI secondary