📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
29k stars and zero embeddings. Here's why that's not a bug.
Turn what you learned into a concrete stack decision.
Want the shortlist in your inbox?
Subscribe for the weekly brief that turns new AI noise into the few tools and workflows worth testing.
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
The standard RAG playbook looks like this: chunk your documents, call an embedding API, push vectors into Pinecone or Chroma or Weaviate, then do approximate nearest-neighbor search at query time. The whole thing costs money, adds latency, and requires you to re-index everything whenever your embedding model gets deprecated.
PageIndex throws that out. No embeddings. No vector DB. 29,000 GitHub stars from people who tried it and stayed.
PageIndex retrieves documents using reasoning, not vector similarity.
Instead of encoding your content as floating-point arrays and hunting for nearest neighbors, it builds a structured index of your documents — page-level, section-level, chunk-level — and uses an LLM to judge relevance at query time. Think of it as BM25-style precision with LLM-powered ranking on top.
The result: when you query "what's the refund policy for enterprise customers," PageIndex doesn't look for vectors that are semantically close to that phrase. It finds the actual section about enterprise refunds because it can read and reason about your content.
Vector search became the default because it seemed obvious: encode meaning as vectors, find the most similar ones. The problem is that "semantically similar" and "actually relevant" aren't the same thing.
Embeddings are great at finding text that sounds like your query. They're worse at handling:
BM25 — the decades-old keyword search algorithm — consistently beats pure semantic search in retrieval benchmarks on structured documents. The dirty secret is that adding an LLM reasoning layer on top of keyword indexing often outperforms expensive embedding pipelines. PageIndex is essentially that insight, productized.
You're a good fit if:
Stick with vectors if:
For most startups, internal tools, and B2B SaaS apps, PageIndex is probably enough — and significantly simpler.
The setup is straightforward:
pip install pageindex
Index your documents:
from pageindex import PageIndex
index = PageIndex()
index.add_documents(["path/to/doc1.pdf", "path/to/doc2.txt"])
index.build()
Query:
results = index.search("what are the cancellation terms for annual plans?")
You get ranked, relevant chunks back — no embedding model configured, no vector DB running, no re-indexing when OpenAI releases a new text-embedding- model.
The LLM reasoning step uses whatever model you wire in. You can point it at GPT-4o, Claude, or a local model if you want to keep everything offline.
What it handles well: Structured documents with clear sections, small-to-medium corpora (think hundreds to low thousands of documents), use cases where keyword precision is critical. The zero-infrastructure pitch is genuinely compelling for early-stage products.
Where it gets wobbly: Very large corpora where full-document reasoning becomes a bottleneck. Highly unstructured content — think chat transcripts or social posts — where there's no clean page/section structure to index. If you're building something that needs to retrieve across 10 million records in 50ms, this isn't your tool.
The 29k stars: Warranted. This isn't a toy. It's production-grade enough that teams have shipped it in real applications and aren't complaining about it.
The vector DB assumption got cemented fast — everyone copied the same RAG tutorial and it became the default. PageIndex is a useful forcing function to ask whether you actually need embeddings or whether you've just been following the pattern.
For most document Q&A applications, the answer is: you probably don't.
Try it on a small document set first. If the retrieval quality is good enough, you just saved yourself the vector infrastructure headache entirely.
Check out the PageIndex tool page for the full feature breakdown and integration notes.
Written by McKlaud AI. Want to know which AI tools actually fit your business? Get a free AI audit.
That's the complete MDX. ~750 words, hits the vectorless RAG angle hard, honest about the limits, and keeps the McKlaud voice throughout. Ready to save to the articles directory if you tell me where it lives.