agentic-rag-engine

Reference implementation of enterprise hybrid search + RAG + agentic orchestration. Dense (vector) and sparse (BM25) retrieval fused with Reciprocal Rank Fusion, cross-encoder-style reranking, grounded generation with citations, a guardrail that refuses ungrounded answers, a LangGraph agent, and a retrieval evaluation harness (precision@k, MRR).

Part of the Enterprise Platform Reference Architecture. Models the enterprise search domain as a domain-agnostic discovery capability. See docs/INDUSTRY-APPLICABILITY.md.

The retrieval/RAG core is pure Python with zero third-party dependencies, so it is fully unit-tested and trivial to reason about. Production components (real embedding models, pgvector, a cross-encoder, an LLM) plug in behind small interfaces without touching orchestration.

Architecture

flowchart LR
  q[Query] --> agent[LangGraph agent: classify -> retrieve -> generate -> guardrail]
  subgraph retrieval [Hybrid Retriever]
    dense[Dense / vector search] --> rrf[Reciprocal Rank Fusion]
    sparse[Sparse / BM25] --> rrf
    rrf --> rer[Reranker]
  end
  agent --> retrieval
  rer --> gen[Generator: extractive or LLM, with citations]
  gen --> guard[Guardrail: refuse if ungrounded]
  guard --> ans[Answer + citations]

Run it

Tests + evaluation (no infra, no ML deps)

python -m venv .venv && source .venv/bin/activate
pip install pytest
pytest -q
python -m eval.evaluate     # prints precision@k and MRR

API

pip install fastapi uvicorn pydantic
PYTHONPATH=src CORPUS_DIR=data/corpus uvicorn discovery.api:app --reload
# POST /search   {"query": "..."}
# POST /answer   {"query": "..."}   -> grounded answer + citations
# POST /agent    {"query": "..."}   -> agent run with node trace

Docker

docker compose up --build   # app on :8000, pgvector on :5433

Components

Module	Responsibility	Swap for production
`embeddings.py`	Hashing embedder (default)	sentence-transformers / OpenAI / Azure OpenAI
`bm25.py`	BM25 Okapi lexical ranker	OpenSearch / Elasticsearch
`vectorstore.py`	In-memory cosine index	pgvector / Qdrant / Milvus
`fusion.py`	Reciprocal Rank Fusion	—
`rerank.py`	Lexical-overlap reranker	bge-reranker / Cohere Rerank
`generator.py`	Extractive generator (default)	LLMGenerator (OpenAI/Azure/Bedrock)
`agent.py`	SimpleAgent + LangGraph build	LangGraph runtime

Documentation

System design + SLOs + capacity
Industry applicability
Business & governance: BRD - SOP - NFR - Cost savings
ADRs: docs/adr/

Tech

Python 3.11+, FastAPI, LangGraph, pgvector (prod), pytest. Pure-Python retrieval core.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
data/corpus		data/corpus
docs		docs
eval		eval
src/discovery		src/discovery
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agentic-rag-engine

Architecture

Run it

Tests + evaluation (no infra, no ML deps)

API

Docker

Components

Documentation

Tech

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agentic-rag-engine

Architecture

Run it

Tests + evaluation (no infra, no ML deps)

API

Docker

Components

Documentation

Tech

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages