RAG-Based Algorithm Question Answering System

Overview

This project is a Retrieval-Augmented Generation (RAG) system designed to answer Data Structures and Algorithms (DSA) theory questions using a local Large Language Model (LLM). Instead of relying on external APIs, the system performs semantic search over curated algorithm notes and generates context-aware answers in real time.

The goal of this project is to demonstrate applied backend + ML system design, including document ingestion, vector retrieval, and controlled LLM inference.

Key Features

End-to-end RAG pipeline using a local LLM (llama-cpp-python)**
Vector-based semantic search with ChromaDB
Structured ingestion of algorithm theory and question datasets
Context-aware answer generation to reduce hallucinations
Interactive Streamlit UI for querying the system

Tech Stack

Language: Python
Vector Database: ChromaDB
LLM Inference: llama-cpp-python (local model)
Frontend: Streamlit
Embeddings: Sentence/transformer-based embeddings

System Architecture

Algorithm notes and question datasets are preprocessed and chunked
Text chunks are converted into embeddings and stored in ChromaDB
User submits a query via Streamlit UI
Relevant chunks are retrieved using semantic similarity search
Retrieved context is injected into the LLM prompt
Local LLM generates a concise, grounded answer

Dataset Coverage

Graph Algorithms: BFS, DFS, Dijkstra, Bellman-Ford, Floyd-Warshall
Minimum Spanning Tree: Kruskal, Disjoint Set Union
Sorting: Counting Sort, Radix Sort, Bucket Sort
Dynamic Programming vs Greedy
NP, NP-Complete, NP-Hard, P vs NP
Backtracking (N-Queens)
String Algorithms: KMP, Rabin-Karp, Huffman Coding

Project Structure

algorithms.txt → Core algorithm theory dataset
Questions.txt → Question-focused dataset
ingest.py → Text preprocessing, embedding, and vector storage
app.py → Backend RAG pipeline
ui_app.py → Streamlit-based user interface
requirements.txt → Dependencies
commands.txt → Execution instructions

Installation & Usage

pip install -r requirements.txt
python ingest.py
python app.py
streamlit run ui_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG-Based Algorithm Question Answering System

Overview

Key Features

Tech Stack

System Architecture

Dataset Coverage

Project Structure

Installation & Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
db		db
Questions.txt		Questions.txt
README.md		README.md
README.txt		README.txt
app.py		app.py
commands.txt		commands.txt
ingest.py		ingest.py
requirements.txt		requirements.txt
screenshot of the running project.jpeg		screenshot of the running project.jpeg
ui_app.py		ui_app.py

Folders and files

Latest commit

History

Repository files navigation

RAG-Based Algorithm Question Answering System

Overview

Key Features

Tech Stack

System Architecture

Dataset Coverage

Project Structure

Installation & Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages