RAG Systems and Prompt Engineering — Practical Implementations

A collection of 11 end-to-end implementations covering RAG pipelines, semantic search, prompt engineering, and model evaluation using the Google Gemini API.

What this covers

RAG Pipeline — Full retrieval-augmented generation using Gemini embeddings and cosine similarity
RAG Triad Evaluation — Faithfulness, relevance, and context precision scoring to measure and reduce hallucinations
Semantic vs Keyword Search — BM25 keyword retrieval compared against vector embedding search
Prompt Engineering — Zero-shot, few-shot, and Chain-of-Thought comparisons with quantified output differences
Clinical Document Parsing — Extracting structured data from medical PDF reports using PyMuPDF
Synthetic Dataset Generation — Automated Q&A bank creation exported to Excel using openpyxl
LLM Parameter Analysis — Temperature and top_p comparisons for stochastic vs deterministic output

RAG pipeline architecture

Unstructured Data (PDF/TXT)
        |
   Text Chunking
        |
 Gemini Embedding-001
        |
  In-Memory Vector Store
        |
   User Query --> Query Embedding --> Cosine Similarity Search
                                              |
                                     Top-K Context Retrieval
                                              |
                                   Prompt: Context + Query
                                              |
                                    Gemini 1.5 Pro / Flash
                                              |
                                    Final Answer + RAG Triad Score

Tech stack

Python · Google Generative AI (Gemini SDK) · PyMuPDF · NumPy · openpyxl

Setup

git clone https://github.com/Soum-Code/GenAI-Lab-Experiments.git
cd GenAI-Lab-Experiments
pip install -r requirements.txt

Replace the API key placeholder in each script:

API_KEY = "your_gemini_api_key_here"

Get a free Gemini API key at aistudio.google.com.

Implementations

#	Focus	What it builds
1	LLM Parameters	Temperature and top_p comparison across generation tasks
2	QA Metrics	Automated toxicity, bias, and fluency scoring
3	Lexical Search	BM25 keyword retrieval and its limitations
4	Semantic Search	Vector embeddings with cosine similarity retrieval
5	Document AI	Clinical PDF parsing and structured data extraction
6	Dataset Generation	Synthetic Q&A export to Excel
7	RAG Pipeline	End-to-end documentation retrieval and answering
8	RAG Triad	Faithfulness, relevance, and context precision evaluation
11	Prompt Strategy	Zero-shot vs few-shot performance comparison
12	Chain-of-Thought	Reasoning path elicitation for logic tasks
13	Fine-Tuning Prep	Supervised fine-tuning data preparation logic

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
GenAI_Lab_Experiments		GenAI_Lab_Experiments
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Systems and Prompt Engineering — Practical Implementations

What this covers

RAG pipeline architecture

Tech stack

Setup

Implementations

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Systems and Prompt Engineering — Practical Implementations

What this covers

RAG pipeline architecture

Tech stack

Setup

Implementations

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages