Purpose

Develop a simple local AI workflow using the Retrieval-Augmented Generation (RAG) AI design pattern
Experiment with multiple open-weight AI models and assess performance
Gain exposure and hands-on experience with the latest AI tools

Build

RAG-Based LLM App

Takes PDF document as input and allows user to ask questions about the document via chat

Tech Stack

AI Framework: LangFlow
LLM Model: Gemma 4 + Ollama
Vector Database: DataStax Astra DB

Architecture

Flow

Sample Chat Input / Output

Insights

Using model gemma4:latest over qwen3.5:latest improved response times significantly
- Average of 50s for qwen3.5 verseus average of 20s for gemma4

Local Deployment

Clone Git repository in local environment
Execute: docker compose up
Access local instance of Langflow at: http://localhost:7860

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docs		docs
flows		flows
input		input
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Purpose

Build

RAG-Based LLM App

Tech Stack

Architecture

Flow

Sample Chat Input / Output

Insights

Local Deployment

About

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Purpose

Build

RAG-Based LLM App

Tech Stack

Architecture

Flow

Sample Chat Input / Output

Insights

Local Deployment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!