Skip to content

kdayno/gemma-flow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gemma Flow Header

Purpose

  • Develop a simple local AI workflow using the Retrieval-Augmented Generation (RAG) AI design pattern
  • Experiment with multiple open-weight AI models and assess performance
  • Gain exposure and hands-on experience with the latest AI tools

Build

RAG-Based LLM App

  • Takes PDF document as input and allows user to ask questions about the document via chat

Tech Stack

  • AI Framework: LangFlow
  • LLM Model: Gemma 4 + Ollama
  • Vector Database: DataStax Astra DB

Architecture

Gemma Flow Architecture Diagram

Flow

Flow Diagram Image

Sample Chat Input / Output

Flow Diagram Image

Insights

  • Using model gemma4:latest over qwen3.5:latest improved response times significantly
    • Average of 50s for qwen3.5 verseus average of 20s for gemma4

Local Deployment

  1. Clone Git repository in local environment
  2. Execute: docker compose up
  3. Access local instance of Langflow at: http://localhost:7860

About

Simple RAG-based AI workflow

Resources

License

Stars

Watchers

Forks

Contributors