Cortex — Support Co-Pilot

Your personal Support Co-Pilot — 100% local, private, and always on-tone.

Built for QVAC Hackathon I — Unleash Edge AI · General Purpose Track · June 2026

Cortex is a production-ready desktop application that helps customer support agents draft fast, professional, perfectly on-tone replies — with zero cloud dependencies, zero API keys, and zero data leaving the machine.

All AI inference and RAG run entirely on-device via the @qvac/sdk — Tether's local inference engine. No OpenAI. No Anthropic. No external APIs of any kind. One laptop, full capability.

Track: General Purpose — runs on Apple Silicon Macs (16 GB RAM recommended, tested on M-series). The QVAC SDK handles model loading, streaming completions, embeddings, and RAG entirely on the local machine.

This project is open source. We believe great support tools should be transparent, customizable, and community-driven.

The Problem

Customer support agents in crypto and financial services handle sophisticated users and high-stakes tickets every day. Every reply must be:

Direct but polite — no fluff, no corporate-speak
Security- and compliance-first (TXID, 2FA, KYC, withdrawal verification)
Consistent with the brand's expert, no-nonsense voice
Fast — hundreds of tickets per shift

Today agents either write everything from scratch (slow, inconsistent) or use cloud-based AI tools (privacy risk, cost at scale, vendor lock-in).

Cortex solves this entirely on-device.

What Cortex Does

Chat Agent — paste a ticket, get a professional draft in seconds. Streaming output. One-click "Use as Response."
Grammar & Style — polish any draft to match the exact tone and professionalism standard
Smart Translate — EN ↔ ES ↔ FR ↔ PT ↔ DE ↔ IT ↔ ZH while preserving technical terminology
Response Templates — 6 pre-built templates for the most common ticket types (withdrawal issues, missing deposits, KYC, API problems, security concerns, general acknowledgements)
Local RAG — point to any folder of .md, .txt, or .pdf docs. QVAC embeds them locally. The agent cites sources in replies.
Full customization — editable system prompt, tone presets, extra instructions, behavior toggles. No code changes required.

Result: Every agent replies faster, with higher consistency and quality, while staying fully in control. And your customers' data never leaves the building.

QVAC SDK Integration

Cortex uses @qvac/sdk for all local AI operations:

Operation	QVAC feature used
LLM inference (chat, grammar, translate, templates)	`llamacpp-completion` via `loadModel` + `streamCompletion`
Knowledge base embeddings	`EMBEDDINGGEMMA_300M_Q4_0` via `loadModel`
RAG retrieval	`searchKnowledge` with locally indexed chunks
Model download + caching	QVAC registry (`LLAMA_3_2_1B_INST_Q4_0`, `QWEN3_*`)

The SDK runs in a Node.js sidecar (src-tauri/qvac-host.cjs) spawned by the Tauri Rust backend. This keeps @qvac/sdk (a Node/Bare runtime package) completely out of the webview bundle while giving the React UI full access to streaming completions and RAG via a clean IPC bridge.

Recommended models (all downloaded on first use, cached locally in ~/.qvac/models):

LLAMA_3_2_1B_INST_Q4_0 — default, ultra-light (~0.5–1 GB), fastest daily driver
QWEN3_1_7B_INST_Q4 — excellent instruction following
QWEN3_4B_INST_Q4_K_M — best quality/weight trade-off

Quick Start

Prerequisites

macOS (Apple Silicon recommended)
Rust (via rustup)
pnpm

Run in Development

git clone https://github.com/fran011245/cortex-support
cd cortex-support
pnpm install
pnpm tauri dev

On first chat send, Cortex auto-downloads the configured model via the QVAC registry (live progress shown in UI). Subsequent runs are instant from the local cache at ~/.qvac/models.

Production Build

pnpm tauri build

Produces:

src-tauri/target/release/bundle/macos/Cortex.app
src-tauri/target/release/bundle/dmg/Cortex_0.1.0_aarch64.dmg

Run Tests

pnpm test:run   # single pass
pnpm test       # watch mode
pnpm test:ui    # browser UI

⌨️ Keyboard Shortcuts

Shortcut	Action
`⌘,` / `Ctrl+,`	Open / close Settings
`⌘N` / `Ctrl+N`	New conversation
`⌘K` / `Ctrl+K`	Focus message input
`Enter`	Send message
`Shift+Enter`	New line in composer
`Esc`	Stop current generation

How to Customize the Agent (No Code Required)

All customization lives in Settings (⌘,). Changes apply instantly to new generations.

Agent System Prompt

Pre-filled with a strong default for crypto/fintech support. Fully editable. Restore default with one click.

Live Effective Prompt (transparency hero)

The Agent Prompt tab shows exactly what the model receives — base prompt + active Tone Rules + Extra Instructions — with a live token estimate and one-click copy. You always know what the agent "thinks."

Tone Rules & Style Presets

Presets: Professional (default), Concise, Detailed, Empathetic
Fine-grained toggles: full sentences, no emojis, direct-but-polite, prioritize security warnings
Max reply length slider

Extra Instructions

Free-form text appended to every prompt:

"Always mention the ticket ID at the top."
"For corporate clients, use last name only."
"Never promise specific timelines."

Knowledge Base (RAG)

Pick a folder of .md, .txt, .pdf files
QVAC embeddings index them locally (one click, no cloud)
Agent automatically pulls relevant context and cites sources
Toggle "Enable RAG" to activate

Export / Import

Export your full agent configuration (prompt + rules + model prefs + RAG path) as JSON. Share across the team or version-control it.

Architecture

Tauri 2 (Rust backend)
  └── spawns qvac-host.cjs (Node.js sidecar)
        └── @qvac/sdk — model loading, embeddings, streaming completions

React 19 frontend (webview)
  └── sends IPC commands to Rust → forwarded as NDJSON to qvac-host
  └── receives streaming tokens back via IPC events

Stack:

Desktop: Tauri 2 — native macOS .app / .dmg, tiny footprint
Frontend: React 19 + TypeScript + Vite + Tailwind v4 + shadcn/ui + Zustand
AI Engine: @qvac/sdk (all inference + embeddings + RAG — 100% local)
Persistence: Tauri Store plugin (settings) + localStorage (chat sessions)
Theme: Deep navy (#0A0F1C) + accent blue (#3B82F6) + glassmorphism

Why a Node sidecar? @qvac/sdk uses native addons and the Bare/Hypercore runtime — it cannot be bundled into a browser webview. Running it as a child process is the clean, stable solution. All communication is NDJSON over stdin/stdout, brokered by Tauri IPC.

Why Open Source?

Support work is high-stakes, high-context, and deeply human. The tools agents use should reflect that.

Transparency & Trust — Teams handling sensitive financial accounts need to audit exactly what the AI sees. 100% local + the Live Effective Prompt is table stakes.
No vendor lock-in — Every team has its own voice and policies. Cortex lets teams evolve the agent entirely through Settings. Fork, extend, or self-host anytime.
Respect for the craft — Support agents are experts. Cortex amplifies their expertise, it doesn't replace their judgment.

We're not trying to build the next big AI company. We're building a tool that makes excellent support work a little easier, clearer, and more consistent.

Contributing

We welcome contributions:

Bug reports and feature requests (open an issue)
New agent tools or UI improvements
Better onboarding or settings experience
Documentation, translations, example knowledge base folders
Cross-platform support (currently macOS / Apple Silicon)

Please open an issue first for bigger changes to align on the approach.

Demo

A ready-to-record demo script (2:30–3:00 min, hackathon-optimized) is in DEMO_VIDEO_SCRIPT.md.

Suggested flow: App launch → model load → paste a support ticket → streaming draft → "Use as Response" → Grammar tool → Response Templates → Settings customization → RAG folder setup.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
.vscode		.vscode
assets		assets
docs		docs
public		public
src-tauri		src-tauri
src		src
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DEBUG_MODEL_LOADING.md		DEBUG_MODEL_LOADING.md
DEMO_VIDEO_SCRIPT.md		DEMO_VIDEO_SCRIPT.md
LICENSE		LICENSE
NEXT_PHASE_ANALYSIS.md		NEXT_PHASE_ANALYSIS.md
PRODUCT_FEATURES.md		PRODUCT_FEATURES.md
README.md		README.md
SESSION_2026-06-06_PHASE2_DONE.md		SESSION_2026-06-06_PHASE2_DONE.md
SESSION_2026-06-06_USAGE_STATS.md		SESSION_2026-06-06_USAGE_STATS.md
SESSION_MODEL_GUIDE_MAC.md		SESSION_MODEL_GUIDE_MAC.md
components.json		components.json
cortex-support@0.1.0		cortex-support@0.1.0
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tailwind.config.ts		tailwind.config.ts
tauri		tauri
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cortex — Support Co-Pilot

The Problem

What Cortex Does

QVAC SDK Integration

Quick Start

Prerequisites

Run in Development

Production Build

Run Tests

⌨️ Keyboard Shortcuts

How to Customize the Agent (No Code Required)

Agent System Prompt

Live Effective Prompt (transparency hero)

Tone Rules & Style Presets

Extra Instructions

Knowledge Base (RAG)

Export / Import

Architecture

Why Open Source?

Contributing

Demo

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cortex — Support Co-Pilot

The Problem

What Cortex Does

QVAC SDK Integration

Quick Start

Prerequisites

Run in Development

Production Build

Run Tests

⌨️ Keyboard Shortcuts

How to Customize the Agent (No Code Required)

Agent System Prompt

Live Effective Prompt (transparency hero)

Tone Rules & Style Presets

Extra Instructions

Knowledge Base (RAG)

Export / Import

Architecture

Why Open Source?

Contributing

Demo

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages