🍵 matcha

Simple. Efficient. Deliberate. Never twice.

Anti-bloat engineering convention for AI coding agents.
6 agents · 6 commands · 16 rule sets · 3 lifecycle hooks · 12 platforms

Why matcha?

Every AI coding session starts the same: jump in, write code, realize mid-way you missed something obvious. matcha is a 6-checkpoint filter that runs before, during, and after every implementation — catching the things you'd catch yourself if you stopped to think.

No bloat. Just a deliberate gate between you and messy code.

Quick Setup

Three ways to install:

Method Works on Command

curl | bash

Any AI agent (auto-detects 12 platforms)

curl -fsSL https://raw.githubusercontent.com/plumpslabs/matcha/main/install.sh | bash

/plugin marketplace

Claude Code only

/plugin marketplace add https://github.com/plumpslabs/matcha
/plugin install matcha@plumpslabs-matcha

agy plugin

Antigravity CLI

agy plugin install https://github.com/plumpslabs/matcha

See QUICKSTART.md for the full 5-minute guide.

The 6-Checkpoint Filter

Every implementation passes through these gates:

🎯 Purpose  →  🔎 Reuse  →  🔍 Stack  →  🛠️ Implementation  →  🧹 Cleanup  →  ✅ Verify

#	Checkpoint	What it enforces
🎯	Purpose	5W1H — What/Why/Who/When/Where/How. Can't answer Why/How? → STOP.
🔎	Reuse	Before writing: search codebase for existing implementations. Never write what already exists.
🔍	Stack	Scan manifests, services, deps. Overlap found? → STOP. Report.
🛠️	Implementation	No hardcode. Explicit errors. One function. "Is there a simpler path?"
🧹	Cleanup	Done = working AND clean. Temp files, debug code, unused imports. Decision log.
✅	Verify	Auto-detect + run tests, typecheck, lint. Tests fail? → STOP and fix first.

Philosophy

matcha enforces universal engineering principles — language-agnostic, framework-agnostic, team-agnostic. Every rule traces back to one of these core ideas:

Principle	What it means	Applied in
Type-safe by default	No escape hatches, no `any` casting. Let the type system work for you.	Writing rules
CQS	Commands change state (return void). Queries return data (no side effects). Never both.	Writing rules
Idempotency	Mutations must be retry-safe. Use idempotency keys for payments, webhooks, and critical writes.	Writing rules
Validate at boundaries	Validate inputs at the outer layer (controller/handler), not deep in service logic. Fail before mutation, not after partial write.	Writing rules
Contract-first	Draft response shape / component props / API contract before implementation. No contract = no code.	Checkpoint 1 (5W1H)
Error shape consistency	All endpoints return errors in the same format. FE shouldn't guess error structure.	Writing rules
State origin awareness	Before adding state: is this server state, client state, or shared state? Prevents state management refactors.	Writing rules
Observability	Structured logging, no `console.log`. Every log line should be parsable and actionable.	Writing rules + PostToolUse hook
Pure functions first	Side effects at boundaries. Business logic should be pure — testable without mocks.	Writing rules
Fail fast	Validate config at startup, not at first use. Catch misconfig before it reaches production.	Config validation
Performance awareness	Watch for N+1 queries, O(n²+) loops, and unnecessary allocations in hot paths.	Performance rules + PostToolUse hook

Companion Tools

matcha pairs naturally with other tools from the same ecosystem:

🐻 Kuma — Runtime safety enforcement. MCP server that blocks dangerous operations before they execute. Complements matcha-shield. Use when handling sensitive data or production infrastructure.
🦊 Fennec — AI-native developer observability. MCP server that gives AI agents browser, terminal, and process visibility. Complements matcha's debugging and review checkpoints.

Together they form a complete stack: matcha 🍵 (philosophy) + kuma 🐻 (safety) + fennec 🦊 (observability).

Agents

Agent	Checkpoint	Tools	When to use
`matcha-planner`	🎯 Purpose	Read Grep Glob	Before starting features, refactoring, architecture (plan only, no exec)
`matcha-finder`	🔎 Reuse	Read Grep Glob Bash	Before writing any new code — reuse hunter
`matcha-auditor`	🔍 Stack	Read Grep Glob Bash	Stack audits, service overlap, health checks
`matcha-reviewer`	🛠️ + 🧹 + ✅	Read Grep Glob	Post-implementation review + verify tests passed
`matcha-cleaner`	🧹 Cleanup	Read Grep Glob Bash	Temp removal, debug code, unused imports
`matcha-debugger`	🎯 → ✅ (full)	Read Grep Glob Bash	Systematic debugging — one hypothesis at a time

Invoke: @matcha-reviewer or let the agent auto-route via description.

Slash Commands

Command	Description	Where
`/matcha:why`	5W1H check before starting a task	Claude, OpenCode
`/matcha:review`	Post-implementation review	Claude, OpenCode
`/matcha:audit`	Stack audit for overlaps & inefficiencies	Claude, OpenCode
`/matcha:observe\|enforce\|audit`	Set intensity level	Claude, OpenCode
`/matcha:status`	Session health + component availability	Claude, OpenCode
`node bin/matcha.js status`	Show version, platform, shield status	Cloned repo
`node bin/matcha.js why`	5W1H interactive check (piped input)	Cloned repo
`node bin/matcha.js audit`	Quick project stack audit	Cloned repo
`node bin/matcha.js verify`	Auto-run tests + typecheck + lint	Cloned repo

🛡️ Hooks System

matcha ships 3 lifecycle hooks that enforce philosophy deterministically — no prompt engineering, no guessing.

1. PreToolUse — `matcha-shield.js`

A deterministic safety gate that:

Blocks codebase modifications/commands before planning: In enforce (default) and audit modes, it blocks all write/edit tools and terminal execution commands until you create/update the 5W1H plan in .agents/matcha-plan.md (using a valid <matcha_gate> XML block with at least 15 characters per section).
Blocks dangerous commands before they reach the OS:

Blocked	Example
Root filesystem deletion	`rm -rf /`, `rm -rf ~`, `rm -rf .`
Permission abuse	`chmod 777`
Destructive git	`git push --force`
Database destruction	`DROP DATABASE`, `TRUNCATE TABLE`
Remote code execution	`curl \| bash`, `wget \| sh`
Disk corruption	Write to `/dev/sda`, `mkfs`, `dd` to block device
System commands	`shutdown`, `reboot`, `init 0`

Override: MATCHA_SHIELD_OFF=true (not recommended)

Architecture:

CLI mode — invoked by Claude Code PreToolUse hook, reads event from stdin, exits code 2 to block
Programmatic API — ESM export via beforeToolUse(event, context) for use in other platforms

Platform support:

Platform	Integration
Claude Code	`.claude/settings.json` — PreToolUse hook
OpenCode	`tool.execute.before` in `matcha.mjs` plugin
Qoder	`hooks.json` — `before:tool_use` event
Antigravity	`gemini-extension.json` — tool lifecycle

Dual-mode: Works both as a Claude Code hook (stdin/stdout) and programmatic API (ESM import).

2. PostToolUse — `matcha-post-write.js`

Automatically scans modified files after every Write/Edit for cleanup issues and returns additionalContext so the agent can self-correct.

Severity	Check	Pattern
🔴 Critical	Empty catch block	Error silently swallowed
🔴 Critical	Hardcoded credential	API key/secret/password/token
🟡 Minor	Debug log/statement	`console.log`, `print()`, `debugger`
🟡 Minor	TODO/FIXME	Unfinished work left in code
🟡 Minor	Unbounded query	`SELECT` without `LIMIT`
🟡 Minor	High OFFSET	Consider cursor pagination
🟡 Minor	Function in WHERE	`YEAR()`, `MONTH()`, `LOWER()` — index won't be used

How it works:

After Write/Edit tool completes, hook fires
Scans the modified file for 4 check types
If issues found → injects additionalContext into agent's reasoning chain
If clean → silent exit (zero overhead)

3. Stop — `matcha-stop.js`

Replaces the old prompt-based "surface 3 tips" instruction with a deterministic stop event hook that scans git diff for real issues.

Tip	Scan	Trigger
⚡ Efficiency	Debug code, empty catches, oversized functions	`git diff --unified=0`
🔎 Reuse	New functions count, import density	`git diff --diff-filter=AM`
🧹 Cleanup	Untracked files, temp artifacts	`git status --porcelain`

Why deterministic:

Old approach: "include 3 tips at the end" → LLM guesswork, inconsistent
New approach: scan actual git diff → factual tips every time

Platform fallback: For platforms without hook support (OpenCode), an inline End-of-Task checklist is injected via session.created system prompt.

Hook registration (.claude/settings.json):

{
  "hooks": {
    "PreToolUse": [{
      "matcher": "Bash",
      "hooks": [{
        "type": "command",
        "command": "node ${CLAUDE_PLUGIN_ROOT}/hooks/matcha-shield.js",
        "timeout": 5000
      }]
    }],
    "PostToolUse": [{
      "hooks": [{
        "type": "command",
        "command": "node ${CLAUDE_PLUGIN_ROOT}/hooks/matcha-post-write.js",
        "timeout": 3000
      }]
    }],
    "Stop": [{
      "hooks": [{
        "type": "command",
        "command": "node ${CLAUDE_PLUGIN_ROOT}/hooks/matcha-stop.js",
        "timeout": 5000
      }]
    }]
  }
}

Benchmarks

matcha ships with two benchmark modes:

⚖️ Compliance Score

Scans your project for matcha compliance issues:

Usage:
  node benchmark/matcha-bench.js            ← scan current dir
  node benchmark/matcha-bench.js ./src      ← scan specific dir
  node benchmark/matcha-bench.js --json     ← JSON output
  npm run benchmark                         ← via npm script
  npm run benchmark:json                    ← JSON via npm script

Scoring:

A+ (95-100): Excellent — matcha compliant
A (85-94): Good — minor issues
B (70-84): Needs attention
C (50-69): Poor — needs cleanup
F (0-49): Critical — major cleanup required

Checks: Debug logs, TODO/FIXME, empty catch blocks, hardcoded credentials, hardcoded URLs.

Baseline comparison: node benchmark/matcha-bench.js --json --baseline baseline.json

🤖 Agentic Benchmark

Two modes:

Static A/B/C — compares pre-written solutions for baseline, terse, and matcha:

node benchmark/matcha-bench.js --agentic
node benchmark/matcha-bench.js --agentic --json   # JSON output

Live A/B/C — spawns real Claude Code sessions for each arm (requires Claude Code CLI):

# Full live run — spawns Claude Code for each task × arm
node benchmark/agentic-runner.js

# Simulated mode (uses default solutions, no Claude needed)
node benchmark/agentic-runner.js --simulate

# Single arm or task
node benchmark/agentic-runner.js --arm matcha
node benchmark/agentic-runner.js --task email-validator

# Keep temp directories for inspection
node benchmark/agentic-runner.js --keep

# Also accessible via matcha-bench.js
node benchmark/matcha-bench.js --agentic-live
node benchmark/matcha-bench.js --agentic-live --simulate

How it works:

For each task, creates an isolated temp directory
Writes the task specification as a prompt file
matcha arm — injects .claude/CLAUDE.md with core matcha rules into the temp project
Spawns claude --print with the spec as input
Waits for solution.js to be generated (120s timeout)
Runs the task's test.js against the generated solution
Measures LOC, tracks correctness, cleans up

Arms:

Arm	Description	Purpose
baseline	Standard implementation	Control — no conventions
terse	Just "be brief"	Control — mere brevity vs structural rules
matcha	Full matcha conventions	Test subject

Scoring per task:

✅ Correctness gate — output matches expected results
🛡️ Adversarial gate — handles edge cases (null, empty, type mismatches) without crashing
📐 LOC & tokens — code volume comparison across all 3 arms

Tasks (7 total):

Type	Tasks	What it tests
🧪 Surgical	Email validator, Debounce, CSV sum, FizzBuzz, Array flatten	Single-function precision — does matcha reduce code without breaking correctness?
🏗️ Over-build	Date formatter, Log level filter	Real feature tickets that tempt over-engineering — does matcha prevent unnecessary abstractions?

The terse arm is the key control: if matcha beats terse on correctness + adversarial while using less code, it proves matcha rules work — the improvement isn't just because the agent writes shorter code.

Sample output:
── Email Validator ──
  ✅ baseline   12 LOC, ~ 53 tok
  ✅ terse       4 LOC, ~ 35 tok  (-8 LOC, -17%)
  ✅ matcha      3 LOC, ~ 30 tok  (-9 LOC, -19%)  🛡️

═══ Summary ═══
                  baseline    terse    matcha
Total LOC:             52       37        27
Total tok:            ~278     ~187      ~158
Correct:              5/7      5/7       5/7
Adversarial:          5/7      5/7       7/7

  📐 matcha vs baseline: -25 LOC (-48%)
  📐 terse  vs baseline: -15 LOC (-29%)
  🎯 matcha beats terse by 10 LOC — rules > mere brevity

Language Rules

matcha ships per-language coding standards for your tech stack:

Language / Framework	Files	Auto-activates when editing
Common	`matcha-common`	Always (testing, git, conventions)
Redis	`matcha-redis`	All files (caching patterns)
Tailwind CSS	`matcha-tailwind`	`.css`, `.tsx`, `.jsx`, `.html`, `*.vue`
TypeScript/JS	`matcha-typescript`	`.ts`, `.tsx`, `.js`, `.jsx`
React	`matcha-react`	`.tsx`, `.jsx`
Next.js	`matcha-nextjs`	`.tsx`, `.ts`
TanStack	`matcha-tanstack`	`.ts`, `.tsx`
Angular	`matcha-angular`	`*.ts`
NestJS	`matcha-nestjs`	`*.ts`
Nuxt	`matcha-nuxt`	`.vue`, `.ts`
Go	`matcha-go`	`*.go`
Python	`matcha-python`	`*.py`
PHP	`matcha-php`	`*.php`
Java	`matcha-java`	`*.java`
React Native	`matcha-react-native`	`.tsx`, `.jsx`

Rules are available on all platforms with adapter-specific formatting (Cursor .mdc, Kiro steering, Windsurf rules, etc.).

Platform Adapters

matcha adapts to 12 platforms, each with its own file format and lifecycle model.

Platform	Files	Key Features
Claude Code	`.claude/agents/`, `.claude/commands/`, `.claude/settings.json`	6 agents + 6 commands + 3 lifecycle hooks + skill
OpenCode	`.opencode/agents/`, `.opencode/plugins/matcha.mjs`	6 agents + 6 commands + lifecycle plugin (`tool.execute.before`, `session.created`)
Cursor	`.cursor/rules/matcha-*.mdc` (20 files)	4 scoped rules: core (alwaysApply), cleanup (globs), audit (manual), review (manual) + 15 language rules + 1 combined legacy
Windsurf	`.windsurfrules` (root) + `.windsurf/rules/*.md`	Root `.windsurfrules` read by Cascade AI + 16 per-language `.md` rules
Kiro	`.kiro/steering/matcha*.md` (17 files)	`inclusion: always` for core, `inclusion: manual` for dev/review modes
Cline / Roo Code	`.clinerules/matcha*.md`	Per-language `.clinerules` files
OpenClaw	`.openclaw/skills/matcha/SKILL.md`	Skill file only
Qoder	`.qoder/`	AGENTS.md + agents + rules + shield hook
Qwen Code	`.qwen/`	QWEN.md + skill + settings.json
Codebuff / agy	`.agents/` (or global)	agents + commands + rules + skill (universal format)
Antigravity CLI	`GEMINI.md` + `gemini-extension.json`	GEMINI.md convention + deprecated extension manifest
Any / None	`.agents/` (auto-created)	Universal format — agents + rules + commands + skill

Adapter Details

Cursor — Scoped .mdc files:

matcha-core.mdc — alwaysApply: true, under 200 words, loaded on every request
matcha-cleanup.mdc — globs: ["**/*.{ts,tsx,js,jsx,py,go,java,php}"], auto-triggered on source files
matcha-audit.mdc — alwaysApply: false, on-demand via @matcha-auditor
matcha-review.mdc — alwaysApply: false, on-demand via @matcha-reviewer
15 language .mdc files with globs scoped to respective file types

Kiro — Steering files:

matcha.md — inclusion: always (loaded every turn), trimmed to core rules (<200 words)
dev-mode.md, review-mode.md, language files — inclusion: manual (invoked as needed)
Uses native Kiro inclusion mode instead of Cursor's alwaysApply/globs

OpenCode — Plugin with lifecycle hooks:

session.created — injects matcha system prompt every session
tool.execute.before — shield enforcement for dangerous commands
3 slash commands: matcha:why, matcha:audit, matcha:review
No tool.execute.after needed (cleanup handled via inline instructions)

Antigravity CLI / Gemini:

GEMINI.md — convention file with core rules + intensity + agent list
gemini-extension.json — deprecated (kept for backward compat until v3.0.0), migration guide in GEMINI.md

Intensity Levels

Level	Behavior
observe	Tips only. No blocking.
enforce	Full 6-checkpoint filter. Default.
audit	Enforce + mandatory cleanup. Everything flagged.

Set with /matcha:audit or /matcha:observe in supported platforms.

Communication

When matcha flags something:

🍵 matcha: [TITLE]

Observation: [what was found]
Why it matters: [impact]
Options:
  A) [option] — [trade-off]
  B) [option] — [trade-off]

Recommendation: [which and why]

✍️ Writing Style

matcha ships a writing style guide (rules/common/writing-style.md) with 10 rules derived from matcha's own DNA — not borrowed or copied. Every rule traces back to matcha's core philosophy: Simple, Efficient, Deliberate, Never twice, 5W1H.

The 10 Rules

Rule	Principle	Domain	Mechanical Detection
RULE-01: Direct Sentences	Efficient	Commits, docs, comments	✅ Filler phrases (`in order to`, `due to the fact that`)
RULE-02: Comments = Why, Not What	Deliberate	Code comments	— (via agent system prompt)
RULE-03: Actionable Error Messages	Actionable	Error messages	— (via agent system prompt)
RULE-04: Conventional Commits	Clean	Commit messages	✅ Vague commit (`WIP`, `fix bug`, `update`)
RULE-05: Concrete, Not Abstract	Never twice	All docs	— (via agent system prompt)
RULE-06: Active Voice	Simple	All docs	✅ Passive voice (`was done`, `has been implemented`)
RULE-07: PR with 5W1H	5W1H	PR descriptions	— (via agent system prompt)
RULE-08: No Dead Buzzwords	Simple	All docs	✅ Buzzwords (`leverage`, `cutting-edge`, `synergy`)
RULE-09: Single Source of Truth	Never twice	Documentation	— (via agent system prompt)
RULE-10: Tone Casual-Direct	matcha tone	All communication	— (via agent system prompt)

Enforcement

Writing quality is enforced through two layers:

Layer 1 — System prompt (soft): The rules/common/writing-style.md file is auto-loaded as a common rule on all platforms. The agent reads it at session start and follows the rules during generation.

Layer 2 — PostToolUse hook (deterministic): The matcha-post-write.js hook scans modified .md, .txt, and COMMIT_EDITMSG files for mechanically detectable violations:

Check	Detects	Format
🔵 Filler phrases	`in order to`, `due to the fact that`, `it is important to note that`	🟢 Writing style
🔵 Passive voice	`was done`, `has been implemented`, `will be processed`	🟢 Writing style
🔵 Dead buzzwords	`leverage`, `cutting-edge`, `synergy`, `paradigm shift`	🟢 Writing style
🔵 Vague commit	`fix bug`, `WIP`, `update`, `changes` (on COMMIT_EDITMSG only)	🟢 Writing style

Philosophy

These writing rules are not copy-pasted from other projects. Every rule is born from the matcha DNA:

RULE-01 (Direct Sentences) ← Efficient: filler = bloat, just like code bloat
RULE-02 (Comment = Why) ← Deliberate: code = what, comment = why
RULE-06 (Active Voice) ← Simple: shorter, clearer
RULE-09 (Single Source of Truth) ← Never twice: information must not be duplicated
RULE-07 (PR with 5W1H) ← 5W1H: aligned with the Purpose checkpoint

Tone: casual-direct with light sarcasm. Not stiff-formal (no "Dear Sir/Madam"), not too informal (no "yo dude"). Target: chatting with a senior engineer you respect.

Scope

Domain	Applicable	Enforcement
Commit messages	✅	PostToolUse hook + system prompt
Code comments	✅	System prompt
PR descriptions	✅	System prompt
Error messages	✅	System prompt
README/docs	✅	PostToolUse hook + system prompt
External documentation	❌ (use judgment)	—
Regulatory/compliance	❌ (use judgment)	—

CLI (from cloned repo)

node bin/matcha.js status      Show version, platform, components
node bin/matcha.js why          5W1H interactive check (piped input supported)
node bin/matcha.js audit        Quick project stack audit
node bin/matcha.js verify       Auto-detect + run tests, typecheck, lint
node bin/matcha.js help         Show usage

Supports: npm test, jest, vitest, pytest, go test, cargo test, phpunit, mvn test, gradle test, make test.

Install via:

curl -fsSL https://raw.githubusercontent.com/plumpslabs/matcha/main/install.sh | bash

Contributing

Contributions are welcome! If you want to add support for a new platform, update language guidelines, or run local benchmarks and tests, please see our CONTRIBUTING.md guide.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.agents		.agents
.claude-plugin		.claude-plugin
.claude		.claude
.clinerules		.clinerules
.cursor		.cursor
.github/workflows		.github/workflows
.kiro/steering		.kiro/steering
.openclaw/skills/matcha		.openclaw/skills/matcha
.opencode		.opencode
.windsurf/rules		.windsurf/rules
benchmark		benchmark
bin		bin
commands		commands
docs		docs
hooks		hooks
public		public
rules		rules
scripts		scripts
skills/matcha		skills/matcha
tests		tests
.gitignore		.gitignore
.windsurfrules		.windsurfrules
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.bench		Dockerfile.bench
GEMINI.md		GEMINI.md
LICENSE		LICENSE
QUICKSTART.md		QUICKSTART.md
QWEN.md		QWEN.md
README.md		README.md
UPGRADE.md		UPGRADE.md
gemini-extension.json		gemini-extension.json
install.sh		install.sh
package-lock.json		package-lock.json
package.json		package.json
plugin.json		plugin.json
promptfooconfig.yaml		promptfooconfig.yaml
vitest.config.js		vitest.config.js

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🍵 matcha

Why matcha?

Quick Setup

The 6-Checkpoint Filter

Philosophy

Companion Tools

Agents

Slash Commands

🛡️ Hooks System

1. PreToolUse — matcha-shield.js

2. PostToolUse — matcha-post-write.js

3. Stop — matcha-stop.js

Benchmarks

⚖️ Compliance Score

🤖 Agentic Benchmark

Language Rules

Platform Adapters

Adapter Details

Intensity Levels

Communication

✍️ Writing Style

The 10 Rules

Enforcement

Philosophy

Scope

CLI (from cloned repo)

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. PreToolUse — `matcha-shield.js`

2. PostToolUse — `matcha-post-write.js`

3. Stop — `matcha-stop.js`

Packages