Every conversation your agent has ever had — across every session, every project — is already in the database. It's just not looking.
OpenCode stores the full conversation history from every session your agent has ever run — messages, tool calls, tool outputs, reasoning traces. All of it. Not just the current session. Not just the current project. Every project on the machine. Even after compaction shrinks what the model can see, the original content stays in the database — just no longer visible to the agent.
This plugin gives the agent five tools to search and retrieve all of it on demand.
No new database. No embeddings. No summarization. No duplication. No overhead.
Just install the plugin. The agent gains access to its entire history.
Your agent solves a tricky build error. Twenty minutes later, compaction runs. An hour later, the same error shows up. The agent starts from zero — debugging something it already figured out, while the answer sits in the database it's connected to.
You built rate-limiting middleware in your API project last week. Now you need it in another project. The agent has no idea it ever existed — while the original implementation, the requirements discussion, the edge cases you worked through, all of it is sitting in the same database, in a session from a different project.
You're 200 tool calls and 3 compactions deep. The agent has drifted from your original request. Your exact words are gone from context. But they're not gone — they're in the database. The agent just can't see them.
The data already exists. This plugin removes the blindfold.
"We already fixed this."
recall({ query: "ECONNREFUSED retry", scope: "session" })
Agent finds its own solution from 2 hours ago. Doesn't re-derive it.
"It was in that other project."
recall_sessions({ scope: "global", search: "rate limit" })
recall_get({ sessionID: "...", messageID: "..." })
Finds the implementation from your API project. Reuses it instead of reinventing it.
"What did I originally ask for?"
recall_messages({ limit: 5, role: "user" })
Pulls up exact original requirements after 3 compactions. Checks its own work against what you actually said.
"What was that error?"
recall({ query: "TypeError", type: "tool", scope: "session" })
Gets the full stack trace from a tool output that got pruned. Doesn't re-run the failing command.
"Why did we decide on that approach?"
recall({ query: "chose postgres over", scope: "project", type: "reasoning" })
Recovers the reasoning behind an architectural decision from three sessions ago. Context that no summary captures.
"Find it even with a typo."
recall({ query: "prefiltr", match: "fuzzy", scope: "session" })
Fuzzy search finds prefilter even when the agent misremembers the exact spelling. Results ranked by relevance, not just recency.
"Which sessions touched this topic?"
recall({ query: "rate limiting", scope: "global", match: "smart", group: "session" })
4 sessions across 3 projects, each with hitCount and best representative snippet. One call to discover everywhere a topic came up.
Ranked fuzzy retrieval powered by Fuse.js. Three matching strategies:
| Mode | Behavior | Best for |
|---|---|---|
literal (default) |
Case-insensitive substring match | Exact terms, all scopes |
smart |
Fuzzy ranked search (threshold 0.3) | Uncertain wording, typos, separator differences |
fuzzy |
Looser fuzzy search (threshold 0.5) | Very approximate queries, exploratory search |
recall({ query: "rate limit middleware", match: "smart", scope: "project" })
Smart and fuzzy modes:
- Handle typos —
prefiltrfindsprefilter,ECONNREFUSDfindsECONNREFUSED - Normalize separators —
rate-limitmatchesrateLimitmatchesrate_limit - Rank by relevance — results scored 0–1 with structural boosts for exact phrases, full token coverage, reasoning traces, and recency
- Fall back gracefully — if smart/fuzzy finds nothing, literal search runs automatically
- Time-budget degradation — if ranking takes too long, returns prefilter-ranked results instead of timing out
- Explain mode — add
explain: trueto see scoring breakdowns viamatchReasons
Available across all scopes — "session", "project", and "global".
This is not a memory system. Memory is selective and curated. Recall is raw history retrieval — verbatim, exhaustive, on demand.
If you use a persistent memory system alongside this plugin, recall gives it source material. The agent searches history, follows promising hits with recall_get or recall_context, then stores only durable findings deliberately.
Good memory candidates: user preferences, project decisions, reusable root causes, environment facts, corrections, and approaches that clearly succeeded or failed. Do not store ephemeral session details, one-off commands, transient errors, or implementation minutiae.
opencode plugin opencode-session-recallOr add it to your opencode.json:
To disable cross-project search:
{
"plugin": [["opencode-session-recall", { "global": false }]],
}Five tools, designed around how agents actually navigate conversation history:
The primary tool. Full-text search across session titles, messages, tool outputs, tool-input commands and cwd values, reasoning, and subtask descriptions. Searches globally by default, or narrow to the current project or session.
Use before real work when prior history could change the approach: debugging, unexpected behavior, feature work, architecture or configuration changes, past commands, root causes, decisions, or "what did we do last time?" questions.
Do not call it for every request. Skip it for trivial commands, local file/code inspection, simple edits with complete context, and questions that do not benefit from prior conversations. Use code search for the current codebase.
recall({ query: "authentication", scope: "project" })
recall({ query: "error", type: "tool", scope: "session" })
recall({ query: "JWT", sessionID: "ses_from_another_project" })
recall({ query: "rate limit", match: "smart", scope: "session", group: "session" })
recall({ query: "prefiltr", match: "fuzzy", scope: "session", explain: true })
recall({ query: "unauthorized", expand: "context", window: 1 })
recall({ query: "auth failure", expand: "context", window: "auto" })
recall({ query: "migration", last: "7d", directory: "/workspace/project" })
recall({ query: "release notes", from: "30d ago", to: "now" })
recall({ query: "legacy config", before: "2026-01-01" })
recall({ query: "deploy", directory: "/workspace/project", fallback: true })
recall({ query: "npm test", type: "tool", toolName: "bash" })
First call guidance: omit sessions unless you need a hard scan cap; the default scans all eligible sessions subject to configured and provider limits. Use match: "smart" for topic discovery, naming variants, and likely typos. Use group: "session" for broad discovery. Add expand: "context" or expand: "message" when you already know you need evidence from the top hit. Reserve literal matching for exact errors, commands, function names, or file paths.
| Param | Default | Description |
|---|---|---|
query |
required | Text to search for |
scope |
"global" |
"session", "project", or "global" |
match |
"literal" |
"literal", "smart", or "fuzzy" |
explain |
false |
Include scoring metadata in results |
sessionID |
— | Target a specific session (overrides scope); blank values are ignored |
type |
"all" |
"text", "tool", "reasoning", or "all" |
role |
"all" |
"user", "assistant", or "all" |
before/after |
— | Time bounds as ms epoch or dates like "2026-01-01"; nonpositive numbers are ignored |
last |
— | Recent-history lower bound like "2h", "7d", or "3w" |
from/to |
— | Explicit time bounds like "365d ago" and "now" |
since/until |
— | Compatibility relative filters; prefer last, from/to, or before/after in new calls |
directory |
— | Filter sessions by exact directory or descendant path |
fallback |
false |
With directory, fill remaining results from same project/worktree and then global history |
toolName |
— | Exact tool-name filter; only valid with type: "all" or type: "tool" |
expand |
"none" |
"none", "context", or "message"; inlines bounded evidence from top results |
expandResults |
1 |
Number of results to expand; oversized values are clamped to the safe maximum |
window |
3 |
Messages on each side for expand: "context", or "auto"; oversized values are clamped to plugin maxWindow |
expandBudgetMessages |
30 |
Total context messages to inline across expanded results |
expandBudgetChars |
30000 |
Total expanded text budget |
width |
200 |
Snippet size (50–1000 chars) |
sessions |
all eligible | Optional max sessions to scan, capped by plugin maxSessions |
title |
— | Filter by session title substring; blank values are ignored |
group |
"part" |
"part" or "session" — when "session", collapses results by session (one entry per session with the best-scoring or most-recent hit as representative, plus hitCount) |
results |
10 |
Max results to return |
Blank optional filters are ignored. Malformed optional time filters are ignored or normalized with warnings when safe. toolName is exact; if unsure, run a broad type: "tool" search first and inspect returned toolName values. Combining time bounds is allowed only when the resulting time window is valid.
Result entries can include these fields. Ranked-mode fields are present for smart/fuzzy matches; evidence fields explain why any result matched.
| Field | Description |
|---|---|
score |
Relevance score (0–1, higher is better) |
matchMode |
Which strategy produced this result |
matchedTerms |
Query tokens found in the candidate |
matchReasons |
Scoring breakdown (only when explain: true) |
hitCount |
Number of part-level hits in this session (only when group: "session") |
source |
"message", "title", "tool", or "reasoning" |
why |
Compact match explanation: matched fields, terms, confidence, recency |
directoryRelevance |
"exact", "project", "global", or "unknown" |
titleMatch |
Title evidence when a session title matched |
Response metadata:
| Field | Description |
|---|---|
loadErrorCount |
Number of scanned sessions that failed to load; omitted when zero |
loadErrors |
Sample load failures; use this to distinguish load failures from zero matches |
matchMode |
"smart", "fuzzy", or "literal" (if fell back) |
degradeKind |
"none", "time", "budget", or "fallback" |
group |
"part" or "session" — echoes back the grouping applied |
expanded |
Bounded context or message entries when expand is not "none" |
warnings |
Safe downgrades, clamped options, expansion caps, fallback broadening |
suggestions |
Actionable next steps for empty or weak searches |
coverage |
Sessions/messages/parts searched, skipped reasons, and limiting factors |
nearMisses |
Closest searched sessions for empty results when cheap to report |
Expanded entries inline full message structure, but large text/output/error fields are truncated with a [truncated by recall expansion] marker to keep recall responses bounded. If context expansion exceeds message or text budgets, recall returns the base hits plus as much expansion as fits and reports the cap in warnings; it does not hard-fail a successful base search because expansion was too large.
Get the full content of a specific message, including all parts. Tool outputs are returned in their original form, even if they were pruned from context. Use after recall finds something interesting.
recall_get({ sessionID: "ses_abc", messageID: "msg_def" })
Get a window of messages around a specific message. After recall finds a match, see what was asked before it and what happened after. Supports symmetric and asymmetric windows.
recall_context({ sessionID: "ses_abc", messageID: "msg_def", window: 3 })
recall_context({ sessionID: "ses_abc", messageID: "msg_def", before: 1, after: 5 })
Returns hasMoreBefore/hasMoreAfter so the agent knows if it's at a boundary.
Paginated message browsing. Walk through a session chronologically, read the beginning, check the most recent messages, or filter by role. Also supports content filtering to combine search and pagination.
recall_messages({ limit: 5, role: "user", reverse: true })
recall_messages({ sessionID: "ses_abc", offset: 10, limit: 10 })
recall_messages({ query: "npm", role: "user", reverse: true })
Defaults to the current session. Pagination metadata includes total, hasMore, and offset.
Blank sessionID or query values are treated as omitted.
List sessions by title. Use this for lightweight recent-session browsing or recency checks. For topical discovery, prefer recall; it searches titles and content together and labels title-only hits.
recall_sessions({ scope: "project", search: "auth" })
recall_sessions({ scope: "global", search: "deployment" })
Blank search values are treated as omitted.
| Option | Type | Default | Description |
|---|---|---|---|
primary |
boolean |
true |
Register tools as primary (available to all agents) |
global |
boolean |
true |
Allow cross-project search via scope: "global" |
Advanced limits (all have sensible defaults):
| Option | Default | Description |
|---|---|---|
concurrency |
3 |
Parallel session loads |
maxSessions |
unlimited | Hard max sessions per search; caps recall.sessions and directory-filter broad listing |
maxResults |
50 |
Max results per search |
maxSessionList |
100 |
Max sessions in listing |
maxMessages |
50 |
Max messages per browse |
maxWindow |
10 |
Max context window size |
defaultWidth |
200 |
Default snippet width |
When OpenCode compacts a session, it doesn't delete anything. Tool outputs get a compacted timestamp and are replaced with placeholder text in the LLM's context — but the original data stays in the database. Messages before a compaction boundary are skipped when building the LLM context — but they're still there.
This plugin reads all of it through the OpenCode SDK:
- No direct database queries, no separate storage
- Zero setup — no embeddings to generate, no indexes to build, no data to sync
- Eligible sessions scanned newest-first with bounded concurrency;
maxSessionsis the hard safety cap - Respects abort signals for long-running searches
- Cross-project search enabled by default (disable with
global: false) - Smart and fuzzy ranking works across all scopes — session, project, and global
When match is "smart" or "fuzzy", the search goes through a multi-stage ranking pipeline:
- Candidate construction — Messages are scanned newest-first. Session titles and each part's searchable text are extracted and tokenized. Per-session and global budgets cap the candidate pool.
- Prefiltering — Cheap lexical gate using exact substring, quoted phrase, token overlap, and bounded edit-distance (Levenshtein ≤ 1 for tokens ≥ 4 chars). Only candidates with at least one match survive.
- Normalization — Surviving candidates get full stage-2 normalization (camelCase splitting, separator normalization, whitespace collapse) for Fuse.js field matching.
- Fuse.js ranking — Weighted search across primary text (0.65), project directory (0.20), session title (0.10), and tool name (0.05). Returns all matches above the mode threshold.
- Structural re-ranking — Fuse scores are adjusted with deterministic boosts (exact phrase, full token coverage, reasoning traces, error text, user role, recency) and penalties (weak single-token fuzzy, poor coverage).
- Snippet selection — Token-density sliding window finds the most relevant excerpt from the raw text.
The entire pipeline runs within a 2-second post-fetch time budget. If the pre-Fuse stage alone exceeds 1.5 seconds, Fuse.js is skipped and prefilter-ranked results are returned with degradeKind: "time". If the full pipeline completes but exceeds the total budget, Fuse-ranked results are still returned but marked as time-degraded.
See CONTRIBUTING.md for architecture details, module guide, and development setup.
MIT
{ "plugin": ["opencode-session-recall"], }