Eidetic

Long-term memory for Claude Code that knows when memories go bad.

Session 1:  "Never mock the database in tests"
Session 2:  *mocks the database*
Session 50: *still mocking the database*

That's the Day 1 problem — Claude forgets everything between sessions. Every memory tool solves this.

But there's a harder problem nobody talks about:

Session 35: "Use validate_key() for all API checks"
            *validate_key() works great, gets saved to memory*

Session 50: *validate_key() was renamed to check_auth() two weeks ago*
            *Claude confidently injects the old rule*
            *Agent gets WORSE, not better, from its own memory*

That's the Day 60 problem — after 500+ memory files, stale knowledge actively hurts the agent. More memory = worse performance. No existing tool detects this.

Eidetic solves both.

git clone https://github.com/LARIkoz/eidetic.git && cd eidetic && bash install.sh

One command. Zero external dependencies for core. Works immediately.

What It Does

Problem	How Eidetic solves it
MEMORY.md caps at 200 lines (only a fraction visible)	Smart compression: all 160 rules in same token budget
Keyword search only	Hybrid FTS5 + vector search (e5-large, ~100 languages)
Forgets between sessions	Auto-extracts decisions, failures, patterns at session end
Knowledge piles up as duplicate files	Compounds — updates existing memories instead of creating new ones
Stale memories poison the agent	Drift detection — flags broken wikilinks, age staleness, confidence escalation
Agent-created memories reinforce hallucinations	Self-referential discount — agent-extracted = 0.5x weight
All memories treated equally	Evidence tiers — validated > observed > hypothesis
Can't search code	Tree-sitter parses functions/classes into searchable chunks

How It Works

                SESSION START (~350ms warm)
                        |
                Reindex (FTS5) + Code Index (tree-sitter) + Vector Embed
                        |
                Assemble Context (160 rules + project + recent)
                        |
                Write to ~/.claude/rules/ (auto-loaded, no size cap)
                        |
                Agent starts KNOWING everything


                     MID-SESSION
                        |
                /memory-recall "query"  or  MCP memory_search
                        |
                FTS5 + e5 vector (forced for non-English) -> RRF merge
                        |
                Calibrated confidence (two-signal gate) + drift warnings


                     SESSION END (~5s, async)
                        |
                Extract signals (Haiku primary, Codex fallback)
                        |
                Compound: update existing memory OR create new
                        |
                Tag: agent-extracted = 0.5x weight

Compound Ranking

Every result is ranked by:

score = relevance x evidence x source x freshness x status

evidence:   validated = 1.0    observed = 0.7    hypothesis = 0.4
source:     user-created = 1.0  agent-extracted = 0.5  system = 0.3
freshness:  < 30 days = 1.0    > 30 days = 0.5    (a drift finding overrides: stale 0.5x, broken link 0.8x)
status:     current = 1.0      resolved/fixed = 0.75   superseded/deprecated = 0.35   archived = 0.25

Keyword hits also carry a match-quality factor. A validated, recent, current, human-created memory always outranks an old, unverified, agent-extracted, or superseded guess.

Install

git clone https://github.com/LARIkoz/eidetic.git
cd eidetic
bash install.sh

Requirements: bash, python3, sqlite3 (pre-installed on macOS/Linux).

Optional upgrades:

pip install fastembed                    # semantic search (e5-large ONNX, ~2.2GB model)
pip install tree-sitter tree-sitter-python tree-sitter-javascript tree-sitter-bash  # code search

Core works without pip installs — degrades to FTS5-only search and skips code indexing. Rollback: bash ~/.claude/memory-system/bin/rollback.sh

Updates

Background update check every 6 hours. When available:

Eidetic update available (a1b2c3d). Run: bash ~/.claude/memory-system/bin/update.sh

Updates preserve databases, rules, and hooks — only code files are replaced.

MCP Server

Works with Cursor, Windsurf, Cline, and any MCP-compatible agent:

{
  "mcpServers": {
    "eidetic": {
      "command": "python3",
      "args": ["~/.claude/memory-system/mcp_server.py"]
    }
  }
}

7 tools: memory_search, memory_search_detail, memory_serendipity, memory_health, memory_reindex, memory_lint, export_vault.

Key Features

Drift Detection (v2.5)

The feature that makes Eidetic different. Three checks, 24h throttle, zero file mutations:

Check	What it catches	Threshold
Wikilink drift	`[[validate-key]]` referenced but file renamed/deleted	Immediate
Age staleness	Project memory untouched for 30+ days	30d project, 60d status, 90d reference
Confidence escalation	3+ agent-extracted updates, 0 human confirmation	3 events

Drift findings penalize ranking: broken wikilink = 0.8x, stale = 0.5x, confidence escalation = 0.3x. Auto-resolve when the problem disappears.

Smart Token Compression (v1.3)

160 feedback rules in ~6,800 tokens. Previously only ~57 fit in MEMORY.md. Keyword clustering groups related rules; tiered display shows important rules in full, low-priority as name only.

Hybrid Search (v2.0, v5.1)

FTS5 for keywords (~50ms). Vector search (multilingual-e5-large, 1024-dim) as fallback for semantic queries. Cross-language by design — Russian queries find English rules. v5.1 replaced the old MiniLM-384 embedder with e5-large: RU-paraphrase recall@3 went 25% → 67% (measured). Results merged via Reciprocal Rank Fusion. If every candidate is weak, reports No confident results instead of surfacing noise.

Two-signal confidence gate (v5.1). e5 compresses scores, so a true cross-lingual match (~0.83 cosine) is indistinguishable from topical noise (~0.83) by cosine alone. A vector-only hit reaches actionable confidence only with lexical corroboration (shared query anchors) — high recall, no false confidence. A model/dim stamp on the vector store also guards against silent embedder drift.

Progressive search keeps broad queries compact. Use --detail <id> to fetch full content when a candidate looks relevant.

Code-Aware Search (v2.2)

Tree-sitter parses .py, .js, .ts, .tsx, .sh — every function and class becomes searchable by name or purpose.

Knowledge Compounding

Before creating a new memory, searches for existing ones on the same topic. Found? Updates it, adds history. Not found? Creates new file. 50 sessions = 50 refined rules, not 500 duplicate files.

Obsidian Vault Export (v4.0)

eidetic export-vault ~/my-vault/

Quality gate filters 640+ memory files down to a validated subset. Template formatting, verified wikilinks, auto-MOC, graph colors. Optional --polish for LLM-rewritten human-readable cards.

Performance

Metric	Value
Session start (warm)	~350ms
Session start (cold)	~15s (e5-large ONNX load)
FTS reindex (1083 files)	~0.3s
Full vector embed (7.8K)	~1h one-time (e5 CPU); incremental = seconds
FTS5 search	~50ms
Vector query (e5)	~32ms
Signal extraction	~$0.002/session (Haiku)
Index size	31MB (FTS5) + 35MB (vectors, 1024-dim)
External dependencies	zero for core (e5 model optional)

Comparison

What only Eidetic does

Based on 40-repo competitive analysis (May 2026):

Feature	Why it matters
Drift detection	Catches stale memories before they poison the agent
Compounding	Updates existing knowledge instead of creating duplicates
Self-referential discount	Agent guesses can't reinforce themselves into "facts"
Evidence tiers	Proven rules always outrank unverified guesses
Code search	"Where is the rate-limit handler?" actually works
Zero-dep core	No Docker, no npm, no pip for basic usage

When to use what

Your situation	Best choice
Want Claude Code to remember AND detect stale knowledge	Eidetic
Want largest community, web UI, multi-agent	claude-mem (large community project)
Need shared memory across Cursor + Claude + Copilot	engram
Already using Obsidian, want simple integration	lucasrosati's setup
Need heavy multilingual semantic search	memsearch

Design Philosophy

Inspired by Luhmann's Zettelkasten, Tiago Forte's Second Brain, and Karpathy's AI wiki concept.

Core principles:

Files are truth. Indexes are derived. If the DB dies, rebuild from markdown in <1s.
Every memory must be findable without knowing where it is. Cross-project search is the default.
Critical rules must never become invisible. The system guarantees all behavioral rules reach the agent.
The system maintains itself — or it dies. The AI agent does all indexing, linking, searching, compounding. Human curates when they want to, not because they have to.

Safety

Atomic writes — tempfile + os.replace(), crash-safe
Backup/restore — auto-backup before reindex, auto-restore on failure
Lock serialization — fcntl lock file via bin/lock_runner.py
Anti-injection — prefix-validated signal extraction blocks transcript noise from becoming memory
Graceful degradation — missing index falls back to head -200 MEMORY.md
Rollback — one command, <5 seconds

Roadmap

Shipped: v1.0 FTS5 + signals + compounding, v1.3 token compression (2.17x), v2.0 hybrid search, v2.2 code search (tree-sitter), v2.5 drift detection, v4.0-4.2 Obsidian vault export + LLM polish, v4.3 lifecycle signals, v5.0 progressive search, v5.0.1 lifecycle Phase B, v5.1 e5-large embedder + two-signal precision gate + model-drift guard (RU recall@3 25% → 67%).

Next: distribution (pip package, docs polish).

Planned: v6 — truth-maintenance: supersession + contradiction detection as a typed-edge graph — memory that resolves its own contradictions and doesn't rot. Plus session-transcript search and a cross-encoder reranker.

Full version history: CHANGELOG.md

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
.github		.github
bin		bin
examples/demo-vault		examples/demo-vault
hooks		hooks
skill		skill
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
llms.txt		llms.txt
mcp_server.py		mcp_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Eidetic

What It Does

How It Works

Compound Ranking

Install

Updates

MCP Server

Key Features

Drift Detection (v2.5)

Smart Token Compression (v1.3)

Hybrid Search (v2.0, v5.1)

Code-Aware Search (v2.2)

Knowledge Compounding

Obsidian Vault Export (v4.0)

Performance

Comparison

What only Eidetic does

When to use what

Design Philosophy

Safety

Roadmap

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Eidetic

What It Does

How It Works

Compound Ranking

Install

Updates

MCP Server

Key Features

Drift Detection (v2.5)

Smart Token Compression (v1.3)

Hybrid Search (v2.0, v5.1)

Code-Aware Search (v2.2)

Knowledge Compounding

Obsidian Vault Export (v4.0)

Performance

Comparison

What only Eidetic does

When to use what

Design Philosophy

Safety

Roadmap

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages