MindBank

Persistent graph memory for AI agents.

Hybrid search · Temporal versioning · Local embeddings · Per-project isolation

⚡ One-Command Install

curl -sSL https://raw.githubusercontent.com/spfcraze/MindBank/main/scripts/setup.sh | bash

Dashboard: http://localhost:8095

Note: The setup script creates a .env file with all required configuration. The API server (mindbank-api) and MCP server (mindbank-mcp) are started together via scripts/start.sh.

What is MindBank?

MindBank is a persistent memory layer that lives between your AI agents and raw conversation history. Instead of losing context every time a session ends, MindBank remembers what matters — decisions, facts, problems, preferences — and surfaces them when relevant.

Think of it as a second brain for AI: structured, searchable, and relationship-aware.

The Problem

Every AI conversation starts from zero. Ask Claude to refactor a service, and next session it has forgotten the design decisions. Ask it about a bug, and the context from three weeks ago is gone. You're re-explaining, re-contextualizing, re-deciding.

The Solution

MindBank remembers:

Decisions → "We chose PostgreSQL over MongoDB for ACID compliance"
Facts → "The API base URL is https://api.internal/v2"
Problems → "Rate limiter fails under >1000 req/s"
Preferences → "User prefers functional React patterns"
Questions → "Should we migrate to gRPC?" (unanswered, tracked)

...and connects them into a graph so your AI can follow relationships: "This decision depends on that problem, which relates to this project."

Why MindBank?

Feature	What you get
🔍 Hybrid Search	Full-text + semantic vector search combined with Reciprocal Rank Fusion. Find "auth bug" even if you wrote "authentication failure"
📁 Temporal Versioning	Never lose history. Every update creates a new version. See what changed, when, and why
🌱 100% Local	Embeddings via Ollama (nomic-embed-text). No API keys, no cloud, no data leaves your machine
🔗 Graph Relationships	Nodes connected by typed edges (`contains`, `relates_to`, `depends_on`, `decided_by`, etc.)
🖼️ Per-Project Isolation	Auto-namespace by working directory. `~/project-a` and `~/project-b` have separate memory graphs
⚡ Wake-Up Context	Pre-computed snapshot of important memories served on session start
🔮 Observer Perspective	Trace causal precursors, detect blind spots, measure knowledge coverage — understand why a decision was made
🤖 MCP Native	Works with Claude Desktop, Claude Code CLI, and Hermes Agent out of the box

Quick Start

Option 1: One-liner (recommended)

curl -sSL https://raw.githubusercontent.com/spfcraze/MindBank/main/scripts/setup.sh | bash

Option 2: Manual

# 1. Clone
git clone https://github.com/spfcraze/MindBank.git ~/mindbank
cd ~/mindbank

# 2. Run the setup script (creates .env, starts Postgres, builds binaries)
bash scripts/setup.sh

# 3. Start Ollama + pull model
curl -fsSL https://ollama.ai/install.sh | sh
ollama pull nomic-embed-text

# 4. Start MindBank (API + MCP)
bash scripts/start.sh

# 5. Verify
curl http://localhost:8095/api/v1/health
curl http://localhost:8096/mcp
# → {"status":"ok","postgres":"connected","ollama":"connected"}

Open http://localhost:8095 for the dashboard.

Dashboard

The web UI gives you:

Dashboard — Stats, namespace breakdown, activity feed, node creation, search
Questions — Unanswered queries tracked for future reference
Edges — Browse and filter graph relationships
Tools — Auto-connect, clean orphans, import/export, batch operations, data quality analyzer
Graph 2D — Interactive force-directed visualization
Brain 3D — Immersive 3D graph exploration
Observer — Causal precursor tracing, blind spot detection, knowledge coverage analysis

Lifecycle Event Capture

MindBank captures detailed session lifecycle events via hooks:

Event Types

Event	Description	Payload
`session_start`	Session begins	`{model, timestamp}`
`user_prompt_submit`	User sends message	`{prompt}`
`pre_tool_use`	Before tool execution	`{tool, command, args}`
`post_tool_use`	After tool execution	`{tool, result, error}`
`stop`	Session ends	`{duration_seconds, exit_code}`

Storage

Events are stored as JSON files in ~/.hermes/sessions/events/<session_id>/. Each event becomes a node in the graph with temporal_next edges linking the sequence.

Visualization

Event nodes appear in Brain3D as colored dots:

Green: session_start
Blue: user_prompt_submit
Amber: pre_tool_use
Teal: post_tool_use
Red: stop

How It Works

Memory Model

Node (type: decision)
├── label: "Use PostgreSQL for primary store"
├── content: "Chose Postgres over MongoDB because..."
├── namespace: "my-project"
├── importance: 0.92
└── version: 3 (2 previous versions preserved)

    Edge (decided_by)
    └─── connects to Node "Load testing results"

    Edge (depends_on)
    └─── connects to Node "ACID requirements"

Search Pipeline

User query
    ↓
[Hybrid Search]
  ├── FTS: tsvector ts_rank_cd (keyword matching)
  └── Vector: pgvector HNSW (semantic similarity)
    ↓
[Reciprocal Rank Fusion]
    ↓
[Graph Boost]
  └── Edges to ranked nodes get boost
    ↓
Results ranked by combined score

Importance Scoring

Nodes are automatically scored by 5 factors:

Factor	Weight	What it measures
Recency	30%	How recently accessed/updated
Frequency	25%	How often referenced
Connectivity	20%	Number of edges (hub nodes)
Explicit	15%	User-set importance
Type	10%	Decisions and problems weigh more

🔮 Observer Perspective: Causal Intelligence

MindBank doesn't just store memories — it understands their causal structure. The Observer Perspective system treats every memory as an observable event and traces backward to find its domain of dependence: the set of precursors that had to exist for this memory to be possible.

Why This Matters

Traditional search gives you what you know. The Observer Perspective tells you why you know it:

"We chose PostgreSQL" → Because load testing showed MongoDB failed at 10k req/s
"Use connection pooling" → Because we learned from an outage where connections exhausted
"JWT for auth" → Depends on the decision to go stateless, which depends on scaling requirements

Without this, AI agents repeat decisions without understanding their rationale. With it, they inherit the full reasoning chain.

Core Concepts

Concept	Definition
Domain of Dependence	All causal precursors of a node — everything that had to happen for this memory to exist
Critical Depth	The shallowest depth at which 90% of total influence is captured. If critical depth is 2, you only need to look 2 hops back to understand almost everything that matters
Coverage	Fraction of a node's immediate edges that have upstream precursors. 100% = fully explained; low = orphan decisions
Influence Modes	Ranked list of precursors by influence score (weighted by edge strength and decayed by depth)
Blind Spots	Automatically detected gaps: unresolved contradictions, missing supporting evidence, orphan decisions

How It Works

User asks: "Why did we choose PostgreSQL?"
    ↓
[Hybrid Search] finds node: "Use PostgreSQL for primary store"
    ↓
[Dependence Trace] backward BFS through causal edges:
  depth 1: "Load testing results" (decided_by)
  depth 1: "ACID requirements" (depends_on)
  depth 2: "Financial data integrity" (depends_on)
  depth 2: "MongoDB failure at 10k req/s" (learned_from)
    ↓
[Analysis]
  Critical Depth: 2 (90% of influence within 2 hops)
  Coverage: 100% (all immediate edges have precursors)
  Blind Spots: none
  Influence Modes:
    1. "Load testing results" (score: 0.85, depth: 1)
    2. "ACID requirements" (score: 0.72, depth: 1)
    3. "MongoDB failure at 10k req/s" (score: 0.51, depth: 2)

Dependence-Aware Search & Q&A

Both search and Q&A support opt-in dependence expansion:

// Search with causal context
{"name": "mindbank_search", "arguments": {
  "query": "database choice",
  "dependence_expansion": true
}}

// Ask with supporting evidence
{"name": "mindbank_ask", "arguments": {
  "query": "why postgres over mongodb",
  "dependence_expansion": true
}}

When enabled, MindBank:

Runs hybrid search to find the most relevant node
Traces backward up to 2 hops through causal edges
Appends the top precursors (up to limit/4) to the result set
Returns both the answer and its supporting evidence chain

This is disabled by default for backward compatibility. Agents must explicitly opt-in.

Direct Dependence Trace

For deep causal analysis, use the dedicated dependence tool:

{"name": "mindbank_dependence", "arguments": {
  "query": "database configuration",
  "max_depth": 3
}}

Returns:

Critical Depth: How far back you need to go to capture 90% of influence
Coverage %: How well-explained the seed node is
Influence Modes: Ranked precursors with scores and depths
Blind Spots: Detected gaps (unresolved contradictions, missing evidence)
Graph Stats: Node and edge counts in the dependence subgraph

Edge Types Used for Causal Tracing

The dependence system follows these edge types backward:

Edge Type	Direction	Meaning
`contains`	A → B	A contains/is about B
`relates_to`	A ↔ B	A is related to B (symmetric)
`depends_on`	A → B	A depends on B (B is prerequisite)
`learned_from`	A → B	A was learned from experience B
`decided_by`	A → B	Decision A was informed by B
`produced`	A → B	A produced outcome B
`supports`	A → B	A supports/evidences B
`contradicts`	A → B	A contradicts B (triggers blind spot detection)
`tested_by`	A → B	A was empirically validated by B
`invalidated_by`	A → B	A was disproven/invalidated by B
`derived_from`	A → B	A was derived from B
`assumed`	A → B	A assumes B as a premise
`superseded_by`	A → B	A was replaced/superseded by B
`refined_by`	A → B	A was refined/improved by B
`merged_into`	A → B	A was merged into B
`created_by`	A → B	A was created by agent B
`reviewed_by`	A → B	A was reviewed by agent B
`executed_by`	A → B	A was executed by agent B
`failed_due_to`	A → B	A failed because of B
`incompatible_with`	A → B	A is incompatible with B
`precondition_for`	A → B	A is a precondition for B

Connect Your AI Agent

Hermes Agent (Recommended)

MindBank connects to Hermes via MCP (Model Context Protocol) over HTTP:

# 1. Start MindBank (API + MCP)
cd ~/mindbank && bash scripts/start.sh

# 2. Hermes auto-detects the MCP server at http://localhost:8096/mcp
#    (configured in ~/.hermes/config.yaml)

Claude Desktop / Claude Code CLI

bash scripts/install-plugin.sh

Detects and configures:

Claude Desktop → ~/.config/claude/claude_desktop_config.json
Claude Code CLI → ~/.claude/mcp.json

Or use flags:

bash scripts/install-plugin.sh --all
bash scripts/install-plugin.sh --claude-desktop --hermes

MCP Tools Available to Agents

Tool	Description
`mindbank_store`	Save facts, decisions, questions, preferences
`mindbank_search`	Hybrid FTS + semantic search (opt-in causal precursors)
`mindbank_ask`	Natural language query → structured context (opt-in supporting evidence)
`mindbank_snapshot`	Get wake-up context on session start
`mindbank_neighbors`	Graph traversal (connected nodes)
`mindbank_dependence`	Trace causal precursors — understand why a decision exists

Dependence Expansion: Both search and ask support an optional dependence_expansion flag. When enabled, MindBank traces backward from the top result through depends_on, learned_from, decided_by, produced, and supports edges to surface the supporting evidence that led to a decision or fact. This gives your AI agent the full causal chain, not just the conclusion.

Unified Mining Pipeline

MindBank automatically mines knowledge from your Hermes sessions and .md cron logs:

# Run full pipeline (called automatically by update.sh)
python3 scripts/unified_scheduler.py

# Or run individual miners:
python3 scripts/md_miner.py --dry-run        # Preview .md log mining
python3 scripts/session_miner.py --mine-all  # Mine sessions + MEMORY.md
python3 scripts/node_fixer.py --dry-run    # Preview node repairs

.md Log Mining

Hermes cron jobs generate .md logs at ~/.hermes/cron/output/. The md_miner.py extracts only the ## Response section, ignoring injected skill dumps. Each response is classified (decision/fact/problem/advice/preference/project) and stored as a MindBank node with skill provenance metadata.

Skill Query

Query nodes by the skill that produced them:

curl "http://127.0.0.1:8095/api/v1/nodes?skill=gap-analysis&limit=5"

API Overview

Nodes

# Create
POST /api/v1/nodes
{"label":"API Rate Limit","type":"problem","content":"...","namespace":"my-project"}

# List (with filters)
GET /api/v1/nodes?namespace=my-project&type=decision&limit=50

# List by skill (skill provenance)
GET /api/v1/nodes?skill=gap-analysis&limit=5

# Get (bumps access count)
GET /api/v1/nodes/{id}

# Update (creates temporal version)
PUT /api/v1/nodes/{id}

# History
GET /api/v1/nodes/{id}/history

# Neighbors (graph traversal)
GET /api/v1/nodes/{id}/neighbors?depth=2

Search

# Full-text
GET /api/v1/search?q=authentication&limit=10

# Semantic
POST /api/v1/search/semantic
{"query":"how do we handle auth","limit":10}

# Hybrid (best of both)
POST /api/v1/search/hybrid
{"query":"rate limiter bug","limit":10}

# Dependence trace
POST /api/v1/analyze/dependence
{"query":"database choice","max_depth":3}

Snapshots

# Get wake-up context
GET /api/v1/snapshot

# Rebuild
POST /api/v1/snapshot/rebuild

Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                    MindBank Architecture                     │
└─────────────────────────────────────────────────────────────────────┘

  💻 Dashboard (static/)          🤖 MCP Server (stdio)
         │                              │
         └──────────────────────────────────┘
                        │
              🔗 Go HTTP API (chi router)
                        │
       ┌──────────────────────────────────────────┐
       │                                              │
   💾 PostgreSQL 16+                          🔬 Ollama
   • pgvector (HNSW)                          • nomic-embed-text:v1.5
   • tsvector (FTS)                           • 768 dims
   • Temporal tables                          • Local, offline
   • Recursive CTEs                           • 4 concurrent semaphores
   • Dependence analysis (BFS)                • Embedding client

Key Design Decisions

Temporal versioning — valid_from/valid_to columns, never DELETE. Full audit trail.
Dual history path — Fast latest view + complete version chain.
Semaphore-bounded embeddings — Max 4 concurrent Ollama requests to prevent overload.
Typed errors — BUSY (retry), UNAVAILABLE (wait), BAD_QUERY (don't retry).
Rate limiting — 100 req/min per IP with chi middleware.
Observer Perspective — Recursive CTE BFS for causal precursor tracing, influence scoring with depth decay, and blind spot detection.

Configuration

Variable	Default	Description
`MB_PORT`	`8095`	HTTP server port
`MB_DB_DSN`	`postgres://mindbank:mindbank@localhost:5434/mindbank?sslmode=disable`	PostgreSQL connection string
`MB_POSTGRES_PASSWORD`	`mindbank`	Postgres password (used by docker-compose)
`MB_OLLAMA_URL`	`http://localhost:11434`	Ollama API endpoint
`MB_EMBED_MODEL`	`nomic-embed-text`	Embedding model
`MB_LOG_LEVEL`	`info`	debug / info / warn / error
`MB_API_KEY`	(none)	Require API key for all endpoints
`MCP_HTTP_PORT`	`8096`	MCP server HTTP port
`MCP_TRANSPORT`	(stdio)	Set to `http` for HTTP mode

Development

# Build
make build        # API server
make build-mcp    # MCP server

# Run
make run          # Build + start Postgres + start API
make stop         # Stop everything

# Quality
make test         # Run Go tests
make vet          # Run go vet
make health       # Quick health check

# Update
make update       # Check GitHub + auto-update

Node Types

MindBank natively understands these memory types:

Type	Use for	Example
`decision`	Architecture choices, tech stack	"Use Go for the API"
`fact`	Technical facts, config values	"Redis runs on port 6379"
`problem`	Known issues, bugs, limitations	"Webhook delivery is unreliable"
`preference`	Style guides, user preferences	"Prefer table-driven tests"
`advice`	Recommendations, best practices	"Use connection pooling"
`project`	Top-level project containers	"MindBank v2 refactor"
`question`	Unanswered queries (tracked)	"Should we add caching?"
`concept`	Domain concepts, definitions	"Idempotency"
`topic`	Thematic groupings	"Authentication"
`person`	Team members, stakeholders	"Sarah owns the frontend"
`event`	Milestones, incidents	"v1.0 launch"

License

MIT — free for personal and commercial use.

Built with ❤️ for AI agents that deserve to remember.

Report Bug · Request Feature · Releases

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
bin		bin
brain3d-physics		brain3d-physics
cmd		cmd
docs		docs
internal		internal
migrations		migrations
plugins/memory/mindbank		plugins/memory/mindbank
scripts		scripts
tests/integration		tests/integration
web/dashboard		web/dashboard
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DISTRIBUTION_README.md		DISTRIBUTION_README.md
LICENSE		LICENSE
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
README_GITHUB.md		README_GITHUB.md
RELEASE_NOTES.md		RELEASE_NOTES.md
RELEASE_NOTES_v0.5.0.md		RELEASE_NOTES_v0.5.0.md
VERSION		VERSION
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
mindbank		mindbank
mindbank-api		mindbank-api
mindbank-mcp		mindbank-mcp

Folders and files

Latest commit

History

Repository files navigation

MindBank

⚡ One-Command Install

What is MindBank?

The Problem

The Solution

Why MindBank?

Quick Start

Option 1: One-liner (recommended)

Option 2: Manual

Dashboard

Lifecycle Event Capture

Event Types

Storage

Visualization

How It Works

Memory Model

Search Pipeline

Importance Scoring

🔮 Observer Perspective: Causal Intelligence

Why This Matters

Core Concepts

How It Works

Dependence-Aware Search & Q&A

Direct Dependence Trace

Edge Types Used for Causal Tracing

Connect Your AI Agent

Hermes Agent (Recommended)

Claude Desktop / Claude Code CLI

MCP Tools Available to Agents

Unified Mining Pipeline

.md Log Mining

Skill Query

API Overview

Nodes

Search

Snapshots

Architecture

Key Design Decisions

Configuration

Development

Node Types

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 34

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages