Summary
The index file codedb.snapshot is written into the indexed project's root directory, where it shows up in git status and can be accidentally committed. It is large - in one real project the root codedb.snapshot is 22.8 MB.
Evidence
- A
codedb.snapshot (22.8 MB) sits at the root of an actively-developed repo after normal codedb use.
- In an automated harness, a plain
git add -A && git diff swept codedb.snapshot into the captured patch (it began with diff --git a/codedb.snapshot ... Binary files differ), corrupting the diff. Workaround required adding it to .git/info/exclude per checkout.
Impact
- Pollutes
git status; easy to commit a multi-MB binary by accident.
- Breaks any tooling that diffs/snapshots the working tree.
Suggested direction
- Store the index outside the working tree (e.g. an XDG cache dir keyed by repo path), or
- Write it to a
.codedb/ directory and auto-append codedb.snapshot/.codedb/ to the repo's .gitignore (or .git/info/exclude) on first index.
Found via an independent SWE-bench Lite token-efficiency benchmark: identical agent (`claude -p`, Sonnet 4.6) and tasks, only the tool surface differs - native Read/Grep/Edit vs codedb MCP tools. N=51 paired instances; full harness + data available.
Summary
The index file
codedb.snapshotis written into the indexed project's root directory, where it shows up ingit statusand can be accidentally committed. It is large - in one real project the rootcodedb.snapshotis 22.8 MB.Evidence
codedb.snapshot(22.8 MB) sits at the root of an actively-developed repo after normal codedb use.git add -A && git diffsweptcodedb.snapshotinto the captured patch (it began withdiff --git a/codedb.snapshot ... Binary files differ), corrupting the diff. Workaround required adding it to.git/info/excludeper checkout.Impact
git status; easy to commit a multi-MB binary by accident.Suggested direction
.codedb/directory and auto-appendcodedb.snapshot/.codedb/to the repo's.gitignore(or.git/info/exclude) on first index.Found via an independent SWE-bench Lite token-efficiency benchmark: identical agent (`claude -p`, Sonnet 4.6) and tasks, only the tool surface differs - native Read/Grep/Edit vs codedb MCP tools. N=51 paired instances; full harness + data available.