estado	Completed

Sync & Indexing

backscroll sync reads Claude Code session files, extracts the conversation, strips noise, and indexes everything into a local SQLite database for fast full-text search.

CLI Usage

backscroll sync --path ~/.claude/sessions              # Index all sessions
backscroll sync --path ~/.claude/sessions --include-agents  # Include subagent sessions

Flags

Flag	Description
`--path <DIR>`	Directory containing session files (default: configured `session_dir`)
`--include-agents`	Include subagent sessions (excluded by default)

Session File Format

Claude Code stores each conversation as a JSONL file — one JSON record per line. Each record has a type field (user, assistant, summary, etc.) and a message object containing the actual content.

Backscroll extracts only user and assistant records. Everything else — summaries, metadata, tool calls, tool results — is skipped.

Noise Filtering

Raw session messages contain machine-generated content injected by the system. Backscroll strips all of this before indexing, producing a clean corpus of human ↔ assistant dialogue.

Filtered patterns:

Pattern	Description
`<system-reminder>...</system-reminder>`	Context injected by the system
`<task-notification>...</task-notification>`	Background task status updates
`<caveat>...</caveat>`	Local command caveats
`<local-command-stdout>...</local-command-stdout>`	Hook and command output
`<command-name>...</command-name>`	Command metadata tags
`<command-message>...</command-message>`	Command message tags
`<command-args>...</command-args>`	Command argument tags
`Caveat: ...` (line prefix)	Caveat prefix lines
`Base directory: ...` (line prefix)	Base directory lines
`Request interrupted`	Partial responses (entire message dropped)

After filtering, if a message is empty, it is discarded entirely.

Incremental Sync

Backscroll computes a SHA-256 hash for each session file and stores it in the database alongside the indexed content. On subsequent syncs, the hash is compared — only files whose content has changed are re-processed.

This makes repeated syncs fast: the first run indexes everything, subsequent runs skip unchanged files.

Project Detection

Each indexed file is assigned to a project. Resolution order:

sessions-index.json — Claude maintains a sessions-index.json file mapping session UUIDs to project paths. If found, the project name is extracted from the last path component.
Directory structure fallback — If no index entry exists, the project name is inferred from the parent directory structure (the directory above sessions/ or subagents/).
Default — If neither method resolves, the project is set to "unknown".

Subagent Sessions

Claude Code spawns subagent sessions in /subagents/ subdirectories. These are nested conversations that tend to be noisy and implementation-focused. They are excluded by default to keep the index focused on primary conversations.

Use --include-agents to index them alongside primary sessions.

Exit Codes

Code	Meaning
`0`	Sync completed successfully
`1`	Error (missing directory, permission denied, parse failure)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync & Indexing

CLI Usage

Flags

Session File Format

Noise Filtering

Incremental Sync

Project Detection

Subagent Sessions

Exit Codes

FilesExpand file tree

sync.md

Latest commit

History

sync.md

File metadata and controls

Sync & Indexing

CLI Usage

Flags

Session File Format

Noise Filtering

Incremental Sync

Project Detection

Subagent Sessions

Exit Codes