papersmith

Docker-only PDF autonamer.

It watches a folder, OCRs PDFs with Surya, asks Docker Ollama for a filename, and renames PDFs to:

YYYYMMDD-title.pdf

It does not create sidecar folders in the watched directory. If a PDF cannot be renamed confidently, it is left in place and the reason is logged to stdout.

Run

Default watched folder:

~/Library/Mobile Documents/com~apple~CloudDocs/docs

Start the stack:

docker compose up --build

Watch logs:

docker compose logs -f watcher

Useful watcher events:

renamed
needs_review
process_failed
retry_scheduled
skip_failed_retry_limit

Options

No config is required for the default setup.

To watch a different folder, create .env:

cp .env.example .env

Then set:

WATCH_DIR_HOST=/absolute/path/to/pdfs

Failed OCR attempts are retried rather than suppressed permanently. The defaults are:

MAX_PROCESS_ATTEMPTS=3
FAILED_RETRY_DELAY_SECONDS=300
OCR_MAX_PAGES=3
SURYA_TIMEOUT_SECONDS=1200
OLLAMA_TIMEOUT_SECONDS=1200
SURYA_RENDERED_FALLBACK_MAX_DIMENSION=1600
SURYA_DIRECT_OCR_MAX_PAGE_DIMENSION_POINTS=1600
SURYA_MEM_LIMIT=8g
SURYA_MEMSWAP_LIMIT=20g

OCR_MAX_PAGES limits OCR to the first N pages before naming. Set it to 0 to OCR every page.

SURYA_TIMEOUT_SECONDS and OLLAMA_TIMEOUT_SECONDS control how long the watcher waits for OCR and filename inference. The defaults are 20 minutes.

SURYA_MEMSWAP_LIMIT is Docker's total memory plus swap allowance for the Surya container, not swap alone. Docker Desktop must also have enough memory and swap enabled in its resource settings for this to help.

The default model is qwen3:4b, which fits typical Docker Desktop memory limits. To use another model:

OLLAMA_MODEL=qwen3:1.7b

One File

Process one PDF mounted inside Docker at /watch:

docker compose run --rm watcher --once /watch/example.pdf

Dry run:

docker compose run --rm -e DRY_RUN=true watcher --once /watch/example.pdf

Notes

Everything runs in Docker: watcher, OCR, and Ollama. This is tidy but slower than host-native GPU/MPS OCR or inference.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
surya_service		surya_service
watcher		watcher
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

papersmith

Run

Options

One File

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

papersmith

Run

Options

One File

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages