GitHub - thoherr/maki: Finally my own asset management software that (hopefully) does exactly what i want

A command-line digital asset manager built in Rust, designed for large collections of images, videos, and other media spread across multiple storage devices.

Key Features

Content-addressable storage — files identified by SHA-256 hash, enabling deduplication and integrity verification
Multi-volume support — manage assets across external drives, NAS, and offline storage (terabytes scale)
Variant grouping — automatically groups related files (RAW + JPEG + XMP) into a single asset by filename stem
Recipe management — tracks processing sidecars from CaptureOne, Lightroom/XMP, RawTherapee, DxO, and ON1
EXIF/XMP extraction — camera metadata, keywords, ratings, color labels, and descriptions extracted at import
Bidirectional XMP sync — rating, tag, description, and label changes written back to .xmp recipe files
Preview generation — thumbnails for images, RAW files (via dcraw/LibRaw), videos (via ffmpeg), and info cards for audio/documents
Integrity verification — detect bit rot and corruption by re-hashing files against stored checksums
Stacks — group burst shots and similar-scene images into collapsible stacks, showing only the "pick" in the browse grid
Hierarchical tags — tree-structured keywords with Lightroom lr:hierarchicalSubject interop
Saved searches & collections — smart albums (dynamic queries) and static albums (curated lists)
AI auto-tagging — zero-shot image classification using SigLIP vision-language models (ViT-B/16-256 or ViT-L/16-256) for automated tag suggestions, visual similarity search via stored embeddings, and natural language image search via text: filter (MAKI Pro)
Face recognition — detect faces with YuNet, generate ArcFace embeddings, auto-cluster into people groups, and manage named people across your catalog (MAKI Pro)
Interactive shell — maki shell REPL with named variables ($picks = search "rating:5"), tab completion, session defaults, .maki script files, and source for script composition
Web UI — browser-based interface with search, inline editing, batch operations, keyboard navigation, lightbox viewer, dark mode, grid density controls, calendar heatmap, faceted sidebar, visual similarity stroll page, per-tag review of AI suggestions before applying, and OS integration (reveal in Finder, open terminal)
Flexible output — JSON on all commands, custom format templates, quiet mode for scripting

Quick Start

cargo build --release

# Initialize a catalog
maki init

# Register a storage volume
maki volume add "Photos 2024" /Volumes/PhotosDrive

# Import files
maki import /Volumes/PhotosDrive/Photos/

# Search and browse
maki search "tag:landscape rating:4+"
maki stats --all

# Start the web UI
maki serve
# Open http://127.0.0.1:8080

# Or use the interactive shell
maki shell
# photos> $picks = search "rating:5 date:2024"
# photos [picks=38]> export --target /tmp/best $picks

Commands

44 commands covering setup, import, search, editing, maintenance, and more:

init · volume add/list/combine/split/rename/remove/set-purpose · import · delete · export · contact-sheet · describe · search · show · preview · edit · tag · group · split · auto-group · auto-tag · embed · faces · stack · duplicates · dedup · generate-previews · create-sidecars · relocate · update-location · verify · sync · sync-metadata · refresh · cleanup · writeback · stats · backup-status · fix-roles · fix-dates · fix-recipes · rebuild-catalog · migrate · saved-search · collection · serve · doc · licenses · shell

Global flags: --json, --log, --verbose, --debug, --time. Run maki --help or maki <command> --help for usage.

See the Command Reference for detailed documentation of every command, or the Search Filters Reference for the 20+ filter types available in maki search.

Architecture

The system uses a two-tier storage model:

YAML sidecar files are the source of truth for all metadata (human-readable, diffable, never lost)
SQLite catalog is a derived index for fast queries (rebuildable from sidecars via maki rebuild-catalog)

Assets live on media volumes (external drives, NAS) while the catalog stays local with enough data (index + thumbnails) to browse without media mounted.

See doc/architecture-overview.md for the system design and doc/component-specification.md for detailed component specs.

Documentation

The full User Manual covers:

User Guide — workflow-oriented guides from setup through maintenance
Reference Guide — man-page style docs for every command, filter, and config option
Developer Guide — REST API, module reference, and build/test instructions

Configuration is documented in the Configuration Reference. All settings live in maki.toml at the catalog root; every field is optional with sensible defaults.

External Tools (Highly Recommended)

dcraw or LibRaw (dcraw_emu) — RAW file preview extraction
ffmpeg — video thumbnail extraction
curl — model file download for AI auto-tagging (only needed with --features ai) and VLM image descriptions (maki describe)

Install on macOS: brew install libraw ffmpeg curl Install on Linux: sudo apt install libraw-bin ffmpeg curl (Debian/Ubuntu) Install on Windows: winget install LibRaw.LibRaw Gyan.FFmpeg cURL.cURL or scoop install libraw ffmpeg curl

When missing, RAW and video files get an info card preview instead. maki prints a warning on first use when a tool is not found.

AI Auto-Tagging (MAKI Pro)

Download the MAKI Pro binary or build with cargo build --features pro to enable AI-powered commands. This uses SigLIP vision-language models (via ONNX Runtime) for zero-shot image classification against a configurable tag vocabulary. Two models are available: ViT-B/16-256 (~207 MB, default) and ViT-L/16-256 (~670 MB, higher accuracy). Select with --model or [ai] model in maki.toml. Model files are downloaded from HuggingFace on first use. Commands: maki auto-tag for tag suggestion/application, maki embed for batch embedding generation, maki search "similar:<id>" for visual similarity search, and maki search "text:\"sunset on the beach\"" for natural language image search. The web UI includes a Stroll page (/stroll) for graph-based visual exploration — pick an asset, see its nearest visual neighbors arranged radially, click through to explore connections. The "Suggest tags" and "Auto-tag" buttons also store embeddings opportunistically. Similarity search uses an in-memory index for sub-millisecond results at any scale. See the Configuration Reference for [ai] settings.

GPU acceleration (macOS): The pre-built macOS Pro binary includes CoreML support for hardware-accelerated inference on Apple Silicon (Neural Engine) and Intel Macs (Metal). Falls back to CPU automatically when CoreML is unavailable. Configure via [ai] execution_provider in maki.toml ("auto", "cpu", "coreml"). When building from source on macOS, add ai-gpu: cargo build --features pro,ai-gpu.

Face Recognition (MAKI Pro)

Download the MAKI Pro binary or build with cargo build --features pro. Uses two ONNX models: YuNet for face detection (bounding boxes + 5-point landmarks, ~230 KB) and ArcFace ResNet-100 FP32 for recognition (512-dim embeddings, ~261 MB). Each detected face is aligned to a canonical 112×112 template via a 5-point similarity transform before embedding — matching InsightFace's reference preprocessing.

CLI workflow: detect faces → cluster into groups → name people:

maki faces download                                    # download YuNet + ArcFace models
maki faces detect --query "type:image" --apply         # detect + embed faces
maki faces cluster --apply                             # group similar faces (HAC, default threshold 0.35)
maki faces people                                      # list unnamed person groups
maki faces name <person-id> "Alice"                    # name a person

Diagnostic commands for tuning clustering:

maki faces similarity --query <scope>           # histogram of pairwise similarities
maki faces dump-aligned --query <scope>         # save aligned 112x112 crops for visual check
maki faces clean --apply                        # delete unassigned orphan faces
maki faces status                               # show per-model face counts and stale-embedding warnings

Web UI: /people page with person gallery and per-cluster browse links (works for both named people and unnamed clusters), asset detail face chips with assign/unassign, browse filter by faces: and person: that persists across pagination.

Data persistence: Face records (including recognition_model stamp for migration safety), people, and embeddings live in both SQLite and files (YAML + binary) for rebuild resilience.

Search filters: faces:any / faces:none / faces:N / faces:N+ (face count), person:<name> (matches name or person UUID). Config: [ai] face_cluster_threshold (default 0.35), [ai] face_min_confidence (default 0.7).

VLM Image Descriptions (MAKI Pro)

Download the MAKI Pro binary or build with cargo build --features pro to enable VLM-powered image descriptions. Works with any OpenAI-compatible API server (Ollama, LM Studio, vLLM).

ollama pull qwen2.5vl:3b                              # download a VLM
maki describe "description:none" --apply                 # describe undescribed assets
maki describe "date:2024-06" --mode tags --apply          # suggest tags via VLM
maki describe --mode both --volume "Photos" --apply       # both in one pass
maki import --describe /Volumes/Photos/NewShoot/        # auto-describe during import

Web UI: "Describe" button on asset detail page, batch "Describe" in browse toolbar. VLM availability is detected at server startup.

Auto-describe on import: maki import --describe generates descriptions for newly imported assets. Enable permanently with [import] descriptions = true in maki.toml. Silently skips if VLM endpoint is not available.

Config: [vlm] section in maki.toml — endpoint, model, max_tokens, temperature, timeout, mode, prompt, concurrency. CLI flags override config. Set concurrency to process multiple assets in parallel during batch describe. See the Configuration Reference for [vlm] settings.

Technology

Rust, SQLite, clap, axum, askama, htmx. See Cargo.toml for the full dependency list.

Requirements

Rust 2021 edition (stable)
macOS, Linux, or Windows

License

Licensed under the Apache License, Version 2.0. See NOTICE for attribution.

Name		Name	Last commit message	Last commit date
Latest commit History 1,105 Commits
.cargo		.cargo
.github/workflows		.github/workflows
.idea		.idea
doc		doc
scripts		scripts
src		src
templates		templates
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
about.hbs		about.hbs
about.toml		about.toml
deny.toml		deny.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Key Features

Quick Start

Commands

Architecture

Documentation

External Tools (Highly Recommended)

AI Auto-Tagging (MAKI Pro)

Face Recognition (MAKI Pro)

VLM Image Descriptions (MAKI Pro)

Technology

Requirements

License

About

Uh oh!

Releases 29

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Key Features

Quick Start

Commands

Architecture

Documentation

External Tools (Highly Recommended)

AI Auto-Tagging (MAKI Pro)

Face Recognition (MAKI Pro)

VLM Image Descriptions (MAKI Pro)

Technology

Requirements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 29

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages