From dff2d123077183d4b703660081767b68682b0430 Mon Sep 17 00:00:00 2001
From: Andrii Ryzhkov <andrii.ryzhkov@pm.me>
Date: Sun, 5 Apr 2026 13:51:57 +0200
Subject: [PATCH 1/2] Update README with current models and versioning docs

---
 README.md | 40 +++++++++++++++++++++++++++++++---------
 1 file changed, 31 insertions(+), 9 deletions(-)
diff --git a/README.md b/README.md
index d6ea022..4ae4743 100644
--- a/README.md
+++ b/README.md
@@ -6,19 +6,23 @@ Currently targets the ONNX backend. The pipeline is designed to support addition
 
 ## Models
 
-| Model                                                                           | Task        | Description                                   |
-|---------------------------------------------------------------------------------|-------------|-----------------------------------------------|
-| [`denoise-nafnet`](models/denoise-nafnet/README.md)                             | denoise     | NAFNet denoiser trained on SIDD dataset       |
-| [`denoise-nind`](models/denoise-nind/README.md)                                 | denoise     | UNet denoiser trained on NIND dataset         |
-| [`mask-object-segnext-b2hq`](models/mask-object-segnext-b2hq/README.md)         | mask-object | SegNext ViT-B SAx2 HQ for masking             |
-| [`upscale-bsrgan`](models/upscale-bsrgan/README.md)                            | upscale     | BSRGAN 2x and 4x blind super-resolution      |
+| Model                                                                         | Task    | Description                                     |
+|-------------------------------------------------------------------------------|---------|-------------------------------------------------|
+| [`denoise-nind`](models/denoise-nind/README.md)                               | denoise | UNet denoiser trained on NIND dataset           |
+| [`depth-da2-small`](models/depth-da2-small/README.md)                         | depth   | Depth Anything V2 Small for depth masking       |
+| [`embed-openclip-vitb32`](models/embed-openclip-vitb32/README.md)             | embed   | OpenCLIP ViT-B/32 text/image embeddings         |
+| [`mask-object-sam21-base-plus`](models/mask-object-sam21-base-plus/README.md) | mask    | SAM 2.1 Hiera Base Plus for interactive masking |
+| [`mask-object-sam21-small`](models/mask-object-sam21-small/README.md)         | mask    | SAM 2.1 Hiera Small for interactive masking     |
+| [`mask-object-sam21-tiny`](models/mask-object-sam21-tiny/README.md)           | mask    | SAM 2.1 Hiera Tiny for interactive masking      |
+| [`mask-object-segnext-b2hq`](models/mask-object-segnext-b2hq/README.md)      | mask    | SegNext ViT-B SAx2 HQ for semantic masking      |
+| [`upscale-bsrgan`](models/upscale-bsrgan/README.md)                          | upscale | BSRGAN 2x and 4x blind super-resolution        |
 
 ## Repository structure
 
 ```
 pyproject.toml        Project configuration, dependency groups, CLI entry point
 darktable_ai/         Python package (CLI + pipeline orchestration)
-vendor/               Git submodules (NAFNet, nind-denoise, SegNext)
+vendor/               Git submodules (nind-denoise, sam2, SegNext)
 samples/<task>/        Sample images organized by task
 output/               Build output: ONNX models + config.json (gitignored)
 temp/                 Downloaded checkpoints (gitignored)
@@ -34,7 +38,7 @@ models/
 
 Requires [uv](https://docs.astral.sh/uv/) and Python 3.11–3.12.
 
-Dependencies are managed through [dependency groups](https://docs.astral.sh/uv/concepts/dependency-groups/) in `pyproject.toml`. The base package only needs `click` and `pyyaml`. ML dependencies are split into groups — one per model plus a shared `core` group — so you only install what you need. Use `uv sync --group <name>` to install a specific group, or `--group all-models` for everything.
+Dependencies are managed through [dependency groups](https://docs.astral.sh/uv/concepts/dependency-groups/) in `pyproject.toml`. The base package only needs `click` and `pyyaml`. ML dependencies are split into groups – one per model plus a shared `core` group – so you only install what you need. Use `uv sync --group <name>` to install a specific group, or `--group all-models` for everything.
 
 ## Setup
 
@@ -71,18 +75,36 @@ uv run dtai validate denoise-nind    # Validate ONNX output
 uv run dtai package denoise-nind     # Create .dtmodel archive
 uv run dtai demo denoise-nind        # Run demo on sample images
 
+# Generate versions.json
+uv run dtai versions
+
 # Evaluate a model
 uv sync --group eval
 uv run dtai eval mask mask-object-segnext-b2hq --limit 5
 ```
 
+## Versioning
+
+Each model has a `version` field in its `model.yaml` that gets written to `config.json`. The `dtai versions` command generates `output/versions.json` – a manifest mapping model IDs to their current versions:
+
+```json
+{
+  "models": {
+    "denoise-nind": "1.0",
+    "mask-object-sam21-small": "1.0"
+  }
+}
+```
+
+This file is published as a GitHub release asset alongside `.dtmodel` packages. darktable fetches it to check for model updates without downloading full packages.
+
 ## Demos
 
 Each model includes a `demo.py` script that runs inference on sample images
 from `samples/<task>/`. Models that require per-image input (e.g. point prompts
 for object segmentation) define `image_args` in their `model.yaml`.
 
-Output images are saved to `models/<model>/output/`.
+Output images are saved to `output/<model>-demo/`.
 
 ## Model selection criteria
 

From 34aa3010875f719a3f8cb0f58eaf976bdee8d868 Mon Sep 17 00:00:00 2001
From: Andrii Ryzhkov <andrii.ryzhkov@pm.me>
Date: Wed, 8 Apr 2026 15:06:28 +0200
Subject: [PATCH 2/2] Remove DA2

---
 README.md | 1 -
 1 file changed, 1 deletion(-)

diff --git a/README.md b/README.md
index 4ae4743..f99275b 100644
--- a/README.md
+++ b/README.md
@@ -9,7 +9,6 @@ Currently targets the ONNX backend. The pipeline is designed to support addition
 | Model                                                                         | Task    | Description                                     |
 |-------------------------------------------------------------------------------|---------|-------------------------------------------------|
 | [`denoise-nind`](models/denoise-nind/README.md)                               | denoise | UNet denoiser trained on NIND dataset           |
-| [`depth-da2-small`](models/depth-da2-small/README.md)                         | depth   | Depth Anything V2 Small for depth masking       |
 | [`embed-openclip-vitb32`](models/embed-openclip-vitb32/README.md)             | embed   | OpenCLIP ViT-B/32 text/image embeddings         |
 | [`mask-object-sam21-base-plus`](models/mask-object-sam21-base-plus/README.md) | mask    | SAM 2.1 Hiera Base Plus for interactive masking |
 | [`mask-object-sam21-small`](models/mask-object-sam21-small/README.md)         | mask    | SAM 2.1 Hiera Small for interactive masking     |