AI Colmap Camera Tracking

Automated pipeline for camera tracking and scene reconstruction using COLMAP and NeRF-compatible formats. Processes video inputs, performs sparse 3D reconstruction, undistorts footage, and exports camera data for use in Houdini or NeRF training.

Features

Automated Workflow: Batch processes multiple video files.
Frame Extraction: Uses FFmpeg to extract frames from input videos.
Feature Extraction & Matching: Utilises COLMAP for feature extraction and sequential matching.
Sparse Reconstruction: Uses COLMAP Global Mapper (requires COLMAP 4.0+).
NeRF Conversion: Converts COLMAP data to transforms.json (NeRF format).
Undistortion: Expands the undistorted canvas to preserve all valid pixels while keeping the focal length unchanged. Optionally crops to the original canvas size (--crop).
Houdini Integration: Automatically generates a Houdini .hip scene with the reconstructed point cloud and animated camera. Converts COLMAP/OpenCV coordinates to Houdini's Y-up world, and correctly handles sensor size, canvas expansion, and principal-point offset.

Prerequisites

uv — Python package and environment manager.
FFmpeg — video processing.
COLMAP 4.0+ — feature extraction, matching, and Global Mapper reconstruction.
Houdini (hython) — required only for Houdini scene generation.
Vocabulary tree (optional) — required only when using --loop for loop detection. Download vocab_tree_faiss_flickr100K_words32K.bin (≈9 MB) from the COLMAP project (https://demuc.de/colmap/) and place it in the repo root, or pass its path via --vocab_tree_path. The file is intentionally not tracked in git.

Python Dependencies

Dependencies are declared in pyproject.toml and managed by uv. To create the virtual environment and install all dependencies:

uv sync

Optional: For automatic object masking in colmap2nerf.py, PyTorch and Detectron2 are required.

Usage

The main entry point is run_autotracker.py.

uv run python run_autotracker.py <input_videos_dir> <output_dir> [options]

Graphical User Interface (GUI)

A PySide6-based GUI is available for a more user-friendly experience. It wraps the run_autotracker.py script and provides a real-time log of the processing steps.

Launching the GUI

uv run gui_autotracker.py

The GUI allows you to browse for input/output directories, adjust processing scales, select COLMAP camera models, and toggle advanced settings like loop detection or Houdini path configuration.

A Copy Command button builds the equivalent run_autotracker.py invocation from the current settings, copies it to the clipboard, and echoes it into the log — useful when you want to configure visually but run from a terminal (or paste into a batch script).

Arguments

Argument	Default	Description
`input_videos_dir`	—	Directory containing source video files (`.mp4`, `.mov`, …)
`output_dir`	—	Directory for all output data
`--scale`	`0.5`	Image scaling factor applied before processing
`--overlap`	`12`	Sequential matching overlap (number of frames)
`--skip-houdini`	off	Skip Houdini `.hip` generation
`--hfs`	—	Path to Houdini installation directory (e.g. `C:\Program Files\Side Effects Software\Houdini 20.0.625`). If omitted, `hython` must be in `PATH`.
`--multi-cams`	off	Treat each video as a separate camera (useful for multi-device shoots)
`--acescg`	off	Convert input from ACEScg to sRGB before processing
`--lut`	—	Path to a `.cube` LUT file for colour-space conversion
`--mask`	—	Path to a directory containing per-frame masks
`--focal_length_mm`	—	Lens focal length in mm (e.g. `24`). Locks COLMAP to this value instead of estimating it.
`--sensor_width_mm`	`36.0`	Physical sensor width in mm. Used together with `--focal_length_mm`. Common values: full-frame=36.0, ARRI LF=36.7, Super35=24.89, MFT=17.3
`--crop`	off	Keep original canvas size during undistortion instead of expanding it. Houdini focal length and aperture remain at exact physical values (e.g. 20 mm / 36 mm).
`--camera_model`	`SIMPLE_RADIAL`	COLMAP camera model (e.g. `OPENCV`, `PINHOLE`, `SIMPLE_RADIAL`)
`--loop`	off	Enable loop detection in sequential matching
`--loop_period`	`5`	Loop detection period
`--loop_num_images`	`50`	Number of images considered per loop detection pass
`--vocab_tree_path`	`vocab_tree_faiss_flickr100K_words32K.bin`	Path to vocabulary tree for loop detection
`--extra_fe`	—	Extra COLMAP feature-extraction arguments. Accepts a JSON string or `.json` file path.
`--extra_sm`	—	Extra COLMAP sequential-matching arguments.
`--extra_ma`	—	Extra COLMAP global-mapper arguments.

Specifying Focal Length

When the shooting focal length is known, providing it improves reconstruction accuracy by preventing COLMAP from freely estimating it:

python run_autotracker.py ./videos ./output --focal_length_mm 24 --sensor_width_mm 36

Extra Arguments Example

Create a params.json:

{
    "SiftExtraction.peak_threshold": 0.01,
    "SiftExtraction.max_num_features": 8192
}

Then pass it:

uv run python run_autotracker.py ./in ./out --extra_fe params.json

Masking

The pipeline supports per-frame masks to exclude moving objects or unwanted regions from reconstruction.

Rules:

Auto-detection: For a video shot01.mp4, the script looks for a sibling directory named shot01_mask.
Custom root: --mask <path> looks for <video_name>_mask inside the specified path.
Filename format: PNG files named frame_000001.jpg.png. If frame_000001.png is found it is automatically renamed to match COLMAP requirements.

Example

uv run python run_autotracker.py ./videos ./output \
    --scale 0.5 \
    --focal_length_mm 20 \
    --sensor_width_mm 36 \
    --hfs "C:/Program Files/Side Effects Software/Houdini 20.0.625"

Batch Processing

batch_run.py processes multiple folders within a target directory. Per-folder settings can be defined in a batch_config.ini placed in the target directory.

Folder discovery

batch_run.py walks the target directory and treats each subfolder as a batch of videos to process. Two naming conventions are reserved:

Folders starting with . are skipped (hidden directories).
Folders ending in -output are skipped — they are reserved for results produced by batch_run.py itself (<folder>-output/).

If you want a folder named something-output to be processed, rename it.

Usage

uv run python batch_run.py <target_directory>

Configuration Format (`batch_config.ini`)

If a section name matches a folder name, its settings override the defaults for that folder.

[global]
scale = 0.5
hfs = C:/Program Files/Side Effects Software/Houdini 20.0.625

[shot_01]
scale = 0.8
camera_model = OPENCV
focal_length_mm = 24
sensor_width_mm = 36

[shot_02]
focal_length_mm = 20
sensor_width_mm = 24.89
acescg = true

Supported INI Keys:

Key	Type	Description
`scale`	float	Image scaling factor
`overlap`	int	Sequential matching overlap
`camera_model`	string	e.g. `OPENCV`, `PINHOLE`
`focal_length_mm`	float	Lens focal length in mm
`sensor_width_mm`	float	Physical sensor width in mm
`mask`	string	Path to mask directory
`lut`	string	Path to `.cube` LUT file
`hfs`	string	Path to Houdini installation
`crop`	bool	`true` / `false`
`multi_cams`	bool	`true` / `false`
`acescg`	bool	`true` / `false`
`skip_houdini`	bool	`true` / `false`
`loop`	bool	`true` / `false`
`loop_period`	int	Loop detection period
`loop_num_images`	int	Images per loop detection pass
`vocab_tree_path`	string	Path to vocabulary tree

Advanced Parameter Injection (INI Only)

Pass any COLMAP internal parameter using these prefixes:

fe.<Parameter> — injected into feature_extractor
sm.<Parameter> — injected into sequential_matcher
ma.<Parameter> — injected into global_mapper

Example:

[global]
fe.SiftExtraction.peak_threshold = 0.01
sm.SequentialMatching.min_num_matches = 20

Quick Start / Demo

run_demo_test.bat

Processes ./demo-test/walking-forest and outputs to ./demo-test/walking-forest-output. Edit the .bat file to point to your Houdini installation if hython is not in PATH.

Pipeline Steps

Frame extraction — FFmpeg extracts frames from each input video.
Feature extraction & matching — COLMAP feature_extractor + sequential_matcher.
Sparse reconstruction — COLMAP global_mapper (requires COLMAP 4.0+).
Model export — Sparse model converted to TXT and PLY formats.
NeRF conversion — colmap2nerf.py generates transforms.json.
Undistortion — undistortionNerfstudioColmap.py removes lens distortion.
- Default: canvas is expanded to include all valid pixels; sensor_w/sensor_h are recorded so downstream tools can recover the physical focal length.
- --crop: keeps the original canvas size; Houdini focal length and aperture remain at their exact physical values (e.g. 20 mm / 36 mm).
Houdini scene — build_houdini_scene.py imports the point cloud and creates an animated camera with correct focal length, aperture, and principal-point offset. Both the camera and point cloud are converted from COLMAP/OpenCV space to Houdini's Y-up world via an Rx(180) rotation (flip Y and Z), so the scene appears upright and un-mirrored.

Scripts Overview

Script	Description
`run_autotracker.py`	Master script — orchestrates the full pipeline
`autotracker.py`	Core photogrammetry: FFmpeg, COLMAP feature extraction, matching, and Global Mapper
`colmap2nerf.py`	Converts COLMAP sparse model to `transforms.json`
`undistortionNerfstudioColmap.py`	Undistorts images; expands canvas or crops to original size
`restore_distortion.py`	Utility to apply or remove lens distortion from rendered images. Supports EXR via `--exr`
`build_houdini_scene.py`	Generates a `.hip` file with point cloud and animated camera
`batch_run.py`	Batch runner with per-folder INI configuration

Output Structure

For each processed video:

<output_dir>/
├── <video_name>_transforms.json   # NeRF camera poses (at the top level)
└── <video_name>/
    ├── images/                    # Extracted frames
    ├── sparse/                    # COLMAP sparse reconstruction
    ├── database.db                # COLMAP feature database
    ├── points3D.ply               # Point cloud
    ├── undistort/
    │   ├── images_undistorted/    # Undistorted frames
    │   └── transforms_undistorted.json
    └── <video_name>.hip           # Houdini project file

References

Inspired by: Video Link
Demo test video: Pexels — Tranquil Autumn Forest Walkway Path

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Colmap Camera Tracking

Features

Prerequisites

Python Dependencies

Usage

Graphical User Interface (GUI)

Launching the GUI

Arguments

Specifying Focal Length

Extra Arguments Example

Masking

Example

Batch Processing

Folder discovery

Usage

Configuration Format (`batch_config.ini`)

Advanced Parameter Injection (INI Only)

Quick Start / Demo

Pipeline Steps

Scripts Overview

Output Structure

References

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
demo-test/walking-forest		demo-test/walking-forest
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
autotracker.py		autotracker.py
batch_config.ini		batch_config.ini
batch_run.py		batch_run.py
build_houdini_scene.py		build_houdini_scene.py
colmap2nerf.py		colmap2nerf.py
gui_autotracker.py		gui_autotracker.py
pyproject.toml		pyproject.toml
restore_distortion.py		restore_distortion.py
run_autotracker.py		run_autotracker.py
run_demo_test.bat		run_demo_test.bat
undistortionNerfstudioColmap.py		undistortionNerfstudioColmap.py
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

AI Colmap Camera Tracking

Features

Prerequisites

Python Dependencies

Usage

Graphical User Interface (GUI)

Launching the GUI

Arguments

Specifying Focal Length

Extra Arguments Example

Masking

Example

Batch Processing

Folder discovery

Usage

Configuration Format (batch_config.ini)

Advanced Parameter Injection (INI Only)

Quick Start / Demo

Pipeline Steps

Scripts Overview

Output Structure

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Configuration Format (`batch_config.ini`)

Packages