Red-Black Graphs are a specific type of graph, a directed acyclic graph of interleaved binary trees. This data structure resulted from exploration of efficient representations for family history. This package presents and implements the underlying linear algebra as well as discusses some interesting applications.
This python module extends both scipy and numpy and also conforms to dockerutils
conventions for building and running docker images used in module development. There is a script in the bin
directory that can be used to setup the project for development or to prep for reading the notebook.
(bin/setup-project.sh). You will want to create an activate a virtual environment prior to running the script.
A research paper describing the linear algebra underlying Red-Black graphs as well as examples of application can be found in the Jupyter notebook, "Red Black Graph - A DAG of Multiple, Interleaved Binary Trees.ipynb". To access the notebook after you've setup the project for development, simply:
run-image notebookopen http://localhost:8888/lab
If you'd prefer to read hard copy, simply run:
`bin/generate-pdf.sh notebooks/Red Black Graph - A DAG of Multiple, Interleaved Binary Trees.ipynb`
A pdf file will be generated into the build/latex-{datestamped} directory.
Run the following:
# use crawl-fs to extract a sample data set from FamilySearch
pip install fs-crawler
crawl-fs -i <FamilySearch Ids to seed crawl> -o <output-directory> -b <name portion of output file>
# this will generate a <name>.vertices.csv and <name>.edges.csv file which can be ingested into a RedBlackGraph
pip install RedBlackGraph
# use rbgcf to generate both a simple form and cannonical form of a Red Black Graph (xlsx files)
rbgcf -f <directory and base name of vertices and edges file> -o <output-directory>
# Use excel to view output
RedBlackGraph uses the Meson build system (as of version 0.5.0, migrated from numpy.distutils).
- Python 3.10, 3.11, or 3.12
- Meson >= 1.2.0
- Ninja build tool
- Cython >= 3.0
- NumPy 1.26+ (including NumPy 2.x)
# Install build dependencies
pip install meson-python meson ninja cython numpy
# Build and install in development mode
pip install -e . --no-build-isolation
# Or build wheel
pip install build
python -m buildMeson-python editable installs require --no-build-isolation (the editable loader
needs a persistent build directory). A setup script handles this:
./bin/setup-uv.sh # CPU only
./bin/setup-uv.sh --gpu # Include CuPy for GPU support
source .venv/bin/activate
pytest tests/Or manually:
uv venv
uv pip install meson-python meson ninja cython tempita numpy
uv pip install -e ".[test,io]" --no-build-isolationIf you use uv, a convenience script is provided to create a fresh .venv, install build/test dependencies, perform an editable install using pip with --no-build-isolation, and install the test extra:
./bin/setup-uv.sh
source .venv/bin/activate
uv run -m pytestThe script expects:
uvon yourPATH- the
ninjabuild tool installed (e.g.sudo apt install ninja-buildon Debian/Ubuntu) - the
fs-crawlersubmodule present at./fs-crawler
The Meson build system compiles all C/C++ extensions and Cython modules automatically.
RedBlackGraph uses cibuildwheel to build wheels for multiple platforms and Python versions.
# Build wheels for current platform
./bin/build-wheels-cibuildwheel.sh
# Or use cibuildwheel directly
pip install cibuildwheel
cibuildwheel --platform auto --output-dir wheelhouseWheels are automatically built and published to PyPI when a version tag is pushed:
git tag -a v0.5.1 -m "Release version 0.5.1"
git push origin v0.5.1For detailed instructions, see:
- PyPI Publishing Guide - Complete guide for building and publishing
- Release Checklist - Quick reference for releases
redblackgraph.reference- a pure python implementation. This simple implementation is intended primarily for illustrative purposes.redblackgraph.matrixandredblackgraph.array- a Numpy C-API extension for efficient computation with the matrix multiplication operator, @, overloaded to support avos sum and product.redblackgraph.sparse_matrix- an optimized implementation built on scipy's sparse matrix implementation.redblackgraph.gpu- GPU-accelerated sparse operations using CuPy and inline CUDA kernels. Provides SpGEMM (sparse matrix multiply) and transitive closure on GPU using the AVOS semiring.
The GPU module (redblackgraph.gpu) provides two transitive closure algorithms:
- Repeated squaring (
transitive_closure_gpu) — works for any graph. Computes TC(A) = A + A² + A⁴ + ... via CUDA SpGEMM with all data GPU-resident between iterations. - Level-parallel DAG propagation (
transitive_closure_dag_gpu) — specialized for DAGs (triangular matrices). Processes vertices by topological level with full GPU parallelism within each level.
Benchmarked on synthesized family DAGs (lower-triangular adjacency matrices). CPU uses the Cython O(V+E+nnz) topological propagation algorithm.
| Vertices | NNZ | CPU-DAG (s) | GPU-Sqr (s) | GPU-DAG (s) | Best GPU/CPU |
|---|---|---|---|---|---|
| 442 | 1,226 | 0.0020 | 0.0039 | 0.0032 | 1.6x CPU |
| 1,326 | 3,728 | 0.0080 | 0.0055 | 0.0038 | 2.1x GPU |
| 2,144 | 5,932 | 0.0129 | 0.0059 | 0.0043 | 3.0x GPU |
| 4,701 | 13,103 | 0.0292 | 0.0073 | 0.0057 | 5.1x GPU |
| 11,012 | 30,536 | 0.0700 | 0.0125 | 0.0087 | 8.1x GPU |
| 21,162 | 58,486 | 0.1380 | 0.0268 | 0.0138 | 10.0x GPU |
GPU crossover is ~1,000 vertices. The DAG kernel nearly doubles the speedup over repeated squaring at scale.
pip install cupy-cuda12x # or cupy-cuda11x for older CUDAGPU features are optional — all functionality gracefully falls back to CPU when CuPy is unavailable.
from redblackgraph.gpu import CSRMatrixGPU, transitive_closure_dag_gpu
# Transfer sparse matrix to GPU
A_gpu = CSRMatrixGPU.from_cpu(my_sparse_matrix)
# Compute transitive closure
closure, diameter = transitive_closure_dag_gpu(A_gpu)
# Transfer result back to CPU
result = closure.to_cpu()Run python bench_closure.py to reproduce the benchmark on your hardware.