LogFlow

Modern, multiprocess-safe logging specifically engineered for High-Performance Computing (HPC) and Machine Learning (ML).

Why LogFlow?

ML experiments and distributed training (like PyTorch DDP) present unique logging challenges:

Log Storms: 128 identical lines when 128 GPUs log simultaneously.
Multiprocess Safety: Corrupted log files when multiple processes write to the same file.
Startup Consistency: Tracking which logs belong to which experiment run.

LogFlow solves these by being distributed-aware and framework-agnostic.

Design Goals & Requirements

Core Functionality

High-Fidelity Logging: Provide a thread-safe and multiprocess-safe logging engine.
Unified Observability: Standardize logging levels (TRACE, DEBUG, INFO, SUCCESS, WARNING, ERROR, CRITICAL).
Auto-Infrastructure: Automatically create log directories if they do not exist.
Global Configuration: Support XDG-standard configuration (~/.config/logflow/config.yaml).

Integration

Log-Symmetry: Support automatic log file naming based on the active script/config name.
Rich Integration: Provide beautiful, filtered console output via the Rich library.
Framework Interception: Intercept standard library logging and redirect to LogFlow.

Performance

Zero-Overhead Inactive Levels: Ensure that TRACE/DEBUG levels have minimal impact when disabled.
Asynchronous Sinks: Support enqueued logging to prevent blocking the hot path.

Key Features

Rank-Aware: Automatically filters console output to Rank 0 (supports SLURM, DDP, MPI).
Multiprocess Safe: Uses Loguru's enqueue=True for thread/process safety.
Startup Rotation: Archives old logs on script start, giving every run a fresh log file.
Framework Interoperability: Automatically intercepts and formats logs from TensorFlow, PyTorch, JAX, and standard Python logging.
Zero-Blocking: Non-blocking logging via background sinking.

Installation

pip install git+https://github.com/Gearlux/logflow.git@main

Quick Start

from logflow import get_logger, configure_logging

# Optional: customize levels and directories
configure_logging(log_dir="./experiment_logs", console_level="INFO")

logger = get_logger(__name__)

logger.info("Starting training loop...")
logger.debug("Hyperparameters: batch_size=32, lr=0.001")
logger.success("Model checkpoint saved!")

Configuration

LogFlow supports a hierarchical configuration system that allows you to manage settings across different projects and environments.

1. Configuration Priority

Settings are resolved in the following order (highest to lowest):

Function Arguments (passed to configure_logging())
Environment Variables (prefixed with LOGFLOW_)
Local logflow.yaml / logflow.yml
Local pyproject.toml (under [tool.logflow])
XDG User Config (~/.config/logflow/config.yaml)
Defaults

2. Usage Examples

via `pyproject.toml`

[tool.logflow]
log_dir = "./custom_logs"
console_level = "DEBUG"
retention = 10

via `logflow.yaml`

log_dir: "./experiment_logs"
file_level: "TRACE"
enqueue: true
rotation_on_startup: true

via Environment Variables

export LOGFLOW_DIR="/var/log/myapp"
export LOGFLOW_CONSOLE_LEVEL="ERROR"

Log Inspection

For the best experience viewing LogFlow logs (especially interleaving logs from multiple ranks/workers), we recommend using lnav (The Log File Navigator).

lnav automatically detects LogFlow's timestamp format and can merge multiple log files into a single, chronological view.

Usage with lnav

# View all logs in the directory interleaved by time
lnav ./logs

For more information, see the lnav documentation.

Distributed Training (DDP/SLURM)

LogFlow handles ranks automatically. No need to wrap your log calls in if rank == 0:.

# In a torchrun or SLURM environment
from logflow import get_logger

logger = get_logger(__name__)

# Only shows up once in console (Rank 0), but saved in file for all Ranks
logger.info("Initializing process group...")

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/workflows		.github/workflows
examples		examples
logflow		logflow
tests		tests
.flake8		.flake8
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
FUTURE_DEVELOPMENT.md		FUTURE_DEVELOPMENT.md
GEMINI.md		GEMINI.md
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
RATIONALE.md		RATIONALE.md
README.md		README.md
logflow.example.yaml		logflow.example.yaml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LogFlow

Why LogFlow?

Design Goals & Requirements

Core Functionality

Integration

Performance

Key Features

Installation

Quick Start

Configuration

1. Configuration Priority

2. Usage Examples

via `pyproject.toml`

via `logflow.yaml`

via Environment Variables

Log Inspection

Usage with lnav

Distributed Training (DDP/SLURM)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LogFlow

Why LogFlow?

Design Goals & Requirements

Core Functionality

Integration

Performance

Key Features

Installation

Quick Start

Configuration

1. Configuration Priority

2. Usage Examples

via pyproject.toml

via logflow.yaml

via Environment Variables

Log Inspection

Usage with lnav

Distributed Training (DDP/SLURM)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

via `pyproject.toml`

via `logflow.yaml`

Packages