Code-Guardian

A fast, modular CLI tool for scanning codebases to detect non-productive code.

Features
Installation
System Requirements
Performance Benchmarks
Usage
Advanced Usage
Supported Patterns
Output Formats
Architecture
Development
Documentation
Contributing
Branch Protection
License

Features

🔍 Pattern Detection: Scan for TODO, FIXME, and other customizable patterns
📊 Multiple Output Formats: Support for text, JSON, CSV, Markdown, and HTML
💾 Persistent Storage: SQLite-based scan history and comparison
⚡ High Performance: Parallel processing with Rust and Rayon
🏗️ Modular Architecture: Clean separation of concerns across crates
🌐 Distributed Scanning: Handle large codebases with distributed processing
🔄 Incremental Scanning: Efficient rescanning of changed files only
📈 Performance Benchmarking: Built-in benchmarks and optimization recommendations
🚀 Production Readiness: Checks and CI/CD integration for production environments
🛠️ Custom Detectors: JSON-configurable custom pattern detectors
⚙️ Advanced Scanning Options: Streaming, optimized, and metrics-based scanning
🏷️ Technology Stack Presets: Presets for web, backend, fullstack, mobile, and systems
🌍 Multi-Language Support: Scanning for Rust, JavaScript, TypeScript, Python, Go, Java, C#, PHP and 20+ other programming languages

Installation

From Source

git clone https://github.com/d-oit/code-guardian
cd code-guardian
cargo build --release

The binary will be available at target/release/code-guardian.

Using Cargo Install

cargo install code-guardian

This will download, compile, and install the binary to your Cargo bin directory (usually ~/.cargo/bin/).

System Requirements

Minimum Rust Version: 1.70.0 (Rust 2021 edition)
Supported Platforms: Linux, macOS, Windows
Memory: 50MB+ recommended for large codebases

Performance Benchmarks

Code-Guardian is optimized for speed and efficiency. Here are typical performance metrics:

Metric	Small Project (1k files)	Medium Project (10k files)	Large Project (100k files)
Scan Duration	~2.3 seconds	~18.7 seconds	~2.6 minutes
Memory Usage	~45MB	~67MB	~87MB
Throughput	~434 files/second	~535 files/second	~641 files/second

For detailed performance data and optimization recommendations, see Performance Benchmarks.

Usage

Scan a Directory

code-guardian scan /path/to/your/project

View Scan History

code-guardian history

Generate Reports

# Text format (default)
code-guardian report 1

# JSON format
code-guardian report 1 --format json

# HTML format
code-guardian report 1 --format html

Compare Scans

code-guardian compare 1 2 --format markdown

Advanced Usage

Custom Database Location

By default, scans are stored in data/code-guardian.db. You can specify a custom database path:

code-guardian scan /path/to/project --db /custom/path/my-scans.db
code-guardian history --db /custom/path/my-scans.db
code-guardian report 1 --db /custom/path/my-scans.db --format json

Piping and Redirecting Output

Redirect reports to files for further processing:

# Save HTML report to file
code-guardian report 1 --format html > scan-report.html

# Pipe JSON output to jq for filtering
code-guardian report 1 --format json | jq '.matches[] | select(.pattern == "TODO")'

# Export CSV for spreadsheet analysis
code-guardian report 1 --format csv > scan-results.csv

Automating Scans with Scripts

Create a bash script for regular scanning:

#!/bin/bash
# daily-scan.sh
PROJECT_DIR="/path/to/your/project"
DB_PATH="$HOME/code-guardian-scans.db"

echo "Running daily code scan..."
code-guardian scan "$PROJECT_DIR" --db "$DB_PATH"
SCAN_ID=$(code-guardian history --db "$DB_PATH" | tail -1 | awk '{print $2}' | tr -d ',')

echo "Generating reports..."
code-guardian report "$SCAN_ID" --db "$DB_PATH" --format html > "scan-$(date +%Y%m%d).html"
code-guardian report "$SCAN_ID" --db "$DB_PATH" --format json > "scan-$(date +%Y%m%d).json"

echo "Scan complete. Reports saved."

Comparing Scan Results Over Time

Track progress by comparing scans:

# Compare last two scans
LATEST_ID=$(code-guardian history | tail -1 | awk '{print $2}' | tr -d ',')
PREVIOUS_ID=$(code-guardian history | tail -2 | head -1 | awk '{print $2}' | tr -d ',')

code-guardian compare "$PREVIOUS_ID" "$LATEST_ID" --format markdown

Integrating with CI/CD

The project includes an enhanced CI/CD pipeline that combines the best features from multiple workflows:

Enhanced CI/CD Workflow (enhanced-ci.yml): Combines features from optimized-ci.yml, security.yml, performance.yml, and auto-fix.yml
Concurrency Controls: Prevents overlapping runs
Least Privilege Permissions: Enhanced security
Auto-fix Capabilities: Automatically fixes formatting and clippy issues
Comprehensive Testing: Cross-platform testing with incremental builds
Security Scanning: Cargo audit, deny, and security-focused clippy
Performance Benchmarking: Build time and binary size optimization
Coverage Thresholds: Enforces 82%+ test coverage

Example integration for scanning TODOs in CI:

# .github/workflows/enhanced-ci.yml
- name: Scan for TODOs
  run: |
    ./code-guardian scan . --db /tmp/scans.db
    SCAN_ID=$(./code-guardian history --db /tmp/scans.db | tail -1 | awk '{print $2}' | tr -d ',')
    COUNT=$(./code-guardian report "$SCAN_ID" --db /tmp/scans.db --format json | jq '.matches | length')
    if [ "$COUNT" -gt 10 ]; then
      echo "Too many TODOs found: $COUNT"
      exit 1
    fi

Benchmarking

Run performance benchmarks to assess scanning speed and receive optimization recommendations:

code-guardian benchmark --quick

Production Readiness Checks

Perform production readiness checks with configurable severity levels:

code-guardian production-check --severity high

Incremental Scanning

Efficiently rescan only changed files for faster subsequent scans:

code-guardian scan /path --incremental

Distributed Scanning

Distribute scanning across multiple processes for large codebases:

code-guardian scan /path --distributed

Supported Patterns

TODO: Tasks that need to be completed
FIXME: Code that needs to be fixed
HACK: Temporary workarounds
BUG: Known bugs
XXX: Critical issues
PANIC: Rust panic calls
UNWRAP: Rust unwrap calls
UNSAFE: Rust unsafe blocks
Custom Patterns: Define your own patterns via configuration files

Custom Detectors

Code-Guardian supports custom pattern detectors for detecting project-specific issues:

# Create example custom detectors
code-guardian custom-detectors create-examples

# Scan with custom detectors
code-guardian scan /path/to/project --custom-detectors custom_detectors.json

# List available custom detectors
code-guardian custom-detectors list

Custom detectors can detect security vulnerabilities, code quality issues, and more. See the Custom Detectors Guide for details.

Output Formats

text: Human-readable console output
json: Machine-readable JSON format
csv: Spreadsheet-compatible CSV format
markdown: Documentation-friendly Markdown tables
html: Web-friendly HTML tables

Architecture

The project follows a modular architecture with separate crates:

core: Scanning logic, pattern detection, custom detectors, distributed scanning, incremental scanning, performance optimization, enhanced configuration
storage: SQLite database operations, scan persistence, and migrations
output: Multiple output format support (text, json, csv, markdown, html)
cli: Command-line interface with handlers for scanning, reporting, comparisons, benchmarks, production usage, advanced features

Development

Building

cargo build

Testing

cargo test

Linting

cargo clippy

Formatting

cargo fmt

Documentation

Contributing

See CONTRIBUTING.md for detailed contribution guidelines.

Quick checklist:

Follow the guidelines in AGENTS.md
Keep modules under 500 lines of code
Maintain 82%+ test coverage
Use conventional commit messages

Branch Protection

To ensure code quality and security, this repository employs branch protection rules aligned with 2025 best practices. These include requiring 2 approvals for pull requests, signed commits, and passing all status checks (such as CI/CD, linting, and tests).

For detailed setup instructions, refer to BRANCH_PROTECTION_SETUP.md.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Code-Guardian

Table of Contents

Features

Installation

From Source

Using Cargo Install

System Requirements

Performance Benchmarks

Usage

Scan a Directory

View Scan History

Generate Reports

Compare Scans

Advanced Usage

Custom Database Location

Piping and Redirecting Output

Automating Scans with Scripts

Comparing Scan Results Over Time

Integrating with CI/CD

Benchmarking

Production Readiness Checks

Incremental Scanning

Distributed Scanning

Supported Patterns

Custom Detectors

Output Formats

Architecture

Development

Building

Testing

Linting

Formatting

Documentation

Contributing

Branch Protection

License

Uh oh!

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Code-Guardian

Table of Contents

Features

Installation

From Source

Using Cargo Install

System Requirements

Performance Benchmarks

Usage

Scan a Directory

View Scan History

Generate Reports

Compare Scans

Advanced Usage

Custom Database Location

Piping and Redirecting Output

Automating Scans with Scripts

Comparing Scan Results Over Time

Integrating with CI/CD

Benchmarking

Production Readiness Checks

Incremental Scanning

Distributed Scanning

Supported Patterns

Custom Detectors

Output Formats

Architecture

Development

Building

Testing

Linting

Formatting

Documentation

Contributing

Branch Protection

License