Skip to content

Releases: sharpic/redact

v0.0.1

26 Jun 10:38

Choose a tag to compare

Initial release of redact.

A local command-line tool that redacts PII (names, emails, IDs, usernames) in Word, ODT, Excel, and PDF files. Replaces values with consistent pseudonyms and writes a mapping file to restore originals.

Files

  • redact.py — main script
  • redact_config.toml — pattern definitions
  • redact_tests.py — 116 tests

Usage

python3 redact.py <input_file>
python3 redact.py <input_redacted_file> --restore

Requirements

pip install python-docx odfpy openpyxl pdfplumber
pip install spacy && python -m spacy download en_core_web_sm