GitHub - helanabi/colcat: A CLI tool for merging an arbitrary number of Excel/CSV files

Overview

A tool that combines messy Excel/CSV files into a clean, unified sheet

Demo

./colcat.py -m mapping.json sample_data/file*

Features

Takes an arbitrary number of Excel/CSV files
Automatic column alignment
Automatic schema union
Empty rows cleanup
Custom column mapping
Optional summary sheet
Optional source tracking column

Installation

Download the latest version
unzip colcat-0.1.0.zip
cd colcat-0.1.0
pip install -r requirements
chmod +x colcat.py
Use ./colcat.py -h for usage help

Usage

usage: colcat [-h] [-b] [-m JSON] [-n NAME] [-o OUTPUT] [-r] [-s] [-v]
              file [file ...]

cat(1) but for columns in Excel/CSV files

positional arguments:
  file                  input data files

options:
  -h, --help            show this help message and exit
  -b, --verbose
  -m, --mapping JSON    JSON file mapping column names
  -n, --sheet-name NAME
  -o, --output OUTPUT   output file name
  -r, --summary         add a summary sheet
  -s, --source          add a column for row source file
  -v, --version         show program's version number and exit

Column Mapping file

A JSON file mapping column names can be specified in the command line using the -m/--mapping option. Valid mapping files must have the following format:

{
  "Column1": ["alternative name1", ...],
  ...
}

Column1: this is the name that will be used in the output file, its case is conserved but column name matching is case-insensitive.
alternative name1: an arbitrary number of alternative names to Column1 can be specified in a list, these alternative names must be in lower case, in order to perform a case-insensitive matching.

See mapping.json for an example.

Exit codes

2: usage error
3: filesystem error
4: invalid input file
9: unknown error

LICENSE

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
sample_data		sample_data
CHANGELOG		CHANGELOG
LICENSE		LICENSE
README.md		README.md
colcat.py		colcat.py
demo.gif		demo.gif
mapping.json		mapping.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Demo

Features

Installation

Usage

Column Mapping file

Exit codes

LICENSE

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Demo

Features

Installation

Usage

Column Mapping file

Exit codes

LICENSE

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages