Skip to content

HothoLina/python-automation-csv-cleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Automated CSV Cleaning & Summary Generator

Overview

This project is a Python automation utility that cleans raw CSV files and generates structured outputs for analysis and reporting. The script focuses on safe data handling and clear documentation.

Features

  • Loads CSV files using headers automatically
  • Removes fully empty rows
  • Trims extra spaces in text columns
  • Preserves missing values for data integrity
  • Keeps numeric data unchanged internally
  • Generates:
    • Cleaned CSV file
    • Display-ready CSV with currency formatting
    • Summary report text file

Technologies Used

  • Python
  • Pandas
  • Google Colab

Output Files

  • cleaned_data.csv — cleaned, analysis-ready data
  • cleaned_data_display.csv — formatted for presentation
  • summary_report.txt — summary of cleaning operations

Learning Outcomes

  • Data cleaning automation
  • Safe handling of numeric and text data
  • Separation of processing vs presentation
  • Real-world data preparation practices

Status

Completed ✅
Part of a learning portfolio focused on Cloud, Automation, Data, and AI integration.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors