This project is a Python automation utility that cleans raw CSV files and generates structured outputs for analysis and reporting. The script focuses on safe data handling and clear documentation.
- Loads CSV files using headers automatically
- Removes fully empty rows
- Trims extra spaces in text columns
- Preserves missing values for data integrity
- Keeps numeric data unchanged internally
- Generates:
- Cleaned CSV file
- Display-ready CSV with currency formatting
- Summary report text file
- Python
- Pandas
- Google Colab
cleaned_data.csv— cleaned, analysis-ready datacleaned_data_display.csv— formatted for presentationsummary_report.txt— summary of cleaning operations
- Data cleaning automation
- Safe handling of numeric and text data
- Separation of processing vs presentation
- Real-world data preparation practices
Completed ✅
Part of a learning portfolio focused on Cloud, Automation, Data, and AI integration.