This repository contains implementations of several data science and data processing algorithms based on exercises from the book Algorithms for Data Science by Brian Steele, John Chandler, and Swarna Reddy.
The exercises focus on working with large real-world datasets and implementing techniques such as data reduction, aggregation, and similarity analysis.
Data reduction and dictionary-based processing on large datasets.
Analysis of election contribution datasets and aggregation of contributions by party and state.
Large dataset processing and statistical analysis using FEC contribution datasets.
The assignments use datasets from the FEC election contributions dataset (2012–2014):