Skip to content
View nateeatsrice's full-sized avatar

Highlights

  • Pro

Block or report nateeatsrice

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nateeatsrice/README.md

👋 Hi, I'm Nathaniel!

I'm currently based in Bethesda, MD, originally from NYC and Utah. I spent several years in Northern Virginia working as a Risk and Controls Analyst before pivoting into data science. That journey began with coursework at Georgetown University, which sparked a deeper interest in the math behind machine learning and led me to complete a Master’s in Applied Statistics at CSULB.

My current focus is on computer vision research, specifically improving object detection performance on class-imbalanced datasets using Generative AI. I'm especially interested in how synthetic data can be used to augment limited samples and enhance model robustness.

Before returning to school, I took some time to reset and reconnect with nature completing two long-distance thru-hikes: the Arizona Trail (which was amazing until the rattlesnakes came out in April 🐍 ) and the Oregon Coast Trail 🐳 🌲.

This GitHub is where I share my research, side projects, and ongoing experiments in data science, deep learning, and computer vision.

📫 Let’s connect or collaborate! The easest way to reach me is via email

Pinned Loading

  1. fraud-detection fraud-detection Public

    A complete workflow for detecting and managing multiple types of fraud risk within a synthetic transaction dataset.

    Jupyter Notebook

  2. data-pipeline data-pipeline Public

    A production-grade batch data engineering pipeline that ingests NYC TLC taxi trip data and NOAA weather data, transforms it through a medallion architecture, and produces feature tables for data sc…

    Python

  3. nyc-taxi-demand nyc-taxi-demand Public

    End-to-end MLOps platform for NYC taxi demand forecasting using Metaflow, AWS Batch (spot), EKS, MLflow, Terraform.

    Python

  4. dfast-stress-testing dfast-stress-testing Public

    DFAST stress-testing ML project. This is a production-grade project that replicates Dodd-Frank Act Stress Testing (DFAST) using Fannie Mae's publicly available Multifamily Loan Performance data.

    Jupyter Notebook