Skip to content
View Wilmar3752's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Wilmar3752

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Wilmar3752/README.md

Profile Views


About Me

  • M.Sc. in Statistics — Universidad del Valle, Colombia
  • Specialized in: survival analysis, predictive modeling, MLOps, and data engineering
  • Build end-to-end ML pipelines: from feature engineering and model training to REST APIs and containerized deployment
  • Currently working with: Databricks (medallion architecture), FastAPI, Docker, DVC
  • Also an instructor — teaching Statistics, Python, and Machine Learning at university level since 2019
  • Ask me about: statistical modeling, MLOps pipelines, goodness-of-fit testing, or Python/R packages

Tech Stack

Languages

Python R SQL Bash

ML & Data Science

scikit-learn Pandas NumPy XGBoost SciPy

MLOps & Infrastructure

Docker FastAPI DVC Databricks GitHub Actions

Data & Cloud

AWS PostgreSQL MongoDB

Apps & Visualization

Streamlit Gradio HuggingFace


Featured Projects

Project Description Tech
pdist Python package to automatically identify the best-fit probability distribution. Implements KS, Anderson-Darling & Chi-square goodness-of-fit tests, AIC/BIC criteria, and visualizations. Supports 9 continuous + 4 discrete distributions. Python · SciPy · Matplotlib
itseries R package for analyzing irregularly spaced stochastic processes — built during M.Sc. research in Statistics. R
car_predict End-to-end ML pipeline for car price prediction with model versioning, REST API, and containerized deployment. Python · DVC · FastAPI · Docker
ETL_scraper Automated ETL pipeline extracting vehicle data, processing with Python, and loading to AWS S3 via CI/CD. Python · AWS S3 · GitHub Actions
cluster-app Interactive web app for credit card customer segmentation using unsupervised learning, deployed on HuggingFace Spaces. Python · Gradio · HuggingFace
meli_scrapper Web scraper for all products published on Mercado Libre, containerized and deployed with CI/CD. Python · Docker · CI/CD
pptex Docker-based toolkit for generating LaTeX presentations and reports on any OS — no LaTeX installation required. Supports pdflatex, xelatex, lualatex, and watch mode. Docker · LaTeX · Shell

GitHub Stats

GitHub Streak

Activity Graph


Connect

LinkedIn Website Email

Pinned Loading

  1. survey-etl survey-etl Public

    This project extract data from survey123, transform and loads to database

    Python 1

  2. meli_scrapper meli_scrapper Public

    Web scrapping proyect for predict vehicle prices

    Python 20 1

  3. ETL_scraper ETL_scraper Public

    ETL for predict car prices in Colombia

    Jupyter Notebook 2

  4. car_predict car_predict Public

    Machine Learning Model to predict car prices

    Jupyter Notebook 3

  5. car_predict_app car_predict_app Public

    APP for ML model

    Python 4