Skip to content
View cuburt's full-sized avatar

Block or report cuburt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
cuburt/README.md

Whats poppin? πŸ‘‹

🧠 "AI Researcher" | Machine Learning Engineer | Generative AI Specialist

(quotes intentional, impostor syndrome included at no extra cost)

I’m a burnt-out programmer living at the edge of generative modeling and decision science, where GPUs scream, losses explode, and convergence is a suggestion. These days I’m obsessed with making SLMs/LLMs scale without crying, pushing Diffusion Transformers (DiT) past their U-Net era, and applying Bayesian methods so my models can say β€œI’m not sure” with confidence.


πŸš€ Technical Focus

  • LLMs & SLMs: Designing robust pipelines for Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) to align models with complex reasoning tasks.
  • Bayesian Forecasting: Leveraging Gaussian Process Regression (GPR) for uncertainty-aware predictions in high-stakes environments like infrastructure management.
  • Time-Series Hybridization: Combining classical VARIMA models with Deep Learning (MLP) for multi-variable forecasting.
  • Diffusion Transformers (DiT): Investigating scalable transformer-based backbones for latent diffusion, moving beyond U-Net architectures for higher-fidelity generation.
  • RAG Systems: Developing Multimodal RAG architectures capable of synthesizing insights from variant data sources, including structured tables, unstructured text, and visual assets, utilizing cross-modal embeddings for holistic retrieval.
  • Post-Training & Alignment: Engineering iterative workflows for model distillation and preference alignment to optimize Small Language Models (SLMs) for efficient, local, resource-constrained inference.

πŸ› οΈ Tech Stack

  • Languages: Python Jupyter Notebook Shell Script
  • AI/ML Frameworks: PyTorch Scikit-Learn Hugging Face
  • Infrastructure: Docker Git CUDA

πŸ“‚ Featured Projects

Project Description
gpr-slippage-forecasting A Bayesian framework using Gaussian Process Regression to predict and quantify uncertainty in infrastructure project slippage.
project-crimex A comprehensive analytical platform for crime data visualization and predictive modeling using hybrid ML approaches.
varima-nn Hybrid forecasting system combining Vector ARIMA with Neural Networks for multivariate data.
llm-wrappers A versatile chatbot framework supporting local inference and code interpretation.
llm-local-setup Optimized configurations and scripts for running high-performance local LLM inference and SLM training using CUDA/MPS.
cuburt/ai-toolkit Implementation of DPO and GRPO training loops for aligning Diffusion Transformer (DiT) to specific preferences.

πŸ“– Research Highlights

  • Gaussian Process Regression (GPR): Published work on using GPR for project slippage forecasting, providing probabilistic intervals for better project governance.
  • Diffusion Scalability: Researching the transition from convolutional denoising to attention-based denoising in latent spaces.
  • Tabular & Multimodal RAG: Developing methods to improve LLM reasoning over sparse, large-scale financial tables using semantic chunking.
  • LLM Scalability & Efficiency: Investigating quantization-aware training, KV-cache optimization, and distillation techniques to maintain high-reasoning performance in Small Language Models (SLMs) for edge deployment.
  • Bio-plausibility in ML: Exploring neural architectures inspired by predictive coding and synaptic plasticity to develop more efficient, lifelong learning systems that mimic biological processing.

πŸ“Š GitHub Stats

Cuburt's GitHub stats Top Langs

"Advancing AI through Bayesian uncertainty, Transformer scalability, and an unhealthy relationship with PyTorch."

Pinned Loading

  1. varima-nn varima-nn Public

    Multivariable Forecast with Vector ARIMA + MLP

    Jupyter Notebook

  2. project-crimex project-crimex Public

    Time Series Forecast of PH Regional Crime Rate using Gaussian Process Regression

    Python

  3. llm-wrappers llm-wrappers Public

    Multi-domain Chatbot. Code Interpreter. UI Critique

    Jupyter Notebook

  4. llm-local-setup llm-local-setup Public

    local language model setup with cuda and mps for inference and slm training

    Jupyter Notebook

  5. GAN GAN Public

    just GANs

    Jupyter Notebook

  6. ai-toolkit ai-toolkit Public

    Forked from ostris/ai-toolkit

    The ultimate training toolkit for finetuning diffusion models. With Diffusion-DPO and GRPO support

    Python