Skip to content
View harsharajkumar's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report harsharajkumar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
harsharajkumar/README.md

Harsha Raj Kumar

Typing SVG

Portfolio LinkedIn Twitter Email


About

I'm a first-year MS Computer Science student at Vanderbilt University and a Research Assistant at the VU-BEAM Lab, where I work on self-supervised medical imaging using contrastive learning. My work sits at the intersection of research and production — I care about both the theory and actually shipping things.

My current research focus is DAC-Learn (Depth-Aware Contrastive Learning), a novel self-supervised beamforming method for ultrasound imaging evaluated on the PICMUS benchmark. Outside the lab, I build production-grade systems — an API gateway handling 25k+ req/s, a full crisis operating platform on PostGIS, and an AI agent evaluation environment.

Open to Summer 2026 internships — SDE, ML Engineering, and Research roles.


Research

DAC-Learn Depth-Aware Contrastive Learning for ultrasound beamforming. Novel self-supervised method evaluated on PICMUS benchmark against SimCLR, BYOL, CycleGAN, and supervised baselines. Ongoing — targeting a top-tier ML venue.
VU-BEAM Lab 85% improvement in image contrast (CNR: 3.5+ dB) using SimCLR with custom NT-Xent loss on 10,000+ unlabeled cardiac ultrasound images. Outperforms CycleGAN baselines by 20% on held-out clinical data.

Featured Projects

FernOS — Crisis OS for Nashville

Full-stack crisis platform built around a shared live PostGIS graph. Citizens get safe routing via modified Dijkstra with time-decaying edge weights. Responders get crew sequencing, XGBoost ML-predicted circuit failures, and Claude-powered NLP triage — all synchronized via Socket.io in real time.

TypeScript PostGIS Socket.io XGBoost Claude API FastAPI

Repo

High-Performance API Gateway

Production-grade Node.js gateway achieving 25,000+ req/s at P99 < 15ms. Google SRE error-budget model, distributed tracing with waterfall flame-graph, EWMA PID adaptive rate limiter, circuit-breaker state machine, and 10-tab live admin dashboard. Deployed on AWS ECS.

Node.js Redis Prometheus Docker AWS ECS

Repo

Paper2Story — IEEE Best Paper Award 🏆

Multi-model NLP pipeline across 7 fine-tuned transformers (TinyLLaMA, Mistral 7B, ProphetNet) with LoRA fine-tuning, instruction tuning, and W&B experiment tracking. Achieved 15% ROUGE/BLEU improvement over baselines across 200+ documents. Awarded Best Paper at IEEE international conference.

PyTorch HuggingFace LoRA W&B Mistral 7B

Repo

CleanOps OpenEnv

Realistic benchmark environment for evaluating AI agents on multi-step business data-cleaning workflows. Agent interacts with messy CRM, orders, and payments tables via typed env APIs and receives deterministic scores with reward shaping for partial credit. Includes task design, grading logic, tests, and a deployable server.

Python FastAPI Agent Eval Reward Shaping

Repo Demo

API Security Scanner — IEEE Published

Fine-tuned Code Llama 7B with QLoRA on a custom 10,000-sample dataset covering OWASP API Top 10 across 10+ languages. 3-stage pipeline: structural endpoint discovery → LLM inspection → fuzzy OpenAPI policy validation. Streamlit dashboard + CLI.

Python QLoRA Code Llama 7B Streamlit OWASP

Repo Paper

BridgeAI — Real-Time Dispatch

React Native + Node.js mutual-aid dispatch platform. Intake flow, volunteer matching, OTP-verified handoff, charity fallback, and Socket.io real-time sync across mobile clients. JWT-secured REST API with session-based auth.

React Native Node.js Socket.io MongoDB JWT

Repo


Stack

Languages

Python TypeScript Java JavaScript C++ SQL

AI / ML

PyTorch TensorFlow HuggingFace scikit-learn XGBoost W&B

Backend & Infrastructure

Node.js FastAPI Redis PostgreSQL Docker AWS Prometheus


GitHub Stats

Activity Graph


Publications & Awards

  • 🏆 IEEE Best Paper Award — Paper2Story: Multi-Model Narrative Generation Pipeline (2024)
  • 📄 IEEE PublishedHybrid API Vulnerability Detection System (2025)
  • 🔬 Ongoing Research — DAC-Learn: Depth-Aware Contrastive Learning for Ultrasound Beamforming (targeting top-tier ML venue)

Currently

  • 🔬 Developing DAC-Learn — novel depth-aware contrastive learning for ultrasound beamforming at VU-BEAM Lab
  • 🏗️ Building a distributed task queue and job-matching RAG pipeline
  • 📖 Grinding NeetCode 150 daily for SDE interviews
  • 🤝 Open to Summer 2026 internships — SDE, ML Engineering, Research

Profile Views

Snake animation

Pinned Loading

  1. American-Sign-Language-Fingerspelling-Recognition American-Sign-Language-Fingerspelling-Recognition Public

    Detect and translate American Sign Language (ASL) fingerspelling into text.

    Jupyter Notebook

  2. Deepfake-Detection Deepfake-Detection Public

    This repository provides a deepfake detection toolkit leveraging CNNs and transfer learning. It guides you through building, training, and evaluating deepfake detection models. The code offers flex…

    Jupyter Notebook

  3. Paper2Story Paper2Story Public

    Paper2Story is a pipeline that converts complex research PDFs into concise summaries and then into engaging, easy-to-understand stories using LLMs. Ideal for students, educators, and content creato…

    Python

  4. graph-algorithms-benchmark graph-algorithms-benchmark Public

    A comprehensive Java project benchmarking classical graph algorithms (BFS, DFS, Dijkstra, Bellman-Ford, A*, Max Flow, Bipartite Check) on various graph sizes and densities. Includes result analysis…

    Java