I'm a third-year B.Tech Computer Science (Data Science) student at VIT Chennai (CGPA 9.23/10), where research and engineering meet on my keyboard every day. My world sits at the intersection of machine learning research, full-stack engineering, and cloud-native systems โ I publish papers and ship production code.
I authored a Springer peer-reviewed paper on AI-driven lung cancer prognosis (ICT4SD 2025), where I designed the Clinical Readiness Score (CRS) โ a novel evaluation framework using the Analytic Hierarchy Process (AHP) to weigh interpretability, efficiency, clinical validation, and accuracy across CNNs, RNNs, RF, and SVMs on the LIDC-IDRI and TCGA datasets.
When I'm not in research mode, I architect AI-powered systems like LifeMemory AI (RAG + LangGraph + pgvector) and cloud-native platforms on AWS โ and lead operations as Financial Lead at ACM-W VIT, having run 4 hackathons for 250โ300+ participants each.
- ๐ฌ Currently: building LifeMemory AI โ a privacy-first journal with multi-step LangGraph reasoning
- ๐ฑ Learning: advanced distributed systems, retrieval-augmented architectures, MLOps
- ๐ฏ Open to: research collaborations, ML/SWE internships, open-source contributions
- ๐ฌ Ask me about: RAG pipelines, AHP-based evaluation, AWS 3-tier architectures, pgvector
- โก Fun fact: I once analyzed ~4.8M UIDAI records and turned them into actionable district-level policy insights
๐ฌ Research ใป โก Engineering ใป โ๏ธ Cloud ใป ๐ Data Science ใป ๐ง LLM Systems
Four projects spanning AI-powered systems, cloud-native platforms, and data-science research at scale.
A privacy-first journaling platform that uses Retrieval-Augmented Generation to help users explore their own memories like a conversation with their past self.
- ๐งฌ RAG pipeline combining semantic search, metadata filtering, and temporal prioritization
- ๐ธ๏ธ Multi-step LangGraph reasoning โ intent classification โ retrieval โ synthesis
- โก Async FastAPI backend with PostgreSQL + pgvector for vector similarity at scale
- ๐ JWT + Supabase Auth + Row-Level Security (RLS) โ privacy by design, not afterthought
- ๐ณ Docker-deployed with structured logging and monitoring
A full-stack social book platform with hybrid AI recommendations and real-time community features.
- ๐ฏ Hybrid recommendation engine built on PostgreSQL + pgvector (content + collaborative)
- ๐ฌ Real-time chat via WebSockets โ book clubs that actually talk
- ๐ Open Library API integration for a dynamic, ever-growing catalog
- ๐ JWT auth via Supabase with secure token refresh
- ๐งฑ Zustand state management + Axios interceptors for clean, resilient client-server contracts
A 3-tier AWS-native real-time pair-programming platform with live video and multi-language code execution.
- ๐๏ธ 3-tier AWS architecture โ S3 (static) + CloudFront (CDN) + Elastic Beanstalk (compute)
- ๐๏ธ DynamoDB with GSI/LSI indexing for high-throughput, low-latency reads
- ๐ฅ WebRTC video conferencing + collaborative code editor with operational transforms
- ๐ JWT + Role-Based Access Control (RBAC) โ owner / editor / viewer permissions
- ๐ Multi-language code execution via secure external sandbox APIs
- ๐ AWS CloudWatch monitoring with custom metrics and alarms
District-level stress-zone analytics on ~4.8M Aadhaar enrolment + update records with a custom metric and policy framework.
- ๐ฆ Cleaned and analyzed ~4.8M records across enrolment + update datasets
- ๐งฎ Designed a novel Service Stress Ratio metric for district-level diagnosis
- ๐ Performed univariate, bivariate, and trivariate statistical analysis
- ๐ฒ Random Forest classifier for stress-level categorization
- ๐๏ธ Built a policy recommendation framework for resource allocation
- ๐ Trend & persistence analysis โ separating systemic issues from transient spikes
This work introduces the Clinical Readiness Score (CRS) โ a structured, multi-criteria evaluation metric for AI models in lung cancer diagnosis. CRS combines interpretability, efficiency, clinical validation, and accuracy, with weights assigned via the Analytic Hierarchy Process (AHP) and validated for consistency. We benchmarked CNNs, RNNs, Random Forest, and SVM on the LIDC-IDRI and TCGA datasets, evaluating AUC, F1-score, sensitivity, and specificity, and provide AHP weight distributions, sensitivity analysis, and CRS factor contributions to support clinical adoption.
๐ Methodology Flow
flowchart LR
A[LIDC-IDRI / TCGA<br/>Datasets] --> B[Preprocessing<br/>+ SMOTE]
B --> C{Model Training}
C --> D[CNN]
C --> E[RNN]
C --> F[Random Forest]
C --> G[SVM]
D & E & F & G --> H[Metrics<br/>AUC ยท F1 ยท Sensitivity ยท Specificity]
H --> I[AHP Weighting<br/>Interpretability ยท Efficiency<br/>Clinical Validation ยท Accuracy]
I --> J{{Clinical Readiness Score<br/>CRS}}
J --> K[Sensitivity Analysis<br/>+ Recommendations]
| ๐ Hackathons Led | ๐ฅ Participants Served | ๐ผ Sponsor Partnerships | ๐ Tenure |
|---|---|---|---|
| 4 | 400+ | Multiple | Active |
๐ The snake auto-regenerates daily and on every push via the GitHub Actions workflow at
.github/workflows/snake.yml.










