Skip to content
View Nikelroid's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Highlights

  • Pro

Block or report Nikelroid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Nikelroid/README.md

Hi, I'm Nima Kelidari ๐Ÿ‘‹

AI / ML Engineer ยท MS Computer Science (AI) @ USC

Reinforcement Learning ยท Computer Vision ยท Large-Scale MLOps

๐ŸŒ Portfolio โ€ข ๐Ÿ“„ CV โ€ข ๐Ÿ’ผ LinkedIn โ€ข โœ‰๏ธ Email


๐Ÿš€ About Me

I'm a Master's student in Computer Science (AI) at USC, with a BS from Sharif University of Technology. I build agents that learn under uncertainty and ship them on infrastructure that scales.

  • ๐Ÿ”ญ Researching: adversarial co-evolution of RL and VLM/LLM agents
  • ๐Ÿ› ๏ธ Recently shipped: PPO agents for imperfect-information games, MoE steering at inference time, probing frameworks for speech transformers
  • ๐ŸŒฑ Learning: ROS, control theory, advanced MLOps
  • ๐Ÿค Open to collaborate on: robotics simulation, medical imaging
  • ๐Ÿ’ฌ Ask me about: PPO and offline RL, computer vision, MLOps pipelines on GCP/AWS

๐Ÿ› ๏ธ Tech Stack

Languages

ML & Deep Learning

RL & Simulation ย  Stable-Baselines3 ยท PettingZoo ยท Gymnasium ยท Ollama ยท vLLM

Data

MLOps & Cloud

Storage & Systems


๐Ÿ“‚ Featured Projects

Project What it does Stack
Risk-Scaled Steering in MoE Token-aware steering for MoE LLMs โ€” 3D delta tensors that dynamically scale expert activations to improve safety at inference time. vLLM PyTorch HF
Linguistic-Agnostic SER Probing framework that measures how speech-emotion transformers encode paralinguistic vs. acoustic information across hidden layers. PyTorch HF
Adversarial Co-Evolution Trains PPO agents against LLM opponents in imperfect-information card games via curriculum learning and knowledge distillation. PPO Ollama
Multi-Modal Sentiment Classification Sentiment analysis over image-text conversations with time-dynamics exploration of multimodal cues. PyTorch Pandas

Replace the last row's link with the real repo URL โ€” the original pointed to a Google search.


๐Ÿ“Š GitHub

Pinned Loading

  1. adversarial-coevolution adversarial-coevolution Public

    Adversarial Co-Evolution of RL and LLM Agents: A framework for training high-performance PPO agents against Large Language Models in Gin Rummy, utilizing curriculum learning and knowledge distillatโ€ฆ

    Python 2

  2. multimodal-sentiment-classification multimodal-sentiment-classification Public

    A multimodal deep learning framework that fuses visual features from EfficientNet and textual features from BERT to classify sentiment in image-text conversations using the MSCTD dataset.

    Jupyter Notebook 9 2

  3. anime-recommender-application anime-recommender-application Public

    An end-to-end MLOps project implementing a deep learning-based anime recommendation system with automated CI/CD deployment to Google Kubernetes Engine.

    Python

  4. image-compression-svd-fft image-compression-svd-fft Public

    A Python implementation of lossy image compression and reconstruction utilizing Singular Value Decomposition (SVD) and Fast Fourier Transform (FFT) techniques for optimal storage efficiency.

    Jupyter Notebook 3

  5. artist-explorer-app artist-explorer-app Public

    A comprehensive full-stack web and Android application for exploring artists and artworks using the Artsy API, featuring JWT authentication, MongoDB favorites, and an experimental AI assistant.

    HTML

  6. mysql-metadata-manager mysql-metadata-manager Public

    A Python-based GUI application using Tkinter and MySQL to reverse-engineer, manage, and export database schema metadata.

    Jupyter Notebook 3