Skip to content
View aminfadaei116's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report aminfadaei116

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aminfadaei116/README.md

Hi, I'm Amin Fadaeinejad

ML Researcher | Video Diffusion · 3D/4D Generation/Reconstruction · VLM

I am a Machine Learning Researcher at Huawei's 3D Vision Team. My current focus is on video diffusion models — specifically DiT-based architectures like Wan2.1, and HunyuanVideo for various tasks such as object removal, video inpainting, and model distillation. I work with both bidirectional (full-attention) and autoregressive generation paradigms, and I implement flow matching at the training level, not only as an inference scheduler.

On the 3D side, I continue to work with 3D and 4D Gaussian Splatting methods (4D-GS, SC-GS, SpacetimeGaussians) and feed-forward reconstruction pipelines (DUSt3R, MASt3R, InstantSplat). My earlier work at Ubisoft La Forge focused on geometry-aware texture synthesis and differentiable shading for digital avatars.

I completed my Master's at York University (BioMotion Lab), advised by Prof. Niko Troje and Prof. Marcus A. Brubaker.


Research Interests

  • Video Diffusion Models: DiT-based architectures, bidirectional and autoregressive generation, flow matching, distillation (consistency models, flow shortcuts), ControlNet-style conditioning with pose/depth/segmentation signals
  • 3D / 4D Reconstruction: 3D and 4D Gaussian Splatting, feed-forward reconstruction, novel view synthesis
  • Generative AI for Faces & Avatars: Talking head generation, face reenactment, geometry-aware texture generation, differentiable shading
  • Vision-Language Models: CLIP, SigLIP, Florence — used as conditioning or semantic supervision in generation pipelines
  • Other CV: Anomaly detection, face swapping

Selected Publications

  • Geometry-Aware Texture Generation for 3D Head Modeling with Artist-driven Control CVPRW 2025 | Project Page | Paper

  • MoSAR: Monocular Semi-Supervised Model For Avatar Reconstruction Using Differentiable Shading CVPR 2024 | Project Page | Paper

Additional work in video diffusion currently under review (2026).


Tech Stack

  • Languages: Python, C/C++, MATLAB
  • Deep Learning: PyTorch, TensorFlow, JAX, Keras
  • Video & 3D: Diffusers, 3DGS, DUSt3R / MASt3R, ControlNet, LoRA / adapter fine-tuning
  • Training Infrastructure: Multi-GPU distributed training with FSDP and DeepSpeed
  • Libraries: OpenCV, NumPy, Pandas, Scikit-learn
  • Tools: Git, LaTeX, Linux

Pinned Loading

  1. cohere-nlp-challenge cohere-nlp-challenge Public

    Sentence Embedding using Bert model, by using different NLP approaches such as Universal sentence embedding, Natural Language Inference (NLI), and Contrastive Approach based for sentence embedding.

    Jupyter Notebook 2

  2. implicit-regularization-in-relu-networks-paper-code implicit-regularization-in-relu-networks-paper-code Public

    Reproducing the results in "Implicit Regularization in ReLU Networks with the Square Loss" using Matlab

    MATLAB

  3. SajjadPSavoji/CitiBikeNYC SajjadPSavoji/CitiBikeNYC Public

    A deep analysis of the bike sharing network of New York City

    HTML 3 1

  4. ai-deployment-bootcamp ai-deployment-bootcamp Public

    Forked from VectorInstitute/ai-deployment

    Python

  5. fitness-app fitness-app Public

    Python 1

  6. Danial-Kord/CaseLogic Danial-Kord/CaseLogic Public

    EvenUp Hackathon

    Python 4