gradient-masking

Here are 3 public repositories matching this topic...

ishaaqdev / Staged-Embarrassment-Learning-SEL

The most compute-efficient training framework for ResNet architectures. 99.1% FLOPs reduction. 100% Local.

computer-vision pytorch resnet cifar10 curriculum-learning edge-ai green-ai sparse-training model-optimization efficient-deep-learning gradient-masking

Updated May 28, 2026
Jupyter Notebook

Hinedes / Proteus

Star

Code for "Matryoshka Plasticity: Exploiting Nested Transformer Structure for Zero‑Overhead Continual Learning"

machine-learning deep-learning edge-computing continual-learning catastrophic-forgetting edge-ai pretraining llm matformer nested-transformers gradient-masking

Updated May 3, 2026
Python

ishaaqdev / Staged-Embarrassment-Learning

Star

Staged Embarrassment Learning (SEL) is a bio-inspired framework for efficient Deep Learning. Inspired by a child’s rapid correction after a mistake, SEL uses dynamic gradient sparsity to focus compute on high-loss "embarrassing" samples . It achieves up to 99% FLOPs reduction, making it ideal for Edge AI.

computer-vision pytorch resnet cifar10 curriculum-learning edge-ai green-ai sparse-training model-optimization efficient-deep-learning gradient-masking

Updated Apr 23, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the gradient-masking topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gradient-masking topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly