ASL Classifier

Overview

This repository contains the implementation of a deep learning model for real-time interpretation of American Sign Language (ASL) from video data using a novel architecture that combines Transformer models with TensorFlow Lite. Our system processes both spatial and temporal features of ASL gestures, focusing on accessibility enhancements for the deaf and hard-of-hearing community.

Project Details

How to Run it (Kaggle)

Since the dataset is around 40 GBs, we import the data from kaggle in a kaggle notebook and suggest the same for reproducing the code.

Go to https://www.kaggle.com/ and create a new notebook.
In the notebook, go to Files section, and import the dl_model.ipynb notebook.
In the right side of the notebook, add input and serach for Google - Isolated Sign Language Recognition or add the dataset from here.
Run all cells! Done :)

Authors

Kunal Thadani
Sakshi Goenka
Soumili Nandi

Abstract

The project uses advanced deep learning techniques to interpret ASL by processing video data with a hybrid approach that integrates Transformers and custom attention mechanisms. By leveraging extensive data preprocessing and robust Attention models, our goal is to improve communication tools for inclusivity.

Problem Statement

Automated interpretation of sign language presents challenges such as gesture variability and limited data availability, leading to significant communication barriers. Our project addresses these by focusing on gesture recognition, variability and adaptability, and enriching training data.

Dataset

The dataset used is from the Google-hosted Kaggle competition "Isolated Sign Language Recognition". It includes landmark data obtained via MediaPipe, mapped to ASL signs, with extensive metadata for training robust models.

Architecture

The core architecture involves:

Landmark Models: Utilizing MediaPipe Holistic for generating comprehensive sets of landmarks.
Normalization and Dominant Hand Correction: Adjusting data based on the detected dominant hand.
Embedding Layers: Robust feature representation using dense layers.
Transformer Blocks: Handling sequence processing for gesture recognition.
Pooling and Classification: Final gesture classification using a softmax layer.

Methodology

Data Preparation: Normalization and adjustment based on the dominant hand.
Model Training: Employing techniques like random masking and data augmentation for robustness.
Validation and Evaluation: Continuous performance evaluation during training using a separate validation set.

Results

The model achieves a validation accuracy of approximately 74%, indicating strong potential for real-time applications in ASL interpretation.

Acknowledgments

Thanks to Google for providing the dataset on Kaggle.
Special thanks to previous seminal works by Starner, Pentland, Cui et al., and others in the field of gesture recognition.

Contributing

Contributions are welcome! For major changes, please open an issue first to discuss what you would like to change. Please ensure to update tests as appropriate.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
extra_output_files		extra_output_files
README.md		README.md
dl_model.ipynb		dl_model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASL Classifier

Overview

Project Details

How to Run it (Kaggle)

Since the dataset is around 40 GBs, we import the data from kaggle in a kaggle notebook and suggest the same for reproducing the code.

Authors

Abstract

Problem Statement

Dataset

Architecture

Methodology

Results

Acknowledgments

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ASL Classifier

Overview

Project Details

How to Run it (Kaggle)

Since the dataset is around 40 GBs, we import the data from kaggle in a kaggle notebook and suggest the same for reproducing the code.

Authors

Abstract

Problem Statement

Dataset

Architecture

Methodology

Results

Acknowledgments

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages