Skip to content

Suprov007/Clip-To-Detect-PII

Repository files navigation

CLIP2PII2

Description

This project creates a reliable framework using CLIP to detect different types of Personally Identifiable Information (PII) in images in a zero-shot manner, allowing sensitive data to be recognized and highlighted without extra training(Zero Shot). This project also compares the results of CLIP, Siglip, and CLIPSeg on the same test cases.
The code is designed to be run on Google Colab.

Practical Use Cases

CLIP can be used to detect PII in scanned documents (IDs, invoices), medical records, and financial screenshots by identifying sensitive visual patterns like names, numbers, or signatures. It also helps flag personal data in social media images (e.g., boarding passes, IDs) and prevents workplace data leaks from shared dashboards or internal tools. Additionally, it can support anonymization in street or surveillance images by detecting faces and license plates

Getting Started

To get started with this project, follow the instructions below.

Running on Google Colab

  1. Open your web browser and go to Google Colab.
  2. Click on File > Open notebook.
  3. Select the GitHub tab and enter the repository URL: https://mygit.th-deg.de/ai-project-summer-24/clip2pii2.git.
  4. Choose the notebook you want to run (e.g., CLIP_&_CLIPSeg.ipynb) and open it.
  5. Follow the instructions within the notebook to run the code. #
  6. Preferable change the runtime of the notebook to T4.

Dependencies

The required dependencies for this project are listed in the notebook. To ensure you have all the necessary packages, run the code lines in the same order of the notebook cells.

Resources

https://github.com/merveenoyan/siglip/blob/main/clip_siglip.py

About

This project uses OpenAI CLIP to detect different types of Personally Identifiable Information in images in a zero-shot manner.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors