Skip to content

rprabhat/chart-digitizer

Repository files navigation

📊 Chart Digitizer

Extract data points from chart images using machine learning and computer vision.

Stars License

🎯 What It Does

Chart Digitizer extracts numerical data from chart images (bar charts, line charts, scatter plots) using:

  • Machine Learning - CNN-based object detection for accurate data point identification
  • Computer Vision - OpenCV and Tesseract OCR for legend and axis detection

🖥️ Demo

Flow

🚀 Quick Start

# Clone the repository
git clone https://github.com/rprabhat/chart-digitizer.git
cd chart-digitizer

# Run the Jupyter Notebook
jupyter notebook

📋 Requirements

  • Python 3.x
  • Jupyter Notebook
  • OpenCV
  • Tesseract OCR
  • TensorFlow/PyTorch

📚 Documentation

See the Jupyter Notebook for:

  • Random chart generator for training data
  • CNN model training
  • Data extraction pipeline

🔗 References

📄 License

Apache License 2.0 - See LICENSE

👤 Author

Prabhat Ranjan

About

Extract data points from chart images using machine learning - Jupyter Notebook

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors