PDF Summarization App using GPT-2

This project is a Python-based application that extracts text from PDF files and generates summaries using the GPT-2 model from Hugging Face's transformers library. The summaries are saved in a text file for easy access.

Features

Extracts text from PDF files.
Summarizes large chunks of text using GPT-2.
Saves the summary in a separate .txt file.

Requirements

Ensure you have the following libraries installed:

Python 3.8+
PyPDF2 for extracting text from PDFs
transformers for loading the GPT-2 model
torch for running GPT-2 model
PyTorch (see PyTorch installation guide)

You can install the necessary dependencies using pip:

pip install PyPDF2 transformers torch

Usage

Clone the repository or download the code.
Place the PDF you want to summarize in the project folder.
Open the script file and change the path of the PDF in the main section to point to your file.
Run the script: python summarize_app.ipynb

Known Issues

Large PDF files: The summarization works by breaking large PDFs into chunks. For extremely large PDFs, the process might take a while.
Padding Token Warning: GPT-2 doesn't have a native padding token. This is handled by setting the pad_token to eos_token, but it's worth noting in case of any unexpected behavior.

License

This project is licensed under the MIT License. Feel free to use it and modify it as needed.

Contributing

Feel free to open issues or submit pull requests if you would like to contribute to the project!

Thank you, Ashen

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
lec		lec
README.md		README.md
summerize_app.ipynb		summerize_app.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Summarization App using GPT-2

Features

Requirements

Usage

Known Issues

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF Summarization App using GPT-2

Features

Requirements

Usage

Known Issues

License

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages