Skip to content

GitHub Scrapper for RePylot is a web scraper for GitHub that generates datasets consisting of code files, later used to fine tune GPT-2. In its current state, it can efficiently extract Python scripts from repositories, making it a valuable tool for preparing training data for machine learning and NLP models.

License

Notifications You must be signed in to change notification settings

repylot/GithubScrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GitHub Scrapper for RePylot

GitHub Scrapper for RePylot is a web scraper for GitHub that generates datasets consisting of code files, later used to fine tune GPT-2. In its current state, it can efficiently extract Python scripts from repositories, making it a valuable tool for preparing training data for machine learning and NLP models.



© 2024 RePylot Code Generator

About

GitHub Scrapper for RePylot is a web scraper for GitHub that generates datasets consisting of code files, later used to fine tune GPT-2. In its current state, it can efficiently extract Python scripts from repositories, making it a valuable tool for preparing training data for machine learning and NLP models.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages