Find the best datasets for intermediate fine-tuning
-
Updated
May 4, 2025 - Jupyter Notebook
Find the best datasets for intermediate fine-tuning
Official minimal release for LED: dataset scoring and compact data subset selection.
Code and data repository for selection of datasets for FAIRification in Drug Discovery and Development.
Add a description, image, and links to the dataset-selection topic page so that developers can more easily learn about it.
To associate your repository with the dataset-selection topic, visit your repo's landing page and select "manage topics."