Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data

🔍 Overview

This repository provides the official implementation of Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data (ACL'26). It includes experimental code for Gap-K% along with several baseline methods on the WikiMIA and MIMIR datasets. For the Neighbor baseline experiments on the MIMIR benchmark, we use the implementation provided here: https://github.com/zjysteven/mimir

⚙️ Environment

Our experiments are conducted under the following environment:

Python 3.10
PyTorch 2.7.1
CUDA 12.6

After setting up PyTorch, install the remaining dependencies:

pip install -r requirements.txt

🔐 Hugging Face Access

Please log in to Hugging Face before running the scripts:

huggingface-cli login

📁 Dataset

We conduct experiments on WikiMIA and MIMIR:

WikiMIA original: swj0419/WikiMIA
WikiMIA paraphrased: zjysteven/WikiMIA_paraphrased_perturbed
MIMIR iamgroot42/mimir

🤖 Models

We conduct WikiMIA experiments on a diverse set of large language models:

Mamba: state-spaces/mamba-1.4b-hf
Pythia: EleutherAI/pythia-6.9b, EleutherAI/pythia-12b
LLaMA: huggyllama/llama-13b, huggyllama/llama-65b

Note: LLaMA-65B is evaluated using INT8 inference.

For MIMIR experiments, we use Pythia 160M, 1.4B, 2.8B, 6.9B, and 12B.

🚀 Running

We provide SLURM job scripts to run all experiments:

wikimia.sh
Evaluates Loss, Zlib, Min-K%, Min-K%++, Gap-K% on WikiMIA.
wikimia_neighbor.sh
Evaluates Neighbor on WikiMIA.
mimir.sh
Evaluates Loss, Zlib, Min-K%, Min-K%++, Gap-K% on MIMIR.

After running the scripts, results will be saved to:

results/

🙏Acknowledgement

This implementation is based on the official codebase of Min-K%++.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
LICENSE		LICENSE
README.md		README.md
mimir.py		mimir.py
mimir.sh		mimir.sh
requirements.txt		requirements.txt
wikimia.py		wikimia.py
wikimia.sh		wikimia.sh
wikimia_neighbor.py		wikimia_neighbor.py
wikimia_neighbor.sh		wikimia_neighbor.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data

🔍 Overview

⚙️ Environment

🔐 Hugging Face Access

📁 Dataset

🤖 Models

🚀 Running

🙏Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data

🔍 Overview

⚙️ Environment

🔐 Hugging Face Access

📁 Dataset

🤖 Models

🚀 Running

🙏Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages