Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models

Time-Annealed Perturbation Sampling (TAPS) is an inference-time method for improving diversity in diffusion language models without sacrificing generation quality.

This repository contains the official implementation of TAPS and the code used to reproduce experiments reported in the paper.

Method Overview

A conceptual comparison of the inference process between the base Diffusion-LM and our proposed method, TAPS, illustrating different context conditioning behaviors.

Supported Backbones

This repository supports two diffusion language model backbones:

Backbone	Hugging Face	Loader
LLaDA-8B-Instruct	GSAI-ML/LLaDA-8B-Instruct	`transformers.AutoModel`
TraDo-8B-Instruct	Gen-Verse/TraDo-8B-Instruct	`transformers.AutoModelForCausalLM`

Environment Setup

This project uses two separate Python environments:

llada: for LLaDA-related experiments
trado: for TraDo-related experiments

# Clone the repository
git clone https://github.com/Johnny221B/TAPS.git
cd TAPS

# Create the llada environment
python -m venv envs/llada
source envs/llada/bin/activate
pip install --upgrade pip
pip install -r requirements_llada.txt

# Create the trado environment
python -m venv envs/trado
source envs/trado/bin/activate
pip install --upgrade pip
pip install -r requirements_trado.txt

Benchmarks

GSM8K
WritingPrompts
NoveltyBench
Arena-Hard-Auto

Reproduce: WritingPrompts

LLaDA

cd /mnt/data/wujx/DLM/TAPS
CUDA_VISIBLE_DEVICES=0,1 accelerate launch --num_processes 2 -m benchmarks.writingprompts.eval_llada_wp \
  --model_path /path/to/llada \
  --mode embedding \
  --dataset euclaise/writingprompts \
  --num_prompts 50 \
  --num_samples 16 \
  --temperature 0.7 \
  --cfg 0.0 \
  --cond_embed_noise_std 0.35 \
  --cond_noise_start 0.05 \
  --cond_noise_until 0.95 \
  --cond_embed_impl hook \
  --steps 512 \
  --gen_length 512 \
  --block_length 256 \
  --empty_cache_every 20

TraDo

cd /mnt/data/wujx/DLM/TAPS
CUDA_VISIBLE_DEVICES=0 python -m benchmarks.writingprompts.eval_trado_wp \
  --run_name trado_embedding_run \
  --mode embedding \
  --model_path /path/to/trado \
  --num_prompts 25 \
  --num_samples 16 \
  --gen_length 512 \
  --steps 4 \
  --block_length 4 \
  --temperature 0.8 \
  --seed 1234 \
  --cond_embed_noise_std 0.40 \
  --top_k 0 \
  --top_p 1.0 \
  --min_p 0.0

Citation

@misc{wu2026timeannealedperturbationsamplingdiverse,
      title={Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models}, 
      author={Jingxuan Wu and Zhenglin Wan and Xingrui Yu and Yuzhe Yang and Yiqiao Huang and Ivor Tsang and Yang You},
      year={2026},
      eprint={2601.22629},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2601.22629}, 
}

License

This project is released under the MIT License. See the LICENSE file for the full text.

SPDX-License-Identifier: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
benchmarks		benchmarks
docs		docs
eval		eval
scripts		scripts
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements_llada.txt		requirements_llada.txt
requirements_trado.txt		requirements_trado.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models

Method Overview

Supported Backbones

Environment Setup

Benchmarks

Reproduce: WritingPrompts

LLaDA

TraDo

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models

Method Overview

Supported Backbones

Environment Setup

Benchmarks

Reproduce: WritingPrompts

LLaDA

TraDo

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages