Release per-iteration mdata copies in compute_fake_perturbation_tests by adamklie · Pull Request #5 · EngreitzLab/PerturbNMF

adamklie · 2026-05-10T02:20:09Z

Release per-iteration mdata copies in compute_fake_perturbation_tests

Summary

Closes #3.

The fake-test inner loop calls mdata.copy() per iteration. On real datasets (~10 GB sparse rna matrix), each deep copy retains ~15 GB residual that Python's reference-counted GC can't reap fast enough between iterations. This blows past 256 GB SLURM allocations around iteration 16/50.

Change

Adds import gc, then at the end of each fake-test iteration:

del _mdata, mdata_samp
gc.collect()

Plus releases the K-loop mdata between K values to keep memory bounded across K too.

Validation

End-to-end on Huangfu HUES8 ESC (~190k cells × 36k genes, K = 30,50,60,80,100,200,250,300, sel_thresh=2.0, --number_run 50):

Before fix: OOM at iteration 16/50 of K=30 in 256 GB
After fix: completed all 8 K × 50 iterations in ~51 min, peak ~250 GB

DE (~270k cells) also completes successfully with this patch.

This is a bandaid, not the structural fix

The underlying redundancy is that the fake-test code only mutates _mdata[prog_key].obsm[guide_assignment_key] and two uns arrays — it never touches the rna modality, which is what makes mdata.copy() expensive. A more efficient refactor would mutate mdata[prog_key] in place each iteration (or operate on a small dict of overrides) and skip the deep copy entirely. That's a bigger change worth doing separately. This PR keeps the existing logic intact and just keeps memory bounded so the function actually runs to completion on realistic datasets.

🤖 Generated with Claude Code

args.reference_targets is never defined on the argparse namespace; mirror the real-test path's fallback (line 49) and use args.guide_annotation_key instead. This is the correct reference-target list since the fake-test code relabels NT guides to {'non-targeting', 'targeting'} before this call. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The fake-test inner loop calls mdata.copy() per iteration, which on real datasets (~10 GB sparse rna matrix) stacks up ~15 GB residual per iteration and OOMs around iteration 16/50 in a 256 GB allocation. Add explicit del + gc.collect() at the end of each iteration so deep-copies are reaped before the next iteration starts. Also release the K-loop mdata between K values to avoid accumulating across the K loop. This is a minimal bandaid — the structural fix would avoid the full mdata.copy() in the first place since the fake-test only mutates obsm and uns on the prog_key modality, not the rna modality (which is what makes the deep-copy expensive). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

adamklie and others added 2 commits May 8, 2026 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release per-iteration mdata copies in compute_fake_perturbation_tests#5

Release per-iteration mdata copies in compute_fake_perturbation_tests#5
adamklie wants to merge 2 commits into
EngreitzLab:mainfrom
adamklie:fix/utest-oom-leak

adamklie commented May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

adamklie commented May 10, 2026

Release per-iteration mdata copies in compute_fake_perturbation_tests

Summary

Change

Validation

This is a bandaid, not the structural fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant