Fixes #8697 GPU memory leak by checking both image and label tensors for CUDA device #8698

benediktjohannes · 2026-01-13T18:34:21Z

Modified device detection to check BOTH image and label tensors

torch.cuda.empty_cache() now called if EITHER tensor is on GPU

Prevents GPU memory leaks in mixed device scenarios

Fixes Project-MONAI#8697 GPU memory leak by checking both image and label tensors for CUDA device Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai · 2026-01-13T18:37:19Z

📝 Walkthrough

Walkthrough

FgImageStats.call and LabelStats.call now create local image_tensor and label_tensor variables, determine using_cuda from either tensor, build ndas from image_tensor and ndas_label from label_tensor (cast to int16), and produce empty foregrounds as MetaTensor([0.0]). All shape checks, CCP application, per-label statistics, and control flow remain the same. CUDA memory cleanup and public API signatures are unchanged.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	Description lacks required template structure. Missing 'Fixes #' reference, formal description section, and Types of changes checklist required by repository template.	Restructure description to match template: add 'Fixes `#8697`', formal description section, and complete the Types of changes checklist.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	Title accurately describes the main change: fixing a GPU memory leak by improving CUDA device detection across both image and label tensors.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 6ac3162 and 090eb95.

📒 Files selected for processing (1)

monai/auto3dseg/analyzer.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/auto3dseg/analyzer.py

🪛 Ruff (0.14.11)

monai/auto3dseg/analyzer.py

485-485: Avoid specifying long messages outside the exception class

(TRY003)

🔇 Additional comments (2)

monai/auto3dseg/analyzer.py (2)

471-482: CUDA detection and local variable usage look correct.

The disjunctive check properly handles mixed-device scenarios where either tensor could be on GPU. Using torch.Tensor in isinstance is valid since MetaTensor is a subclass.

Note: .astype() (line 482) is MetaTensor-specific. This is fine given the type signature data: Mapping[Hashable, MetaTensor] enforces the contract. If plain torch.Tensor support is ever needed, use .to(torch.int16) as fallback.

Per coding guidelines: Consider adding a @skip_if_no_cuda GPU unit test in tests/apps/test_auto3dseg.py to cover the mixed-device scenario.

488-488: Consistent empty foreground handling.

Using MetaTensor([0.0]) aligns with the pattern in FgImageStats and maintains type consistency.

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @monai/auto3dseg/analyzer.py:
- Around line 471-477: Refactor the later references that still use
d[self.image_key] and d[self.label_key] to reuse the local variables
image_tensor and label_tensor (so subsequent logic around self.do_ccp and CUDA
detection uses those locals consistently), and add a GPU unit test in
tests/apps/test_auto3dseg.py (for LabelStats) decorated with @skip_if_no_cuda
that constructs a mixed-device input (one tensor on CPU, the other on CUDA) to
exercise the new mixed-device detection and ensure behavior is correct under
self.do_ccp.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 57fdd59 and 6f2dac0.

📒 Files selected for processing (1)

monai/auto3dseg/analyzer.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/auto3dseg/analyzer.py

🧬 Code graph analysis (1)

monai/auto3dseg/analyzer.py (1)

monai/data/meta_tensor.py (1)

MetaTensor (52-609)

monai/auto3dseg/analyzer.py

Small refactoring (usage of local variable instead for performance) Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

monai/auto3dseg/analyzer.py (1)
487-488: Type inconsistency in empty foreground fallback.

Line 488 uses torch.Tensor([0]) while line 354 (FgImageStats) uses MetaTensor([0.0]). Since get_foreground_label returns MetaTensor, the fallback should also use MetaTensor for consistency.
Suggested fix
-        nda_foregrounds = [nda if nda.numel() > 0 else torch.Tensor([0]) for nda in nda_foregrounds]
+        nda_foregrounds = [nda if nda.numel() > 0 else MetaTensor([0.0]) for nda in nda_foregrounds]

🧹 Nitpick comments (1)

monai/auto3dseg/analyzer.py (1)
481-481: Inconsistent variable usage.

Use image_tensor.shape[0] instead of d[self.image_key].shape[0] for consistency with the refactoring.
Suggested fix
-        ndas: list[MetaTensor] = [image_tensor[i] for i in range(d[self.image_key].shape[0])]  # type: ignore
+        ndas: list[MetaTensor] = [image_tensor[i] for i in range(image_tensor.shape[0])]  # type: ignore

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 6f2dac0 and 72522c1.

📒 Files selected for processing (1)

monai/auto3dseg/analyzer.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/auto3dseg/analyzer.py

🧬 Code graph analysis (1)

monai/auto3dseg/analyzer.py (1)

monai/data/meta_tensor.py (2)

MetaTensor (52-609)

astype (434-461)

🔇 Additional comments (1)

monai/auto3dseg/analyzer.py (1)

471-477: CUDA detection logic looks correct.

Properly checks both tensors for CUDA presence, addressing the mixed-device memory leak scenario.

Another small refactoring Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

monai/auto3dseg/analyzer.py (1)
488-488: Type inconsistency: torch.Tensor([0]) vs MetaTensor([0.0]).

Line 488 differs from line 354 which uses MetaTensor([0.0]), creating both type and dtype inconsistency. When nda_foregrounds is empty, it becomes torch.Tensor([0]) (int), whereas elsewhere it's MetaTensor (float). This inconsistency should be resolved for type safety:
-        nda_foregrounds = [nda if nda.numel() > 0 else torch.Tensor([0]) for nda in nda_foregrounds]
+        nda_foregrounds = [nda if nda.numel() > 0 else MetaTensor([0.0]) for nda in nda_foregrounds]

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 72522c1 and f6b76dc.

📒 Files selected for processing (1)

monai/auto3dseg/analyzer.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/auto3dseg/analyzer.py

🧬 Code graph analysis (1)

monai/auto3dseg/analyzer.py (1)

monai/data/meta_tensor.py (2)

MetaTensor (52-609)

astype (434-461)

🔇 Additional comments (2)

monai/auto3dseg/analyzer.py (2)

471-477: CUDA detection logic looks correct.

The fix properly addresses the GPU memory leak by checking both image_tensor and label_tensor for CUDA device presence before calling torch.cuda.empty_cache() at line 517. This handles mixed-device scenarios as intended.

481-482: LGTM on tensor extraction refactor.

Using the local image_tensor and label_tensor variables improves readability and avoids repeated dictionary lookups.

Corrected type inconsistency Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @monai/auto3dseg/analyzer.py:
- Around line 481-482: The code calls label_tensor.astype(...) which only exists
on MetaTensor but earlier isinstance checks allow plain torch.Tensor; update the
ndas_label assignment to handle both: if isinstance(label_tensor, MetaTensor)
use label_tensor.astype(torch.int16), else if isinstance(label_tensor,
torch.Tensor) use label_tensor.to(torch.int16) (or wrap/convert to MetaTensor as
appropriate), ensuring ndas_label is a MetaTensor-compatible tensor; adjust the
isinstance logic around image_tensor/label_tensor if needed to keep types
consistent.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between f6b76dc and b26f6be.

📒 Files selected for processing (1)

monai/auto3dseg/analyzer.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/auto3dseg/analyzer.py

🧬 Code graph analysis (1)

monai/auto3dseg/analyzer.py (2)

monai/data/meta_tensor.py (2)

MetaTensor (52-609)

astype (434-461)

monai/auto3dseg/utils.py (1)

get_foreground_label (70-83)

🪛 Ruff (0.14.11)

monai/auto3dseg/analyzer.py

485-485: Avoid specifying long messages outside the exception class

(TRY003)

🔇 Additional comments (2)

monai/auto3dseg/analyzer.py (2)

471-477: CUDA detection logic looks correct.

Checking both tensors ensures torch.cuda.empty_cache() is called whenever CuPy-backed CCP might encounter PyTorch-held GPU memory. This should fix the memory leak in mixed-device scenarios.

488-488: MetaTensor fallback is appropriate.

get_foreground_label returns MetaTensor, so using MetaTensor([0.0]) for empty foregrounds maintains type consistency for downstream operations.

monai/auto3dseg/analyzer.py

Fixed type check and astype() usage Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

monai/auto3dseg/analyzer.py (2)
471-477: Consider using torch.Tensor instead of MetaTensor for broader CUDA detection.

The current check only detects CUDA tensors if they are specifically MetaTensor instances. Since MetaTensor inherits from torch.Tensor, and all tensors have the .device attribute, using torch.Tensor would be more robust:
 using_cuda = (
-    isinstance(image_tensor, MetaTensor) and image_tensor.device.type == "cuda"
+    isinstance(image_tensor, torch.Tensor) and image_tensor.device.type == "cuda"
 ) or (
-    isinstance(label_tensor, MetaTensor) and label_tensor.device.type == "cuda"
+    isinstance(label_tensor, torch.Tensor) and label_tensor.device.type == "cuda"
 )
This would catch CUDA tensors even if someone passes a plain torch.Tensor (though line 482 would fail later due to .astype()). The isinstance guard still protects against numpy arrays.

484-485: Minor: Consider extracting the error message.

Static analysis flags the long inline message. Acceptable as-is, but could be a constant if reused elsewhere.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between b26f6be and 6ac3162.

📒 Files selected for processing (1)

monai/auto3dseg/analyzer.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/auto3dseg/analyzer.py

🧬 Code graph analysis (1)

monai/auto3dseg/analyzer.py (2)

monai/data/meta_tensor.py (1)

MetaTensor (52-609)

monai/auto3dseg/utils.py (1)

get_foreground_label (70-83)

🪛 Ruff (0.14.11)

monai/auto3dseg/analyzer.py

485-485: Avoid specifying long messages outside the exception class

(TRY003)

🔇 Additional comments (2)

monai/auto3dseg/analyzer.py (2)

481-482: LGTM.

Using local variables image_tensor and label_tensor improves readability and avoids repeated dictionary lookups.

488-488: Good fix for type consistency.

Using MetaTensor([0.0]) instead of torch.Tensor([0]) maintains consistent typing throughout the pipeline when handling empty foregrounds. This matches the same pattern at line 354 in FgImageStats.

Using torch.Tensor instead of MetaTensor for broader CUDA detection Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

benediktjohannes · 2026-01-13T19:47:42Z

I've fixed all issues mentioned by CodeRabbit 👍

benediktjohannes · 2026-01-14T12:34:30Z

@coderabbitai review

coderabbitai · 2026-01-14T12:34:37Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

ericspod · 2026-01-15T00:59:35Z

monai/auto3dseg/analyzer.py

+        using_cuda = (
+            isinstance(image_tensor, torch.Tensor) and image_tensor.device.type == "cuda"
+        ) or (
+            isinstance(label_tensor, torch.Tensor) and label_tensor.device.type == "cuda"
+        )


Suggested change

using_cuda = (

isinstance(image_tensor, torch.Tensor) and image_tensor.device.type == "cuda"

) or (

isinstance(label_tensor, torch.Tensor) and label_tensor.device.type == "cuda"

)

using_cuda = any(isinstance(t, torch.Tensor) and t.device.type == "cuda" for t in (image_tensor, label_tensor))

ericspod · 2026-01-15T01:00:59Z

Hi @benediktjohannes thanks for the contribution. I think it's fine just from looking at the changes but we'll run the tests now and see. I would also like @mingxin-zheng to have a look as the file originates with work I think.

Update analyzer.py

6f2dac0

Fixes Project-MONAI#8697 GPU memory leak by checking both image and label tensors for CUDA device Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

benediktjohannes requested review from KumoLiu, Nic-Ma and ericspod as code owners January 13, 2026 18:34

coderabbitai bot reviewed Jan 13, 2026

View reviewed changes

monai/auto3dseg/analyzer.py Show resolved Hide resolved

Update analyzer.py

72522c1

Small refactoring (usage of local variable instead for performance) Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai bot reviewed Jan 13, 2026

View reviewed changes

Update analyzer.py

f6b76dc

Another small refactoring Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai bot reviewed Jan 13, 2026

View reviewed changes

Update analyzer.py

b26f6be

Corrected type inconsistency Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai bot reviewed Jan 13, 2026

View reviewed changes

monai/auto3dseg/analyzer.py Show resolved Hide resolved

Update analyzer.py

6ac3162

Fixed type check and astype() usage Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

coderabbitai bot reviewed Jan 13, 2026

View reviewed changes

Update analyzer.py

090eb95

Using torch.Tensor instead of MetaTensor for broader CUDA detection Signed-off-by: benediktjohannes <benedikt.johannes.hofer@gmail.com>

ericspod reviewed Jan 15, 2026

View reviewed changes

ericspod requested a review from mingxin-zheng January 15, 2026 00:59

Fixes #8697 GPU memory leak by checking both image and label tensors for CUDA device #8698

Are you sure you want to change the base?

Fixes #8697 GPU memory leak by checking both image and label tensors for CUDA device #8698

Conversation

benediktjohannes commented Jan 13, 2026

Uh oh!

coderabbitai bot commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

benediktjohannes commented Jan 13, 2026

Uh oh!

benediktjohannes commented Jan 14, 2026

Uh oh!

coderabbitai bot commented Jan 14, 2026

Uh oh!

ericspod Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

ericspod commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Jan 13, 2026 •

edited

Loading