fix: support decollate for numpy scalars #8470

arthurdjn · 2025-06-03T10:02:26Z

Description

This PR supports numpy scalars (e.g. in the form of np.array(1) ) in the decollate_batch function (fix issue #8471).

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

KumoLiu · 2025-06-04T03:05:53Z

monai/data/utils.py

@@ -625,6 +625,8 @@ def decollate_batch(batch, detach: bool = True, pad=True, fill_value=None):
        type(batch).__module__ == "numpy" and not isinstance(batch, Iterable)
    ):
        return batch
+    if isinstance(batch, np.ndarray) and batch.ndim == 0:


Thanks for the pr! Do you think it might be beneficial to convert the array into a tensor? This way, the data could be handled more consistently.

We could, I think it does not matter for my use cases. As long as the function handles numpy scalars in the form of an array it is good for me!

I will add this change and convert it as a tensor there (L629) if you prefer :)

Thanks for the quick fix!
May I ask the reason for only convert to tensor when batch.ndim == 0 here?

I noticed a different behavior when using the decollate_batch function on torch tensors vs numpy arrays (see discussion #8472) so I don't want to convert numpy arrays to torch tensors as it will introduce some breaking changes

This PR only address the issue #8471 as I think it was not expected and should be supported (?).

ericspod · 2025-06-10T16:12:10Z

Could we consider a more complete solution? The issue it seems is that 0-d arrays are iterable but can't be iterated over. We already check for non-iterable things in decollate_batch here. Can we modeify this to correctly pick up when the batch is a 0-d array and just return it in that case? Or return its contents?

arthurdjn · 2025-06-10T16:40:44Z

Thanks for the feedback. The initial PR was:

if isinstance(batch, (float, int, str, bytes)) or (
    type(batch).__module__ == "numpy" and not isinstance(batch, Iterable)
):
    return batch
if isinstance(batch, np.ndarray) and batch.ndim == 0:
    return batch.item() if detach else batch
# rest ...

Is this something that you find more complete?

Note

I refactored the PR to convert from numpy array to torch tensor as suggested by @KumoLiu.

ericspod · 2025-06-17T15:25:33Z

Is this something that you find more complete?

What I had in mind was more of the following change:

...
    if batch is None or isinstance(batch, (float, int, str, bytes)):
        return batch
    if getattr(batch, "ndim", -1) == 0:  # assumes only Numpy objects and Pytorch tensors have ndim
        return batch.item() if detach else batch
    if isinstance(batch, torch.Tensor):
        if detach:
            batch = batch.detach()
# REMOVE
#      if batch.ndim == 0:
#          return batch.item() if detach else batch
...

arthurdjn · 2025-06-17T15:45:09Z

Thanks! I will update the PR to include these changes.

coderabbitai · 2025-09-05T09:36:26Z

Important

Review skipped

Review was skipped as selected files did not have any reviewable changes.

💤 Files selected but had no reviewable changes (1)

notebooks/semantic_segmentation.ipynb

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

Adds a pre-check in monai/data/utils.py::decollate_batch for 0‑dimensional inputs: if the incoming batch has ndim == 0 and exposes an item() method, the function returns batch.item() when detach=True, otherwise returns the original 0‑d array (short‑circuits before the torch.Tensor branch). No other control flow or logic changes and no public API or exports are modified.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

✨ Finishing Touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore or @coderabbit ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (3)

monai/data/utils.py (3)
628-635: Prefer generic 0-D detection early; drop special-case conversion.

Handle any object with ndim==0 up-front, then the torch branch can skip its own scalar check. This reduces duplication and avoids unnecessary conversion for NumPy scalars when detach=False.

Apply this diff (and remove the torch scalar early-return below):
@@
-    if isinstance(batch, (float, int, str, bytes)) or (
-        type(batch).__module__ == "numpy" and not isinstance(batch, Iterable)
-    ):
-        return batch
-    if isinstance(batch, np.ndarray) and batch.ndim == 0:
-        batch = torch.from_numpy(batch)
+    if isinstance(batch, (float, int, str, bytes)) or (
+        type(batch).__module__ == "numpy" and not isinstance(batch, Iterable)
+    ):
+        return batch
+    # generic 0-D objects (NumPy/Torch/others exposing ndim): treat as scalars
+    if getattr(batch, "ndim", -1) == 0:
+        return batch.item() if detach else batch
@@
-    if isinstance(batch, torch.Tensor):
-        if detach:
-            batch = batch.detach()
-        if batch.ndim == 0:
-            return batch.item() if detach else batch
+    if isinstance(batch, torch.Tensor):
+        if detach:
+            batch = batch.detach()
+        # ndim==0 handled above
628-630: Graceful fallback for unsupported NumPy dtypes.

torch.from_numpy will fail for some dtypes (e.g., datetime64, object). Consider catching and falling back to .item() or returning the ndarray unchanged.
-    if isinstance(batch, np.ndarray) and batch.ndim == 0:
-        batch = torch.from_numpy(batch)
+    if isinstance(batch, np.ndarray) and batch.ndim == 0:
+        try:
+            batch = torch.from_numpy(batch)
+        except (TypeError, ValueError):
+            return batch.item() if detach else batch
614-621: Doc/test touch-up for 0-D NumPy arrays.

Please note 0-D NumPy array behavior in the decollate_batch docstring and add unit tests for:

np.array(1) with detach=True/False

Nested structures containing 0-D arrays

Edge dtype (e.g., bool, float32)

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Knowledge Base: Disabled due to Reviews > Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 0968da2 and c1339ec.

📒 Files selected for processing (1)

monai/data/utils.py (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

⚙️ CodeRabbit configuration file

Review the Python code for quality and correctness. Ensure variable names adhere to PEP8 style guides, are sensible and informative in regards to their function, though permitting simple names for loop and comprehension variables. Ensure routine names are meaningful in regards to their function and use verbs, adjectives, and nouns in a semantically appropriate way. Docstrings should be present for all definition which describe each variable, return value, and raised exception in the appropriate section of the Google-style of docstrings. Examine code for logical error or inconsistencies, and suggest what may be changed to addressed these. Suggest any enhancements for code improving efficiency, maintainability, comprehensibility, and correctness. Ensure new or modified definitions will be covered by existing or new unit tests.

Files:

monai/data/utils.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (19)

GitHub Check: min-dep-py3 (3.12)
GitHub Check: min-dep-pytorch (2.8.0)
GitHub Check: min-dep-pytorch (2.6.0)
GitHub Check: min-dep-pytorch (2.7.1)
GitHub Check: min-dep-py3 (3.11)
GitHub Check: min-dep-pytorch (2.5.1)
GitHub Check: min-dep-os (windows-latest)
GitHub Check: min-dep-os (ubuntu-latest)
GitHub Check: min-dep-py3 (3.10)
GitHub Check: min-dep-py3 (3.9)
GitHub Check: min-dep-os (macOS-latest)
GitHub Check: flake8-py3 (codeformat)
GitHub Check: flake8-py3 (pytype)
GitHub Check: build-docs
GitHub Check: flake8-py3 (mypy)
GitHub Check: packaging
GitHub Check: quick-py3 (ubuntu-latest)
GitHub Check: quick-py3 (windows-latest)
GitHub Check: quick-py3 (macOS-latest)

🔇 Additional comments (1)

monai/data/utils.py (1)

628-635: Good fix: avoids 0-D ndarray iteration error and aligns with tensor path.

Converting 0-D NumPy arrays to torch first prevents the TypeError raised by iterating 0-D arrays and lets the existing scalar-tensor handling (.item() when detach=True) kick in. Looks correct.

fix linter Signed-off-by: Arthur Dujardin <[email protected]> fix numpy decollate multi arrays Signed-off-by: Arthur Dujardin <[email protected]> fix linter Signed-off-by: Arthur Dujardin <[email protected]> fix numpy scalar support Signed-off-by: Arthur Dujardin <[email protected]> minor refactoring for typing Signed-off-by: Arthur Dujardin <[email protected]> convert scalar array to tensor Signed-off-by: Arthur Dujardin <[email protected]> update decollate item

for more information, see https://pre-commit.ci

Signed-off-by: Arthur Dujardin <[email protected]>

arthurdjn mentioned this pull request Jun 3, 2025

decollate batch different behavior with numpy and torch for scalars #8471

Open

arthurdjn force-pushed the support-decollate-batch-numpy-scalars branch 5 times, most recently from 187c141 to c438fe0 Compare June 3, 2025 13:03

arthurdjn marked this pull request as ready for review June 3, 2025 14:20

arthurdjn requested review from KumoLiu, ericspod and Nic-Ma as code owners June 3, 2025 14:20

KumoLiu reviewed Jun 4, 2025

View reviewed changes

arthurdjn force-pushed the support-decollate-batch-numpy-scalars branch from 451c207 to 49d4954 Compare June 4, 2025 09:00

arthurdjn requested a review from KumoLiu June 4, 2025 09:01

arthurdjn added 2 commits August 15, 2025 17:45

Créé à l'aide de Colab

07f10b6

Merge branch 'Project-MONAI:dev' into dev

f066edf

coderabbitai bot reviewed Sep 5, 2025

View reviewed changes

arthurdjn force-pushed the support-decollate-batch-numpy-scalars branch from abb8218 to ec3e5d9 Compare September 5, 2025 10:27

arthurdjn force-pushed the support-decollate-batch-numpy-scalars branch from ec3e5d9 to f9dba63 Compare September 5, 2025 10:28

pre-commit-ci bot and others added 2 commits September 5, 2025 10:29

[pre-commit.ci] auto fixes from pre-commit.com hooks

342cd57

for more information, see https://pre-commit.ci

rm notebook

e649f3a

Signed-off-by: Arthur Dujardin <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: support decollate for numpy scalars #8470

fix: support decollate for numpy scalars #8470

arthurdjn commented Jun 3, 2025 •

edited

Loading

Uh oh!

KumoLiu Jun 4, 2025

Uh oh!

arthurdjn Jun 4, 2025

Uh oh!

KumoLiu Jun 6, 2025

Uh oh!

arthurdjn Jun 6, 2025 •

edited

Loading

Uh oh!

ericspod commented Jun 10, 2025 •

edited

Loading

Uh oh!

arthurdjn commented Jun 10, 2025 •

edited

Loading

Uh oh!

ericspod commented Jun 17, 2025

Uh oh!

arthurdjn commented Jun 17, 2025

Uh oh!

coderabbitai bot commented Sep 5, 2025 •

edited

Loading

Review skipped

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

fix: support decollate for numpy scalars #8470

Are you sure you want to change the base?

fix: support decollate for numpy scalars #8470

Conversation

arthurdjn commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Types of changes

Uh oh!

KumoLiu Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

arthurdjn Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

KumoLiu Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

arthurdjn Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ericspod commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arthurdjn commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ericspod commented Jun 17, 2025

Uh oh!

arthurdjn commented Jun 17, 2025

Uh oh!

coderabbitai bot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Estimated code review effort

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arthurdjn commented Jun 3, 2025 •

edited

Loading

arthurdjn Jun 6, 2025 •

edited

Loading

ericspod commented Jun 10, 2025 •

edited

Loading

arthurdjn commented Jun 10, 2025 •

edited

Loading

coderabbitai bot commented Sep 5, 2025 •

edited

Loading