Handle 0-d ndarrays in scalar isinstance checks by h-mayorquin · Pull Request #1415 · hdmf-dev/hdmf

h-mayorquin · 2026-03-03T06:47:40Z

Chained after #1414 (_is_collection). Motivated by hdmf-dev/hdmf-zarr#325 (zarr v2 to v3 migration).

hdmf's get_type functions infer element dtype by recursively indexing with data[0] until reaching a scalar, then calling type() on it. With numpy and zarr v2, data[0] on a 1-d float array returns a numpy scalar (e.g., numpy.float64), which passes isinstance(val, float) and has no __len__, so all downstream checks work. With zarr v3 (following the Python array API standard), data[0] returns a 0-d ndarray instead. A 0-d ndarray fails isinstance(val, (int, float, str, bool)) and type() returns numpy.ndarray rather than the element dtype. PR #1414 fixed the crash path (__len__ heuristic), but isinstance checks in other parts of the codebase still silently take the wrong branch when they encounter a 0-d ndarray.

This PR adds a _unwrap_scalar helper in hdmf.utils that converts 0-d ndarrays to numpy scalars via .item(), and applies it at the remaining isinstance checks that compare against Python scalar types.

Together with #1414, this eliminates the need for the __getitem__ monkey-patch in hdmf-zarr PR #325.

Checklist

Did you update CHANGELOG.md with your changes?
Does the PR clearly describe the problem and the solution?
Have you reviewed our Contributing Guide?
Does the PR use "Fix #XXX" notation to tell GitHub to close the relevant issue numbered XXX when the PR is merged?

codecov · 2026-03-03T06:48:43Z

Codecov Report

❌ Patch coverage is 88.46154% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 93.08%. Comparing base (ebc5a3b) to head (05423f5).
⚠️ Report is 1 commits behind head on dev.

Files with missing lines	Patch %	Lines
src/hdmf/backends/hdf5/h5tools.py	60.00%	1 Missing and 1 partial ⚠️
src/hdmf/common/table.py	85.71%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##              dev    #1415   +/-   ##
=======================================
  Coverage   93.08%   93.08%           
=======================================
  Files          41       41           
  Lines       10007    10014    +7     
  Branches     2060     2061    +1     
=======================================
+ Hits         9315     9322    +7     
  Misses        416      416           
  Partials      276      276

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

zarr v3 scalar indexing returns 0-d ndarrays, which fail check_type(arg, int). Unwrap before the type check so ElementIdentifiers validation works with zarr v3 arrays. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… guards - container.py: Data.__len__ uses _get_length for zarr v3 Arrays without __len__ - h5tools.py: use _get_length when sizing datasets during export - objectmapper.py: use _get_length for compound dtype shape, unwrap 0-d rows Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

VectorData.get, DynamicTableRegion.get, DynamicTableRegion.shape, DynamicTable.add_column, and EnumData.__add_term all call len() on self.data which fails with zarr v3 Arrays that lack __len__. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The function returns shape[0], not len(). The new name reflects the actual semantics: the size of the first dimension, which is what the array API standard exposes via .shape rather than __len__. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

After discussion, _get_length is the clearest name: it maps to what Python programmers understand as "how many items if I iterate this", which is what every call site needs. The implementation detail of using shape[0] vs len() belongs in the docstring, not the name. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…k_typing_for_type

rly · 2026-03-10T18:15:53Z

@h-mayorquin could you please resolve the conflicts in this PR? Thanks.

h-mayorquin · 2026-03-10T18:37:55Z


    def __len__(self):
-        return len(self.__data)
+        return _get_length(self.__data)


I am unsure why those were not merged with #1414

Ok, I think I only found those errors later and did some further additions here.

But maybe, we should have tests for those?

Yeah, let's take this opportunity to add some. Those tests were missing in the first place.

Added I also improved the docstring and unify some uses len that were not going through the new centralized function.

Let me know if you think there is something else to add

rly · 2026-03-11T09:09:49Z

I haven't searched through where else len could be replaced, but these changes here look good to me. If you spot others, please feel free to open a new PR.

rly · 2026-03-11T09:10:03Z

Thanks @h-mayorquin !

h-mayorquin · 2026-03-11T14:18:42Z

I haven't searched through where else len could be replaced, but these changes here look good to me. If you spot others, please feel free to open a new PR.

Yes, I double checked. I will be on the look for more uses.

h-mayorquin added 4 commits March 3, 2026 00:15

Better collection detection

1c5861e

Add bettter type detection for 0 dimensional arrays

9bdf858

fix ruff

4e4c611

ruff

68e95a4

This was referenced Mar 3, 2026

Migrate hdmf-zarr from zarr-python v2 to v3 hdmf-dev/hdmf-zarr#325

Open

Remove zarr.Array monkey-patches, use ndim-based detection hdmf-dev/hdmf-zarr#333

Open

h-mayorquin and others added 11 commits March 3, 2026 02:20

Re-trigger CI after len() and unwrap_scalar fixes

97ced09

Merge base branch and resolve import conflict in table.py

ba5a1f7

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Remove draft markdown files accidentally committed

d915d63

Merge branch 'remove_duck_typing_for_array_detection' into remove_duc…

dac32ea

…k_typing_for_type

Ryan suggestion

7375f89

Merge branch 'remove_duck_typing_for_array_detection' into remove_duc…

3e4d211

…k_typing_for_type

Base automatically changed from remove_duck_typing_for_array_detection to dev March 10, 2026 18:15

fix conflicts

7dafb41

h-mayorquin commented Mar 10, 2026

View reviewed changes

h-mayorquin added 3 commits March 10, 2026 14:06

missing _get_length

4249094

utils

9eca5eb

one regression test in validate

05423f5

rly approved these changes Mar 11, 2026

View reviewed changes

rly merged commit d878593 into dev Mar 11, 2026
27 of 33 checks passed

rly deleted the remove_duck_typing_for_type branch March 11, 2026 09:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle 0-d ndarrays in scalar isinstance checks#1415

Handle 0-d ndarrays in scalar isinstance checks#1415
rly merged 19 commits intodevfrom
remove_duck_typing_for_type

h-mayorquin commented Mar 3, 2026

Uh oh!

codecov Bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

rly commented Mar 10, 2026

Uh oh!

h-mayorquin Mar 10, 2026

Uh oh!

h-mayorquin Mar 10, 2026

Uh oh!

h-mayorquin Mar 10, 2026

Uh oh!

rly Mar 10, 2026

Uh oh!

h-mayorquin Mar 10, 2026

Uh oh!

rly commented Mar 11, 2026

Uh oh!

rly commented Mar 11, 2026

Uh oh!

Uh oh!

h-mayorquin commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

h-mayorquin commented Mar 3, 2026

Checklist

Uh oh!

codecov Bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

rly commented Mar 10, 2026

Uh oh!

h-mayorquin Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

rly Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

rly commented Mar 11, 2026

Uh oh!

rly commented Mar 11, 2026

Uh oh!

Uh oh!

h-mayorquin commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Mar 3, 2026 •

edited

Loading