Public API for buffer objects #2876

TomAugspurger · 2025-02-28T15:56:36Z

This moves the public imports from buffer things out of zarr.core.buffer.

Abstract stuff is availble under zarr.abc.buffer.
Concrete implementations are available under zarr.buffer.{cpu,gpu}.

I haven't added any new tests, but I updated the tests in tests/test_buffer.py to use the public API.

This moves the public imports from buffer things out of `zarr.core`. Abstract stuff is availble under `zarr.abc.buffer`. Concrete implementations are available under `zarr.buffer.{cpu,gpu}`.

dstansby · 2025-03-04T20:55:41Z

Do you think it's worth just moving the actual code to the new public location, so it doesn't live in zarr.core.buffer any more? That would make for a simpler code base and be my preference, but I might be missing some reason not to move the code.

TomAugspurger · 2025-03-05T02:46:57Z

Mmm, I think I'd still want the implementation to be in a private module so that implementation details like numpy imports don't leak into the public API's namespace. So it'd end up looking pretty similar. And if we're moving stuff out of zarr.core I'd want to do it properly, since I wouldn't be surprised if people were already using it. IMO not worth it right now.

TomAugspurger · 2025-03-05T17:29:36Z

I wouldn't be surprised if people were already using it.

Yep: https://github.com/earth-mover/icechunk/blob/584d4f2b0aad1160f8e4ba9a48ebbf770c058c00/icechunk-python/python/icechunk/store.py#L13

I think this PR takes care of those zarr.core.buffer imports. IMO, we should ensure that all the types in our public API (like BytesLike) are exported somewhere, but that should be done separately.

dstansby

I left some comments/suggestions inline.

I also think we should move the code to where we expect users to import it from. This is because:

having code in zarr.core.buffer, which we expcititly mark as private API, could lead developers to think they can make API changes without deprecations. But in reality it's public API.
It forces us to import code in our tests from where users are also importing it

I appreciate it's a bit more work to move the code around, but that's the cost of being a stable and widely used library 😄

And if we're moving stuff out of zarr.core I'd want to do it properly, since I wouldn't be surprised if people were already using it.

We decided that the API in zarr.core is private, so I think we should feel free to do what we want there - if downstream users are using it then I don't think that's our problem, since we don't document it's existence. On a practical level I can appreciate just moving the code might cause issues, in which case we could still have the code importable from zarr.core, but issue deprecation warnings before removing it completely.

src/zarr/buffer/__init__.py

TomAugspurger · 2025-03-28T14:30:14Z

I don't plan to move the implementation at this time. We already know that some libraries, like icechunk, are depending on it. I don't want to break those implementations because Zarr lacked a public API for this previously. We need to offer a public API first and then give them some time to migrate over.

I'll post a proposed plan over in #2621.

TomAugspurger · 2025-03-28T15:08:50Z

Should be good to go.

dstansby

As well as my inline comments/suggestions, I noticed that there are parts of the API that are typed with zarr.core.buffer... - it would be good to change those to use the new public interface. As an example, zarr.abc.codec.ArrayArrayCodec

dstansby · 2025-03-28T15:12:02Z

docs/user-guide/extending.rst

@@ -83,7 +83,10 @@ Coming soon.
 Custom array buffers
 --------------------

-Coming soon.
+zarr-python provides control where and how arrays stored in memory through


Suggested change

zarr-python provides control where and how arrays stored in memory through

Zarr-Python provides control for where and how arrays stored in memory through

We're inconsistent about zarr-python vs. Zarr-python in the docs.

docs/user-guide/extending.rst

src/zarr/buffer/__init__.py

dstansby · 2025-03-28T15:18:47Z

I don't plan to move the implementation at this time. We already know that some libraries, like icechunk, are depending on it. I don't want to break those implementations because Zarr lacked a public API for this previously. We need to offer a public API first and then give them some time to migrate over.

👍 - my suggestion would be to introduce those deprecations in this PR for the buffer stuff that's been made public, but happy to punt that to a later PR if that's easier.

TomAugspurger · 2025-03-28T19:13:42Z

The latest pair of commits updates our zarr.config, which references zarr.core.buffer. We needed to slightly update our registration code to let the class being registered have a different qualname / import path than where they're defined.

TomAugspurger · 2025-03-28T19:34:14Z

Thanks for the review.

I noticed that there are parts of the API that are typed

This is proving to be a bit more work than I can do right now. Feel free to push a commit if you're able to, or we can merge this as is since and handle that later.

TomAugspurger · 2025-03-28T19:35:05Z

👍 - my suggestion would be to introduce those deprecations in this PR for the buffer stuff that's been made public, but happy to punt that to a later PR if that's easier.

I'd recommend waiting. A release with this public API will give us a chance to migrate packages like icechunk without their users ever seeing a warning.

d-v-b · 2025-04-20T19:20:04Z

It would be nice to get this merged. In the interest of moving forward here, @dstansby are you OK with sorting out some of the requested changes in follow-up PRs?

dstansby · 2025-04-22T21:56:20Z

I had a think about this again, and I actually think fairly strongly that if we are moving where we want users to import this code from, then we should deprecate importing it from the old locations at the same time, in the same PR. This would make sure:

It's logistically possible to deprecate the old namespace (otherwise, if it's impossible there's no point complicating things with a new namespace)
Doing the deprecation happens and isn't lost.

I know it's extra work, but I think in order to make sure the current nice bit of work is worth it, pushing users towards using the new work (ie deprecating the old import path) should happen at the same time.

dstansby · 2025-04-22T22:00:04Z

that's my two cents - if there are a couple of other developers that feel strongly in the other direction (splitting creation of new API/deprecating old so there's a period with duplicate API), then happy to do that. I have a gut feeling that it has the potential to go wrong and create a mess if not done in one go though.

TomAugspurger · 2025-04-27T12:48:18Z

If you want to do that then go for it.I don’t like introducing warnings to users of downstream libraries before those libraries have had a chance to adapt.On Apr 22, 2025, at 5:00 PM, David Stansby ***@***.***> wrote: that's my two cents - if there are a couple of other developers that feel strongly in the other direction (splitting creation of new API/deprecating old so there's a period with duplicate API), then happy to do that. I have a gut feeling that it has the potential to go wrong and create a mess if not done in one go though.—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***> dstansby left a comment (zarr-developers/zarr-python#2876) that's my two cents - if there are a couple of other developers that feel strongly in the other direction (splitting creation of new API/deprecating old so there's a period with duplicate API), then happy to do that. I have a gut feeling that it has the potential to go wrong and create a mess if not done in one go though. —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>

d-v-b · 2025-06-07T10:02:57Z

should we fold this into the 3.1 release?

TomAugspurger · 2025-06-07T12:26:10Z

If @dstansby is OK with things more or less as is I'll fix the merge conflict. Otherwise I don't plan to work on it.

dstansby · 2025-06-07T12:40:00Z

This definitely shouldn't be merged without a clear migration plan explaining what exsiting users of zarr.core.buffer should do (presumably just switch their imports to zarr.buffer?).

I'm okay with adding the new API, then deprecating later 👍 - could you open an issue to make sure deprecating isn't forgotten though?

The config uses the public `zarr.buffer.cpu.Buffer`, which differs from the implementation path `zarr.core.buffer.cpu.Buffer`. This is OK because the public API for getting the buffer doesn't depend on where it's implemented at.

TomAugspurger · 2025-06-07T16:16:49Z

clear migration plan explaining

Updated the release note.

then deprecating later

#2621

dstansby

👍 thanks for the udpated changelog. I'm not sure if we want to put this in 3.1 or before - I guess there's no harm in putting it in 3.0.x as long as we don't do the deprecation until 3.2?

dstansby · 2025-06-17T08:05:14Z

When conflicts are fixed, this is good to be merged

Public API for buffer objects

d4be973

This moves the public imports from buffer things out of `zarr.core`. Abstract stuff is availble under `zarr.abc.buffer`. Concrete implementations are available under `zarr.buffer.{cpu,gpu}`.

github-actions bot added the needs release notes Automatically applied to PRs which haven't added release notes label Feb 28, 2025

changelog

efa674d

github-actions bot removed the needs release notes Automatically applied to PRs which haven't added release notes label Mar 1, 2025

dstansby reviewed Mar 28, 2025

View reviewed changes

src/zarr/buffer/__init__.py Outdated Show resolved Hide resolved

src/zarr/buffer/__init__.py Outdated Show resolved Hide resolved

src/zarr/buffer/__init__.py Outdated Show resolved Hide resolved

src/zarr/buffer/__init__.py Outdated Show resolved Hide resolved

TomAugspurger added 3 commits March 28, 2025 09:17

Merge remote-tracking branch 'upstream/main' into tom/fix/public-buffers

ef2da4d

absolute imports

4ff4f7e

fixed warning in doc build

7946745

TomAugspurger mentioned this pull request Mar 28, 2025

Make zarr.core private #2621

Open

dstansby requested changes Mar 28, 2025

View reviewed changes

TomAugspurger added 2 commits March 28, 2025 13:01

Updated config

5cf1bde

Updated config

50792a4

wording

674ca51

dstansby mentioned this pull request Apr 29, 2025

create a module for group metadata #3019

Open

TomAugspurger added 2 commits June 7, 2025 07:48

Merge remote-tracking branch 'upstream/main' into tom/fix/public-buffers

3143a89

doc

83c8c32

TomAugspurger force-pushed the tom/fix/public-buffers branch from 839b9a6 to 83c8c32 Compare June 7, 2025 13:00

TomAugspurger added 2 commits June 7, 2025 08:19

backwards compat

de66999

dstansby approved these changes Jun 7, 2025

View reviewed changes

dstansby added this to the 3.1.0 milestone Jun 17, 2025

Merge remote-tracking branch 'upstream/main' into tom/fix/public-buffers

c28a425

dstansby merged commit f68bf06 into zarr-developers:main Jun 18, 2025
30 checks passed

	zarr-python provides control where and how arrays stored in memory through
	Zarr-Python provides control for where and how arrays stored in memory through

Uh oh!

Public API for buffer objects #2876

Public API for buffer objects #2876

Uh oh!

Conversation

TomAugspurger commented Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dstansby commented Mar 4, 2025

Uh oh!

TomAugspurger commented Mar 5, 2025

Uh oh!

TomAugspurger commented Mar 5, 2025

Uh oh!

dstansby left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TomAugspurger commented Mar 28, 2025

Uh oh!

dstansby left a comment

Choose a reason for hiding this comment

Uh oh!

dstansby Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dstansby commented Mar 28, 2025

Uh oh!

TomAugspurger commented Mar 28, 2025

Uh oh!

TomAugspurger commented Mar 28, 2025

Uh oh!

TomAugspurger commented Mar 28, 2025

Uh oh!

d-v-b commented Apr 20, 2025

Uh oh!

dstansby commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dstansby commented Apr 22, 2025

Uh oh!

TomAugspurger commented Apr 27, 2025 via email

Uh oh!

d-v-b commented Jun 7, 2025

Uh oh!

TomAugspurger commented Jun 7, 2025

Uh oh!

dstansby commented Jun 7, 2025

Uh oh!

TomAugspurger commented Jun 7, 2025

Uh oh!

dstansby left a comment

Choose a reason for hiding this comment

Uh oh!

dstansby commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Feb 28, 2025 •

edited

Loading

TomAugspurger commented Mar 28, 2025 •

edited

Loading

dstansby commented Apr 22, 2025 •

edited

Loading