-
Notifications
You must be signed in to change notification settings - Fork 47
Description
We still have dependency issues with kerchunk not supporting zarr-python v3, but icechunk requiring it (see #321). It unfortunately doesn't look like these will be resolved in kerchunk any time soon.
Right now we only actually use kerchunk code directly within VirtualiZarr for:
- the netCDF3 reader
- the HDF5 reader (which we now have Sean's alternative
HDF
reader for) - the FITS reader (which is cool but fewer people use)
We really want to be able to pin zarr-python>=3.0.0
, for so many reasons. But right now doing so would break those readers in VirtualiZarr because of the kerchunk incompatibility.
I realised yesterday that the most expedient thing to do here might be to just vendor (i.e. copy-paste) the code for the FITS and netCDF3 readers into VirtualiZarr.
The implementations for these are:
- the only part of kerchunk we need to keep right now to make our tests pass (as we don't currently have a working TIFF reader - see open_virtual_dataset fails to open tiffs #291),
- they already work with zarr v3 (I think - the netCDF3 implementation doesn't even import zarr so that one really should work),
- they have barely been updated in over a year so we're not likely to miss out on regular new changes,
- they're an easily abstracted component that we can fix/replace later,
- They're not that big, ~300 lines each.
That would allow us to go full steam ahead with all the other things we need to do to be able to pin zarr-python>=3.0.0
(e.g. #374, #182, #175).
FYI @bamford - I see you're the last person that committed to the FITS reader (fsspec/kerchunk#531), so you will want to be aware of us potentially forking it!
Metadata
Metadata
Assignees
Labels
Type
Projects
Status