Closed
Description
Following discussion from pangeo-data/pangeo#19
How would we go about implementing a concat or append function for zarr data stores? I am imagining something like xr.concat here. Its not clear to me how this would work when using open_mfdataset
.
zarray_1 = xr.open_zarr(store=gcsmap)
ds2 = xr.open_dataset(path_to_netcdf)
xr.concat([zarray_1, ds2])
Problem description
If you are using cloud storage facility like gcs, ds.to_zarr
can fail before the completion of the upload. This is a problem for multi-TB datasets as the entire process needs to be restarted without any way to resume where you left off.
Expected Output
new zarr dataset with additional dataset appended along appropriate dim