Description
I'm trying to use CMIP6 NetCDF datas on the fly, from http server at ESGF
They expose data on opendap , download_url and gridftp.
There are some limitation due to opendap service so I'm trying to use kerchunk to open the data through download_url.
Here is the notebook showing the problem.
Kerchunk work perfect, Xarray recognise the file with chunks. But when starting to access from multiple dask workers to access the data, some data are missing, and for each try, it is not the same data missing.
I downloaded the NetcdF in question to local s3 bucket and did same operation, no data missing.
My guess is probably server is not responding time to time, (but I do not know )
I would like to stay with access through kerchunk directory from the server, and avoid doing wget to all the file we will analyse.
Is there anyway to
- show err so that we know there is something not going well
- automatic re-try if there is no answer from server ?
Can fsspec do something to fix this problem? (or this is more on Xarray side?)
Thank you for your help