Skip to content

Loading xarray based on kerchunke'd catalogue may sometime get NaN from unstable http access? #253

Closed
fsspec/filesystem_spec
#1120
@tinaok

Description

@tinaok

I'm trying to use CMIP6 NetCDF datas on the fly, from http server at ESGF

They expose data on opendap , download_url and gridftp.
There are some limitation due to opendap service so I'm trying to use kerchunk to open the data through download_url.

Here is the notebook showing the problem.

Kerchunk work perfect, Xarray recognise the file with chunks. But when starting to access from multiple dask workers to access the data, some data are missing, and for each try, it is not the same data missing.

I downloaded the NetcdF in question to local s3 bucket and did same operation, no data missing.

My guess is probably server is not responding time to time, (but I do not know )

I would like to stay with access through kerchunk directory from the server, and avoid doing wget to all the file we will analyse.

Is there anyway to

  • show err so that we know there is something not going well
  • automatic re-try if there is no answer from server ?
    Can fsspec do something to fix this problem? (or this is more on Xarray side?)

Thank you for your help

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions