Handle large datasets efficienlty #582

dalonsoa · 2024-10-09T13:25:49Z

Some models are going to require data at much higher temporal resolution than the wider model update tick. An example here is sub-daily or daily inputs to the Abiotic model.
The input data files for this use case can be very large – not something we really want to ingest into the Data object at model startup and try and store in RAM.
So, where do we store this kind of data, and is there a way to lazily load the data as required. This might be something that dask is well-suited to as this handles lazy loading of chunked data.

The text was updated successfully, but these errors were encountered:

dalonsoa · 2024-10-09T13:27:31Z

@vgro , we will need an example simulation with, at least, one BIG file and some indication to where it is used, so we can explore how to best handle that memory wise.

dalonsoa added the enhancement New feature or request label Oct 9, 2024

alexdewar self-assigned this Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle large datasets efficienlty #582

Handle large datasets efficienlty #582

dalonsoa commented Oct 9, 2024

dalonsoa commented Oct 9, 2024

Handle large datasets efficienlty #582

Handle large datasets efficienlty #582

Comments

dalonsoa commented Oct 9, 2024

dalonsoa commented Oct 9, 2024