Skip to content

Conversation

@cjfuller
Copy link
Collaborator

While experimenting with using zarr for arrays instead of dask, I discovered that we could get a modest extra speedup for dask by using even larger chunks. This commit makes us use chunks up to the amount of available memory (up to a maximum of 2**31-1 bytes, which was a hard limit for zarr, and when testing with larger chunks on dask it seemed to hang).

Tested by:

  • ./dev/format.sh
  • ./dev/autofix_lint.sh
  • ./dev/type_check.sh
  • ./dev/run_tests.sh
  • run a manual test on an 8x8fov, 3000x3000px, 5-channel image, see it completes faster than before.

…mory

While experimenting with using zarr for arrays instead of dask, I
discovered that we could get a modest extra speedup for dask by using
even larger chunks. This commit makes us use chunks up to the amount of
available memory (up to a maximum of 2**31-1 bytes, which was a hard
limit for zarr, and when testing with larger chunks on dask it seemed to
hang).

Tested by:
- `./dev/format.sh`
- `./dev/autofix_lint.sh`
- `./dev/type_check.sh`
- `./dev/run_tests.sh`
- run a manual test on an 8x8fov, 3000x3000px, 5-channel image, see it
  completes faster than before.
@cjfuller cjfuller requested a review from hongquanli January 16, 2025 02:09
@cjfuller cjfuller merged commit 41f4cd9 into main Jan 16, 2025
@cjfuller cjfuller deleted the colin/dynamic_chunk_size_with_memory_limit branch January 16, 2025 02:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants