Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generalize delayed #7811

Open
wants to merge 153 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
153 commits
Select commit Hold shift + click to select a range
15fc2b8
generalise chunk methods to allow cubed
TomNicholas Sep 10, 2022
5e05b71
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 10, 2022
cff89ee
fic typing typo
TomNicholas Sep 10, 2022
039973b
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas Sep 10, 2022
60d44bc
fixed circular import
TomNicholas Sep 11, 2022
5ddba7e
fix some mypy errors
TomNicholas Sep 11, 2022
37d0d66
added cubed to mypy ignore list
TomNicholas Sep 11, 2022
67d7efc
Merge branch 'main' into cubed_integration
TomNicholas Sep 11, 2022
cdcb3fb
simplify __array_ufunc__ check
TomNicholas Sep 12, 2022
73e4563
Revert "simplify __array_ufunc__ check" as I pushed to wrong branch
TomNicholas Sep 12, 2022
5995685
update cubed array type
TomNicholas Sep 20, 2022
46223ae
Merge branch 'main' into cubed_integration
TomNicholas Sep 20, 2022
320b09f
fix missed conflict
TomNicholas Sep 20, 2022
3facfd6
sketch for ChunkManager adapter class
TomNicholas Sep 20, 2022
c616a85
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 20, 2022
ecabaa4
Remove erroneous docstring about usage of map_blocks
TomNicholas Sep 23, 2022
e53a588
apply_ufunc -> apply_gufunc
TomNicholas Sep 23, 2022
fe21edd
chunk -> from_array
TomNicholas Sep 23, 2022
3f6aedc
remove staticmethods
TomNicholas Sep 23, 2022
ea8f482
attempt to type methods of ABC
TomNicholas Sep 23, 2022
c49ab8e
from_array
TomNicholas Sep 23, 2022
26d1868
attempt to specify types
TomNicholas Sep 23, 2022
e9b4a33
method for checking array type
TomNicholas Sep 26, 2022
3a43b00
Merge branch 'main' into pr/7019
Illviljan Oct 27, 2022
c7c9589
Update pyproject.toml
Illviljan Oct 27, 2022
fc051e3
Merge branch 'main' into cubed_integration
TomNicholas Jan 21, 2023
56e9d0f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 21, 2023
3b16cca
fixed import errors
TomNicholas Jan 22, 2023
7ac3323
generalize .chunk method kwargs
TomNicholas Jan 22, 2023
e732b87
used dask functions in dask chunkmanager
TomNicholas Jan 22, 2023
68930eb
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas Jan 22, 2023
8442e1f
define signatures for apply_gufunc, blockwise, map_blocks
TomNicholas Jan 22, 2023
3717431
prototype function to detect which parallel backend to use
TomNicholas Jan 22, 2023
78d8969
Merge branch 'main' into cubed_integration
TomNicholas Mar 6, 2023
7ac6531
add cubed.apply_gufunc
TomNicholas Mar 6, 2023
e423bfb
ruffify
TomNicholas Mar 6, 2023
149db9d
add rechunk and compute methods for cubed
TomNicholas Mar 6, 2023
280c563
xr.apply_ufunc now dispatches to chunkmanager.apply_gufunc
TomNicholas Mar 6, 2023
42186e7
CubedManager.chunks
TomNicholas Mar 6, 2023
103a755
attempt to keep dask and cubed imports lazy
TomNicholas Mar 6, 2023
f2bce3d
generalize idxmax
TomNicholas Mar 6, 2023
f09947d
move unify_chunks import to ChunkManager
TomNicholas Mar 6, 2023
e760f10
generalize Dataset.load()
TomNicholas Mar 6, 2023
b1a4e35
check explicitly for chunks attribute instead of hard-coding cubed
TomNicholas Mar 6, 2023
5320f4d
better function names
TomNicholas Mar 6, 2023
45ed5d2
add cubed version of unify_chunks
TomNicholas Mar 6, 2023
eec096b
recognize wrapped duck dask arrays (e.g. pint wrapping dask)
TomNicholas Mar 6, 2023
c64ff5f
add some tests for fetching ChunkManagers
TomNicholas Mar 7, 2023
8a37905
add from_array_kwargs to open_dataset
TomNicholas Mar 9, 2023
989d6bb
add from_array_kwargs to open_zarr
TomNicholas Mar 9, 2023
8c7fe79
pipe constructors through chunkmanager
TomNicholas Mar 9, 2023
0222b55
generalize map_blocks inside coding
TomNicholas Mar 9, 2023
9d6cf6b
Merge branch 'main' into cubed_integration
TomNicholas Mar 13, 2023
afc6abc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 13, 2023
2c0cc26
fixed full_like
TomNicholas Mar 13, 2023
1a255cf
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas Mar 13, 2023
c398d98
add from_array_kwargs to open_zarr
TomNicholas Mar 13, 2023
598bf12
don't import dask.tokenize
TomNicholas Mar 14, 2023
7bef188
fix bugs with passing from_array_kwargs down
TomNicholas Mar 14, 2023
7af5395
generalise reductions by adding to chunkmanager
TomNicholas Mar 14, 2023
287e96c
moved nanfirst/nanlast to duck_array_ops from dask_array_ops
TomNicholas Mar 14, 2023
8bbc141
generalize interp
TomNicholas Mar 14, 2023
6cfe9fa
generalized chunk_hint function inside indexing
TomNicholas Mar 14, 2023
4ca044b
DaskIndexingAdapter->ChunkedIndexingAdapter
TomNicholas Mar 14, 2023
8ed5ed6
Merge branch 'main' into cubed_integration
TomNicholas Mar 14, 2023
2a4c38b
Revert "DaskIndexingAdapter->ChunkedIndexingAdapter"
TomNicholas Mar 14, 2023
45a4c98
pass cubed-related kwargs down through to_zarr by adding .store to Ch…
TomNicholas Mar 14, 2023
dee5b33
fix typing_extensions on py3.9
TomNicholas Mar 14, 2023
176d7fa
fix ImportError with cubed array type
TomNicholas Mar 14, 2023
9e58d6d
give up trying to import TypeAlias in CI
TomNicholas Mar 14, 2023
a6219a0
fix import of T_Chunks
TomNicholas Mar 14, 2023
9f21994
fix no_implicit_optional warnings
TomNicholas Mar 14, 2023
eb7bb0b
don't define CubedManager if cubed can't be imported
TomNicholas Mar 14, 2023
57733de
fix local mypy errors
TomNicholas Mar 14, 2023
4c58b28
don't explicitly pass enforce_ndim into dask.array.map_blocks
TomNicholas Mar 15, 2023
d07830c
fix drop_axis default
TomNicholas Mar 15, 2023
c1bf040
Merge branch 'main' into cubed_integration
TomNicholas Mar 16, 2023
3ae21d9
use indexing adapter on cubed arrays too
TomNicholas Mar 16, 2023
7ef0129
use array API-compatible version of astype function
TomNicholas Mar 16, 2023
ec22963
whatsnew
TomNicholas Mar 16, 2023
4c8d773
document new kwargs
TomNicholas Mar 16, 2023
f4de577
add chunkmanager entrypoint
TomNicholas Mar 21, 2023
1cd7283
move CubedManager to a separate package
TomNicholas Mar 21, 2023
5386711
guess chunkmanager based on whats available
TomNicholas Mar 21, 2023
6b173de
Merge branch 'main' into cubed_integration
TomNicholas Mar 22, 2023
c431a5f
fix bug with tokenizing
TomNicholas Mar 22, 2023
7ab9047
adapt tests to emulate existence of entrypoint
TomNicholas Mar 22, 2023
72f8f5f
use fixture to setup/teardown dummy entrypoint
TomNicholas Mar 22, 2023
34c6aea
refactor to make DaskManager unavailable if dask not installed
TomNicholas Mar 22, 2023
fb9466d
typing
TomNicholas Mar 22, 2023
ffd2e21
Merge branch 'main' into cubed_integration
TomNicholas Mar 22, 2023
36b2be0
move whatsnew to latest xarray version
TomNicholas Mar 22, 2023
77a1e4e
remove superfluous lines from whatsnew
TomNicholas Mar 22, 2023
a6222f9
fix bug where zarr backend attempted to use dask when not installed
TomNicholas Mar 23, 2023
61fe236
Remove rogue print statement
TomNicholas Mar 23, 2023
447d1f1
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas Mar 23, 2023
a7a6a6e
Clarify what's new
TomNicholas Mar 23, 2023
aa64996
use monkeypatch to mock registering of dummy chunkmanager
TomNicholas Mar 23, 2023
fec1a13
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas Mar 23, 2023
db11947
more tests for guessing chunkmanager correctly
TomNicholas Mar 23, 2023
2c18df6
raise TypeError if no chunkmanager found for array types
TomNicholas Mar 23, 2023
2e49154
Correct is_chunked_array check
TomNicholas Mar 23, 2023
748e90d
vendor dask.array.core.normalize_chunks
TomNicholas Mar 24, 2023
70804f4
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas Mar 24, 2023
dae2fe4
add default implementation of rechunk in ABC
TomNicholas Mar 24, 2023
4ef500c
remove cubed-specific type check in daskmanager
TomNicholas Mar 24, 2023
ba66419
nanfirst->chunked_nanfirst
TomNicholas Mar 24, 2023
7fd4617
revert adding cubed to NON_NUMPY_SUPPORTED_ARRAY_TYPES
TomNicholas Mar 24, 2023
69d77c9
licensing to vendor functions from dask
TomNicholas Mar 24, 2023
8337857
fix bug
TomNicholas Mar 24, 2023
9850a46
ignore mypy error
TomNicholas Mar 24, 2023
488fd5b
separate chunk_manager kwarg from from_array_kwargs dict
TomNicholas Mar 29, 2023
00bcf6c
rename kwarg to chunked_array_type
TomNicholas Mar 29, 2023
ff1f8ab
Merge branch 'main' into cubed_integration
TomNicholas Mar 30, 2023
844726d
refactor from_array_kwargs in .chunk ready for deprecation
TomNicholas Apr 4, 2023
3d56a3d
print statements in test so I can comment on them
TomNicholas Apr 5, 2023
1952c55
remove print statements now I've commented on them in PR
TomNicholas Apr 5, 2023
3ba8d42
should fix dask naming tests
TomNicholas Apr 5, 2023
b15411c
Merge branch 'main' into cubed_integration
TomNicholas Apr 5, 2023
53d6094
make dask-specific kwargs explicit in from_array
TomNicholas Apr 5, 2023
7dc6581
debugging print statements
TomNicholas Apr 6, 2023
fcaf499
Revert "debugging print statements"
TomNicholas Apr 6, 2023
64df7e8
fix gnarly bug with auto-determining chunksizes caused by not referri…
TomNicholas Apr 6, 2023
747ada5
hopefully fix broken docstring
TomNicholas Apr 6, 2023
9b33ab7
Revert "make dask-specific kwargs explicit in from_array"
TomNicholas Apr 6, 2023
c8d5aa1
Merge branch 'main' into cubed_integration
TomNicholas Apr 6, 2023
6a7a043
show chunksize limit used in failing tests
TomNicholas Apr 6, 2023
20f92c6
move lazy indexing adapter up out of chunkmanager code
TomNicholas Apr 6, 2023
796a577
try upgrading minimum version of dask
TomNicholas Apr 6, 2023
29d0c92
Revert "try upgrading minimum version of dask"
TomNicholas Apr 11, 2023
031017b
un-vendor dask.array.core.normalize_chunks
TomNicholas Apr 11, 2023
a8c3413
Merge branch 'main' into cubed_integration
TomNicholas Apr 11, 2023
14a1226
refactored to all passing ChunkManagerEntrypoint objects directly
TomNicholas Apr 12, 2023
5dd9d35
Remove redundant Nones from types
TomNicholas Apr 17, 2023
5a46294
From future import annotations
TomNicholas Apr 17, 2023
d6b56c6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 17, 2023
471d22a
From functools import annotations
TomNicholas Apr 17, 2023
8378f43
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 17, 2023
f8b1020
From future import annotations
TomNicholas Apr 17, 2023
907c15b
Merge branch 'main' into cubed_integration
TomNicholas Apr 17, 2023
11676ab
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 17, 2023
76ce09e
defined type for NormalizedChunks
TomNicholas May 1, 2023
127c184
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas May 1, 2023
604bbf3
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] May 1, 2023
7604594
standardized capitalization of ChunkManagerEntrypoint
TomNicholas May 1, 2023
97537dd
Merge branch 'cubed_integration' of https://github.com/TomNicholas/xa…
TomNicholas May 1, 2023
026fe17
Merge branch 'main' into cubed_integration
TomNicholas May 1, 2023
355555f
ensure ruff doesn't remove import
TomNicholas May 1, 2023
6eac87a
ignore remaining typing errors stemming from unclear dask typing for …
TomNicholas May 2, 2023
f4224f6
Merge branch 'main' into cubed_integration
TomNicholas May 2, 2023
4ec8370
rename store_kwargs->chunkmanager_store_kwargs
TomNicholas May 2, 2023
316c63d
missed return value
TomNicholas May 2, 2023
c183e73
redirect dask.delayed in open_mfdataset through ChunkManager
TomNicholas May 2, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
  • Loading branch information
pre-commit-ci[bot] committed Apr 17, 2023
commit 8378f4365ff8e698d9ab4e6039662d66dc7550e1
32 changes: 15 additions & 17 deletions xarray/core/parallelcompat.py
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,7 @@
Any,
Callable,
Generic,
Optional,
TypeVar,
Union,
)

import numpy as np
Expand All @@ -31,7 +29,7 @@


@functools.lru_cache(maxsize=1)
def list_chunkmanagers() -> dict[str, "ChunkManagerEntrypoint"]:
def list_chunkmanagers() -> dict[str, ChunkManagerEntrypoint]:
"""
Return a dictionary of available chunk managers and their ChunkManagerEntrypoint objects.

Expand All @@ -49,7 +47,7 @@ def list_chunkmanagers() -> dict[str, "ChunkManagerEntrypoint"]:

def load_chunkmanagers(
entrypoints: Sequence[EntryPoint],
) -> dict[str, "ChunkManagerEntrypoint"]:
) -> dict[str, ChunkManagerEntrypoint]:
"""Load entrypoints and instantiate chunkmanagers only once."""

loaded_entrypoints = {
Expand All @@ -65,8 +63,8 @@ def load_chunkmanagers(


def guess_chunkmanager(
manager: Union[str, "ChunkManagerEntrypoint", None]
) -> "ChunkManagerEntrypoint":
manager: str | ChunkManagerEntrypoint | None,
) -> ChunkManagerEntrypoint:
"""
Get namespace of chunk-handling methods, guessing from what's available.

Expand Down Expand Up @@ -100,7 +98,7 @@ def guess_chunkmanager(
)


def get_chunked_array_type(*args) -> "ChunkManagerEntrypoint":
def get_chunked_array_type(*args) -> ChunkManagerEntrypoint:
"""
Detects which parallel backend should be used for given set of arrays.

Expand Down Expand Up @@ -167,11 +165,11 @@ def chunks(self, data: T_ChunkedArray) -> T_Chunks:
@abstractmethod
def normalize_chunks(
self,
chunks: Union[tuple, int, dict, str],
shape: Union[tuple[int], None] = None,
limit: Union[int, None] = None,
dtype: Union[np.dtype, None] = None,
previous_chunks: Union[tuple[tuple[int, ...], ...], None] = None,
chunks: tuple | int | dict | str,
shape: tuple[int] | None = None,
limit: int | None = None,
dtype: np.dtype | None = None,
previous_chunks: tuple[tuple[int, ...], ...] | None = None,
) -> tuple[tuple[int, ...], ...]:
"""Called by open_dataset"""
...
Expand Down Expand Up @@ -202,10 +200,10 @@ def reduction(
self,
arr: T_ChunkedArray,
func: Callable,
combine_func: Optional[Callable] = None,
aggregate_func: Optional[Callable] = None,
axis: Optional[Union[int, Sequence[int]]] = None,
dtype: Optional[np.dtype] = None,
combine_func: Callable | None = None,
aggregate_func: Callable | None = None,
axis: int | Sequence[int] | None = None,
dtype: np.dtype | None = None,
keepdims: bool = False,
) -> T_ChunkedArray:
"""Used in some reductions like nanfirst, which is used by groupby.first"""
Expand Down Expand Up @@ -262,7 +260,7 @@ def unify_chunks(

def store(
self,
sources: Union[T_ChunkedArray, Sequence[T_ChunkedArray]],
sources: T_ChunkedArray | Sequence[T_ChunkedArray],
targets: Any,
**kwargs: dict[str, Any],
):
Expand Down