CI: 05/02/25 upstream sync #399

rocm-repo-management-api-2 · 2025-05-02T06:02:17Z

Daily sync with upstream

…matting There are no restrictions on window sizes in that backend. Also, replace Markdown quotations with Note/Warning blocks in the GPU reference for added clarity. PiperOrigin-RevId: 750555285

http://github.com/openxla/xla/commit/99b7c3bf05c3877c70ad587439b7481889810564. PiperOrigin-RevId: 750569770

PiperOrigin-RevId: 750570499

PiperOrigin-RevId: 750574686

PiperOrigin-RevId: 750575644

PiperOrigin-RevId: 750600271

At the moment mypy isn't correctly detecting errors related to jaxlib. In a future change this will be fixed, and this PR fixes errors that will be revealed by that change. PiperOrigin-RevId: 750603531

PiperOrigin-RevId: 750619702

Users should now be able to instantiate aliases `smem` buffers by using an `RefUnion`, which takes a variadic number of trees of refs as an input. `RefUnion` represents a union/coproduct of all its operands, and its operands groups alias (overlap in memory), while the elements within the groups represent products, and their operands are consecutive in memory. The resulting aliased `smem` ref can then be unfolded into a flat structure using assignment inside the kernel. Here is an example: ``` @functools.partial( pallas_call, out_shape=jax.ShapeDtypeStruct([128], jnp.float32), in_specs=[pl.BlockSpec((256,))], out_specs=pl.BlockSpec((128,), memory_space=plgpu.GMEM), scratch_shapes=[ plgpu.RefUnion( plgpu.SMEM((256,), jnp.float32), [ plgpu.SMEM((128,), jnp.float32), plgpu.SMEM((128,), jnp.float32), ], ) ], ) def kernel(x_ref, o_ref128, aliased_ref): smem_ref256, _, smem_ref128 = aliased_ref smem_ref256[...] = x_ref[...] + 1 plgpu.commit_smem() plgpu.copy_smem_to_gmem(smem_ref128, o_ref128) ``` PiperOrigin-RevId: 750624152

…issue example is already varying on `x` and we were dropping that when we called into `lax_internal._one` Fixes jax-ml#28193 PiperOrigin-RevId: 750634402

PiperOrigin-RevId: 750637693

PiperOrigin-RevId: 750637751

Having the directory structure of the jaxlib wheel be different to the source tree confuses type checkers such as mypy, since sometimes they find type stubs in the installed jaxlib wheel, and sometimes from the installed source tree. Instead: * don't include type stubs in the jaxlib wheel * don't install the jaxlib wheel as part of pre-commit * make sure that the location of type stubs (and the underlying libraries) is in the same position in the `jaxlib/` directory of the JAX source tree as it would be for the jaxlib wheel when installed. For now, we leave some stubs that forward from the old locations to the new locations for certain headers and modules. These will be removed after migrating some users. PiperOrigin-RevId: 750650528

PiperOrigin-RevId: 750652690

PiperOrigin-RevId: 750681923

Renaming only, no functional changes intended. There are two reasons to do this: * I want to split some XLA specific things out of the JAX wheel and move them back into the XLA repository. It would be nice if the name "xla" could be reserved for that extension instead. * There are lots of jax-specific things in this extension. PiperOrigin-RevId: 750709831

PiperOrigin-RevId: 750722897

PiperOrigin-RevId: 750725054

PiperOrigin-RevId: 750732175

…es aren't removed in sharding propagation. PiperOrigin-RevId: 750739094

…_p from `check_rep` to `check_vma`. PiperOrigin-RevId: 750741689

… param->value dict The parameters must be specified via a dataclass or a mapping from a backend to the corresponding dataclass. PiperOrigin-RevId: 750750391

…nd also change docs to point to `jax.shard_map` PiperOrigin-RevId: 750760353

…changelog PiperOrigin-RevId: 750900798

PiperOrigin-RevId: 750907599

…pgroup logic in the dialect lowering. The `DialectBarrierRef` class has the same interface as `BarrierRef`, but uses mgpu ops for initialization and `expect_arrive_tx`. This makes the IR cleaner and also allows us to take care of adjusting arrival counts and bytes in the dialect lowering. That makes the high-level code cleaner. The new lowering always has all threads in a warpgroup arrive when using WG semantics. The behavior so far was to have only a single thread arrive, but keeping this would have complicated things going forward. The existing tests (including the one that's no longer skipped) test the new behavior. PiperOrigin-RevId: 750948900

PiperOrigin-RevId: 750954967

http://github.com/openxla/xla/commit/15565b8da6d85e9faec669cb22878a0e44cca4ee. PiperOrigin-RevId: 753562330

I don't expect that we actually need the device ordinal to be defined on the execution context, but we can add it back in statically (it's already decoded in the handler) if necessary. PiperOrigin-RevId: 753573598

This is quite helpful while trying to debug the load/store routines. PiperOrigin-RevId: 753599963

… Layout API rename! PiperOrigin-RevId: 753636449

PiperOrigin-RevId: 753639002

…t. So `1.0:f32` -> `1.0:f32[]` PiperOrigin-RevId: 753640777

This issue occurs when some of the leaves have custom `__eq__` methods defined on them, which either result in errors when compared to some other types (see http://cl/753579906), or result in return values that cannot have their truthiness evaluated, e.g.: ``` import jax.tree_util as jtu import numpy as np jtu.all_leaves( [[np.asarray([1, 2])]], is_leaf=lambda x: jtu.all_leaves([x]), ) ``` ``` ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all() ``` This fix avoids equality issues by using the `is` operator instead of `==`, and introduces tests for the case where `is_leaf` is provided. PiperOrigin-RevId: 753684035

PiperOrigin-RevId: 753685498

PiperOrigin-RevId: 753689814

Useful for filtering events by function name or differentiating between events. PiperOrigin-RevId: 753695215

… in_axes, out_axes, axis_name)`. This change does NOT make the API public. The API semantics are as follows: * `smap` only allows going into `Manual` mode one mesh axes at a time via the `axis_name` argument. * mesh needs to be present in the context via `use_mesh` or `set_mesh`. * If in_axes or out_axes contains `None`, it means that the input(s) is **replicated**. This is similar to `vmap` where `None` means unmapped input. * If the context mesh is in full explicit mode, `in_axes` can be inferred from the arguments. But how do we tell `smap` to do that? We **can't** use `None` because `None` means replicated in `smap`. So we introduce a singleton called `Infer` which when passed to `smap`, will tell it to infer the in_axes (in_specs) from the arguments! For example: `smap(f, in_axes=Infer, out_axes=0, axis_name='x')`. You always have the option of specifying `in_axes` and not infer even in full explicit mode :) PiperOrigin-RevId: 753695446

PiperOrigin-RevId: 753705149

PiperOrigin-RevId: 753707559

Updates LLVM usage to match [7752e0a10b25](llvm/llvm-project@7752e0a10b25) PiperOrigin-RevId: 753710403

The extraction of `.tar` files is 10 times faster than the extraction of `.tar.xz` files. By enabling `.tar` files usage in RBE jobs we are going to save at least one min of execution time in all Bazel RBE GPU jobs. PiperOrigin-RevId: 753730448

PiperOrigin-RevId: 753732387

PiperOrigin-RevId: 753734380

PiperOrigin-RevId: 753737826

The XLA GPU runtime does not yet handle device assertions well and will hang if the assert is triggered. However, the assertion output still appears in stderr, so I think having `cf.assert` support is still useful. PiperOrigin-RevId: 753742121

PiperOrigin-RevId: 753758114

PiperOrigin-RevId: 753786660

… just forward out_sharding to their lax variants. PiperOrigin-RevId: 753797017

PiperOrigin-RevId: 753797982

PiperOrigin-RevId: 753840368

PiperOrigin-RevId: 753859510

apaszke and others added 30 commits April 23, 2025 05:32

Include MGPU in grid/blockspec tutorial + use proper note/warning for…

8eb80b9

…matting There are no restrictions on window sizes in that backend. Also, replace Markdown quotations with Note/Warning blocks in the GPU reference for added clarity. PiperOrigin-RevId: 750555285

Update XLA dependency to use revision

d5de9e8

http://github.com/openxla/xla/commit/99b7c3bf05c3877c70ad587439b7481889810564. PiperOrigin-RevId: 750569770

Merge pull request jax-ml#26975 from ROCm:update-amd-install-docs

216de07

PiperOrigin-RevId: 750570499

TSAN FT CI, install cython from nightly wheels

92453fb

Merge pull request jax-ml#28059 from dfm:inline-inline

49e25c6

PiperOrigin-RevId: 750574686

Fix typo "dataclasss".

5227daa

PiperOrigin-RevId: 750575644

Fix some typos.

c483a86

PiperOrigin-RevId: 750600271

Fix mypy errors related to jaxlib.

0948435

At the moment mypy isn't correctly detecting errors related to jaxlib. In a future change this will be fixed, and this PR fixes errors that will be revealed by that change. PiperOrigin-RevId: 750603531

Merge pull request jax-ml#28187 from jakevdp:dirichlet-softmax

c559c2c

PiperOrigin-RevId: 750619702

Add a pvary on the ones we create in grad because the ans in the …

0f067c5

…issue example is already varying on `x` and we were dropping that when we called into `lax_internal._one` Fixes jax-ml#28193 PiperOrigin-RevId: 750634402

Merge pull request jax-ml#28137 from zhenying-liu:host-offloading

2d3b229

PiperOrigin-RevId: 750637693

Merge pull request jax-ml#28195 from vfdev-5:patch-3

777c745

PiperOrigin-RevId: 750637751

Reverts 49e25c6

9ee7bad

PiperOrigin-RevId: 750652690

Merge pull request jax-ml#28178 from yhtang:pr-add-j8s-job-support

4606652

PiperOrigin-RevId: 750681923

Cleanup: remove superfluous jax.numpy utility

735718b

Skip host_offloading notebook from running

75e4aa9

PiperOrigin-RevId: 750722897

Merge pull request jax-ml#28177 from jakevdp:ufunc-jax-array

aeb86f1

PiperOrigin-RevId: 750725054

internal change

9fef4c0

PiperOrigin-RevId: 750732175

Set is_custom field for custom_partitioning_sharding_rule so that rul…

7ec1a3d

…es aren't removed in sharding propagation. PiperOrigin-RevId: 750739094

Remove config _check_rep to _check_vma and the kwarg in shard_map…

4293714

…_p from `check_rep` to `check_vma`. PiperOrigin-RevId: 750741689

[pallas] pl.pallas_call no longer allows compiler_params= to be a…

987fc45

… param->value dict The parameters must be specified via a dataclass or a mapping from a backend to the corresponding dataclass. PiperOrigin-RevId: 750750391

Migrate jax.experimental.shard_map to jax.shard_map in internal JAX a…

d4dd0c4

…nd also change docs to point to `jax.shard_map` PiperOrigin-RevId: 750760353

[pallas] Added a note on the recent compiler_params= change to the …

72b775b

…changelog PiperOrigin-RevId: 750900798

Ran pyupgrade --py310-plus on .pyi files in jaxlib

b2de662

[pallas] Fixed the type of MemoryRef.dtype

aaa0279

PiperOrigin-RevId: 750907599

[Mosaic TPU] Fix a bug in signed scalar upcasts and add a test

c3598cb

PiperOrigin-RevId: 750954967

Google-ML-Automation and others added 28 commits May 1, 2025 06:10

Update XLA dependency to use revision

c0da649

http://github.com/openxla/xla/commit/15565b8da6d85e9faec669cb22878a0e44cca4ee. PiperOrigin-RevId: 753562330

Fix ASAN/MSAN/TSAN failures for buffer callback.

37b87b5

I don't expect that we actually need the device ordinal to be defined on the execution context, but we can add it back in statically (it's already decoded in the handler) if necessary. PiperOrigin-RevId: 753573598

[Mosaic GPU] Add a debug_print for TMEM refs

e189cd4

This is quite helpful while trying to debug the load/store routines. PiperOrigin-RevId: 753599963

Rename with_dll_constraint to with_layout_constraint for the upcoming…

9107c63

… Layout API rename! PiperOrigin-RevId: 753636449

[mosaic_gpu] Use jtu helpers instead of get_sass

dcdc25b

PiperOrigin-RevId: 753639002

Make literal's dtype print with an empty shape so that it's consisten…

48001a2

…t. So `1.0:f32` -> `1.0:f32[]` PiperOrigin-RevId: 753640777

jax._src.util: improve type annotations

42d76fb

Merge pull request jax-ml#28413 from jakevdp:jax-util-annotations

d226cf7

PiperOrigin-RevId: 753685498

Merge pull request jax-ml#28408 from Cjkkkk:fix_return_residual_train

18b0bc6

PiperOrigin-RevId: 753689814

Propagate function name when recording elapsed time event.

879eb41

Useful for filtering events by function name or differentiating between events. PiperOrigin-RevId: 753695215

Wait for async result to populate the exception.

02c2768

PiperOrigin-RevId: 753705149

Merge pull request jax-ml#28318 from mattjj:boxes3

e7d65f9

PiperOrigin-RevId: 753707559

Integrate LLVM at llvm/llvm-project@7752e0a10b25

5a81db0

Updates LLVM usage to match [7752e0a10b25](llvm/llvm-project@7752e0a10b25) PiperOrigin-RevId: 753710403

Update shard_map.md's API specification section

4257233

[state] Slightly restructured _is_trivial_indexer

23df262

PiperOrigin-RevId: 753732387

Merge pull request jax-ml#28464 from jax-ml:shmap_doc

7d395c2

PiperOrigin-RevId: 753734380

remove spurious assertion

d985f36

PiperOrigin-RevId: 753737826

[attrs] replace list.freeze() with list.get(), only allow locals

11d3382

Merge pull request jax-ml#28467 from mattjj:list-get

f05edc7

PiperOrigin-RevId: 753758114

Add out_sharding on broadcast_to().

4cd63a7

PiperOrigin-RevId: 753786660

Add out_sharding to jnp.ravel and jnp.reshape and jnp.dot which…

3b3c1b3

… just forward out_sharding to their lax variants. PiperOrigin-RevId: 753797017

Remove prints

6e58023

PiperOrigin-RevId: 753797982

[Pallas/Fuser] Ignore ops that have no_block_spec being pulled

0f169a2

PiperOrigin-RevId: 753840368

[Pallas Fuser] Add basic sublane/lane reshape fusion

6951cb9

PiperOrigin-RevId: 753859510

rocm-repo-management-api-2 bot requested a review from a team as a code owner May 2, 2025 06:02

rocm-repo-management-api-2 bot enabled auto-merge (rebase) May 2, 2025 06:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CI: 05/02/25 upstream sync #399

CI: 05/02/25 upstream sync #399

Uh oh!

rocm-repo-management-api-2 bot commented May 2, 2025

Uh oh!

Uh oh!

CI: 05/02/25 upstream sync #399

Are you sure you want to change the base?

CI: 05/02/25 upstream sync #399

Uh oh!

Conversation

rocm-repo-management-api-2 bot commented May 2, 2025

Uh oh!

Uh oh!