Add interpreter for AllGatherOp #1727

ghpvnist · 2023-08-09T23:53:38Z

We have the following constraints in the spec:

(I1) `operand`: tensor or per-tensor quantized tensor.
(I2) `all_gather_dim`: constant of type `si64`.
(I3) `replica_groups`: 2-dimensional tensor constant of type `si64`.
(I4) `channel_id`: constant of type `si64`.
(I5) `use_global_device_ids`: constant of type `i1`.
(C1) `0 <= all_gather_dim < rank(operand)`.
(C2) `is_unique(replica_groups)`.
(C3) `size(replica_groups)` is defined as:
* `num_replicas` if `cross_replica` is used.
* `num_replicas` if `cross_replica_and_partition` is used.
* `num_processes` if `flattened_ids` is used.
(C4) `0 <= replica_groups < size(replica_groups)`.
(C5) If `use_global_device_ids = true`, then `channel_id > 0`.
(C6) `type(result) = type(operand)` except:
* `dim(result, all_gather_dim) =
  dim(operand, all_gather_dim) * dim(process_groups, 1)`.

These constraints will be comprehensively covered by the following tests:

I1: a) `operand` is not a tensor. (Covered by ODS).
I2: a) `all_gather_dim` is not a constant of type `si64`. (Covered by ODS).
I3: a) `replica_groups` is not a 2-dimensional tensor constant.
I3: b) `element_type(replica_groups) != si64`. (Covered by ODS).
I4: a) `channel_id` is not a constant of type `si64`. (Covered by ODS).
I5: a) `use_global_device_ids` is not a constant of type `i1`. (Covered by ODS).
C1: a) `all_gather_dim < 0`.
C1: b) `all_gather_dim >= rank(operand)`.
C2: a) `is_unique(replica_groups) = false`.
C3: a) `size(replica_groups)` is defined as:
* `num_replicas` if `cross_replica` is used.
* `num_replicas` if `cross_replica_and_partition` is used.
* `num_processes` if `flattened_ids` is used. 
C4: a) `replica_groups < 0`.
C4: b) `replica_groups >= size(replica_groups)`.
C5: a) `use_global_device_ids = true` and `channel_id <= 0`.
C6: a) `type(result) != type(operand)` except:
* `dim(result, all_gather_dim) =
  dim(operand, all_gather_dim) * dim(process_groups, 1)`.

If we drop the "Covered by ODS" pieces, this will leave us with the following test cases:

I3: a) `replica_groups` is not a 2-dimensional tensor constant.
C1: a) `all_gather_dim < 0`.
C1: b) `all_gather_dim >= rank(operand)`.
C2: a) `is_unique(replica_groups) = false`.
C3: a) `size(replica_groups)` is defined as:
* `num_replicas` if `cross_replica` is used.
* `num_replicas` if `cross_replica_and_partition` is used.
* `num_processes` if `flattened_ids` is used. 
C4: a) `replica_groups < 0`.
C4: b) `replica_groups >= size(replica_groups)`.
C5: a) `use_global_device_ids = true` and `channel_id <= 0`.
C6: a) `type(result) != type(operand)` except:
* `dim(result, all_gather_dim) =
  dim(operand, all_gather_dim) * dim(process_groups, 1)`.

Notes:

C3a verification is infeasible since num_replicas and num_partitions are not known statically.
C4b verification is infeasible since size(replica_groups) is not known statically.

closes #1118

stablehlo/dialect/StablehloOps.cpp

docs/spec.md

stablehlo/dialect/TypeInference.cpp

We have the following constraints in the spec: ``` (I1) `operand`: tensor or per-tensor quantized tensor. (I2) `all_gather_dim`: constant of type `si64`. (I3) `replica_groups`: 2-dimensional tensor constant of type `si64`. (I4) `channel_id`: constant of type `si64`. (I5) `use_global_device_ids`: constant of type `i1`. (C1) `0 <= all_gather_dim < rank(operand)`. (C2) `is_unique(replica_groups)`. (C3) `size(replica_groups)` is defined as: * `num_replicas` if `cross_replica` is used. * `num_replicas` if `cross_replica_and_partition` is used. * `num_processes` if `flattened_ids` is used. (C4) `0 <= replica_groups < size(replica_groups)`. (C5) If `use_global_device_ids = true`, then `channel_id > 0`. (C6) `type(result) = type(operand)` except: * `dim(result, all_gather_dim) = dim(operand, all_gather_dim) * dim(process_groups, 1)`. ``` These constraints will be comprehensively covered by the following tests: ``` I1: a) `operand` is not a tensor. (Covered by ODS). I2: a) `all_gather_dim` is not a constant of type `si64`. (Covered by ODS). I3: a) `replica_groups` is not a 2-dimensional tensor constant. I3: b) `element_type(replica_groups) != si64`. (Covered by ODS). I4: a) `channel_id` is not a constant of type `si64`. (Covered by ODS). I5: a) `use_global_device_ids` is not a constant of type `i1`. (Covered by ODS). C1: a) `all_gather_dim < 0`. C1: b) `all_gather_dim >= rank(operand)`. C2: a) `is_unique(replica_groups) = false`. C3: a) `size(replica_groups)` is defined as: * `num_replicas` if `cross_replica` is used. * `num_replicas` if `cross_replica_and_partition` is used. * `num_processes` if `flattened_ids` is used. C4: a) `replica_groups < 0`. C4: b) `replica_groups >= size(replica_groups)`. C5: a) `use_global_device_ids = true` and `channel_id <= 0`. C6: a) `type(result) != type(operand)` except: * `dim(result, all_gather_dim) = dim(operand, all_gather_dim) * dim(process_groups, 1)`. ``` If we drop the "Covered by ODS" pieces, this will leave us with the following test cases: ``` I3: a) `replica_groups` is not a 2-dimensional tensor constant. C1: a) `all_gather_dim < 0`. C1: b) `all_gather_dim >= rank(operand)`. C2: a) `is_unique(replica_groups) = false`. C3: a) `size(replica_groups)` is defined as: * `num_replicas` if `cross_replica` is used. * `num_replicas` if `cross_replica_and_partition` is used. * `num_processes` if `flattened_ids` is used. C4: a) `replica_groups < 0`. C4: b) `replica_groups >= size(replica_groups)`. C5: a) `use_global_device_ids = true` and `channel_id <= 0`. C6: a) `type(result) != type(operand)` except: * `dim(result, all_gather_dim) = dim(operand, all_gather_dim) * dim(process_groups, 1)`. ``` Notes: * C3a verification is infeasible since `num_replicas` and `num_partitions` are not known statically. * C4b verification is infeasible since `size(replica_groups)` is not known statically. closes openxla#1118

ghpvnist added Interpreter Migrate to MHLO PR that needs to be migrated to MLIR-HLO labels Aug 9, 2023

ghpvnist requested a review from burmako August 9, 2023 23:53

ghpvnist assigned burmako and ghpvnist and unassigned burmako Aug 9, 2023

ghpvnist marked this pull request as draft August 10, 2023 00:02

ghpvnist assigned burmako and unassigned ghpvnist Aug 10, 2023

ghpvnist marked this pull request as ready for review August 10, 2023 00:08

ghpvnist assigned ghpvnist and unassigned burmako Aug 10, 2023

ghpvnist marked this pull request as draft August 10, 2023 01:18

ghpvnist force-pushed the all_gather branch from 879eb0b to 1089efb Compare August 10, 2023 18:11

ghpvnist assigned burmako and unassigned ghpvnist Aug 10, 2023

ghpvnist marked this pull request as ready for review August 10, 2023 18:12

ghpvnist force-pushed the all_gather branch 5 times, most recently from 94fc83c to 16e0cec Compare August 12, 2023 07:05

Add interpreter for AllGatherOp

75ca6fa

ghpvnist force-pushed the all_gather branch from 16e0cec to 75ca6fa Compare August 15, 2023 21:37

burmako suggested changes Aug 18, 2023

View reviewed changes

stablehlo/dialect/StablehloOps.cpp Show resolved Hide resolved

docs/spec.md Outdated Show resolved Hide resolved

burmako assigned ghpvnist and unassigned burmako Aug 18, 2023

Address comments

f9f81da

ghpvnist force-pushed the all_gather branch from cf22726 to f9f81da Compare August 18, 2023 23:40

ghpvnist requested a review from burmako August 18, 2023 23:40

ghpvnist assigned burmako and unassigned ghpvnist Aug 18, 2023

burmako approved these changes Aug 18, 2023

View reviewed changes

stablehlo/dialect/TypeInference.cpp Outdated Show resolved Hide resolved

burmako assigned ghpvnist and unassigned burmako Aug 18, 2023

Add issue number

6e0a60a

ghpvnist mentioned this pull request Aug 19, 2023

Add interpreter for ReduceScatterOp #1731

Merged

ghpvnist merged commit 2a38fde into openxla:main Aug 19, 2023
7 checks passed

ghpvnist deleted the all_gather branch August 19, 2023 00:06

ghpvnist added the Spec label Sep 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add interpreter for AllGatherOp #1727

Add interpreter for AllGatherOp #1727

ghpvnist commented Aug 9, 2023 •

edited

Loading

Add interpreter for AllGatherOp #1727

Add interpreter for AllGatherOp #1727

Conversation

ghpvnist commented Aug 9, 2023 • edited Loading

ghpvnist commented Aug 9, 2023 •

edited

Loading