Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge branch 'master' into feat-general_basic_communication #8477

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
68 commits
Select commit Hold shift + click to select a range
2817305
Add distributed optional run (#8372)
jackalcooper Jun 7, 2022
c07f587
autoprof supports bandwidth (#8367)
daquexian Jun 7, 2022
f237503
remove tmp buffer of cumprod cpu backward kernel (#8369)
liufengwei0103 Jun 7, 2022
7feedc4
Move tensor api to cpython part3 (#8342)
marigoold Jun 7, 2022
e224be4
Add nn_functor Check (#7910)
MARD1NO Jun 8, 2022
c83fcaa
Add FusedMatmulBiasAddReluDropout [OneEmbedding] (#8222)
MARD1NO Jun 8, 2022
0fd888c
fix gather 0-dim tensor bug (#8376)
BBuf Jun 8, 2022
33634a2
add api to apply external job pass (#8370)
hjchen2 Jun 8, 2022
a8576ad
Add condition to find-test-cache-distributed (#8387)
jackalcooper Jun 8, 2022
b6abc62
warp dim util (#8382)
Flowingsun007 Jun 9, 2022
c10a30c
fix_bug_in_broadcast_min_max_grad_and_broadcast_like (#8379)
clackhan Jun 9, 2022
2e17cc3
fix bug about index (#8388)
liufengwei0103 Jun 9, 2022
469f72d
LogicalSliceAssign support full slice sbp (#8344)
wyg1997 Jun 9, 2022
d98132e
fix_tensor_from_numpy_mem_leak_bug (#8391)
clackhan Jun 9, 2022
82f4af5
Make of_pyext_obj static only to make sure only a python ext so has p…
jackalcooper Jun 9, 2022
813ffa7
Adjust tolerance setting in embedding_renorm unit test (#8394)
EsdeathYZH Jun 10, 2022
6aa2416
support front end compile for job to iree (#8249)
howin98 Jun 10, 2022
95240c2
Feat/zero mix with mp (#8036)
strint Jun 10, 2022
b3be1da
Revert embedding normal path and fix amp list (#8374)
EsdeathYZH Jun 10, 2022
5686581
replace fixed_vector with small_vector and make Shape inherit from it…
daquexian Jun 10, 2022
994c0f8
Light plan for debug (#8396)
chengtbf Jun 11, 2022
39a2d45
disable terminfo to fix missing terminfo symbols (#8400)
daquexian Jun 11, 2022
15eef18
fix bug of ZeRO MP in complex case (#8404)
chengtbf Jun 12, 2022
e6e864c
Remove redundant output_lbns in ir (#8409)
jackalcooper Jun 13, 2022
3b42b2f
Dev FusedCrossInteraction[OneEmbedding] (#8335)
MARD1NO Jun 13, 2022
ba56c84
add exe graph physical shape check msg (#8002)
strint Jun 13, 2022
f704b49
add batch_matmul sbp (#8385)
guo-ran Jun 13, 2022
c289645
suppress gcc11 false positive warning (#8401)
daquexian Jun 13, 2022
e8547b4
fix variable op conversion to tosa error in ninja c1 (#8412)
howin98 Jun 13, 2022
312a4f4
Fix eval error in FusedMLP (#8413)
MARD1NO Jun 14, 2022
2d80a33
Init NCCL communicator in graph mode unifiedly (#8263)
leaves-zwx Jun 14, 2022
37832cc
fix dim_scatter 0-dim tensor bug (#8418)
BBuf Jun 14, 2022
bed02f4
target based external libraries (#8421)
daquexian Jun 14, 2022
b58da14
Refine hardcoded attr setting/getting in ir (#8420)
jackalcooper Jun 14, 2022
041f787
Replace cu115 with cu116 in nightly (#8423)
jackalcooper Jun 14, 2022
9bf8090
fix repeat interleave 0-size tensor bug (#8414)
BBuf Jun 14, 2022
3129ba8
Autotest support print input in ci (#8383)
BBuf Jun 14, 2022
56ace89
Modify sbp.split()'s karg: axis to dim (#8411)
lixiang007666 Jun 14, 2022
f416857
Feat/graph logical op debug repr (#8131)
strint Jun 14, 2022
2d71117
rm some test case in test_fused_dot_feature_interaction_pooling_sum (…
guo-ran Jun 15, 2022
932a693
Remove unused linkages (#8426)
jackalcooper Jun 15, 2022
a8f633f
refactor stride (#8402)
daquexian Jun 15, 2022
a818086
Move Tensor.__setitem__ and global related api to Python/C api (#8375)
marigoold Jun 15, 2022
e4347b8
cuda add default error msg (#8427)
guo-ran Jun 15, 2022
4af9b7a
Refactor ShapeView (#8422)
daquexian Jun 15, 2022
362f19c
turn on view slice (#8302)
Flowingsun007 Jun 16, 2022
3896b6f
Add flow env init rdma api (#8415)
clackhan Jun 16, 2022
357ef40
add 1d send recv in nccl logical (#8355)
strint Jun 16, 2022
f6c3cb6
Support iree ci (#8419)
howin98 Jun 16, 2022
f7532fd
Feat straighten task nodes (#8347)
Yipeng1994 Jun 17, 2022
d7ef39f
Refactor NLLLoss to support split class dim (#8380)
leaves-zwx Jun 18, 2022
d79ba3d
Strict ordering in memory reuse algorithm (#8441)
chengtbf Jun 18, 2022
5d74efa
Support broadcast in fused_softmax kernel (#8321)
MARD1NO Jun 20, 2022
962690d
Merge slice and logical slice (#8416)
wyg1997 Jun 20, 2022
a1e91da
Graph block.config.set_stage() for recommended Pipeline api. (#8442)
chengtbf Jun 20, 2022
3cbf392
Update PolynomialLR's doc and paramater (#8430)
basicv8vc Jun 20, 2022
6f9cc3f
Add mv op (#8445)
zhongshsh Jun 22, 2022
ca3cbdd
enable oneflow_iree(python package) and corresponding test works in c…
howin98 Jun 22, 2022
7c701ee
Dev tensor is pinned api (#8447)
Flowingsun007 Jun 22, 2022
1a9f6a8
Nd sbp tensor str (#8458)
liufengwei0103 Jun 22, 2022
42d53ad
Patch sbp cost (#8378)
Yipeng1994 Jun 22, 2022
9a5e750
Decouple stream and instruction (#7607)
lixinqi Jun 22, 2022
8dcfbc1
fix_tensor_numpy_to_avoid_gpu_mem_increase (#8449)
clackhan Jun 22, 2022
64e6e4d
Rename user op tensor shape to shape view (#8433)
lixinqi Jun 23, 2022
8238431
speedup global test (#8468)
hjchen2 Jun 23, 2022
20d0efe
Set the minimum mtu value for IB communication connection (#8451)
ouyangyu Jun 23, 2022
53db4d3
Merge branch 'master' into feat-general_basic_communication
Yipeng1994 Jun 23, 2022
5714de0
Merge branch 'master' into merge-general_basic_communication
Yipeng1994 Jun 23, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .github/workflows/canary.yml
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ jobs:
- name: Checkout Oneflow-Inc/oneflow
if: ${{ github.event.inputs.oneflow-ref == '' }}
uses: actions/checkout@v2
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build manylinux
id: build-cuda
with:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/on_merge.yml
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,6 @@ jobs:
if: github.event.pull_request.merged == true
runs-on: ubuntu-latest
steps:
- uses: Oneflow-Inc/get-oneflow/update-benchmark-history@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/update-benchmark-history@support-iree-ci
name: Update benchmark history
timeout-minutes: 10
8 changes: 4 additions & 4 deletions .github/workflows/release.yml
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ jobs:
with:
ref: ${{ github.event.pull_request.head.sha }}
repository: ${{github.event.pull_request.head.repo.full_name}}
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/build@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/build@support-iree-ci
name: find cache
id: find-cache
timeout-minutes: 5
Expand All @@ -45,7 +45,7 @@ jobs:
release
oneflow-src: ${{ env.ONEFLOW_SRC }}
entries: |
cu115
cu116
cu112
cu102
cpu
Expand Down Expand Up @@ -74,7 +74,7 @@ jobs:
python3 -m pip install -U pip setuptools wheel --user
python3 -m pip install oss2 --user
- uses: actions/checkout@v2
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build ${{ matrix.entry }}
if: ${{ matrix.entry !='cpu' }}
with:
Expand All @@ -98,7 +98,7 @@ jobs:
3.8
3.9
3.10
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build ${{ matrix.entry }}
if: ${{ matrix.entry =='cpu' }}
with:
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/simple.yml
Original file line number Diff line number Diff line change
Expand Up @@ -245,7 +245,7 @@ jobs:
repository: Oneflow-Inc/conda-env
ref: 30a7f00eb48ee9009d85a848e720823e5054c66b
path: conda-env
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build with gcc7
if: ${{ matrix.build-type == 'gcc7'}}
with:
Expand All @@ -254,7 +254,7 @@ jobs:
oneflow-build-env: conda
conda-env-file: conda-env/dev/gcc7/environment-v2.yml
conda-env-name: oneflow-dev-gcc7-v2
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build with clang10
if: ${{ matrix.build-type == 'clang10'}}
with:
Expand Down
64 changes: 45 additions & 19 deletions .github/workflows/test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,8 @@ env:
FLOW_VISION_COMMIT: ca8ebc663b58667cf8cd1b6ef0c861522780b7bb
LIBAI_SRC: libai
LIBAI_COMMIT: 7d31d9781e5f2d559dc0820f599e0bed798488ca
ONEFLOW_IREE_SRC: oneflow_iree
ONEFLOW_IREE_COMMIT: 4322cbad2545877b1664aa8e0f17a17f6b5f687c
TEST_WITH_TORCH_IMG_TAG: registry.cn-beijing.aliyuncs.com/oneflow/test-with-pytorch-1.10.0-cuda11.3-cudnn8-runtime:afaf913e02a4ba02db92260daee22f99121cef62
MLIR_DOCKER_ARGS: "-e ONEFLOW_MLIR_ENABLE_ROUND_TRIP=1 -e ONEFLOW_MLIR_PREFER_NHWC=0 -e ONEFLOW_MLIR_ENABLE_INFERENCE_OPTIMIZATION=1"

Expand All @@ -25,7 +27,7 @@ jobs:
runs-on: ubuntu-latest
if: github.event.pull_request.draft == false && github.base_ref == 'master' && contains(github.event.pull_request.requested_reviewers.*.login, 'oneflow-ci-bot')
steps:
- uses: Oneflow-Inc/get-oneflow/priority-pr@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/priority-pr@support-iree-ci
name: Check priority PR closed
id: save-cache
timeout-minutes: 5
Expand Down Expand Up @@ -159,7 +161,7 @@ jobs:
fi
echo "is_secrets_accessible=1" >> $GITHUB_ENV
- name: Wait for GPU slot
uses: Oneflow-Inc/get-oneflow/wait-for-gpu@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/wait-for-gpu@support-iree-ci
if: env.is_secrets_accessible == '1'
timeout-minutes: 90
continue-on-error: true
Expand All @@ -183,7 +185,7 @@ jobs:
with:
ref: ${{ github.event.pull_request.head.sha }}
repository: ${{github.event.pull_request.head.repo.full_name}}
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/build@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/build@support-iree-ci
name: find cache
id: find-cache
timeout-minutes: 5
Expand Down Expand Up @@ -230,7 +232,7 @@ jobs:
with:
ref: ${{ github.event.pull_request.head.sha }}
repository: ${{github.event.pull_request.head.repo.full_name}}
- uses: Oneflow-Inc/get-oneflow/cache-complete@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete@support-iree-ci
name: Save cache if successful
id: save-cache
timeout-minutes: 5
Expand All @@ -244,13 +246,14 @@ jobs:
run: |
echo "::error file=test.yml,line=204,col=10::steps.save-cache.outputs.cache-hit != matrix.cache-hit"
exit 1
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build manylinux ${{ matrix.entry }}
id: build-cpu
if: ${{ matrix.entry =='cpu' && !matrix.cache-hit }}
with:
cmake-init-cache: ${{ env.ONEFLOW_SRC }}/cmake/caches/ci/cpu.cmake
build-script: ${{ env.ONEFLOW_SRC }}/ci/manylinux/build.sh
run-lit: true
oneflow-src: ${{ env.ONEFLOW_SRC }}
oneflow-build-env: manylinux
wheelhouse-dir: ${{ env.WHEELHOUSE_DIR }}
Expand All @@ -265,7 +268,7 @@ jobs:
python-versions: |
3.6
3.7
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build manylinux ${{ matrix.entry }}
id: build-cuda
if: ${{ matrix.entry =='cu102' && !matrix.cache-hit }}
Expand All @@ -285,7 +288,7 @@ jobs:
clean-ccache: ${{ contains(github.event.pull_request.labels.*.name, 'need-clean-ccache') }}
python-versions: |
3.7
- uses: Oneflow-Inc/get-oneflow@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow@support-iree-ci
name: Build ${{ matrix.entry }}
if: ${{ matrix.entry == 'llvm13' && !matrix.cache-hit }}
with:
Expand Down Expand Up @@ -324,7 +327,7 @@ jobs:
})
- name: Upload packed liboneflow
if: ${{ !fromJson(matrix.cache-hit) && matrix.entry != 'llvm13' && matrix.entry != 'cu102_xla' }}
uses: Oneflow-Inc/get-oneflow/digest/upload@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/digest/upload@support-iree-ci
timeout-minutes: 10
with:
digest: ${{ steps.save-cache.outputs.build-digest }}
Expand All @@ -335,7 +338,7 @@ jobs:
dst-dir: cpack
- name: Upload whl
if: ${{ !fromJson(matrix.cache-hit) && matrix.entry != 'llvm13' && matrix.entry != 'cu102_xla' }}
uses: Oneflow-Inc/get-oneflow/digest/upload@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/digest/upload@support-iree-ci
timeout-minutes: 10
with:
digest: ${{ steps.save-cache.outputs.build-digest }}
Expand All @@ -360,7 +363,7 @@ jobs:
with:
ref: ${{ github.event.pull_request.head.sha }}
repository: ${{github.event.pull_request.head.repo.full_name}}
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/test@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/test@support-iree-ci
name: find cache
id: find-cache
timeout-minutes: 5
Expand Down Expand Up @@ -391,7 +394,7 @@ jobs:
with:
ref: ${{ github.event.pull_request.head.sha }}
repository: ${{github.event.pull_request.head.repo.full_name}}
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/test@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete/matrix/test@support-iree-ci
name: find cache
id: find-cache
timeout-minutes: 5
Expand Down Expand Up @@ -455,12 +458,20 @@ jobs:
# please use a commit here
ref: ${{ env.LIBAI_COMMIT}}
path: ${{ env.LIBAI_SRC}}
- name: Checkout Oneflow-Inc/oneflow_iree
if: ${{ !fromJson(matrix.cache-hit) && contains(matrix.runs-on, 'self-hosted') }}
uses: actions/checkout@v2
with:
repository: Oneflow-Inc/oneflow_iree
# please use a commit here
ref: ${{ env.ONEFLOW_IREE_COMMIT}}
path: ${{ env.ONEFLOW_IREE_SRC}}
- name: Remove container
timeout-minutes: 45
if: ${{ contains(matrix.runs-on, 'self-hosted') }}
run: |
docker rm -f ${{ env.TEST_CONTAINER_NAME }} || true
- uses: Oneflow-Inc/get-oneflow/cache-complete@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete@support-iree-ci
name: Save cache if successful
id: save-cache
timeout-minutes: 5
Expand All @@ -476,7 +487,7 @@ jobs:
exit 1
- name: Download wheel and packed liboneflow
if: ${{ !fromJson(matrix.cache-hit) && contains(matrix.runs-on, 'self-hosted') }}
uses: Oneflow-Inc/get-oneflow/digest/download@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/digest/download@support-iree-ci
id: download-digest
timeout-minutes: 10
with:
Expand All @@ -486,7 +497,7 @@ jobs:
ssh-tank-path: ${{ env.SSH_TANK_PATH }}
- name: Get primary node
if: ${{ !fromJson(matrix.cache-hit) && contains(matrix.runs-on, 'self-hosted') }}
uses: Oneflow-Inc/get-oneflow/master-address@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/master-address@support-iree-ci
id: get-primary-node
with:
rank: ${{ matrix.rank }}
Expand Down Expand Up @@ -559,6 +570,7 @@ jobs:
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install -e ${{ env.FLOW_VISION_SRC}}
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install pybind11 --user
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install -e ${{ env.LIBAI_SRC}}
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install -e ${{ env.ONEFLOW_IREE_SRC}}
- name: Module API test (distributed)
timeout-minutes: 90
if: ${{ !fromJson(matrix.cache-hit) && matrix.test-type == 'module' && matrix.device == 'cuda' && fromJson(matrix.is-distributed) }}
Expand Down Expand Up @@ -648,12 +660,20 @@ jobs:
# please use a commit here
ref: ${{ env.LIBAI_COMMIT}}
path: ${{ env.LIBAI_SRC}}
- name: Checkout Oneflow-Inc/oneflow_iree
if: ${{ !fromJson(matrix.cache-hit) && contains(matrix.runs-on, 'self-hosted') }}
uses: actions/checkout@v2
with:
repository: Oneflow-Inc/oneflow_iree
# please use a commit here
ref: ${{ env.ONEFLOW_IREE_COMMIT}}
path: ${{ env.ONEFLOW_IREE_SRC}}
- name: Remove container
timeout-minutes: 45
if: ${{ contains(matrix.runs-on, 'self-hosted') }}
run: |
docker rm -f ${{ env.TEST_CONTAINER_NAME }} || true
- uses: Oneflow-Inc/get-oneflow/cache-complete@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete@support-iree-ci
name: Save cache if successful
id: save-cache
timeout-minutes: 5
Expand All @@ -669,7 +689,7 @@ jobs:
exit 1
- name: Download wheel and packed liboneflow
if: ${{ !fromJson(matrix.cache-hit) && contains(matrix.runs-on, 'self-hosted') }}
uses: Oneflow-Inc/get-oneflow/digest/download@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/digest/download@support-iree-ci
id: download-digest
timeout-minutes: 10
with:
Expand Down Expand Up @@ -781,6 +801,7 @@ jobs:
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install -e ${{ env.FLOW_VISION_SRC}}
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install pybind11 --user
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install -e ${{ env.LIBAI_SRC}}
docker exec ${TEST_CONTAINER_NAME} python3 -m pip install -e ${{ env.ONEFLOW_IREE_SRC}}
- name: Run OneFlow doctor
if: ${{ !fromJson(matrix.cache-hit) && contains(matrix.runs-on, 'self-hosted') }}
run: |
Expand Down Expand Up @@ -865,7 +886,7 @@ jobs:
body: "<details>\n <summary>Speed stats:</summary>\n\n ``` \n${{ steps.speed.outputs.stats }}\n ``` \n\n</details>".replace(/\\n/g, '\n')
})
- name: Module API test
timeout-minutes: 45
timeout-minutes: 60
if: ${{ !fromJson(matrix.cache-hit) && matrix.test-type == 'module' && !fromJson(matrix.is-distributed) }}
run: |
docker exec -e ONEFLOW_TEST_DIR=$PWD/python/oneflow/test/modules ${{ env.TEST_CONTAINER_NAME }} bash ci/test/generic_test_multi_client.sh
Expand All @@ -883,6 +904,11 @@ jobs:
docker exec -e ONEFLOW_TEST_DEVICE_NUM=4 -w $PWD/${{ env.LIBAI_SRC }} ${{ env.TEST_CONTAINER_NAME }} python3 -m oneflow.distributed.launch --nproc_per_node 4 -m unittest -f tests/models/test_gpt.py
docker exec -e ONEFLOW_TEST_DEVICE_NUM=4 -w $PWD/${{ env.LIBAI_SRC }} ${{ env.TEST_CONTAINER_NAME }} python3 -m oneflow.distributed.launch --nproc_per_node 4 -m unittest -f tests/models/test_t5.py
docker exec -e ONEFLOW_TEST_DEVICE_NUM=4 -w $PWD/${{ env.LIBAI_SRC }} ${{ env.TEST_CONTAINER_NAME }} python3 -m oneflow.distributed.launch --nproc_per_node 4 -m unittest -f tests/models/test_vit.py
- name: oneflow_iree test
timeout-minutes: 45
if: ${{ !fromJson(matrix.cache-hit) && matrix.test-type == 'misc' }}
run: |
docker exec -w $PWD/${{ env.ONEFLOW_IREE_SRC }} ${{ env.TEST_CONTAINER_NAME }} python3 -m pytest examples
- name: Expensive tests (models, cases require exclusive access to GPU)
timeout-minutes: 45
if: ${{ !fromJson(matrix.cache-hit) && (matrix.test-type == 'speed-test' || (matrix.test-type == 'misc' && matrix.device == 'cpu')) && !fromJson(matrix.is-distributed) }}
Expand All @@ -908,7 +934,7 @@ jobs:
- name: Benchmark Test
timeout-minutes: 100
if: ${{ !fromJson(matrix.cache-hit) && matrix.test-type == 'benchmark' && matrix.device == 'cuda' }}
uses: Oneflow-Inc/get-oneflow/pytest-benchmark@single-matrix-for-efficiency
uses: Oneflow-Inc/get-oneflow/pytest-benchmark@support-iree-ci
with:
collect-path: ${{ env.FLOW_VISION_SRC }}/benchmark
container-name: ${{ env.TEST_CONTAINER_NAME }}
Expand Down Expand Up @@ -961,7 +987,7 @@ jobs:
ref: ${{ github.event.pull_request.head.sha }}
repository: ${{github.event.pull_request.head.repo.full_name}}
fetch-depth: 0
- uses: Oneflow-Inc/get-oneflow/cache-complete@single-matrix-for-efficiency
- uses: Oneflow-Inc/get-oneflow/cache-complete@support-iree-ci
name: Save cache if successful
id: save-cache
timeout-minutes: 5
Expand Down
5 changes: 5 additions & 0 deletions ci/manylinux/build-gcc7.sh
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,11 @@ cmake -S ${ONEFLOW_CI_SRC_DIR} -C ${ONEFLOW_CI_CMAKE_INIT_CACHE} -DPython3_EXECU
# cmake build
cd ${ONEFLOW_CI_BUILD_DIR}
cmake --build . --parallel ${ONEFLOW_CI_BUILD_PARALLEL}
if [ ! -z "$ONEFLOW_CI_BUILD_RUN_LIT" ]; then
${ONEFLOW_CI_PYTHON_EXE} -m pip install -i https://mirrors.aliyun.com/pypi/simple --user flowvision==0.1.0
export PATH=$PATH:$(dirname $ONEFLOW_CI_PYTHON_EXE)
cmake --build . -t c1
fi

# build pip
cd ${ONEFLOW_CI_SRC_DIR}
Expand Down
5 changes: 5 additions & 0 deletions ci/manylinux/build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,11 @@ cmake -S ${ONEFLOW_CI_SRC_DIR} -C ${ONEFLOW_CI_CMAKE_INIT_CACHE} -DPython3_EXECU
# cmake build
cd ${ONEFLOW_CI_BUILD_DIR}
cmake --build . --parallel ${ONEFLOW_CI_BUILD_PARALLEL}
if [ ! -z "$ONEFLOW_CI_BUILD_RUN_LIT" ]; then
${ONEFLOW_CI_PYTHON_EXE} -m pip install -i https://mirrors.aliyun.com/pypi/simple --user flowvision==0.1.0
export PATH=$PATH:$(dirname $ONEFLOW_CI_PYTHON_EXE)
cmake --build . -t c1
fi

# build pip
cd ${ONEFLOW_CI_SRC_DIR}
Expand Down
24 changes: 24 additions & 0 deletions cmake/caches/cn/fast/mlir-cpu.cmake
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
set(BUILD_SHARED_LIBS YES CACHE BOOL "")
# uncomment only if you know what you are doing
# set(CMAKE_LINK_DEPENDS_NO_SHARED YES CACHE BOOL "")
set(BUILD_CUDA NO CACHE BOOL "")
set(BUILD_GIT_VERSION NO CACHE BOOL "")
set(TREAT_WARNINGS_AS_ERRORS YES CACHE BOOL "")
set(BUILD_HWLOC NO CACHE BOOL "")
set(BUILD_TESTING OFF CACHE BOOL "")
set(WITH_MLIR YES CACHE BOOL "")
set(WITH_MLIR_CUDA_CODEGEN NO CACHE BOOL "")
set(THIRD_PARTY_MIRROR aliyun CACHE STRING "")
set(PIP_INDEX_MIRROR "https://pypi.tuna.tsinghua.edu.cn/simple" CACHE STRING "")
set(CMAKE_BUILD_TYPE RelWithDebInfo CACHE STRING "")
set(CMAKE_GENERATOR Ninja CACHE STRING "")
set(CMAKE_C_COMPILER_LAUNCHER ccache CACHE STRING "")
set(CMAKE_CXX_COMPILER_LAUNCHER ccache CACHE STRING "")
set(CMAKE_INTERPROCEDURAL_OPTIMIZATION OFF CACHE BOOL "")
set(CMAKE_EXE_LINKER_FLAGS_INIT "-fuse-ld=lld" CACHE STRING "")
set(CMAKE_MODULE_LINKER_FLAGS_INIT "-fuse-ld=lld" CACHE STRING "")
set(CMAKE_SHARED_LINKER_FLAGS_INIT "-fuse-ld=lld" CACHE STRING "")
set(CPU_THREADING_RUNTIME SEQ CACHE STRING
"when using lld with TBB enabled, there will be linkage error")
set(BUILD_HWLOC OFF CACHE BOOL "")
set(WITH_ONEDNN OFF CACHE BOOL "")
Loading