Add NPU FusedAdam support #4343

CurryRice233 · 2023-09-15T06:12:46Z

Add NPU FusedAdam support.

* origin/master: (48 commits) Fix autotune to support Triton 2.1 (microsoft#4340) Fix skipped inference tests (microsoft#4336) Suppress noise (microsoft#4310) Fix a bug in the implementation of dequantization for inference (microsoft#3433) DS-Chat BLOOM: Fix Attention mask (microsoft#4338) clear redundant timers (microsoft#4308) Add release version checking (microsoft#4328) Fix Zero3 contiguous grads, reduce scatter false accuracy issue (microsoft#4321) Clean up modeling code (microsoft#4320) Handle empty parameter groups (microsoft#4277) Update README.md (microsoft#4316) README update (microsoft#4303) Update release and bump patch versioning flow (microsoft#4286) added a bert-model check for triton (microsoft#4266) ZeRO-Inference v2 release bump to 0.10.4 Update index.md (microsoft#4297) fix user args parsing of string with spaces on runner (microsoft#4265) ZeRO-Inference refresh (microsoft#4197) AMD Kernel Compatibility Fixes (microsoft#3180) ...

CurryRice233 · 2023-09-18T01:46:43Z

@tjruwase @jeffra @RezaYazdaniAminabadi @cmikeh2 Sorry for annoying, can you guys review this PR ?

* origin/master: Allow multiple inference engines in single script (microsoft#4384) adds triton flash attention2 kernel (microsoft#4337) Fix llama meta tensor loading in AutoTP and kernel injected inference (microsoft#3608) Fix min torch version (microsoft#4375) Fix multinode runner to properly append to PDSH_SSH_ARGS_APPEND (microsoft#4373) add the missing method (microsoft#4363) Openfold fix (microsoft#4368) deepspeed4science japanese blog (microsoft#4369) deepspeed4science chinese blog (microsoft#4366) Enable workflow dispatch on Torch 1.10 CI tests (microsoft#4361) Update conda env to have max pydantic version (microsoft#4362) add deepspeed4science blog link (microsoft#4364) added check to avoid undefined behavior when the input_id length is greater than max_tokens (microsoft#4349) Add the policy to run llama model from the official repo (microsoft#4313) fix deepspeed4science links (microsoft#4358) DeepSpeed4Science (microsoft#4357) Support InternLM (microsoft#4137) Pass base_dir to model files can be loaded for auto-tp/meta-tensor. (microsoft#4348)

ji-huazhong · 2023-10-12T08:50:38Z

@tjruwase Good day. This PR is approved and ready to be merged. Could you retrigger this workflow and merge it? Thanks :-)

tjruwase · 2023-10-12T14:21:12Z

@tjruwase Good day. This PR is approved and ready to be merged. Could you retrigger this workflow and merge it? Thanks :-)

Sorry for the delay, however there seems to be a formatting issue. Please take a look.

ji-huazhong

Resolve format checking errors

accelerator/npu_accelerator.py

op_builder/npu/fused_adam.py

Co-authored-by: Hz, Ji <hzji210@gmail.com>

ji-huazhong

As long as we modify these two blank lines, the format check error should be solved.

accelerator/npu_accelerator.py

tjruwase · 2023-10-14T11:01:34Z

@CurryRice233, it is best to use this guide for formatting issues: https://github.com/microsoft/DeepSpeed/blob/master/CONTRIBUTING.md#prerequisites

Co-authored-by: Hz, Ji <hzji210@gmail.com>

CurryRice233 · 2023-10-17T02:30:26Z

@CurryRice233, it is best to use this guide for formatting issues: https://github.com/microsoft/DeepSpeed/blob/master/CONTRIBUTING.md#prerequisites

Thank you, new skill get😉. By the way, could you retrigger this workflow again?

CurryRice233 · 2023-10-18T02:14:10Z

@tjruwase hi, could you retrigger this workflow again and merge it? Thanks😀

* add npu support dtypes * add npu fused_adam support * add license * Update accelerator/npu_accelerator.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update accelerator/npu_accelerator.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update accelerator/npu_accelerator.py Co-authored-by: Hz, Ji <hzji210@gmail.com> --------- Co-authored-by: jializheng <jializheng@huawei.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Hz, Ji <hzji210@gmail.com>

jializheng added 3 commits August 26, 2023 19:42

add npu support dtypes

e37215d

add npu fused_adam support

ac78f3a

CurryRice233 requested review from jeffra, RezaYazdaniAminabadi and cmikeh2 as code owners September 15, 2023 06:12

Merge branch 'master' into master

fd64619

CurryRice233 and others added 6 commits September 19, 2023 14:11

Merge branch 'master' into master

a43fcf8

Merge branch 'master' into master

0617206

add license

c87ef32

Merge branch 'master' into master

510166e

Merge branch 'master' into master

efe4c12

tjruwase approved these changes Oct 6, 2023

View reviewed changes

Merge branch 'master' into master

51ddc02

Merge branch 'master' into master

b472ecf

ji-huazhong reviewed Oct 14, 2023

View reviewed changes

CurryRice233 and others added 9 commits October 14, 2023 14:41

Update accelerator/npu_accelerator.py

258eac1

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

620b85e

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

dd94fcc

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

b41f20c

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

0487035

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

52a2a8c

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

d808f86

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

fb77cd0

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Merge branch 'master' into master

38bc926

ji-huazhong reviewed Oct 14, 2023

View reviewed changes

accelerator/npu_accelerator.py Show resolved Hide resolved

accelerator/npu_accelerator.py Outdated Show resolved Hide resolved

CurryRice233 and others added 3 commits October 16, 2023 09:02

Update accelerator/npu_accelerator.py

7b26ce4

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update accelerator/npu_accelerator.py

da7edd6

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Merge branch 'master' into master

4b6197f

Merge branch 'master' into master

20bd85e

tjruwase enabled auto-merge October 18, 2023 02:34

tjruwase added this pull request to the merge queue Oct 18, 2023

Merged via the queue into microsoft:master with commit 3e70a88 Oct 18, 2023
15 checks passed

hipudding mentioned this pull request Oct 26, 2023

[Feature package] Full feature support with Ascend NPU #4567

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NPU FusedAdam support #4343

Add NPU FusedAdam support #4343

CurryRice233 commented Sep 15, 2023

CurryRice233 commented Sep 18, 2023

ji-huazhong commented Oct 12, 2023

tjruwase commented Oct 12, 2023

ji-huazhong left a comment

ji-huazhong left a comment

tjruwase commented Oct 14, 2023

CurryRice233 commented Oct 17, 2023

CurryRice233 commented Oct 18, 2023

Add NPU FusedAdam support #4343

Add NPU FusedAdam support #4343

Conversation

CurryRice233 commented Sep 15, 2023

CurryRice233 commented Sep 18, 2023

ji-huazhong commented Oct 12, 2023

tjruwase commented Oct 12, 2023

ji-huazhong left a comment

Choose a reason for hiding this comment

ji-huazhong left a comment

Choose a reason for hiding this comment

tjruwase commented Oct 14, 2023

CurryRice233 commented Oct 17, 2023

CurryRice233 commented Oct 18, 2023