Skip to content

[NPU]:Added support for the layer_norm operator in npu#1113

Open
TianHao324 wants to merge 1 commit intolinkedin:mainfrom
TianHao324:layer_npu
Open

[NPU]:Added support for the layer_norm operator in npu#1113
TianHao324 wants to merge 1 commit intolinkedin:mainfrom
TianHao324:layer_npu

Conversation

@TianHao324
Copy link
Contributor

Summary

Added NPU-friendly operator implementations. Previously, only some unsupported writing methods were addressed based on the GPU implementation. However, when the n_col is large, the problem of UB overflow still occurs. This modification has solved this issue, and the performance has improved by 2-3 times compared to the original implementation. However, due to device limitations, the performance is still somewhat lower than that of Hugging Face. We will continue to follow up on the performance issues in the future.

Testing Done

image
  • Hardware Type: Atlas 800I A2
  • run make test to ensure correctness
  • run make checkstyle to ensure code style
  • run make test-convergence to ensure convergence

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant