Skip to content

Conversation

@yingxudeng
Copy link
Collaborator

No description provided.

@yingxudeng yingxudeng changed the title feat: introduce USE_NPU_TORCH flag for debugging and enhance NPU support for Qwen3-Dense. feat: introduce USE_NPU_TORCH flag for debugging and enhance NPU support for Qwen3-Dense[4/N]. Dec 23, 2025
@yingxudeng
Copy link
Collaborator Author

yingxudeng commented Dec 23, 2025

image

This PR cannot be merged/tested until #572 #563 and #583 is landed.

Please focus your review only on the commit shown in the image above. All other commits can be ignored for now. Once the dependent PRs are merged, I will rebase the branch to clean up the history.

@yingxudeng yingxudeng force-pushed the feat/npu_backend_torch_3_4_20251223 branch from 43a1e2d to 3d505b6 Compare December 24, 2025 03:06
cos_ = cos_sin_vec[0].view({-1, rotary_dim});
sin_ = cos_sin_vec[1].view({-1, rotary_dim});
cos_ = cos_sin_vec[0].reshape({-1, rotary_dim});
sin_ = cos_sin_vec[1].reshape({-1, rotary_dim});
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some operation will change origin tensor?

@yingxudeng yingxudeng force-pushed the feat/npu_backend_torch_3_4_20251223 branch from 3d505b6 to 5b5dcf2 Compare December 29, 2025 07:36
@yingxudeng yingxudeng marked this pull request as draft January 7, 2026 06:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants