Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA template for element-wise kernels #4007

Merged
merged 18 commits into from
Dec 17, 2020
Merged

Conversation

liujuncheng
Copy link
Collaborator

No description provided.

fix


result_of


BinaryPrimitive


TernaryPrimitive


pack_size


ApplyGeneric


ApplyGeneric


fix


GenericLauncher


fix


refine
@oneflow-ci-bot oneflow-ci-bot removed their request for review December 16, 2020 16:09
@oneflow-ci-bot oneflow-ci-bot removed their request for review December 17, 2020 05:39
@oneflow-ci-bot oneflow-ci-bot removed their request for review December 17, 2020 09:24
@liujuncheng liujuncheng merged commit d2096ae into master Dec 17, 2020
@liujuncheng liujuncheng deleted the dev_cuda_elemwise_template branch December 17, 2020 09:25
liujuncheng added a commit that referenced this pull request Jun 3, 2021
* CUDA template for element-wise kernels

* Add empty line

* ternary

* int8_t/int32_t/int64_t

* rename to .cuh

* tpm

* fix Ternary PackSize

* refine

* factory

* GenericLauncher

* UnaryPrimitive


fix


result_of


BinaryPrimitive


TernaryPrimitive


pack_size


ApplyGeneric


ApplyGeneric


fix


GenericLauncher


fix


refine

* aligned_storage

* ApplyPack

* refine

* refine

* XYZ=>ABC

Former-commit-id: d2096ae
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants