-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add FasterCodeGen/FasterGPTJ #3017
Conversation
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/masked_multihead_attention.cu
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
另外要注意验证对其他模型是否有影响
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
READM补充更新
经验证,sample下模型没问题 |
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/attention_kernels.cu
Outdated
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/decoding_kernels.cu
Outdated
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/masked_multihead_attention.h
Show resolved
Hide resolved
...enlp/ops/patches/FasterTransformer/fastertransformer/cuda/masked_multihead_attention_utils.h
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/topk_kernels.cu
Outdated
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/utils/allocator.h
Outdated
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/attention_kernels.cuh
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/attention_kernels.cu
Outdated
Show resolved
Hide resolved
...enlp/ops/patches/FasterTransformer/fastertransformer/cuda/masked_multihead_attention_utils.h
Outdated
Show resolved
Hide resolved
paddlenlp/ops/patches/FasterTransformer/fastertransformer/cuda/topk_kernels.cu
Show resolved
Hide resolved
24ad27f
to
d9f41be
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
只剩下TODO没有其他问题的话就尽快合入提测吧
PR types
New features
PR changes
Models
Description