Skip to content

Conversation

@chengduoZH
Copy link
Contributor

No description provided.

@chengduoZH chengduoZH requested a review from pkuyym March 23, 2018 12:34
self.axis = 1


class TestConcatOp2(OpTest):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems that this case is somehow duplicated with the above one, how about change to test axis==0?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, there are two CUDA kernels for Concat, they represent two case, TestConcatOp only covers one, TestConcatOp2 can covers the other.


T** outs_gpu =
T** dev_out_gpu_data =
reinterpret_cast<T**>(outputs_data.CUDAMutableData(context.GetPlace()));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please double check that whether static_cast is more suitable.

Copy link
Contributor

@pkuyym pkuyym left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@chengduoZH chengduoZH merged commit 4a92e89 into PaddlePaddle:develop Mar 26, 2018
blacksheep-Aristotle pushed a commit to blacksheep-Aristotle/Paddle that referenced this pull request Nov 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants