ompi/datatype: Fix performance regression in reduce collective #8569

AboorvaDevarajan · 2021-03-09T11:34:45Z

Use the pre-populated convertor flag CONVERTOR_CUDA to check the type of buffer instead of calling cuPointerGetAttribute again which incurs additional overhead for CPU buffers.

Signed-off-by: Aboorva Devarajan abodevar@in.ibm.com

Use the pre populated convertor flag `CONVERTOR_CUDA` to check the type of buffer instead of calling `cuPointerGetAttribute` again which incurs additional overhead for CPU buffers. Signed-off-by: Aboorva Devarajan <abodevar@in.ibm.com>

AboorvaDevarajan · 2021-03-09T11:37:45Z

There is one semantic difference which is introduced in this patch, when using CONVERTOR_CUDA flag just the source buffer for pack and the receive buffer for unpack is checked, where as in opal_cuda_check_bufs both the source and destination buffers are checked for pack and unpack, as far as I verified this should not break something.

But is there is a specific reason why opal_cuda_check_bufs instead of CUDA_CONVERTOR is used in this case? Thanks.

awlauria

Looks good to me.

I'd wait for @markalle's thoughts before merging

markalle · 2021-03-11T20:55:25Z

Thanks, this is a good improvement. I like the idea of having that data cached, and this seems to be how OMPI is trying to cache the data.

After looking at how OMPI sets the CONVERTER_CUDA flag I don't actually think OMPI's cuda cuda check is legitimate, so I agree with making this change, and then maybe separately we should argue about the correctness of the CONVERTOR_CUDA flag in general throughout OMPI

awlauria · 2021-03-12T15:27:24Z

@AboorvaDevarajan can you cherry-pick over to v5.0.x?

AboorvaDevarajan requested review from awlauria and markalle March 9, 2021 11:34

AboorvaDevarajan mentioned this pull request Mar 9, 2021

ompi/datatype: performance regression in mpi reduce collective #8568

Closed

awlauria approved these changes Mar 9, 2021

View reviewed changes

markalle approved these changes Mar 11, 2021

View reviewed changes

awlauria merged commit 5585b03 into open-mpi:master Mar 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ompi/datatype: Fix performance regression in reduce collective #8569

ompi/datatype: Fix performance regression in reduce collective #8569

Uh oh!

AboorvaDevarajan commented Mar 9, 2021

Uh oh!

AboorvaDevarajan commented Mar 9, 2021

Uh oh!

awlauria left a comment

Uh oh!

markalle commented Mar 11, 2021

Uh oh!

awlauria commented Mar 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ompi/datatype: Fix performance regression in reduce collective #8569

ompi/datatype: Fix performance regression in reduce collective #8569

Uh oh!

Conversation

AboorvaDevarajan commented Mar 9, 2021

Uh oh!

AboorvaDevarajan commented Mar 9, 2021

Uh oh!

awlauria left a comment

Choose a reason for hiding this comment

Uh oh!

markalle commented Mar 11, 2021

Uh oh!

awlauria commented Mar 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants