Skip to content

ofi BTL reading uninitialized values during finalize #6345

@jsquyres

Description

@jsquyres

While doing some other testing, I noticed that the OFI BTL is doing some bad reads during MPI_Finalize. I don't know if these are actually causing problems anywhere, but they certainly have the potential to do so.

Can someone from Intel please have a look?

$ valgrind --num-callers=50 --tool=memcheck ./hello_c 
==2379== Memcheck, a memory error detector
==2379== Copyright (C) 2002-2015, and GNU GPL'd, by Julian Seward et al.
==2379== Using Valgrind-3.11.0 and LibVEX; rerun with -h for copyright info
==2379== Command: ./hello_c
==2379== 
==2379== Conditional jump or move depends on uninitialised value(s)
==2379==    at 0x535B307: opal_interval_tree_reader_get_token (opal_interval_tree.c:126)
==2379==    by 0x535C8A2: opal_interval_tree_traverse (opal_interval_tree.c:705)
==2379==    by 0x54287C2: mca_rcache_base_vma_tree_iterate (rcache_base_vma_tree.c:105)
==2379==    by 0x5428391: mca_rcache_base_vma_iterate (rcache_base_vma.c:153)
==2379==    by 0xA103374: mca_rcache_grdma_finalize (rcache_grdma_module.c:543)
==2379==    by 0x5427D16: mca_rcache_base_module_destroy (rcache_base_create.c:113)
==2379==    by 0xA7179A4: mca_btl_ofi_finalize (btl_ofi_module.c:317)
==2379==    by 0x53C3172: mca_btl_base_close (btl_base_frame.c:203)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4CFCFD9: mca_bml_base_close (bml_base_frame.c:130)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4C76C01: ompi_mpi_finalize (ompi_mpi_finalize.c:449)
==2379==    by 0x4CB62F1: PMPI_Finalize (pfinalize.c:54)
==2379==    by 0x4008E9: main (hello_c.c:24)
==2379== 
==2379== Use of uninitialised value of size 8
==2379==    at 0x535A332: opal_thread_compare_exchange_strong_32 (thread_usage.h:156)
==2379==    by 0x535B34E: opal_interval_tree_reader_get_token (opal_interval_tree.c:133)
==2379==    by 0x535C8A2: opal_interval_tree_traverse (opal_interval_tree.c:705)
==2379==    by 0x54287C2: mca_rcache_base_vma_tree_iterate (rcache_base_vma_tree.c:105)
==2379==    by 0x5428391: mca_rcache_base_vma_iterate (rcache_base_vma.c:153)
==2379==    by 0xA103374: mca_rcache_grdma_finalize (rcache_grdma_module.c:543)
==2379==    by 0x5427D16: mca_rcache_base_module_destroy (rcache_base_create.c:113)
==2379==    by 0xA7179A4: mca_btl_ofi_finalize (btl_ofi_module.c:317)
==2379==    by 0x53C3172: mca_btl_base_close (btl_base_frame.c:203)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4CFCFD9: mca_bml_base_close (bml_base_frame.c:130)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4C76C01: ompi_mpi_finalize (ompi_mpi_finalize.c:449)
==2379==    by 0x4CB62F1: PMPI_Finalize (pfinalize.c:54)
==2379==    by 0x4008E9: main (hello_c.c:24)
==2379== 
==2379== Use of uninitialised value of size 8
==2379==    at 0x535A34B: opal_thread_compare_exchange_strong_32 (thread_usage.h:156)
==2379==    by 0x535B34E: opal_interval_tree_reader_get_token (opal_interval_tree.c:133)
==2379==    by 0x535C8A2: opal_interval_tree_traverse (opal_interval_tree.c:705)
==2379==    by 0x54287C2: mca_rcache_base_vma_tree_iterate (rcache_base_vma_tree.c:105)
==2379==    by 0x5428391: mca_rcache_base_vma_iterate (rcache_base_vma.c:153)
==2379==    by 0xA103374: mca_rcache_grdma_finalize (rcache_grdma_module.c:543)
==2379==    by 0x5427D16: mca_rcache_base_module_destroy (rcache_base_create.c:113)
==2379==    by 0xA7179A4: mca_btl_ofi_finalize (btl_ofi_module.c:317)
==2379==    by 0x53C3172: mca_btl_base_close (btl_base_frame.c:203)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4CFCFD9: mca_bml_base_close (bml_base_frame.c:130)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4C76C01: ompi_mpi_finalize (ompi_mpi_finalize.c:449)
==2379==    by 0x4CB62F1: PMPI_Finalize (pfinalize.c:54)
==2379==    by 0x4008E9: main (hello_c.c:24)
==2379== 
==2379== Use of uninitialised value of size 8
==2379==    at 0x535B390: opal_interval_tree_reader_return_token (opal_interval_tree.c:141)
==2379==    by 0x535C8F1: opal_interval_tree_traverse (opal_interval_tree.c:707)
==2379==    by 0x54287C2: mca_rcache_base_vma_tree_iterate (rcache_base_vma_tree.c:105)
==2379==    by 0x5428391: mca_rcache_base_vma_iterate (rcache_base_vma.c:153)
==2379==    by 0xA103374: mca_rcache_grdma_finalize (rcache_grdma_module.c:543)
==2379==    by 0x5427D16: mca_rcache_base_module_destroy (rcache_base_create.c:113)
==2379==    by 0xA7179A4: mca_btl_ofi_finalize (btl_ofi_module.c:317)
==2379==    by 0x53C3172: mca_btl_base_close (btl_base_frame.c:203)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4CFCFD9: mca_bml_base_close (bml_base_frame.c:130)
==2379==    by 0x53A5F6F: mca_base_framework_close (mca_base_framework.c:252)
==2379==    by 0x4C76C01: ompi_mpi_finalize (ompi_mpi_finalize.c:449)
==2379==    by 0x4CB62F1: PMPI_Finalize (pfinalize.c:54)
==2379==    by 0x4008E9: main (hello_c.c:24)

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions