Orion, Intel environment for spack-stack-1.8.0 breaks the system tar
command #1355
Description
Describe the bug
The Orion, Intel spack-stack-1.8.0 environment, specifically the LD_LIBRARY_PATH setting, interferes with the execution of the system tar
command. See the next section on reproducing the error.
Simply running tar
outside of the ecbuild command gets the same failure. After some tracing it appears that things go awry when loading the gzip functionality, where the spack-stack /apps/contrib/spack-stack/spack-stack-1.8.0/envs/ue-intel-2021.9.0/install/intel/2021.9.0/libxcrypt-4.4.35-ebrdc3w/lib/libcrypt.so.2
shared library gets loaded instead of the system libcrypt library (/usr/lib64/lib/libcrypt.so.2
).
I found that if LD_LIBRARY_PATH is unset (or /usr/lib64/lib is prepended to the front) that the tar command then works properly.
I need help with coming up with a workable fix for this. I've tried
- Alter the CRTM test/CMakeLists.txt file to unset LD_LIBRARY_PATH before running tar
- This involves the cmake execute_process command, and I have probably done something wrong when I tried this. It sure seems you should be able to get this to work.
- Defining and exporting a bash function for tar that that unsets LD_LIBRARY_PATH then runs /usr/bin/tar
- This works when run in the shell, but does not work when run from cmake
Does anyone have any ideas about how to address this?
Also, could someone check my environment setup to make sure I'm not missing something.
Thanks!
To Reproduce
Steps to reproduce the behavior:
load the intel environment by sourcing a script file that contains the following sequence:
#!/bin/bash
echo "Loading EWOK-SKYLAB Environment Using Spack-Stack 1.8.0"
SPACK_STACK_INTEL_ENV=/apps/contrib/spack-stack/spack-stack-1.8.0/envs/ue-intel-2021.9.0
# load modules
module purge
module use $SPACK_STACK_INTEL_ENV/install/modulefiles/Core
module load stack-intel/2021.9.0
module load stack-intel-oneapi-mpi/2021.9.0
module load stack-python/3.11.7
jedi-host-post-load() {
module swap git-lfs git-lfs/3.1.2
}
# This is a fix for the issue where the spack-stack-1.8.0 udunits
# module does not get loaded propery. Without this workaround, the
# udunits module from the "spack-managed" gets loaded instead and
# ecbuild on jedi-bundle fails.
#
# Setting LMOD_TMOD_FIND_FIRST gets rid of the default marking
# of modules, and the modification of MODULEPATH makes sure
# that spack-stack-1.8.0 modules are found first before same
# named modules in other directories (ie, "spack-managed")
export LMOD_TMOD_FIND_FIRST=yes
module use $SPACK_STACK_INTEL_ENV/install/modulefiles/intel/2021.9.0
# Load JEDI modules
module load jedi-fv3-env
module load ewok-env
module load soca-env
The export and module use commands near the end are the workaround to get the proper udunits package to load.
Run ecbuild:
ecbuild -DPython3_EXECUTABLE=$(which python3) $JEDI_SRC
This results in the following error:
-- Building tests for CRTM v3.1.1.
-- Downloading CRTM coeffs files from: https://bin.ssec.wisc.edu/pub/s4/CRTM//fix_REL-3.1.1.2.tgz to /work2/noaa/jcsda/herbener/jedi/build-intel/test_data/3.1.1/fix_REL-3.1.1.2.tgz
-- Checking if /work2/noaa/jcsda/herbener/jedi/build-intel/test_data/3.1.1/fix_REL-3.1.1.2 already exists...
-- Untarring the downloaded file (~2 minutes) to /work2/noaa/jcsda/herbener/jedi/build-intel/test_data/3.1.1
tar: Relink `/apps/spack-managed/gcc-11.3.1/intel-oneapi-compilers-2023.1.0-sb753366rvywq75zeg4ml5k5c72xgj72/compiler/2023.1.0/linux/compiler/lib/intel64_lin/libimf.so' with `/usr/lib64/libm.so.6' for IFUNC symbol `sincosf'
CMake Error at crtm/test/CMakeLists.txt:106 (message):
Failed to untar the file.
-- Configuring incomplete, errors occurred!
Expected behavior
The tar command run from the CRTM CMake configuration should complete successfully.
System:
Orion, Intel
Additional context
Add any other context about the problem here.
Metadata
Type
Projects
Status
Done