Skip to content

Update Tribol and MFEM#1534

Merged
chapman39 merged 21 commits intodevelopfrom
task/chapman39/update-tpls-2026-02-05
Feb 27, 2026
Merged

Update Tribol and MFEM#1534
chapman39 merged 21 commits intodevelopfrom
task/chapman39/update-tpls-2026-02-05

Conversation

@chapman39
Copy link
Collaborator

@chapman39 chapman39 commented Feb 5, 2026

This PR

mfem patches

about dfem-related tests and examples

test_dfem_explicit_dynamics, dfem_gpu, and dfem_force_gpu cannot yet run with a GPU execution space. Each get this warning:

/usr/WS2/smithdev/libs/smith/toss_4_x86_64_ib_cray/2026_02_19_11_29_48/llvm-amdgpu-6.4.2/mfem-4.9.1.1-qga6acnz7ejqrjnjn4i2qsixu3yuycd3/include/mfem/fem/dfem/util.hpp:602:17: error: failed to meet occupancy target given by 'amdgpu-waves-per-eu' in '_ZN4mfem6future19forall_kernel_shmemIZZZNS0_22DifferentiableOperator13AddIntegratorINS0_6Entity7ElementEN5smith14InnerQFunctionINS0_6tensorIdJLi2ELi2EEEENS6_18StressDivQFunctionINS6_37StVenantKirchhoffWithFieldDensityDfemEJNS6_15ScalarParameterILi0EEEEEEJRKS9_SG_SG_SG_ddEEENS0_5tupleIJNS0_8GradientILi5EEENSJ_ILi0EEENSJ_ILi1EEENSJ_ILi2EEENSJ_ILi3EEENS0_6WeightENS0_5ValueILi4EEEEEENSI_IJNS0_3SumILi5EEEEEESt16integer_sequenceImJLm0ELm4EEEEEvRT0_T1_T2_RKNS_15IntegrationRuleERKNS_5ArrayIiEET3_ENKUlmE_clEmENUlRSt6vectorINS_6VectorESaIS1C_EERKS1C_E_clES1F_S1H_EUliPdE_EEvT_i': desired occupancy was 4, final occupancy is 3 [-Werror,-Wpass-failed]

i decided to revert that error back to a warning. but it may be possible to resolve it by simplifying the kernel resource usage (e.g. reducing ir_order) - according to livchat at least. given my limited physics knowledge i decided not to go too deep into that route.

the test_dfem_explicit_dynamics test is the only one of the 3 programs listed above that actually used gpu as execution space. as a result, i get this error after the test passes. (output has been simplified)

[meemee@rzadams1017:build-rzadams-toss_4_x86_64_ib_cray-llvm-amdgpu@6.4.2_hip-release]$ f1 ./tests/test_dfem_explicit_dynamics 
flux run -N1 -x ./tests/test_dfem_explicit_dynamics

...

Global test environment tear-down
[==========] 1 test from 1 test suite ran. (266 ms total)
[  PASSED  ] 1 test.
terminate called after throwing an instance of 'umpire::runtime_error'
  what():  ! Umpire runtime_error [/usr/WS2/smithdev/libs/smith/toss_4_x86_64_ib_cray/2026_02_13_17_24_00/build_stage/meemee/spack-stage-umpire-2025.09.0-ifzxz2gfr6lwe2noancq2srjpbwxvhot/spack-src/src/umpire/util/AllocationMap.cpp:255]: Cannot remove 0x1ba37f50
    Backtrace: 15 frames
    0 0xddc975 No dladdr: ./tests/
...
    14 0xc24a1e No dladdr: ./tests/test_dfem_explicit_dynamics(_start+0x2e) [0xc24a1e]

[SIGNAL]: Received signal 6 (Aborted), exiting
terminate called recursively

to resolve this error for now i set the execution space to CPU, though we will probably want to fix this soon, so ill make an issue. #1546

@chapman39 chapman39 self-assigned this Feb 5, 2026
@chapman39 chapman39 added the TPL Third-party libraries label Feb 5, 2026
@chapman39 chapman39 marked this pull request as ready for review February 24, 2026 18:43
packages:
axom:
require:
- spec: "@0.12.0.1"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure if you need this for your current development, but we recently released axom@0.13

set(CALIPER_DIR "${TPL_ROOT}/caliper-2.12.1-am6ehnzp2m2z5f6ekxcuwrf2ugzhewqi" CACHE PATH "")

# ENZYME not built
set(ENZYME_DIR "${TPL_ROOT}/enzyme-0.0.249-zpsgmqbv7vldpmzugxl7vx3n5b7qapkk" CACHE PATH "")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔥

@chapman39 chapman39 merged commit 7a15e3d into develop Feb 27, 2026
10 checks passed
@chapman39 chapman39 deleted the task/chapman39/update-tpls-2026-02-05 branch February 27, 2026 00:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

TPL Third-party libraries

Projects

None yet

Development

Successfully merging this pull request may close these issues.

tuo rocm build with +enzyme

5 participants