You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
To efficiently run on multi GPU nodes we need to control access at a per GPU level. RESOURCE_GROUPS would allow multiple tests using one GPU each to run simulatanously. Currently we have a single lock and only 1 GPU is used. RESOURCE_GROUPS would also accommodate future multiple GPU tests. https://cmake.org/cmake/help/latest/prop_test/RESOURCE_GROUPS.html
Scripts would have to read an environment or input variable for non-default #GPUs=1
Describe the solution you'd like
Switch from LOCKing to resource groups.
Describe alternatives you've considered
None.
Additional context
Essential to make good use of multi-GPU nodes and allow e.g. efficient running of the performance tests on them.
Threads, MPI, and cpu cores could also be handled similarly but GPUs are most constrained.
The text was updated successfully, but these errors were encountered:
Hmm. If there is not a better solution, we could simply abort for GPU builds when the environment variable is not set and give the user the instructions to set it.
e.g. No --resource-spec-file given, abort but give a link to a basic one we include in our repo.
Is your feature request related to a problem? Please describe.
To efficiently run on multi GPU nodes we need to control access at a per GPU level. RESOURCE_GROUPS would allow multiple tests using one GPU each to run simulatanously. Currently we have a single lock and only 1 GPU is used. RESOURCE_GROUPS would also accommodate future multiple GPU tests. https://cmake.org/cmake/help/latest/prop_test/RESOURCE_GROUPS.html
Scripts would have to read an environment or input variable for non-default #GPUs=1
Describe the solution you'd like
Switch from LOCKing to resource groups.
Describe alternatives you've considered
None.
Additional context
Essential to make good use of multi-GPU nodes and allow e.g. efficient running of the performance tests on them.
Threads, MPI, and cpu cores could also be handled similarly but GPUs are most constrained.
The text was updated successfully, but these errors were encountered: