Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend container support to all Tier 1 platforms #85

Merged

Conversation

EdwardSnyder-NOAA
Copy link
Collaborator

@EdwardSnyder-NOAA EdwardSnyder-NOAA commented Mar 29, 2024

Description

This PR extends support for the Land DA develop container to all Tier 1 platforms. To accomplish this, a new script was added to the ufs-containers repo.

Most T1 platforms have Intel MPI installed and can run the workflow as described in the documentation with the exception of Derecho, Gaea, and Hercules. For these platforms different Intel compilers and MPIs are used and the setup_container.sh from the ufs-containers repo needs to run on as well (just on Derecho and Gaea). For Derecho, this script changes the slurm commands to the pbs equivalent commands because that is the job scheduler used there. And for Gaea, this script switches the mpiexec commands to srun and adds additional sbatch commands so that the jobs can run there.

All the relevant information to run this new process was added to the documentation.

Some reorganization occurred in the develop branch, so renaming the submodules path was needed in the do_submit_cycle.sh script.

The container was tested with both the era5 and gswp3 cases for all T1 platform. I can provide the locations of these tests if needed.

When this PR is merged the Land DA develop container will need to be rebuilt and pushed to all T1 platforms.

Anticipated changes to regression tests:

  • Is baseline change expected ?

Subcomponents involved:

  • DA_update (ufs-community/land-DA)
  • ufsLand.fd (NOAA-EPIC/ufs-land-driver-emc-dev)
  • ufs_model.fd (ufs-community/ufs-weather-model)
  • vector2tile_converter.fd (NOAA-PSL/land-vector2tile)
  • none

Linked PR's and Issues:

Testing (for CM's):

  • RDHPCS
    • Hera
    • Orion
    • Jet
    • Gaea
    • Cheyenne
  • CI
    • Completed
  • PW-Clouds
    • AWS
    • AZURE
    • GCP

@chan-hoo
Copy link
Collaborator

chan-hoo commented Apr 1, 2024

This PR looks good. Approving. @jkbk2004, please let me know what you think.

Copy link
Collaborator

@jkbk2004 jkbk2004 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's not necessary to list all machines if we don't fully support analysis. But I don't have technical issue to block this pr.

@chan-hoo chan-hoo merged commit 90e503e into ufs-community:develop Apr 1, 2024
2 checks passed
@EdwardSnyder-NOAA EdwardSnyder-NOAA mentioned this pull request Apr 1, 2024
15 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants