-
Notifications
You must be signed in to change notification settings - Fork 248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build ufs_weather_model on Gaea-C5 + update cubed sphere gitmodules for perturbation/increments for cold starts #2271 #2269
Conversation
Hello..This PR is in support of NOAA-EMC/global-workflow#2535. Please let me know how I should proceed with testing. Since this PR will not affect other platforms, do I need to run RT on Hera/Derecho/Hercules? Thanks for your time! |
@DavidBurrows-NCO Develop branch builds and runs ok on gaea-c5. Can you explain why we need the code change of this PR? Any issue porting global workflow to gaea-c5? |
@jkbk2004 Have you looked into @DavidBurrows-NCO changes to see if it works for the UFSWM on Gaea-C5? |
|
Sorry for accidentally closing/re-opening. Anyway, @BrianCurtis-NOAA @DavidBurrows-NCO the gaea modulefile of this pr is a lot clean. Let me throw a run on gaea. @RatkoVasic-NOAA can you take a look at this modulefile update for gaea? |
One concerning things to me was the removal of the stack-python as it might cause issues with the abort_dep_tasks.py script. |
Good Morning...the previous module file didn't contain a load cmake, so you get the following error upon building:
I added the module load cmake then did some clean up as well. The modulefile for gaea now resembles gfs_utils, ufs_utils, and upp. |
This changes worked for me. @BrianCurtis-NOAA can you give me example so I can try to get failure with abort_dep_tasks.py |
Mostly with importing python ecflow package in https://github.com/ufs-community/ufs-weather-model/blob/develop/tests/abort_dep_tasks.py#L3. But it sounds like it runs ok. @RatkoVasic-NOAA a decision point on your side is to specifically control thru stack-python. I think its safe to keep moving with the stack-python. @DavidBurrows-NCO can you put the stack-python line back. So we can move on. |
3a9d23d
to
db99a8b
Compare
@BrianCurtis-NOAA @RatkoVasic-NOAA I added stack-python load back into the module file and tested a build. |
This concern is just based on rt.sh failing with running ECFlow on Gaea. @RatkoVasic-NOAA are you using ecflow to run the rt.sh suite? |
@BrianCurtis-NOAA I didn't run full test suite, just about 10 test cases to see if it brakes compilation or some executions. I used rocoto. I can now try with ecflow same thing. |
Selected 10 tests worked using both ecflow and rocoto for me. |
OK Great. @jkbk2004 please run the full rt suite on Gaea-C5 using ecflow and we can then merge this PR with another non-baseline changing PR once that's completed. |
Sure! |
@DavidBurrows-NCO This rap clm_lake case is crashing with this pr. experiment is at /gpfs/f5/epic/scratch/Jong.Kim/RT_RUNDIRS |
@jkbk2004 I am also getting a failure with that test. Here are some highlights from the output:
Are you getting the same error? I don't see any other hints in the working directory for a reason for this failure. |
I cannot see your directories due to restrictive permissions. What you've shown me so far are errors that have nothing to do with CLM lake. I cannot debug farther until I see the stack trace. |
@SamuelTrahanNOAA Can I move the working dir somewhere on Gaea for you? My Gaea groups are epic and ufs-ard.
Thanks, Sam! |
I don't know where your working directory resides. You don't necessarily need to move it; I just need the permissions to allow world access. This is not a stack trace:
The stack trace would include a trace of the files and line numbers to the point in the stack with the divide by zero. |
@SamuelTrahanNOAA The working dir is here: |
HYCOM-interface has a change but no .gitmodule entry. Please fix. |
HYCOM hash is already committed one on emc/develop. Good to go with hash update. |
on-behalf-of @ufs-community <ecc.platform@noaa.gov>
on-behalf-of @ufs-community <ecc.platform@noaa.gov>
on-behalf-of @ufs-community <ecc.platform@noaa.gov>
@jkbk2004 @FernandoAndrade-NOAA I talked to Brian offline. We're good to skip acorn. |
Thanks for everyone's help with this PR @jkbk2004 @zach1221 @FernandoAndrade-NOAA @BrianCurtis-NOAA @SamuelTrahanNOAA @RatkoVasic-NOAA @DusanJovic-NOAA @junwang-noaa @weihuang-jedi |
Commit Queue Requirements:
Description:
Commit Message:
Priority:
Git Tracking
UFSWM:
Sub component Pull Requests:
UFSWM Blocking Dependencies:
Changes
Regression Test Changes (Please commit test_changes.list):
Input data Changes:
Library Changes/Upgrades:
Testing Log: