Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added ase Nudged Elastic Band in the NewtonNet recipe #2176

Merged
merged 514 commits into from
Jan 10, 2025

Conversation

kumaranu
Copy link
Contributor

@kumaranu kumaranu commented May 24, 2024

Summary of Changes

>>
Target here is to add ase's NEB as the double ended transition state search method for NewtonNet recipe.

  • I need to make sure that geodesic code is added as a preprocessor for the double ended transition state search.
  • NEB will take the output of Geodesic
  • Sella will them run from the top point of NEB.
  • Tests will need to be added for all the files keeping the coverage of more than 99% before taking it for review.
  • Currently, there is a lot of unstructured code in the ts.py and in the test. This will need to be cleaned up to match with the rest of the recipe's in QuAcc.
  • The summary dictionaries will need to be written so that the data can be passed back to the database.
  • Also, the requirements will be added in the NewtonNet's recipe, however, it is a general problem where ideally the calculator can be changed. If a decision is made later on to move it out of NewtonNet then things will (hopefully) be still usable a general path optimization process and can be added along with the static, relax, freq, and thermo jobs. But, right now, NewtonNet seems like an OK place for this to be put in.

<<

Checklist

  • I have read the "Guidelines" section of the contributing guide. Don't lie! 😉
  • My PR is on a custom branch and is not named main.
  • I have added relevant, comprehensive unit tests.

Notes

  • Your PR will likely not be merged without proper and thorough tests.
  • If you are an external contributor, you will see a comment from @buildbot-princeton. This is solely for the maintainers.
  • When your code is ready for review, ping one of the active maintainers.

@buildbot-princeton
Copy link
Collaborator

Can one of the admins verify this patch?

@kumaranu kumaranu changed the title Neb nn WIP: Added ase Nudged Elastic Band in the NewtonNet recipe May 24, 2024
Copy link

codecov bot commented May 24, 2024

Codecov Report

Attention: Patch coverage is 98.17073% with 3 lines in your changes missing coverage. Please review.

Project coverage is 97.46%. Comparing base (31484cd) to head (9ee830d).
Report is 3 commits behind head on main.

Files with missing lines Patch % Lines
src/quacc/runners/ase.py 93.75% 3 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #2176      +/-   ##
==========================================
+ Coverage   97.44%   97.46%   +0.02%     
==========================================
  Files          85       86       +1     
  Lines        3557     3708     +151     
==========================================
+ Hits         3466     3614     +148     
- Misses         91       94       +3     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @kumaranu: I know this is still a WIP but I wanted to provide some initial comments. The biggest thing to note is that in quacc you cannot write things out to the current working directory or any relative paths. Quacc does not cd into the working directory and so there is no guarantee that the current working directory or any relative path is going to be where the runtime/results directory is.

If you are looking to write out files before the calculation completes, then you need write it out based on the tmpdir that is made when quacc.runners.pre.calc_setup is called. For instance, in phonopy, you can see that certain files are written out (e.g. phonopy.yaml, phonopy_auto_band_structure.yaml) based on the tmpdir of the runtime directory:

# Perform staging operations
tmpdir, job_results_dir = calc_setup(atoms)
# Run phonopy
phonon.forces = forces
phonon.produce_force_constants()
phonon.run_mesh(with_eigenvectors=True)
phonon.run_total_dos()
phonon.run_thermal_properties(t_step=t_step, t_max=t_max, t_min=t_min)
phonon.auto_band_structure(
write_yaml=True, filename=Path(tmpdir, "phonopy_auto_band_structure.yaml")
)
phonon.save(Path(tmpdir, "phonopy.yaml"), settings={"force_constants": True})
phonon.directory = job_results_dir

If you are writing to disk after the calculation completes and a schema is returned, you can query schema["dir_name"] to get the path where the results folder is. In any case, there is going to need to be a fairly significant amount of refactoring here to ensure that your I/O goes where you expect it to go. These functions may have been working fine for you because you are using FireWorks which cd's into a custom directory for each run, but that is very unique to FireWorks. If you were to run these calculations without a workflow engine, for instance, you'll see that the files are not being written out to the unique results folder for that calculation.

Most of your issues in this PR boil down to the fact that you are trying to put too much into the recipes themselves, which are meant to be very well-defined units of work. In reality, most of your additions here should be quacc.runners functions that are called within your new recipes. I/O happens in runners and schemas depending on if it's meant to be done before/during or after the calculation. If using ChatGPT, it has no knowledge of this.

pyproject.toml Outdated
@@ -49,7 +49,7 @@ defects = ["pymatgen-analysis-defects>=2023.8.22", "shakenbreak>=3.2.0"]
jobflow = ["jobflow[fireworks]>=0.1.14", "jobflow-remote>=0.1.0"]
mlp = ["matgl>=1.0.0", "chgnet>=0.3.3", "mace-torch>=0.3.3", "torch-dftd>=0.4.0"]
mp = ["atomate2>=0.0.14"]
newtonnet = ["newtonnet>=1.1"]
newtonnet = ["newtonnet>=1.1", "geodesic-interpolate @ git+https://github.com/virtualzx-nad/geodesic-interpolate.git"]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can't have direct git links in pyproject.toml (unfortunately) because PyPI does not allow us to upload new version of quacc with URLs. The best course of action here is to simply make it clear in the @requires decorator that this package should be installed.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @Andrew-S-Rosen ,
I want to have tests that run for geodesic interpolate on GitHub. Is there a way you see to do that in case we are not able to put URLs?
Thanks.

Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen Jun 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kumaranu: Simply remove the direct link from pyproject.toml. The requirements.txt files in the tests folder are what is used by CI.

src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
@Andrew-S-Rosen Andrew-S-Rosen marked this pull request as draft May 31, 2024 03:15
@kumaranu
Copy link
Contributor Author

kumaranu commented Jun 10, 2024

Hi @Andrew-S-Rosen

I have added a function in runners/ase.py with its test.

My test is running alright locally but it is complaining that the required packages are not installed such as "sella", "NewtonNet", "geodesic" etc.
https://github.com/Quantum-Accelerators/quacc/actions/runs/9449377214/job/26025492513?pr=2176

Maybe the reason for this problem is that requirements-newtonnet.txt is not called for the packages that are present inside runners/ase.py. But I do not know which file will used to get that list of required packages.

Please help resolve this issue. Thanks.

Tagging Sam here as I was discussing this issue with him recently.
@samblau

@kumaranu
Copy link
Contributor Author

Hi @Andrew-S-Rosen

I have added a function in runners/ase.py with its test.

My test is running alright locally but it is complaining that the required packages are not installed such as "sella", "NewtonNet", "geodesic" etc. https://github.com/Quantum-Accelerators/quacc/actions/runs/9449377214/job/26025492513?pr=2176

Maybe the reason for this problem is that requirements-newtonnet.txt is not called for the packages that are present inside runners/ase.py. But I do not know which file will used to get that list of required packages.

Please help resolve this issue. Thanks.

Tagging Sam here as I was discussing this issue with him recently. @samblau

I added the following three lines to requirements.txt inside the tests directory to solve that problem:
newtonnet==1.1.1
geodesic-interpolate @ git+https://github.com/virtualzx-nad/geodesic-interpolate.git
sella==2.3.4

@@ -133,7 +133,7 @@ def test_ase_relax_job(tmp_path, monkeypatch):
output["parameters"]["orcasimpleinput"]
== "def2-tzvp engrad normalprint wb97x-d3bj xyzfile"
)
assert output["parameters_opt"]["fmax"] == 0.1
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please don't change pre-existing tests. They should pass just fine.

pyproject.toml Outdated
@@ -50,7 +50,7 @@ jobflow = ["jobflow[fireworks]>=0.1.14", "jobflow-remote>=0.1.0"]
mlp = ["matgl>=1.1.2", "chgnet>=0.3.3", "mace-torch>=0.3.3", "torch-dftd>=0.4.0"]
mp = ["atomate2>=0.0.14"]
newtonnet = ["newtonnet>=1.1"]
parsl = ["parsl[monitoring]>=2024.5.27; platform_system!='Windows'"]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please don't change unrelated versions. They should work just fine.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know what was changed here. I see that this file is the same as the one in Quantum-Accelerator: main. Maybe you changed it back already.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did I change the parsl version from 2023.10.23 to 2.24.5.27?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I took care of it. However, there are several other changes that still need to be reverted back, as shown in the file comparison view.

@kumaranu
Copy link
Contributor Author

kumaranu commented Jun 10, 2024

Hi @Andrew-S-Rosen
I have added a function in runners/ase.py with its test.
My test is running alright locally but it is complaining that the required packages are not installed such as "sella", "NewtonNet", "geodesic" etc. https://github.com/Quantum-Accelerators/quacc/actions/runs/9449377214/job/26025492513?pr=2176
Maybe the reason for this problem is that requirements-newtonnet.txt is not called for the packages that are present inside runners/ase.py. But I do not know which file will used to get that list of required packages.
Please help resolve this issue. Thanks.
Tagging Sam here as I was discussing this issue with him recently. @samblau

I added the following three lines to requirements.txt inside the tests directory to solve that problem: newtonnet==1.1.1 geodesic-interpolate @ git+https://github.com/virtualzx-nad/geodesic-interpolate.git sella==2.3.4

Although the original problem is solved, I saw two new errors, tblite and mlp tests both of which I did not touch.

mlp test were just complaining that they could not find mace. I added the following lines in the requirements.txt to get rid of that error.
mace-torch==0.3.4
torch-dftd==0.4.0

However, the error in tblite seems much different as the tests ran fine but the numbers don't match. I have left that as it is:
https://github.com/Quantum-Accelerators/quacc/actions/runs/9456768276/job/26049315963?pr=2176

@Andrew-S-Rosen
Copy link
Member

@kumaranu: The tests/requirements.txt file cannot have the various packages you have added to it. It is meant to include a minimum set of packages corresponding to the tests-core CI run. If your tests require things like mace, they need to be running with the tests-mlps run. The reason the TBLite issues are happening is because TBLite is not compatible with torch (tblite/tblite#110).

I do not quite understand what your goal is here with tests/requirements.txt. You have added NewtonNet tests, which should only run if NewtonNet is installed.

@kumaranu
Copy link
Contributor Author

@kumaranu: The tests/requirements.txt file cannot have the various packages you have added to it. It is meant to include a minimum set of packages corresponding to the tests-core CI run. If your tests require things like mace, they need to be running with the tests-mlps run. The reason the TBLite issues are happening is because TBLite is not compatible with torch (tblite/tblite#110).

I do not quite understand what your goal is here with tests/requirements.txt. You have added NewtonNet tests, which should only run if NewtonNet is installed.

I have removed NewtonNet and Sella from the test_ase.py requirements inside runners directory. It is using BFGS and EMT now.
@Andrew-S-Rosen @samblau

@Andrew-S-Rosen
Copy link
Member

@kumaranu: Thanks. I will review the PR once you undo all the unrelated changes you've made (there are still many) and remove commented out code blocks.

@@ -320,6 +326,153 @@ def run_vib(
return vib


def run_path_opt(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's rename this run_neb.

@@ -275,6 +275,40 @@ def summarize_vib_and_thermo(
)


def summarize_path_opt_run(dyn: Optimizer) -> OptSchema:
Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen Jun 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And then we can rename this summarize_neb_run. It also needs a proper docstring.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some more changes to revert later.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some more changes to revert later.

Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @kumaranu. I have provided an initial review below. There are many areas for cleanup, but overall the logic seems mostly reasonable. If you can address these changes, that would be ideal. Please note that you should also be checking GitHub Actions CI to see what tests fail. You'll see that your docs and pre-commit CI runners failed. Clicking the "details" will show you why.

Once you make these changes, I will review it once more and can assist with remaining issues. Thank you. Please also "resolve" the comments when you are done addressing them.

src/quacc/__init__.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
tests/core/runners/test_ase.py Outdated Show resolved Hide resolved
)
],
)
def test_run_neb(
Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen Jun 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your test of run_neb should not rely on geodesic-interpolate. We want to make sure that we are only testing one thing at a time (hence the name a "unit test").

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The NEB function needs list of ASE images. I am avoiding it because it will just become a long list of coordinates in this file. Not sure how to do it cleanly.

tests/core/runners/test_ase.py Outdated Show resolved Hide resolved
Comment on lines 240 to 242
assert neb_summary["trajectory_results"][1]["energy"] == pytest.approx(
1.098, abs=0.01
)
Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen Jun 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A more thorough test of the output schema is required. Also, in general, let's not use such a loose tolerance if possible. This will help us capture errors appropriately.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same problem as the geodesic here because I am using geodesic as the precursor to this step.

Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen Jun 21, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are too many things being tested here at once is the major problem. For instance, runners/test_ase.py should be testing the runner: run_neb. It should be asserting that run_neb() functions appropriately and nothing else. There should be no major precursor to that step and nothing afterwards (i.e. no summarizing).

You'd then make a separate test for summarizing (e.g. tests.cores.schemas) and so on and so forth.

The idea of a "unit test" is that you are testing a specific unit of work.

Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen Jun 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please do not make unrelated changes in your PR like the code changes in this file.

@Andrew-S-Rosen Andrew-S-Rosen self-assigned this Jun 22, 2024
@kumaranu
Copy link
Contributor Author

kumaranu commented Jun 27, 2024

Hi @Andrew-S-Rosen,

I am not sure why one of my tests is failing on Windows all of a sudden. Please let me know if you see anything obvious to you. Thanks.

@Andrew-S-Rosen
Copy link
Member

It looks like a tolerance issue and should be not a problem. You can change the tolerance slightly (just enough for the tests to pass). I am not sure the source either.

Copy link
Member

@Andrew-S-Rosen Andrew-S-Rosen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @kumaranu and @samblau: Please see my comments below. Thank you again for your contribution, and I hope that this helps.

tests/core/runners/test_ase.py Outdated Show resolved Hide resolved
tests/core/runners/test_ase.py Outdated Show resolved Hide resolved
tests/core/runners/test_ase.py Outdated Show resolved Hide resolved
tests/core/runners/test_ase.py Outdated Show resolved Hide resolved
src/quacc/schemas/ase.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
Comment on lines 48 to 59
class NebSchema(TypedDict):
relax_reactant: OptSchema
relax_product: OptSchema
geodesic_results: list[Atoms]
neb_results: dict

class NebTsSchema(TypedDict):
relax_reactant: OptSchema
relax_product: OptSchema
geodesic_results: list[Atoms]
neb_results: dict
ts_results: TSSchema
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The neb_results: dict should use the type hint for the return from summarize_neb_run (if we keep that function) rather than dict.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure how to do this one. I see that quacc.types has the base classes to define the schemas but I don't have those for summarize_neb_run.

src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
src/quacc/recipes/newtonnet/ts.py Outdated Show resolved Hide resolved
Comment on lines 285 to 300
@job
@requires(
has_newtonnet, "NewtonNet must be installed. Refer to the quacc documentation."
)
@requires(
has_geodesic_interpolate,
"geodesic-interpolate must be installed. Refer to the quacc documentation.",
)
def neb_job(
reactant_atoms: Atoms,
product_atoms: Atoms,
relax_job_kwargs: dict[str, Any] | None = None,
calc_kwargs: dict[str, Any] | None = None,
geodesic_interpolate_kwargs: dict[str, Any] | None = None,
neb_kwargs: dict[str, Any] | None = None,
) -> NebSchema:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I haven't commented on this previously but really should now:

In quacc, the @job is a discrete unit of work. A @flow is a workflow, concatenating multiple distinct @jobs. Here (and in your other recipes, for what it's worth), you are actually doing a workflow. You have two relaxations (two "jobs") before running an NEB calculation.

Normally, I would say this is not permitted. If we were running a Q-Chem calculation for instance, this would be highly undesirable because you would want the reactant and product relaxation jobs to be done concurrently, you'd want those results to be stored in your database, and if one fails you'd still like to store the results from the other. You can't do that if everything is packed in one @job.

The challenge here is that because NewtonNet and MLPs are so fast, it is tempting to pack everything into one @job. For users of workflow engines (like FireWorks) where each @job is a Slurm job, this means you'd be queuing more than you need. But not all workflow engines work that way (and even FireWorks has the option to not re-queue each job).

Jobflow does not support the @flow pattern, so I'm not going to inconvenience you by requiring that you implement it here. But I wanted to bring up this difference: what you are actually doing here is making a flow that combines steps from multiple jobs.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the recipes like neb_ts_job and geodesic_ts_job will most likely go away. I have left them working right now but will probably remove them after a discussion together. I have put the workflows on a separate repository so things like the flow can be done over there. I have however not use jobflow over there so the parallel nature of the execution of the codes and also the fireworks related functionalities don't exist in those workflows as well.

@kumaranu
Copy link
Contributor Author

hey @kumaranu , any progress on this? Thanks!

Hi @samblau ,
Things have been quite busy lately. Sorry, I couldn't get back to it earlier.

I reduced the timing of the tests by adding max_iter to them and also got rid of writing to the current directory problem. Let's see if it works out OK or not. Thanks!

Comment on lines 628 to 629
if interval > n_iter:
pass
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kumaranu: FYI that this code block doesn't actually do anything. I'm going to remove it, but if you meant for it to do something else just let me know.

@Andrew-S-Rosen Andrew-S-Rosen changed the title WIP: Added ase Nudged Elastic Band in the NewtonNet recipe Added ase Nudged Elastic Band in the NewtonNet recipe Jan 10, 2025
@Andrew-S-Rosen
Copy link
Member

Alright friends, @kumaranu and @samblau. It is time for the merge! Is it perfect? No. Is it good enough? We'll find out!

I did a lot of refactoring here, but things seem to be okay. Let's move.

@Andrew-S-Rosen Andrew-S-Rosen merged commit 6295c77 into Quantum-Accelerators:main Jan 10, 2025
20 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

4 participants