Feature/distance-preconditioning #1428

SaltyChiang · 2024-01-09T16:24:15Z

We have been working on the bottom quark with the clover action recently. We cannot obtain a stable plateau in the effective mass diagram even though we have set the residual to be 1e-15. I implement the distance preconditioning algorithm proposed in https://arxiv.org/pdf/1006.4028.pdf.

Two parameters distance_pc_alpha0 and distance_pc_t0 are added to QudaInverParam, which follow Eq.(9) in the paper. I treated the preconditioning as a special solver instead of a new Dirac action, so I just modified the Wilson Dslash and didn't make a new Dirac class. The preconditioning is only enabled when calling inverQuda.

I use a point source to solve propagators with different distance_pc_alpha0 and draw the effective mass diagram generated from the pseudoscalar meson correlation functions. The effect of the preconditioning is shown below.

The effective mass plot becomes much better with alpha0=1.5.

The preconditioning is only enabled for CG inverter now due to a lack of testing. This could be enabled in more inverters later.

I created wilson_distance.cu, wilson_clover_distance.cu and wilson_clover_distance_preconditioned.cu, because the compilation time will be extremely long if I add distance and non-distance functions in one file.

Use `alpha` and `source_time` in `QudaInvertParam` to configure distance preconditioning.

…over`.

…ead.

…ter compilation. It's not `std::if_enablt_t`'s fault.

maddyscientist

Thanks @SaltyChiang for this PR, and for making aware of this method (which I was not aware before your work 😄 ).

It's great to have this feature added to QUDA, but one issue is that there is a lot of code duplication here. A few ideas:

I wonder if we can have a single applyWilson function, that simply takes an extra template parameter distance_pc to avoid having two different definitions of applyWilson.
Then instead of

if (d == 3) {
  out += ratio_fwd * (U * in.project(d, proj_dir)).reconstruct(d, proj_dir);
} else {
  out += (U * in.project(d, proj_dir)).reconstruct(d, proj_dir);
}

we could use something like

out += ratio_fwd[d] * (U * in.project(d, proj_dir)).reconstruct(d, proj_dir);

where ratio_fwd would be a constexpr array that would trivially evaluate as 1.0 when distance_pc = 1 or if d < 3. I think that would allow us to have a single applyWilson definition.

For the different .cu files: perhaps we could have a single .hpp file that contains all the boilerplate and have this file included in the stub files dslash_wilson.cu and dslash_wilson_distance.cu, for example, where the only different between the two is a booled template parameter. E.g., like we have dslash_coarse.cu and dslash_coarse_dagger.cu, etc.

Have you given any consideration to the light quark preconditioning technique outlined in the paper?

include/dirac_quda.h

SaltyChiang · 2024-01-17T04:26:58Z

I wonder if we can have a single applyWilson function, that simply takes an extra template parameter distance_pc to avoid having two different definitions of applyWilson.

Then instead of
if (d == 3) {
  out += ratio_fwd * (U * in.project(d, proj_dir)).reconstruct(d, proj_dir);
} else {
  out += (U * in.project(d, proj_dir)).reconstruct(d, proj_dir);
}
we could use something like
out += ratio_fwd[d] * (U * in.project(d, proj_dir)).reconstruct(d, proj_dir);
where ratio_fwd would be a constexpr array that would trivially evaluate as 1.0 when distance_pc = 1 or if d < 3. I think that would allow us to have a single applyWilson definition.

@maddyscientist That's sounds better than my implementation. Thanks and I'll try to do that these days.

For the different .cu files: perhaps we could have a single .hpp file that contains all the boilerplate and have this file included in the stub files dslash_wilson.cu and dslash_wilson_distance.cu, for example, where the only different between the two is a booled template parameter. E.g., like we have dslash_coarse.cu and dslash_coarse_dagger.cu, etc.

I'll check these files, thank you for your advice.

Have you given any consideration to the light quark preconditioning technique outlined in the paper?

Sure. The light quark preconditioning described in the paper requires a 4D $\alpha(x)$. Notice the coefficient is $\frac{1}{\cosh[\alpha_0(x_\mu-L_\mu/2)]}$, which is the inverse of that in the heavy quark case. Preconditioning in space dimensions seems to make sense only if we solve the propagator on a point source, which is not a common situation. Also, you might notice that I force the alpha0 parameter to be positive here:

quda/lib/interface_quda.cpp

Lines 2742 to 2745 in b9be584

    
           // Force the alpha0 to be positive. 
        
           // A negative alpha0 matches something like Eq.(12) in arXiv:1006.4028, but the effect doesn't 
        
           // seem to be good. Disable the negative situation as QUDA already has multigrid for light quarks. 
        
           const double alpha0 = abs(param->distance_pc_alpha0);

In fact, leaving the alpha0 negative will trigger a t-direction light quark preconditioning. (I use the sign of alpha0 to decide whether to apply the coefficient or the inverse) I've tried this with the CG solver, but don't see a significant improvement in iteration numbers. Using multigrid for light quarks should be a much better strategy in my opinion.

mathiaswagner

just reviewed cmake but that looks uncontroversial.

…ndition.

SaltyChiang · 2024-04-12T15:14:09Z

@maddyscientist I move some common parts of dslash subroutine into public hpp files for Wilson and clover. I tried your advice about setting coefficients in Wilson kernel and the performance without distance preconditioning is the same as before. Do you think these modifications fit the QUDA's standard? I'll add more comments and tests for distance precondition later.

maddyscientist

Hi @SaltyChiang, I see your changes and this looks like a great improvement, thanks for your efforts here. I think we'll be able to get this merged with a bit more cleanup. I've left some comments, but some other outstanding things:

Make the distance preconditioning an optional CMake parameter, so that a user can disable it if they don't need it to reduce compilation time and binary size
Provide a sensible error message if distance preconditioning is requested for a Dirac operator type that it's not supported on
Can you think of a good test for the distance preconditioning? Ideally we'd want to augment the invert_test suite to test this.
- Also good to test that apply the reweighting to a quark field and removing it recovers the original field.
Apply clang format

include/kernels/dslash_wilson.cuh

include/kernels/dslash_wilson_clover.cuh

include/kernels/dslash_wilson_clover_preconditioned.cuh

SaltyChiang · 2024-04-13T19:16:31Z

Make the distance preconditioning an optional CMake parameter, so that a user can disable it if they don't need it to reduce compilation time and binary size

A CMake parameter called QUDA_WILSON_DISTANCE is added to enable the distance preconditioning code.

Provide a sensible error message if distance preconditioning is requested for a Dirac operator type that it's not supported on

I show that only Wilson and clover dslash support the distance preconditioning in the error message, not sure if this is enough.

Can you think of a good test for the distance preconditioning? Ideally we'd want to augment the invert_test suite to test this.

Also good to test that apply the reweighting to a quark field and removing it recovers the original field.

Two parameters --distance-pc-alpha0 and --distance-pc-t0 added to invert_test cli app. The verification code should be compatible with distance preconditioning now, as the solution vector should not differ much from the normal one. A function named verifySpinorDistanceReweight will be called if distance preconditioning is required. I'm wondering if the test implementation fits the QUDA style.

There are some other modifications:

Applying and removing distance reweighting is performed in MatQuda, MatDagMatQuda and dslashQuda.
invertMultiShiftQuda will raise an error if distance preconditioning parameters are set.
Requiring distance preconditioning with single or half cuda_prec and cuda_sloppy_prec raises a warning due to the poor convergence in this situation.
I'm not sure about the implementation of invertMultiSrcQuda, do you think it's safe to enable distance precondition here?
We are testing more solver types.

include/dirac_quda.h

maddyscientist · 2024-04-15T21:59:35Z

A CMake parameter called QUDA_WILSON_DISTANCE is added to enable the distance preconditioning code.

Maybe make this parameter QUDA_DIRAC_DISTANCE_PRECONDITIONING, since it might be extended to other Dirac operators in the future, and the more descriptive name will help explain it better.

I show that only Wilson and clover dslash support the distance preconditioning in the error message, not sure if this is enough.

That's fine, thank you.

Two parameters --distance-pc-alpha0 and --distance-pc-t0 added to invert_test cli app. The verification code should be compatible with distance preconditioning now, as the solution vector should not differ much from the normal one. A function named verifySpinorDistanceReweight will be called if distance preconditioning is required. I'm wondering if the test implementation fits the QUDA style.

For testing, I was thinking you should add some specific distance preconditioning to invert_test_gtest.hpp. That way, if the Dirac operator is being tested is the Wilson or Clover operator, and distance preconditioning is enabled, then we would test distance preconditioning with the CG solver. We have a bunch of conditional tests in invert_test_gtest at the moment, so it shouldn't be hard to add this. If we include a distance preconditioning test in invert_test_gtest, then it would be automatically run when ctest is run. If you're not sure how to do this, I can add it after we merge in your branch.

Applying and removing distance reweighting is performed in MatQuda, MatDagMatQuda and dslashQuda.

invertMultiShiftQuda will raise an error if distance preconditioning parameters are set.

Does distance preconditioning break the multi-shift solver? I haven't thought too hard about this, but naively the distance preconditioning shouldn't break the shifted nature of the Krylov space.

Requiring distance preconditioning with single or half cuda_prec and cuda_sloppy_prec raises a warning due to the poor convergence in this situation.

Understood, that's fine.

I'm not sure about the implementation of invertMultiSrcQuda, do you think it's safe to enable distance precondition here?

I think this should be fine, and just work.

Use `QUDA_DIRAC_DISTANCE_PRECONDITIONING` to build the code for distance preconditioning.

SaltyChiang · 2024-04-16T09:43:51Z

Maybe make this parameter QUDA_DIRAC_DISTANCE_PRECONDITIONING, since it might be extended to other Dirac operators in the future, and the more descriptive name will help explain it better.

The parameter is renamed to QUDA_DIRAC_DISTANCE_PRECONDITIONING now.

Does distance preconditioning break the multi-shift solver? I haven't thought too hard about this, but naively the distance preconditioning shouldn't break the shifted nature of the Krylov space.

There shouldn't be any mathematical problem with the multi-shift solver. I considered that the multi-shift solver is usually used in RHMC algorithm, where the source is a spinor in which all time slices are filled with random values. This indicates that the solution will not have a small magnitude at t far from distance_pc_t0, where the distance preconditioning does not give any improvement in precision.

For testing, I was thinking you should add some specific distance preconditioning to invert_test_gtest.hpp. That way, if the Dirac operator is being tested is the Wilson or Clover operator, and distance preconditioning is enabled, then we would test distance preconditioning with the CG solver. We have a bunch of conditional tests in invert_test_gtest at the moment, so it shouldn't be hard to add this. If we include a distance preconditioning test in invert_test_gtest, then it would be automatically run when ctest is run. If you're not sure how to do this, I can add it after we merge in your branch.

Some tests are skipped if distance preconditioning is enabled with --distance-pc-alpha0 and --distance-pc-t0. Some tests with distance preconditioning enabled are added to tests/CMakeLists.txt. Do you think the implementation is correct?

SaltyChiang · 2024-04-17T07:52:24Z

Enable distance preconditioning for other solvers except for multigrid. I tried some tests and they passed:

./invert_test --dslash-type wilson --dim 4 4 4 8 --niter 1000 --ngcrkrylov 8 --matpc even-even --distance-pc-alpha0 0.1 --distance-pc-t0 1 --enable-testing true
./invert_test --dslash-type clover --compute-clover true --dim 4 4 4 8 --niter 1000 --ngcrkrylov 8 --matpc even-even --distance-pc-alpha0 0.1 --distance-pc-t0 1 --enable-testing true
./invert_test --dslash-type clover --compute-clover true --dim 4 4 4 8 --niter 1000 --ngcrkrylov 8 --matpc even-even-asym --distance-pc-alpha0 0.1 --distance-pc-t0 1 --enable-testing true

maddyscientist

Thanks for all the work you've done on this @SaltyChiang. The PR is looking great. Just a couple of things left to do before merging this:

replicate the additions to QudaInvertParam in the Fortran interface
move a function definition into spinor_reweight.cu

Once you've done this, then we can get this merged.

lib/interface_quda.cpp

include/quda.h

maddyscientist · 2024-05-16T20:31:38Z

Also need this CI build error to be fixed

/home/ghrunner/runners/lattice/quda/_work/quda/quda/lib/interface_quda.cpp:1788:80: error: data argument not used by format string [-Werror,-Wformat-extra-args]
      errorQuda("Multigrid solver doesn't support distance preconditioning\n", param.inv_type);

Fix errors in the STRICT build.

…-preconditioning

maddyscientist · 2024-05-17T18:47:51Z

@Jenkins ok to test

SaltyChiang added 13 commits June 21, 2023 15:47

Apply distance preconditioning to Dslash.

c4a77c6

Use `alpha` and `source_time` in `QudaInvertParam` to configure distance preconditioning.

Merge branch 'develop' into feature/distance-preconditioning

06e64f9

Don't check alpha value in kernel.

e0d813f

Fix branch condition.

5ec23d8

Fix bugs while using multiple GPUs.

c074634

Synchronize alpha and source_time in ApplyWilson/`ApplyWilsonCl…

9add962

…over`.

Handle distance preconditioning parameters in Dirac.

0fc0b93

Reuse kernel code.

c1abed0

Reweight ColorSpinorField in quda for distance preconditioning.

f43e9b7

Merge branch 'develop' into feature/distance-preconditioning

516fa14

enable_if_t takes too much time to compile. Use if constexpr inst…

fc0c9a6

…ead.

Split distance and non-distance functions in sperate cu files for fas…

a40f987

…ter compilation. It's not `std::if_enablt_t`'s fault.

Add some comments about alpha0 and t0.

b9be584

SaltyChiang requested review from a team as code owners January 9, 2024 16:24

maddyscientist reviewed Jan 16, 2024

View reviewed changes

include/dirac_quda.h Outdated Show resolved Hide resolved

mathiaswagner approved these changes Feb 5, 2024

View reviewed changes

SaltyChiang added 2 commits February 6, 2024 20:31

Merge normal and distance preconditioned applyWilson.

bf040d5

clang-format.

b1562a2

SaltyChiang marked this pull request as draft March 8, 2024 09:43

SaltyChiang added 2 commits April 9, 2024 14:43

Merge branch 'develop' into feature/distance-preconditioning

bbfaf12

Use multiple inherit to construct Dslash argumets with distance preco…

383e337

…ndition.

SaltyChiang force-pushed the feature/distance-preconditioning branch 2 times, most recently from f1c1bbe to 12231df Compare April 9, 2024 18:30

Bug fix for distance precondition.

ea3567d

SaltyChiang force-pushed the feature/distance-preconditioning branch from 12231df to ea3567d Compare April 9, 2024 18:47

SaltyChiang added 2 commits April 10, 2024 22:55

Performance regression withou distance preconditioning.

133a1e2

Move common part of dslash into hpp files.

be532b7

SaltyChiang marked this pull request as ready for review April 12, 2024 15:09

SaltyChiang changed the title ~~Use distance preconditioning to solve precise heavy quark propagator with Wilson and Wilson-Clover actions.~~ Feature/distance-preconditioning Apr 12, 2024

maddyscientist requested changes Apr 12, 2024

View reviewed changes

include/kernels/dslash_wilson.cuh Outdated Show resolved Hide resolved

include/kernels/dslash_wilson_clover.cuh Outdated Show resolved Hide resolved

include/kernels/dslash_wilson_clover_preconditioned.cuh Outdated Show resolved Hide resolved

SaltyChiang added 2 commits April 13, 2024 18:28

Add QUDA_WILSON_DISTANCE to enable distance preconditioning.

0a7bb99

Add test for distance preconditioning.

56d2b81

Fix a bug in verifySpinorDistanceReweight for multi GPUs.

b7f0910

maddyscientist reviewed Apr 15, 2024

View reviewed changes

include/dirac_quda.h Outdated Show resolved Hide resolved

SaltyChiang added 2 commits April 16, 2024 15:48

Move distance preconditioning parameters from DiracWilson to Dirac.

1a410db

Use `QUDA_DIRAC_DISTANCE_PRECONDITIONING` to build the code for distance preconditioning.

Add tests about distance preconditioning.

38a5509

Enable distance preconditioining for other solvers except for multigrid.

59a4592

Fix typo and clang-format.

6e64e83

maddyscientist reviewed May 16, 2024

View reviewed changes

lib/interface_quda.cpp Show resolved Hide resolved

include/quda.h Show resolved Hide resolved

SaltyChiang added 2 commits May 17, 2024 11:39

Add parameters to the Fortran interface module.

afcdc8e

Fix errors in the STRICT build.

Merge remote-tracking branch 'upstream/develop' into feature/distance…

493876e

…-preconditioning

maddyscientist approved these changes May 17, 2024

View reviewed changes

maddyscientist merged commit f8855bb into lattice:develop May 17, 2024

SaltyChiang deleted the feature/distance-preconditioning branch May 18, 2024 10:26

SaltyChiang mentioned this pull request May 9, 2025

A new gauge fixing algorithm which returns the rotation field. #1481

Open

12 tasks

Feature/distance-preconditioning #1428

Feature/distance-preconditioning #1428

Uh oh!

Conversation

SaltyChiang commented Jan 9, 2024

Uh oh!

maddyscientist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SaltyChiang commented Jan 17, 2024

Uh oh!

mathiaswagner left a comment

Choose a reason for hiding this comment

Uh oh!

SaltyChiang commented Apr 12, 2024

Uh oh!

maddyscientist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SaltyChiang commented Apr 13, 2024

Uh oh!

Uh oh!

maddyscientist commented Apr 15, 2024

Uh oh!

SaltyChiang commented Apr 16, 2024

Uh oh!

SaltyChiang commented Apr 17, 2024

Uh oh!

maddyscientist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

maddyscientist commented May 16, 2024

Uh oh!

maddyscientist commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants