Improve mappers, add ModeBasedMapper #1301

grossardt · 2023-12-12T21:30:42Z

Summary

Adds a new class ModeBasedMapper which implements the basic functionality of the mode based mappers (JW, BK, parity, direct mapper) through a Pauli table. Reverts pauli_table to instance method and removes all caching from the parent class. As discussed in #1289.

Performance testing

I have performed the tests as in #545 and #644 with code and more plots available here. The results look like this:

Essentially, the effect of caching the Pauli table methods seems marginal. As there appear to be some fringe cases, where caching is somewhat useful (e.g. performances almost doubles for the BK and Parity mappers when mapping large numbers of length 10 number operators but not for length 5 or 15) and no real harm is being done, I implement the caching on the level of the individual pauli_table implementations. For this, the instance method pauli_table simply calls the private static method _pauli_table which is cached. This way, one can keep the caching for the existing mappers, but can also implement new mappers that actually use that pauli_table is an instance method.

coveralls · 2023-12-12T23:36:26Z

Pull Request Test Coverage Report for Build 8719469367

Details

74 of 76 (97.37%) changed or added relevant lines in 8 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.03%) to 86.908%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
qiskit_nature/second_q/mappers/mode_based_mapper.py	44	46	95.65%

Totals
Change from base Build 8719460949:	0.03%
Covered Lines:	8968
Relevant Lines:	10319

💛 - Coveralls

woodsp-ibm · 2023-12-13T15:32:57Z

In looking at the number op timing in your notebook, it looks like #644, in that the timing includes the operator build out. Which is not the case in the parity op one where the operator is built before the call to map() rather than being built as the operator parameter.

I do not know how closely things were looked at before. What might be interesting to know is if you put timing in the code around the pauli table buildout how long does that take in comparison to the overall mapping. I imagine that varies a lot though right since the table is built based on the operator width (register size) but the mapping depends on how many terms the operator has.

#644 seems to talk about auxiliary operators - if these are normally relatively few terms then the overhead of building the table for each may end up being more significant. And I guess the table being cached should save some memory too.

grossardt · 2023-12-14T23:54:47Z

In looking at the number op timing in your notebook, it looks like #644, in that the timing includes the operator build out. Which is not the case in the parity op one where the operator is built before the call to map() rather than being built as the operator parameter.

I do not know how closely things were looked at before. What might be interesting to know is if you put timing in the code around the pauli table buildout how long does that take in comparison to the overall mapping. I imagine that varies a lot though right since the table is built based on the operator width (register size) but the mapping depends on how many terms the operator has.

Yes, right. Based on the result my suspicion is that building the Pauli table is actually not the most expensive part, but it's a good idea to look at this explicitly. I will try to find some time to do that (but possibly only after Christmas).

#644 seems to talk about auxiliary operators - if these are normally relatively few terms then the overhead of building the table for each may end up being more significant. And I guess the table being cached should save some memory too.

That's also a good point. Generally I just took these two examples from the previous issues without much questioning, but I think it would be a good idea to put some thought into coming up with some realistic and meaningful benchmark in some sense.

That being said: as I wrote above, I think there is no harm done by doing the caching (on the level of the individual mapper's pauli_table method).

My guess is that the compose and simplify calls in mode_based_mapping are quite expensive. I see if I can time that as well. Possibly, improving the efficiency of simplify would do more for the efficiency of the mappers as well, as compared to caching the pauli_table.

The main difference regarding the caching is that sparse_pauli_operators is not cached any longer, which probably explains the somewhat better performance of the old version even when compared to the new version with caching. However, the composition of the creation/annihilation operators in sparse_pauli_operators seems to not have a large effect overall.

On a different but related note:

I was wondering what the point is of having these two separate methods, pauli_table and sparse_pauli_operators? As I understand it, mode based mapping is supposed to capture all those cases in which the single particle Fock space operators $a_j$, $a_j^\dagger$ are each mapped to some SparsePauliOp. It is obvious that these Pauli operators are then $(P_j \pm Q_j)/2$ where for the common mappers (JW, Parity, BK) $P_j$ and $Q_j$ are single Pauli string operators. I don't see, however, why ModeBasedMapper shouldn't be more general and accept any such mapping.

Now, pauli_table determines these single Pauli strings $P_j$ and $Q_j$, whereas sparse_pauli_operators takes over the combination to $a_j$ and $a_j^\dagger$, or $(P_j \pm Q_j)/2$, respectively. Does it really make sense to keep these two steps as separate methods? I can't think of use cases that would need $P_j$ and $Q_j$ as intermediate results in this process. If anything, it is a mere convenience to prevent repetition of these ~4-6 lines of code in the implementation for the child classes. On the other hand, combining the two methods into a single one that just returns the sparse_pauli_operators matching the creation/annihilation operators may allow for more efficient implementations that do not need to first build the operators $P_j$ and $Q_j$.

My personal sentiment is that I would expect pauli_table to return what sparse_pauli_operators returns, namely a list of Pauli operators corresponding to all the modes in the system, and it should for every mapper separately implement the most efficient way to obtain this list, with or without the detour via calculating $P_j$ and $Q_j$.

Neither pauli_table nor sparse_pauli_operators is currently called anywhere outside of mode_based_mapping (at least within qiskit-nature), so it would probably still be possible without too much harm to change this, but maybe there is a reason for the splitting that I am missing?

grossardt · 2024-01-12T22:44:18Z

Happy New Year everyone! To continue where we left off, I changed the timing of the number operator to not include building the operators. That doesn't change much, however, even for the total time. So building the operators seems to take negligible time compared to the mapper, but caching seems to not have much benefit.
In an attempt to come up with some more practical test, I also built versions of the Fermi-Hubbard Hamiltonian with creation/annihilation operators transformed by a random unitary. I built n operators of size n for n = 1...9, and the result again shows only marginal benefit of the caching. (But there is this marginal advantage, so I would leave the caching in as is.)

For me, the only open question is the "different note" above about whether the pauli_table and sparse_pauli_operators methods should be merged.

mrossinek

A few minor details, but overall this LGTM.

Regarding your question about pauli_table and sparse_pauli_operators: the separation is likely historical. I don't have a strong opinion on whether this is kept as is or not. However, changing it will require proper deprecation.

Maybe the approach suggested in #1340 could inform a potential new implementation.

mrossinek · 2024-02-23T10:39:59Z

qiskit_nature/second_q/mappers/logarithmic_mapper.py

@@ -143,7 +143,7 @@ def _logarithmic_encoding(
            op.chop()
            spin_op_encoding.append(op)

-        return tuple(spin_op_encoding)
+        return (spin_op_encoding[0], spin_op_encoding[1], spin_op_encoding[2], spin_op_encoding[3])


Not sure why this is necessary

I changed this because otherwise mypy fails due to this issue: python/mypy#7509

qiskit_nature/second_q/mappers/mode_based_mapper.py

releasenotes/notes/improve-mappers-b55cb0ca5fd656e4.yaml

woodsp-ibm · 2024-03-07T12:51:25Z

releasenotes/notes/improve-mappers-b55cb0ca5fd656e4.yaml

+  - |
+    The class :class:`.second_q.mappers.ModeBasedMapper` has been added to implement mode based
+    mapping via a Pauli table (previously part of :class:`.second_q.mappers.QubitMapper`).
+upgrade:


upgrade is supposed to be things an end user needs to do to upgrade (alter) their code for the release e.g. things have been removed and users code needs to be adjusted. How much is this really just change and part of the features statement - though honestly this is supposed to be more an end user facing statement around the changes where I am not sure how much, if anything, this internal change affects them or surfaces to them in any way. I imagine code they had before using the mappers works just as it did before.

I see Max's comment saying some methods have been removed - that is a case where the user would have to change things so that makes sense, so that would be upgrade, but that just seems to be the last paragraph there

That explanation makes sense. I have removed the point about changes in caching, and the inheritance which I believe has no end user facing consequences. The removal of public methods as pointed out by @mrossinek should obviously stay. Based on your reasoning, the info that pauli_table is now an instance method instead of a class method should stay, I guess, in case someone is calling it on a class in their code.

mrossinek

Thanks for your patience and hard work on this! You substantially improved the design of our mappers with this PR 👍

grossardt added 2 commits December 11, 2023 23:08

Implement ModeBasedMapper no caching

d23cb86

Added caching and release note

3e318ba

grossardt requested review from woodsp-ibm, mrossinek, robertodr, matteoacrossi and ftroisi as code owners December 12, 2023 21:30

grossardt added 4 commits December 12, 2023 22:31

Merge branch 'main' into fix-improve-mappers-issue1289

f38cd64

Update release note to pass spell check

d268913

fix typing

61042e0

fix typing for Python <3.10

1a4b2e0

Merge branch 'main' into fix-improve-mappers-issue1289

ecd55eb

grossardt mentioned this pull request Jan 28, 2024

Add Ternary Tree Mapper #1313

Open

Merge branch 'main' into fix-improve-mappers-issue1289

5c6f393

mrossinek mentioned this pull request Feb 23, 2024

Speed up fermionic mappers by means of an intermediate conversion to majorana operators #1340

Open

mrossinek previously approved these changes Feb 23, 2024

View reviewed changes

grossardt added 2 commits February 26, 2024 23:20

Merge branch 'main' into fix-improve-mappers-issue1289

7f9f7c3

suggestions by mrossinek plus changes needed to pass pylint

4477de7

grossardt dismissed mrossinek’s stale review via 4477de7 February 26, 2024 22:42

grossardt requested a review from mrossinek February 26, 2024 23:08

robertodr previously approved these changes Mar 7, 2024

View reviewed changes

woodsp-ibm reviewed Mar 7, 2024

View reviewed changes

Update improve-mappers-b55cb0ca5fd656e4.yaml

b1ab3c0

grossardt dismissed robertodr’s stale review via b1ab3c0 March 11, 2024 11:42

Merge branch 'main' into fix-improve-mappers-issue1289

39a2bb2

grossardt requested a review from woodsp-ibm March 11, 2024 11:59

grossardt requested a review from robertodr March 11, 2024 11:59

mrossinek approved these changes Apr 12, 2024

View reviewed changes

mrossinek added 2 commits April 17, 2024 10:55

Merge branch 'main' into fix-improve-mappers-issue1289

466e3d8

Merge branch 'main' into fix-improve-mappers-issue1289

1c68e63

mrossinek merged commit 295788f into qiskit-community:main Apr 17, 2024
16 checks passed

grossardt deleted the fix-improve-mappers-issue1289 branch April 19, 2024 12:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve mappers, add ModeBasedMapper #1301

Improve mappers, add ModeBasedMapper #1301

grossardt commented Dec 12, 2023

coveralls commented Dec 12, 2023 •

edited

Loading

woodsp-ibm commented Dec 13, 2023

grossardt commented Dec 14, 2023

grossardt commented Jan 12, 2024 •

edited

Loading

mrossinek left a comment

mrossinek Feb 23, 2024

grossardt Feb 26, 2024

woodsp-ibm Mar 7, 2024 •

edited

Loading

grossardt Mar 11, 2024

mrossinek left a comment

Improve mappers, add ModeBasedMapper #1301

Improve mappers, add ModeBasedMapper #1301

Conversation

grossardt commented Dec 12, 2023

Summary

Performance testing

coveralls commented Dec 12, 2023 • edited Loading

Pull Request Test Coverage Report for Build 8719469367

Details

💛 - Coveralls

woodsp-ibm commented Dec 13, 2023

grossardt commented Dec 14, 2023

On a different but related note:

grossardt commented Jan 12, 2024 • edited Loading

mrossinek left a comment

Choose a reason for hiding this comment

mrossinek Feb 23, 2024

Choose a reason for hiding this comment

grossardt Feb 26, 2024

Choose a reason for hiding this comment

woodsp-ibm Mar 7, 2024 • edited Loading

Choose a reason for hiding this comment

grossardt Mar 11, 2024

Choose a reason for hiding this comment

mrossinek left a comment

Choose a reason for hiding this comment

coveralls commented Dec 12, 2023 •

edited

Loading

grossardt commented Jan 12, 2024 •

edited

Loading

woodsp-ibm Mar 7, 2024 •

edited

Loading