Fix `pytorch` gradients #1450

Simone-Bordoni · 2024-09-16T12:59:48Z

Fix #1449

Checklist:

Reviewers confirm new code works as expected.
Tests are passing.
Coverage does not decrease.
Documentation is updated.

BrunoLiegiBastonLiegi · 2024-09-17T09:05:50Z

I found a potential problem when you cast the parameters of parametrized gates:

qibo/src/qibo/backends/pytorch.py

Line 32 in 1b6eb8a

return self.np.tensor(x, dtype=self.dtype, requires_grad=self.requires_grad)

here you cast to self.dtype which is usually torch.complex128 but it should actually be torch.float32/64 since this is a parameter. I don't know whether this is the origin of the problem with the gradients though.

EDIT: nevermind, you necessarily have to cast to complex otherwise when you build the matrix elements pytorch complains about mismatching dtypes...

Simone-Bordoni · 2024-09-17T10:27:00Z

I found a potential problem when you cast the parameters of parametrized gates:

qibo/src/qibo/backends/pytorch.py

Line 32 in 1b6eb8a

return self.np.tensor(x, dtype=self.dtype, requires_grad=self.requires_grad)

here you cast to self.dtype which is usually torch.complex128 but it should actually be torch.float32/64 since this is a parameter. I don't know whether this is the origin of the problem with the gradients though.
EDIT: nevermind, you necessarily have to cast to complex otherwise when you build the matrix elements pytorch complains about mismatching dtypes...

Actually this may be part of the problem, i am rewriting in a better way the whole casting function differentiating the matrix_dtype from the parameters_dtype

Simone-Bordoni · 2024-09-17T13:37:37Z

Now the gradients are passing however their final value is always zero. I have tried visualizing the computational graph (shown below) and all the operations seems to be performed correctly.
I am currently tsting a simple circuit with just one rotation gate (test_torch_gradients.py).
In the next days I will not be able to work furthermore on this issue, I will resume working on the gradients next week. In the meantime if you find any possible reason for this problemlet me know, it would be very helpful.

Simone-Bordoni · 2024-09-17T13:47:52Z

Now the gradients are passing however their final value is always zero. I have tried visualizing the computational graph (shown below) and all the operations seems to be performed correctly. I am currently tsting a simple circuit with just one rotation gate (test_torch_gradients.py). In the next days I will not be able to work furthermore on this issue, I will resume working on the gradients next week. In the meantime if you find any possible reason for this problemlet me know, it would be very helpful.

I found out the problem was with the target state. now the gradients are passing correctly,
as soon as possible I will clean up the code and it should be ready for a review.

src/qibo/backends/pytorch.py

Simone-Bordoni · 2024-09-24T14:58:15Z

I think that now everything has been fixed.
The changes required were bigger than expected, I had to add the backend in the gate decompositions and in other parts.
I have added a test to check the correct backpropagation in test_torch_gradients.py so that thay can be easily moved to qiboml.

codecov · 2024-09-24T15:42:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.93%. Comparing base (36ed3fe) to head (3f22d8d).
Report is 51 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1450      +/-   ##
==========================================
- Coverage   99.93%   99.93%   -0.01%     
==========================================
  Files          81       81              
  Lines       11820    11790      -30     
==========================================
- Hits        11812    11782      -30     
  Misses          8        8

Flag	Coverage Δ
unittests	`99.93% <100.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

BrunoLiegiBastonLiegi

Thanks @Simone-Bordoni, looks good. The only two things that I am not a fan of are:

The new requires_grad argument added to cast. I would remove that and, when possible, move the functions, where it is needed, inside the backend as methods (like the _curve_fit case discussed above) . For the cases when it is not possible I would create specific backend.cast_parameter()

# numpy
def cast_parameter(self, x, **kwargs):
    return self.cast(x, **kwargs)
#  torch
def cast_parameter(self, x, **kwargs):
    return self.cast(x, **kwargs).requires_grad_()

The fact that the decomposition needs the backend now. Ideally, I don't think this should be the case, but I don't know in practice if it's possible to avoid.

tests/test_backends_torch_gradients.py

tests/test_gates_gates.py

tests/test_models_hep.py

tests/test_quantum_info_entanglement.py

Simone-Bordoni · 2024-10-03T13:51:42Z

Thanks @Simone-Bordoni, looks good. The only two things that I am not a fan of are:

The new requires_grad argument added to cast. I would remove that and, when possible, move the functions, where it is needed, inside the backend as methods (like the _curve_fit case discussed above) . For the cases when it is not possible I would create specific backend.cast_parameter()
# numpy
def cast_parameter(self, x, **kwargs):
    return self.cast(x, **kwargs)
#  torch
def cast_parameter(self, x, **kwargs):
    return self.cast(x, **kwargs).requires_grad_()
The fact that the decomposition needs the backend now. Ideally, I don't think this should be the case, but I don't know in practice if it's possible to avoid.

Regarding the decomposition, as it is now you need the backend.
Regarding the cast_parameter, I can add it to the other backends, however the corrections you are suggesting is not possible as the cast_parameter differs a lot from the cast function:

cast parameter tries to convert integer to float in torch
the default dtype for cast parameter is torch float64 (self.parameter_dtype), opposite to cast that has complex 128 (self.dtype) that is used on matrices.
cast makes different operations on torch trying to make it the same as tensorflow, for example regarding lists of arrays.

So I agree on adding a cast parameter function to all backends but this should not be related to the standard cast()

Simone-Bordoni · 2024-10-03T13:51:46Z

Thanks @Simone-Bordoni, looks good. The only two things that I am not a fan of are:

The new requires_grad argument added to cast. I would remove that and, when possible, move the functions, where it is needed, inside the backend as methods (like the _curve_fit case discussed above) . For the cases when it is not possible I would create specific backend.cast_parameter()
# numpy
def cast_parameter(self, x, **kwargs):
    return self.cast(x, **kwargs)
#  torch
def cast_parameter(self, x, **kwargs):
    return self.cast(x, **kwargs).requires_grad_()
The fact that the decomposition needs the backend now. Ideally, I don't think this should be the case, but I don't know in practice if it's possible to avoid.

Regarding the decomposition, as it is now you need the backend.
Regarding the cast_parameter, I can add it to the other backends, however the corrections you are suggesting is not possible as the cast_parameter differs a lot from the cast function:

cast parameter tries to convert integer to float in torch
the default dtype for cast parameter is torch float64 (self.parameter_dtype), opposite to cast that has complex 128 (self.dtype) that is used on matrices.
cast makes different operations on torch trying to make it the same as tensorflow, for example regarding lists of arrays.

So I agree on adding a cast parameter function to all backends but this should not be related to the standard cast()

Simone-Bordoni · 2024-10-03T14:00:10Z

Regarding the decomposition, as it is now you need the backend. Regarding the cast_parameter, I can add it to the other backends, however the corrections you are suggesting is not possible as the cast_parameter differs a lot from the cast function:

cast parameter tries to convert integer to float in torch

the default dtype for cast parameter is torch float64 (self.parameter_dtype), opposite to cast that has complex 128 (self.dtype) that is used on matrices.

cast makes different operations on torch trying to make it the same as tensorflow, for example regarding lists of arrays.

So I agree on adding a cast parameter function to all backends but this should not be related to the standard cast()

Actually cast_parameter() was made to cast a single parameter for matrix_parametrized().
An idea could be making it a helper method in the torch backend and creating a new public mthod like cast_parameters() to be used in these situations.
Anyway I don't know if @renatomello agrees on adding this new function to the numpy backend to avoid different backend behaviours.

…diff

Simone-Bordoni added 5 commits August 26, 2024 13:38

create pr

0f800b2

torch example

5281882

Merge branch 'master' into vqe_example

ff9fab6

Merge branch 'master' into vqe_example

b02222c

merge master

364acbf

Simone-Bordoni added the bug Something isn't working label Sep 16, 2024

Simone-Bordoni self-assigned this Sep 16, 2024

renatomello added this to the Qibo 0.2.13 milestone Sep 17, 2024

renatomello requested review from renatomello and BrunoLiegiBastonLiegi September 17, 2024 04:51

Simone-Bordoni added 2 commits September 17, 2024 15:45

refactor casting parameters

9c96d41

gradients passing but value is zero...

b0d805c

working gradient

0e193a4

renatomello reviewed Sep 18, 2024

View reviewed changes

src/qibo/backends/pytorch.py Outdated Show resolved Hide resolved

renatomello reviewed Sep 18, 2024

View reviewed changes

src/qibo/backends/pytorch.py Outdated Show resolved Hide resolved

BrunoLiegiBastonLiegi mentioned this pull request Sep 18, 2024

Move Pytorch backend qiboteam/qiboml#30

Open

Simone-Bordoni added 3 commits September 23, 2024 12:32

Merge branch 'master' into fix_autodiff

9ab0e99

solved errors

061a43d

solve errors

3ce8a2d

Simone-Bordoni marked this pull request as ready for review September 24, 2024 13:47

Simone-Bordoni requested a review from renatomello September 24, 2024 13:47

fixed last tests

21b08c1

Simone-Bordoni added 2 commits September 25, 2024 13:51

fix coverage

cba7a01

improve coverage

ad25c2c

BrunoLiegiBastonLiegi reviewed Oct 3, 2024

View reviewed changes

corrections by andrea

a7a481e

Simone-Bordoni added 3 commits October 3, 2024 18:00

more corrections

e7b7217

restore error in circuit quasm

3b731fd

remove requires_grad from cast

7e9c1e8

Simone-Bordoni requested a review from BrunoLiegiBastonLiegi October 3, 2024 14:17

renatomello and others added 5 commits October 4, 2024 08:52

Merge branch 'master' into fix_autodiff

c6efb1f

Merge branch 'fix_autodiff' of github.com:qiboteam/qibo into fix_auto…

2e2787b

…diff

fix tests

b48928a

Merge branch 'master' into fix_autodiff

96f6b75

Merge branch 'master' into fix_autodiff

699fd26

renatomello changed the title ~~Fix pytorch gradients~~ Fix pytorch gradients Oct 7, 2024

renatomello approved these changes Oct 7, 2024

View reviewed changes

renatomello and others added 6 commits October 7, 2024 17:01

Merge branch 'master' into fix_autodiff

0b77687

Merge branch 'master' into fix_autodiff

571c701

fix merge

acdc560

Merge branch 'master' into fix_autodiff

0be1e8c

last corrections by andrea

c01cd33

Merge branch 'master' into fix_autodiff

71bd989

renatomello mentioned this pull request Oct 9, 2024

Quantum natural gradient qiboteam/qiboml#35

Open

fix test

3f22d8d

BrunoLiegiBastonLiegi approved these changes Oct 10, 2024

View reviewed changes

BrunoLiegiBastonLiegi added this pull request to the merge queue Oct 10, 2024

Merged via the queue into master with commit d290601 Oct 10, 2024
27 checks passed

renatomello deleted the fix_autodiff branch October 17, 2024 05:29

BrunoLiegiBastonLiegi mentioned this pull request Nov 5, 2024

Temporary fixes to support pytorch backend migration qiboteam/qiboml#45

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `pytorch` gradients #1450

Fix `pytorch` gradients #1450

Simone-Bordoni commented Sep 16, 2024 •

edited

Loading

BrunoLiegiBastonLiegi commented Sep 17, 2024 •

edited

Loading

Simone-Bordoni commented Sep 17, 2024 •

edited

Loading

Simone-Bordoni commented Sep 17, 2024

Simone-Bordoni commented Sep 17, 2024

Simone-Bordoni commented Sep 24, 2024

codecov bot commented Sep 24, 2024 •

edited

Loading

BrunoLiegiBastonLiegi left a comment

Simone-Bordoni commented Oct 3, 2024

Simone-Bordoni commented Oct 3, 2024

Simone-Bordoni commented Oct 3, 2024

Fix pytorch gradients #1450

Fix pytorch gradients #1450

Conversation

Simone-Bordoni commented Sep 16, 2024 • edited Loading

BrunoLiegiBastonLiegi commented Sep 17, 2024 • edited Loading

Simone-Bordoni commented Sep 17, 2024 • edited Loading

Simone-Bordoni commented Sep 17, 2024

Simone-Bordoni commented Sep 17, 2024

Simone-Bordoni commented Sep 24, 2024

codecov bot commented Sep 24, 2024 • edited Loading

Codecov Report

BrunoLiegiBastonLiegi left a comment

Choose a reason for hiding this comment

Simone-Bordoni commented Oct 3, 2024

Simone-Bordoni commented Oct 3, 2024

Simone-Bordoni commented Oct 3, 2024

Fix `pytorch` gradients #1450

Fix `pytorch` gradients #1450

Simone-Bordoni commented Sep 16, 2024 •

edited

Loading

BrunoLiegiBastonLiegi commented Sep 17, 2024 •

edited

Loading

Simone-Bordoni commented Sep 17, 2024 •

edited

Loading

codecov bot commented Sep 24, 2024 •

edited

Loading