Multivariate r2 #1310

harahu · 2020-03-14T21:25:45Z

I noticed that the RSquared metric does not behave in the same way as the scikit-learn implementation of the same metric for multivariate regression (in scikit-learn known as "multioutput"). First I thought I should mend it to align with the scikit-learn implementation that calculates separate scores for each label element and aggregates the scores. Then I started asking myself whether there might be other ways of interpreting R² in higher dimensions. Looking at the definition presented in https://en.wikipedia.org/wiki/Coefficient_of_determination, it does look like it might make sense to extend it to higher rank data types, but this can be done in different ways, and I am unsure what would make sense mathematically. The scikit-learn approach at least makes some sense to me as it just treats the model as a model for n separate dependent variables at the same time.

I would really like a discussion on how R² should be understood in higher dimensions, but if we are to go with the scikit-learn interpretation, this PR is one way to go about it. It is slightly annoying that you have to specify the shape of y when initializing the metric object. This could be solved by dynamically setting the shape like it is done in the MeanTensor metric. Although more user friendly, I don't think the code is as clean, so there is a trade-off.

I modified a test to show how this acts for higher dimensional data, currently matching the multioutput="raw_values" flavour of the scikit-learn implementation.

bot-of-gabrieldemarmiesse · 2020-03-14T21:26:14Z

@SSaishruthi

You are owner of some files modified in this pull request.
Would you kindly review the changes whenever you have the time to?
Thank you very much.

harahu · 2020-03-14T21:34:31Z

This PR is incomplete, though. It is really just an idea proposal, so please don't merge before we work out the details.

gabrieldemarmiesse · 2020-03-14T22:37:42Z

Thanks a lot @harahu for this pull request. Mutiple things come to mind:

I'll add WIP to the title of your pull request to prevent merging
I'm no expert in RSquared, but if we decide to go against what was decided for scikit-learn, we need a very strong reason. Often consistency if better than a small design improvement (priciple of least astonishment), especially concerning numerical libraries, where behavior changes go unoticed.
Concerning specifying the shape of y, I am faily certain that Metric, like Layer, will support a build method at some point. There is no technical reason why it wouldn't be possible, it's a better UX, and it's been shown to be needed. The question is what do we do in the meantime. We have two possibilities as you stated, either, a better UX, but using a private TF API, either a worse UX, but we don't use a private API. I'm in favor of your current implementation. It's a worse UX for a small number of users, but in the end it's temporary. On our side, it makes our life a lot easier. Using private APIs means that the behavior can change silently, which is really bad and can break the CI or worse, user code without us knowing about it. In the end, we can always deprecate the y_shape when the build method is officially supported, with a warning and everything, so changing the API is not an issue.

harahu · 2020-03-14T22:59:10Z

Thank you for your thoughts on this @gabrieldemarmiesse! I definitely agree that there should be a high threshold for breaking with the 'scikit-learn' behaviour.

I did ask a question regarding this on Cross Validated, hoping to get some feedback. I really am no expert myself.

harahu · 2020-03-18T21:24:32Z

Did not get much response on Cross Validated, but I did discuss this with my colleague @ttvand . The only generalization of R² we felt comfortable claiming made sense was where you calculate individual scores per label element. This corresponds with the multioutput="raw_values" flavour in in scikit-learn. Although I would like that as the default setting in a utopia, I did implement the other supported options from scikit-learn, and used the same default "uniform_average" to maintain as similar behaviour as possible. Also tested that behaviour remains in line with scikit-learn for these different options.

Any further feedback on this, @gabrieldemarmiesse ?

gabrieldemarmiesse · 2020-03-18T21:35:35Z

I believe you made the right call here. We don't need to think about it too much if we just copy what scikit learn does and make tests to ensure the results are the same for the three values. Users will pick whatever they want from what is available. I think that's perfect and it also has the nice property that we (I) don't need to check the math to merge since scikit-learn does it for us (the review is going to be much faster). Give me some time to review on the technical side of things.

Thanks again for all your work!

gabrieldemarmiesse

Everything is good. Just some style remarks to adress to try to improve readability and we're good to merge this!

tensorflow_addons/metrics/r_square.py

gabrieldemarmiesse

Perfect, thanks a lot!

* Proposal for multivariate r2 * More flavours of multioutput * Improve an unfinished exception message * Simplifications to increase readability * Pass some arguments that should be passed

Proposal for multivariate r2

962be07

boring-cyborg bot added the metrics label Mar 14, 2020

googlebot added the cla: yes label Mar 14, 2020

gabrieldemarmiesse changed the title ~~Proposal for multivariate r2~~ [WIP] Proposal for multivariate r2 Mar 14, 2020

harahu added 2 commits March 18, 2020 22:07

More flavours of multioutput

6a36c79

Improve an unfinished exception message

47994ff

harahu changed the title ~~[WIP] Proposal for multivariate r2~~ Multivariate r2 Mar 18, 2020

gabrieldemarmiesse reviewed Mar 18, 2020

View reviewed changes

harahu added 2 commits March 20, 2020 14:37

Simplifications to increase readability

9e570af

Pass some arguments that should be passed

aa729b0

gabrieldemarmiesse approved these changes Mar 20, 2020

View reviewed changes

gabrieldemarmiesse merged commit 3e562ad into tensorflow:master Mar 20, 2020

failure-to-thrive mentioned this pull request May 1, 2020

tfa.metrics.RSquare: ValueError: Shapes must be equal rank, but are 0 and 1 for 'AssignAddVariableOp' (op: 'AssignAddVariableOp') with input shapes: [], [1]. #1755

Open

harahu mentioned this pull request Jun 5, 2020

Allow overriding the build method for Metrics tensorflow/tensorflow#40195

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multivariate r2 #1310

Multivariate r2 #1310

Uh oh!

harahu commented Mar 14, 2020

Uh oh!

bot-of-gabrieldemarmiesse commented Mar 14, 2020

Uh oh!

harahu commented Mar 14, 2020

Uh oh!

gabrieldemarmiesse commented Mar 14, 2020

Uh oh!

harahu commented Mar 14, 2020

Uh oh!

harahu commented Mar 18, 2020

Uh oh!

gabrieldemarmiesse commented Mar 18, 2020

Uh oh!

gabrieldemarmiesse left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gabrieldemarmiesse left a comment

Uh oh!

Uh oh!

Multivariate r2 #1310

Multivariate r2 #1310

Uh oh!

Conversation

harahu commented Mar 14, 2020

Uh oh!

bot-of-gabrieldemarmiesse commented Mar 14, 2020

Uh oh!

harahu commented Mar 14, 2020

Uh oh!

gabrieldemarmiesse commented Mar 14, 2020

Uh oh!

harahu commented Mar 14, 2020

Uh oh!

harahu commented Mar 18, 2020

Uh oh!

gabrieldemarmiesse commented Mar 18, 2020

Uh oh!

gabrieldemarmiesse left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gabrieldemarmiesse left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!