Get correct coefs for ridge regression #486

topepo · 2021-05-11T18:46:24Z

closes #431

Makes a special parsnip parameter that allows users to set the penalty value independent of the full regularization path. This can help with pure ridge models where glmnet may not produce the correct values.

juliasilge

So are we not protecting against folks still doing the wrong thing in the penalty = 0 case? They will need to see this in the documentation to know what to do?

R/multinom_reg.R

man/rmd/logistic-reg.Rmd

topepo · 2021-05-12T13:09:16Z

So are we not protecting against folks still doing the wrong thing in the penalty = 0 case? They will need to see this in the documentation to know what to do?

They will have to see the documentation to understand that.

We could fix the path values to a vector that would probably capture 90% of the cases (and document that). We can't effectively check the penalty value before setting a default; at parsnip predict-time, we don't know the range of penalties that will be used at predict-time.

The trade-off is:

👍 People won't silently get the wrong answers for ridge regression models.
👎 Model results might be slightly different than if we didn't set lambda; the default might cause poor results for penalties higher than our default.

Either is fine with me.

juliasilge · 2021-05-12T15:37:32Z

OK, I was fuzzy on our options before but I think I am clearer now.

I think we both agree that it is much more common for folks to want to use ridge regression than to end up needing very high penalty values. On the other hand, getting slightly different results than the underlying model is something that we know confuses people and that will happen basically for all cases of using glmnet ever if we set a path of lambdas.

This makes me think we shouldn't set a default. I am definitely a little worried that this is like a GOTCHA that people are not going to see; maybe in #456 we can think about how to highlight this kind of issue on, say, the glmnet landing page.

github-actions · 2021-05-27T00:57:26Z

This pull request has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

topepo added 4 commits May 10, 2021 21:02

prototype of changes for #431

20d5ba4

further changes for #431

9ddb7ac

Merge branch 'master' into path-values

fb8d78b

check values first (and evaluate them)

1fb7988

topepo requested a review from juliasilge May 11, 2021 18:46

juliasilge added 3 commits May 11, 2021 15:36

Update NEWS

2353875

Clarify comment for FUTURE ME

a5fc49b

Edits for clarity in docs

40101e0

juliasilge reviewed May 11, 2021

View reviewed changes

R/multinom_reg.R Outdated Show resolved Hide resolved

juliasilge reviewed May 11, 2021

View reviewed changes

man/rmd/logistic-reg.Rmd Show resolved Hide resolved

juliasilge added 2 commits May 12, 2021 09:25

Use linear_reg translate for multinom_reg for now

4f04f77

Update NEWS

8ff0a28

topepo merged commit fc21c9e into master May 12, 2021

topepo deleted the path-values branch May 12, 2021 16:09

github-actions bot locked and limited conversation to collaborators May 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Get correct coefs for ridge regression #486

Get correct coefs for ridge regression #486

Uh oh!

topepo commented May 11, 2021

Uh oh!

juliasilge left a comment

Uh oh!

Uh oh!

Uh oh!

topepo commented May 12, 2021

Uh oh!

juliasilge commented May 12, 2021

Uh oh!

github-actions bot commented May 27, 2021

Uh oh!

Uh oh!

Get correct coefs for ridge regression #486

Get correct coefs for ridge regression #486

Uh oh!

Conversation

topepo commented May 11, 2021

Uh oh!

juliasilge left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

topepo commented May 12, 2021

Uh oh!

juliasilge commented May 12, 2021

Uh oh!

github-actions bot commented May 27, 2021

Uh oh!

Uh oh!