Scoring model as a good practice #464

ArturoAmorQ · 2021-09-24T12:01:41Z

Closes #444

Though in almost all of the notebooks we score the presented model somehow, there are some exceptions:

logistic_regression.py (not really important as we have an intuition by eye but would be a nice-to-have, or least we can add a Warning message, I think)
linear_models_ex_05.py and its solution
trees_hyperparameters.py

On the same line of thought, ensemble_hyperparameters.py computes cv scores for parameter tuning but it doesn't pass the best parameters to a final test-set scoring.

Plotting a validation curve using the train set but no further scoring on the test-set (as in Solution for Exercise M6.03 and Solution for Exercise M6.04) is discussed in this forum post. The former is not addressed in this PR as the AdaBoost notebook will be fixed elsewhere, the latter was solved by computing asking the student to score the model with n_iter_no_change=5.

glemaitre

Otherwise looks good

python_scripts/ensemble_hyperparameters.py

python_scripts/linear_models_sol_05.py

python_scripts/logistic_regression.py

python_scripts/trees_hyperparameters.py

glemaitre

Otherwise LGTM

python_scripts/ensemble_hyperparameters.py

python_scripts/trees_hyperparameters.py

ArturoAmorQ · 2021-12-10T11:11:08Z

Comments addressed, thanks!

ogrisel

There are conflicts to solve and in general I think we should add an analysis of the new test scores.

python_scripts/ensemble_sol_04.py

python_scripts/ensemble_hyperparameters.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…n-mooc into AddScore

ogrisel

I think this PR should be split because python_scripts/trees_hyperparameters.py needs more work as explained below while the rest should be good to go with the following small suggestion.

python_scripts/ensemble_hyperparameters.py

python_scripts/ensemble_sol_04.py

python_scripts/trees_hyperparameters.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> 2fc9dd3

ArturoAmorQ added 2 commits September 24, 2021 13:48

Scoring model as a good practice

a7071f4

add clarification on exercise

4de9a1c

glemaitre reviewed Nov 19, 2021

View reviewed changes

ArturoAmorQ added 3 commits November 26, 2021 14:53

Mention that the ranking is based on the average

71bc6df

Integrate the score in plot title

44883cc

Move scoring to leave plotting as single cell

bbd5107

ArturoAmorQ requested a review from glemaitre November 29, 2021 09:05

glemaitre approved these changes Dec 8, 2021

View reviewed changes

python_scripts/ensemble_hyperparameters.py Outdated Show resolved Hide resolved

python_scripts/trees_hyperparameters.py Outdated Show resolved Hide resolved

ArturoAmorQ added 4 commits December 10, 2021 11:58

Correct inexact proposition

1d4f6f4

Remove unnecesary show method

5f54a3c

Improve woring of conclusion paragraph

5dfb0e3

Correct text formating

1678980

ArturoAmorQ requested a review from glemaitre December 10, 2021 11:11

ogrisel reviewed Dec 10, 2021

View reviewed changes

python_scripts/ensemble_sol_04.py Outdated Show resolved Hide resolved

python_scripts/ensemble_sol_04.py Outdated Show resolved Hide resolved

python_scripts/ensemble_hyperparameters.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 6 commits December 10, 2021 20:41

Improve presicion of words

2caa6f9

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Solve conflicts

8643e9f

Merge branch 'AddScore' of https://github.com/ArturoAmorQ/scikit-lear…

771c3ca

…n-mooc into AddScore

Remove deprecated plotting function

24a917c

Add conclusions to exercise notebook

d3c9844

Add interpretation to held-out score

d9e24e4

ogrisel reviewed Dec 14, 2021

View reviewed changes

ogrisel added 2 commits December 14, 2021 18:34

Apply suggestions from code review

8f38e74

Commit missing suggestion because it was hidden in resolved thread

1899313

glemaitre self-assigned this Jan 6, 2022

glemaitre added 2 commits January 6, 2022 09:33

Merge remote-tracking branch 'origin/master' into pr/ArturoAmorQ/464

977c66b

iter

cf60eb7

glemaitre mentioned this pull request Jan 6, 2022

WIP Use score in tree hyperparameter notebook #503

Open

fix

81c292c

glemaitre merged commit 2fc9dd3 into INRIA:master Jan 6, 2022

github-actions bot pushed a commit that referenced this pull request Jan 6, 2022

[ci skip] Scoring model as a good practice (#464)

196f7c9

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> 2fc9dd3

Scoring model as a good practice #464

Scoring model as a good practice #464

Uh oh!

Conversation

ArturoAmorQ commented Sep 24, 2021

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ArturoAmorQ commented Dec 10, 2021

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants