-
Notifications
You must be signed in to change notification settings - Fork 584
Scoring model as a good practice #464
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
glemaitre
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise looks good
glemaitre
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise LGTM
|
Comments addressed, thanks! |
ogrisel
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are conflicts to solve and in general I think we should add an analysis of the new test scores.
Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>
ogrisel
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this PR should be split because python_scripts/trees_hyperparameters.py needs more work as explained below while the rest should be good to go with the following small suggestion.
Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> 2fc9dd3
Closes #444
Though in almost all of the notebooks we score the presented model somehow, there are some exceptions:
On the same line of thought, ensemble_hyperparameters.py computes cv scores for parameter tuning but it doesn't pass the best parameters to a final test-set scoring.
Plotting a validation curve using the train set but no further scoring on the test-set (as in Solution for Exercise M6.03 and Solution for Exercise M6.04) is discussed in this forum post. The former is not addressed in this PR as the AdaBoost notebook will be fixed elsewhere, the latter was solved by computing asking the student to score the model with
n_iter_no_change=5.