-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DOC improve introduction to undersampling methods #1018
DOC improve introduction to undersampling methods #1018
Conversation
@glemaitre ready for review |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just 2 nitpics, otherwise LGTM
doc/under_sampling.rst
Outdated
You can refer to | ||
:ref:`sphx_glr_auto_examples_under-sampling_plot_comparison_under_sampling.py`. | ||
One way of handling imbalanced datasets is to reduce the number of observations from | ||
the majority class or classes. The most well known algorithm in this group is random |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the majority class or classes
I think it is a bit weird to say majority classes since usually we would refer to a single class. What about saying something like:
"reduce the number of observations from all classes but the minority class (i.e. the one with the least number of observations)."
doc/under_sampling.rst
Outdated
We will discuss the different algorithms throughout this document. | ||
|
||
Refer to :ref:`sphx_glr_auto_examples_under-sampling_plot_comparison_under_sampling.py` | ||
for a comparison of the different undersampling methodologies. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think that we this sentence because sphinx will use the title which is already stating "Compare under-sampling samplers".
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure I understand this suggestion, but let me try a fix
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, I got it.
done |
related to #853
As per request, splitting into smaller PRs