Skip to content

[DOCS] Add total feature importance to classification example #1382

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Sep 30, 2020

Conversation

lcawl
Copy link
Contributor

@lcawl lcawl commented Sep 29, 2020

Related to elastic/kibana#78238 and elastic/ml-cpp#1387 and #1378 and #1379

This PR adds total feature importance values to the classification example (https://www.elastic.co/guide/en/machine-learning/master/flightdata-classification.html). It also adds an example of a classification job with more than two classes in the feature importance overview (https://www.elastic.co/guide/en/machine-learning/master/ml-feature-importance.html).

It also refreshes some screenshots in the example that were out-dated.

Preview

@lcawl lcawl changed the title [DOCS] Classification total feature importance [DOCS] Add total feature importance to classification example Sep 30, 2020
@lcawl lcawl requested review from valeriy42 and qn895 September 30, 2020 01:27
@lcawl lcawl marked this pull request as ready for review September 30, 2020 01:27
Copy link
Contributor

@valeriy42 valeriy42 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Comment on lines +407 to +414
If you chose to calculate {feat-imp}, the destination index also contains
`ml.feature_importance` objects. Every field that is included in the
{classanalysis} (known as a _feature_ of the data point) is assigned a {feat-imp}
value. This value has both a magnitude and a direction (positive or negative),
which indicates how each field affects a particular prediction. Only the most
significant values (in this case, the top 10) are stored in the index. However,
the trained model metadata also contains the average magnitude of the {feat-imp}
values for each field across all the training data. You can view this
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good description! 👍

@lcawl lcawl merged commit fca79db into elastic:master Sep 30, 2020
@lcawl lcawl deleted the classification-total-feature-importance branch September 30, 2020 14:22
lcawl added a commit to lcawl/stack-docs that referenced this pull request Sep 30, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants