Skip to content

[DOCS] Add total feature importance to classification example #1382

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Sep 30, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
268 changes: 192 additions & 76 deletions docs/en/stack/ml/df-analytics/flightdata-classification.asciidoc

Large diffs are not rendered by default.

Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
13 changes: 10 additions & 3 deletions docs/en/stack/ml/df-analytics/ml-feature-importance.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -18,11 +18,19 @@ in future iterations of your trained model.

You can see the average magnitude of the {feat-imp} values for each field across
all the training data in {kib} or by using the
{ref}/get-inference.html[get trained model API]. For example:
{ref}/get-inference.html[get trained model API]. For example, {kib} shows the
total feature importance for each field in {regression} or binary
{classanalysis} results as follows:

[role="screenshot"]
image::images/flights-regression-total-importance.png["Total {feat-imp} values for a {regression} {dfanalytics-job} in {kib}"]

If the {classanalysis} involves more than two classes, {kib} uses colors to show
how the impact of each field varies by class. For example:

[role="screenshot"]
image::images/diamonds-classification-total-importance.png["Total {feat-imp} values for a {classification} {dfanalytics-job} in {kib}"]

You can also examine the feature importance values for each individual
prediction. In {kib}, you can see these values in JSON objects or decision plots:

Expand All @@ -41,8 +49,7 @@ value is positive, it increases the prediction value.
By default, {feat-imp} values are not calculated. To generate this information,
when you create a {dfanalytics-job} you must specify the
`num_top_feature_importance_values` property. For example, see
<<flightdata-regression>>.
//and <<flightdata-classification>>.
<<flightdata-regression>> and <<flightdata-classification>>.

The {feat-imp} values are stored in the {ml} results field for each document in
the destination index. The number of {feat-imp} values for each document might
Expand Down