22931: Feature Influence Rename (#99)

sdconrox · jackx111 · web-flow · commit 03e399216b77 · 2025-02-26T14:33:33.000-05:00
Refactored all mentions of renamed functions from [FeatureInfluenceRenames.xls](https://diveplane-my.sharepoint.com/:x:/g/personal/cmack_howso_com/EV9BqmEDGolAhoqPkKLIld0BMIyVFwH8EDo2pt8-78XqHw?e=rroB5A) --------- Co-authored-by: jack-xia-dp <144161208+jackx111@users.noreply.github.com>
diff --git a/source/getting_started/concepts.rst b/source/getting_started/concepts.rst
@@ -24,7 +24,7 @@ advances enable all of Engine's insight and analysis capabilities, including sta
 
 - Outperform other commonly used feature importance metrics, including SHAP.
 
-  Howso quantifies individual feature contributions to a prediction, i.e., how much an individual feature impacts a prediction. The concept of feature contribution is similar to the data science concept of "feature importance". However,
+  Howso quantifies individual prediction contributions to a prediction, i.e., how much an individual feature impacts a prediction. The concept of prediction contributions is similar to the data science concept of "feature importance". However,
   Howso is robust against several common challenges (correlated features, redundant features, difference in scale between features, and multiple distinguishing features) faced by other feature importance tools,
   including the SHAP metric, which often lead to misleading results.
 
diff --git a/source/getting_started/intro.rst b/source/getting_started/intro.rst
@@ -69,7 +69,7 @@ Howso values gracious intellectual honesty. In that spirit, we're telling you up
 - Very large datasets
 
   Handling very large datasets with subtle signals (e.g., datasets requiring tens of millions of records and/or thousands of features to capture the complex relationships within the data)
-  currently requires manual work from engineering, data science, and subject matter expert teams. However, currently available Howso tools, including ablation and non-robust feature contribution calculations,
+  currently requires manual work from engineering, data science, and subject matter expert teams. However, currently available Howso tools, including ablation and non-robust prediction contribution calculations,
   can be used to help identify subsamples of large datasets that
   contain enough signal to be used for data science analysis.
 
diff --git a/source/getting_started/terminology.rst b/source/getting_started/terminology.rst
@@ -98,40 +98,39 @@ The mean absolute error between a predicted value and actual value for a predict
 uncertainty. Residuals may be for a given prediction, and expected Residuals may be for a given feature, either
 globally across the entire model or for a particular prediction.
 
-.. _contribution:
+.. _pc:
 
-Contribution
-------------
+Prediction Contributions (PC)
+-----------------------------
 
-Feature contribution is the difference between a prediction in an action feature when each feature or case is
-considered versus not considered. Case contribution is the same but for a case rather than a feature. When applied in
+Prediction contributions is the measured difference between a prediction in an action feature when each feature (Feature Prediction Contributions)
+or case (Case Prediction Contributions) is considered versus not considered. When Feature Prediction Contributions is applied in
 a robust fashion, this is an approximation of the commonly used SHAP feature importance measure. The difference being
 that SHAP is an exact value of a model (which itself is just an approximation of the data) whereas robust contribution is an
 approximation of the feature importance of the relationships expressed in the data.
 
-.. _mda:
+.. _ac:
 
-MDA
----
-
-The *Mean Decrease in Accuracy* (MDA) of an Action Feature is mean decrease in accuracy of removing a feature.  MDA units are on the same scale as the Action feature(s), and will be probabilities for categorical features.
+Accuracy Contributions (AC)
+---------------------------
+Accuracy contributions is the accuracy difference in an action feature when each feature (Feature Accuracy Contributions)
+or case (Case Accuracy Contributions) is considered versus not considered.
 
 .. _robust:
 
 Robust
 ------
 
-A feature or case contribution or MDA that is robust means that it is computed over the power set of possible
-combinations of features or cases, as approximated by a uniform distribution. For feature contributions, robust means
+A feature or case contribution that is robust means that it is computed over the power set of possible
+combinations of features or cases, as approximated by a uniform distribution. For prediction contributions, robust means
 it is an approximation to the well-known SHAP values.
 
 .. _relavant_features:
 
 Relevant Features
 -----------------
 
-Features whose values were important in determining prediction value(s). Generally, this refers to feature MDA or
-contribution, which yield similar but complementary insights.
+Features whose values were important in determining prediction value(s). Generally, this refers to prediction or accuracy contributions, which yield similar but complementary insights.
 
 .. _contexts:
 
@@ -313,8 +312,8 @@ Influential Cases
 
 The cases which were identified as most influential during a prediction, along with their weights when predicting the
 expected value or drawing a value from the distribution of expected values for generative outputs. The influential
-cases are a subset of the :ref:`most_similar_cases`, returning only those cases whose cumulative influence weights added in 
-descending order is below the influential weight threshold. 
+cases are a subset of the :ref:`most_similar_cases`, returning only those cases whose cumulative influence weights added in
+descending order is below the influential weight threshold.
 
 .. _boundary_cases:
 
diff --git a/source/user_guide/advanced_capabilities/case_importance.rst b/source/user_guide/advanced_capabilities/case_importance.rst
@@ -29,9 +29,8 @@ Concepts & Terminology
 
 How-To Guide
 ------------
-Case importance is similar to feature importance in that it comprises of two metrics, case mean decrease in accuracy (MDA) and case contribution.
-As opposed to influential and similar cases which examines the influence of cases on a single case or prediction, case importance examines how important a case is in regards to the overall predictions on a group of cases. Case importance share the same underlying methodology with  :doc:`Feature Importance <feature_importance>`.
-Unlike feature contributions, case contributions are calculated just locally. Conceptually, local metrics use either a specific subset of the cases that are trained into the Trainee or a set of new cases.
+Case importance is similar to feature importance in that it comprises of two metrics, Accuracy Contributions for Case and Prediction Contributions for Case.
+Unlike global feature importance metrics, case contributions are calculated just locally. Conceptually, local metrics use either a specific subset of the cases that are trained into the Trainee or a set of new cases.
 
 Setup
 ^^^^^
@@ -41,19 +40,19 @@ The :class:`~Trainee` will be referenced as ``trainee`` in the sections below.
 Case Contributions
 ^^^^^^^^^^^^^^^^^^
 
-Case contributions can be retrieved by setting ``case_contributions_robust`` or ``case_contributions_full`` to ``True``.
+Case contributions can be retrieved by setting ``case_robust_prediction_contributions`` or ``case_full_prediction_contributions`` to ``True``.
 
 .. code-block:: python
 
-    details = {'case_contributions_robust': True}
+    details = {'case_robust_prediction_contributions': True}
 
-Case MDA
-^^^^^^^^
-Case MDA can be retrieved by setting ``case_mda_robust`` or ``case_mda_full`` to ``True``.
+Case Accuracy Contributions
+^^^^^^^^^^^^^^^^^^^^^^^^^^^
+Case Accuracy Contributions can be retrieved by setting ``case_robust_accuracy_contributions`` or ``case_full_accuracy_contributions`` to ``True``.
 
 .. code-block:: python
 
-    details = {'case_mda_robust': True}
+    details = {'case_robust_accuracy_contributions': True}
 
 
 React
@@ -75,8 +74,8 @@ The results can be retrieved in the ``details`` section of the results.
 
 .. code-block:: python
 
-    case_contributions = pd.DataFrame(results['details']['case_contributions'][0])
-    case_mda = pd.DataFrame(results['details']['case_mda'][0])
+    case_prediction_contributions = pd.DataFrame(results['details']['prediction_contributions'][0])
+    case_accuracy_contributions = pd.DataFrame(results['details']['accuracy_contributions'][0])
 
 
 Complete Code
@@ -112,7 +111,7 @@ The code from all of the steps in this guide is combined below:
 
     trainee.analyze(context_features=context_features, action_features=action_features)
 
-    details = {'case_contributions_robust': True}
+    details = {'case_robust_prediction_contributions': True}
 
     results = trainee.react(
         test_case[context_features],
@@ -121,7 +120,7 @@ The code from all of the steps in this guide is combined below:
         details=details
     )
 
-    case_contributions = pd.DataFrame(results['details']['case_contributions_robust'][0])
+    case_contributions = pd.DataFrame(results['details']['case_robust_prediction_contributions'][0])
 
 API References
 --------------
diff --git a/source/user_guide/advanced_capabilities/feature_importance.rst b/source/user_guide/advanced_capabilities/feature_importance.rst
@@ -5,13 +5,13 @@ Feature Importance
 ==================
 .. topic:: What is covered in this user guide
 
-    In this guide, you will learn how to compute the feature importance metrics, :ref:`Feature Contributions <contribution>` and :ref:`Feature Mean Decrease in Accuracy (MDA) <mda>` from a Trainee. Feature importance metrics
+    In this guide, you will learn how to compute the feature importance metrics, :ref:`Prediction Contributions (PC) <pc>` and :ref:`Accuracy Contributions (AC) <ac>` from a Trainee. Feature importance metrics
     provides information about which features are useful for predicting a target or :ref:`action <action_features>` feature. In addition to learning informative metrics about the data and the model, these insights can be used as guidance for further action such as feature selection or feature engineering.
 
 
 Objectives: what you will take away
 -----------------------------------
-- **How-To** Retrieve the different types of feature importance metrics across several different categories: :doc:`global vs local <../concepts/global_vs_local>`, and :ref:`robust` vs non-robust (full) :ref:`Feature Contributions <contribution>` and :ref:`Feature MDA <mda>`.
+- **How-To** Retrieve the different types of feature importance metrics across several different categories: :doc:`global vs local <../concepts/global_vs_local>`, and :ref:`robust` vs non-robust (full) :ref:`Prediction Contributions <pc>` and :ref:`Accuracy Contributions <ac>`.
 
 
 Prerequisites: before you begin
@@ -33,9 +33,9 @@ recommend being familiar with the following concepts:
 - :ref:`residual`
 - :ref:`robust`
 - :ref:`contribution`
-- :ref:`mda`
+- :ref:`ac`
 
-The two metrics available for feature importance is feature :ref:`contribution` and feature :ref:`mda`.
+The two metrics available for feature importance is feature :ref:`contribution` and feature :ref:`ac`.
 
 Robust vs Non-Robust (Full)
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -52,34 +52,34 @@ The created :class:`~Trainee` will be referenced as ``trainee`` in the sections
 Global Feature Importance
 ^^^^^^^^^^^^^^^^^^^^^^^^^
 To get global feature importance metrics, :py:meth:`Trainee.react_aggregate`, is called on a trained and analyzed Trainee. :py:meth:`Trainee.react_aggregate` calls react internally on the cases already trained into the Trainee and calculates the metrics. In this method, the desired metrics can be selected as parameters. These parameters are named individually
-in the ``details`` parameter and setting them to ``True`` will calculate and return the desired metrics. For example, ``feature_mda_robust`` and ``feature_contributions_robust`` will calculate the robust versions of MDA and Feature Contributions, while ``feature_mda_full`` and ``feature_contributions_full`` will calculate the non-robust (full) versions.
-An action feature must be specified. ``feature_influences_action_feature`` is recommended for feature influence metrics such as feature contributions and mda, especially when used in conjunction with retrieving prediction stats, however, ``action_feature`` can be also used as well. ``action_feature`` sets the action feature for both influence metrics and prediction stats. Since often
+in the ``details`` parameter and setting them to ``True`` will calculate and return the desired metrics. For example, ``feature_robust_accuracy_contributions`` and ``feature_robust_prediction_contributions`` will calculate the robust versions of Accuracy Contributions and Prediction Contributions, while ``feature_full_accuracy_contributions`` and ``feature_full_prediction_contributions`` will calculate the non-robust (full) versions.
+An action feature must be specified. ``feature_influences_action_feature`` is recommended for feature influence metrics such as prediction contributions and accuracy contributions, especially when used in conjunction with retrieving prediction stats, however, ``action_feature`` can be also used as well. ``action_feature`` sets the action feature for both influence metrics and prediction stats. Since often
 only the influence metrics's action feature is intended to be set, ``feature_influences_action_feature`` provides a more precise parameter.
 
 .. code-block:: python
 
-    feature_contributions_robust = trainee.react_aggregate(
+    feature_robust_prediction_contributions = trainee.react_aggregate(
         context_features=context_features,
         feature_influences_action_feature=action_features[0],
-        details={'feature_contributions_robust' : True}
+        details={'feature_robust_prediction_contributions' : True}
     )
 
-    feature_mda_robust = trainee.react_aggregate(
+    feature_robust_accuracy_contributions = trainee.react_aggregate(
         context_features=context_features,
         feature_influences_action_feature=action_features[0],
-        details={'feature_mda_robust': True}
+        details={'feature_robust_accuracy_contributions': True}
     )
 
 Local Feature Importance
 ^^^^^^^^^^^^^^^^^^^^^^^^
-To get local feature importance metrics, :py:meth:`Trainee.react`, is first called on a trained and analyzed Trainee. In this method, the desired metrics, ``feature_contributions_robust`` and ``feature_mda_robust``, can be selected as inputs to the ``details`` parameters as key value pairs from a dictionary. These parameters are named individually
+To get local feature importance metrics, :py:meth:`Trainee.react`, is first called on a trained and analyzed Trainee. In this method, the desired metrics, ``feature_robust_prediction_contributions`` and ``feature_robust_accuracy_contributions``, can be selected as inputs to the ``details`` parameters as key value pairs from a dictionary. These parameters are named individually
 and setting them to ``True`` will calculate the desired metrics. Robust calculations are performed by default.
 
 .. code-block:: python
 
     details = {
-        'feature_contributions_robust':True,
-        'feature_mda_robust':True,
+        'feature_robust_prediction_contributions':True,
+        'feature_robust_accuracy_contributions':True,
     }
 
     results = trainee.react(
@@ -94,31 +94,14 @@ are calculated in :py:meth:`Trainee.react` from the previous step.
 
 .. code-block:: python
 
-    feature_contributions_robust = results['explanation']['feature_contributions_robust']
-    feature_mda_robust = results['explanation']['feature_mda_robust']
+    feature_robust_prediction_contributions = results['details']['feature_robust_prediction_contributions']
+    feature_robust_accuracy_contributions = results['details']['feature_robust_accuracy_contributions']
 
 
 .. warning::
 
-    Contributions and MDA are also metrics for cases and not just features, so please be aware when reading other guides that may use those terms.
+    Accuracy and Prediction Contributions are also metrics for cases and not just features, so please be aware when reading other guides that may use those terms.
 
-Contribution and MDA matrices
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
-
-Howso also provides the two metrics in a matrix view, where for each row which represent the action feature, you can identify the contributions of all
-the other context features to that prediction. Since these matrices may not be symmetrical, examining the differences between the upper and lower triangular matrices
-may reveal additional insights. Please see the linked recipe for more information.
-
-:meth:`Trainee.get_contribution_matrix` and :meth:`Trainee.get_mda_matrix` gets these matrices respectively.
-
-.. warning::
-
-    Matrices may be computationally expensive.
-
-.. code-block:: python
-
-    contrib_matrix = trainee.get_contribution_matrix()
-    mda_matrix = trainee.get_mda_matrix()
 
 Combined Code
 ^^^^^^^^^^^^^
@@ -154,21 +137,21 @@ Combined Code
     trainee.train(df)
     trainee.analyze()
 
-    feature_contributions_robust = trainee.react_aggregate(
+    feature_robust_prediction_contributions = trainee.react_aggregate(
         context_features=context_features,
         feature_influences_action_feature=action_features[0],
-        details={"feature_contributions_robust" : True}
+        details={"feature_robust_prediction_contributions" : True}
     )
 
-    feature_mda_robust = trainee.react_aggregate(
+    feature_robust_accuracy_contributions = trainee.react_aggregate(
         context_features=context_features,
         feature_influences_action_feature=action_features[0],
-        details={"feature_mda_robust" : True}
+        details={"feature_robust_accuracy_contributions" : True}
     )
 
     details = {
-        'feature_contributions_robust':True,
-        'feature_mda_robust':True,
+        'feature_robust_prediction_contributions':True,
+        'feature_robust_accuracy_contributions':True,
     }
 
     results = trainee.react(
@@ -178,12 +161,8 @@ Combined Code
         details=details
     )
 
-    feature_contributions_robust = results['explanation']['feature_contributions_robust']
-    feature_mda_robust = results['explanation']['feature_mda_robust']
-
-    contrib_matrix = trainee.get_contribution_matrix()
-    mda_matrix = trainee.get_mda_matrix()
-
+    feature_robust_prediction_contributions = results['explanation']['feature_robust_prediction_contributions']
+    feature_robust_accuracy_contributions = results['explanation']['feature_robust_accuracy_contributions']
 
 API References
 --------------
@@ -192,6 +171,4 @@ API References
 - :py:meth:`Trainee.analyze`
 - :py:meth:`Trainee.react`
 - :py:meth:`Trainee.react_aggregate`
-- :py:meth:`Trainee.get_contribution_matrix`
-- :py:meth:`Trainee.get_mda_matrix`
 
diff --git a/source/user_guide/basic_capabilities/conviction.rst b/source/user_guide/basic_capabilities/conviction.rst
@@ -86,7 +86,7 @@ specific cases in :py:meth:`Trainee.react`
 .. code-block:: python
 
     details = {
-        'feature_residuals_robust': True
+        'feature_robust_residuals': True
     }
 
     results = trainee.react(
@@ -144,7 +144,7 @@ The code from all of the steps in this guide is combined below:
     print(familiarity_conviction_addition)
 
     details = {
-        'feature_residuals_robust': True,
+        'feature_robust_residuals': True,
         'similarity_conviction': True
     }
 
@@ -178,7 +178,7 @@ Below is an example of expected output from this sample code:
     target
     0       1
     {'action_features': ['target'],
-    'feature_residuals_robust': [{'age': 8.888516681825308,
+    'feature_robust_residuals': [{'age': 8.888516681825308,
                                 'capital-gain': 416.7392605164004,
                                 'capital-loss': 59.906358535804515,
                                 'education': 0.4523004291045252,
diff --git a/source/user_guide/basic_capabilities/residuals.rst b/source/user_guide/basic_capabilities/residuals.rst
diff --git a/source/user_guide/concepts/global_vs_local.rst b/source/user_guide/concepts/global_vs_local.rst