Skip to content

Commit 67fe773

Browse files
SPARK-5502 reworded model prediction rules to use more general language rather than the code/implementation specific terms
1 parent 80bd4c3 commit 67fe773

File tree

1 file changed

+12
-9
lines changed

1 file changed

+12
-9
lines changed

docs/mllib-isotonic-regression.md

Lines changed: 12 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -37,15 +37,18 @@ Training returns an IsotonicRegressionModel that can be used to predict
3737
labels for both known and unknown features. The result of isotonic regression
3838
is treated as piecewise linear function. The rules for prediction therefore are:
3939

40-
* If testData exactly matches a boundary then associated prediction is returned.
41-
In case there are multiple predictions with the same boundary then one of them
42-
is returned. Which one is undefined (same as java.util.Arrays.binarySearch).
43-
* If testData is lower or higher than all boundaries then first or last prediction
44-
is returned respectively. In case there are multiple predictions with the same
45-
boundary then the lowest or highest is returned respectively.
46-
* If testData falls between two values in boundary array then prediction is treated
47-
as piecewise linear function and interpolated value is returned. In case there are
48-
multiple values with the same boundary then the same rules as in previous point are used.
40+
* If the prediction input exactly matches a training feature
41+
then associated prediction is returned. In case there are multiple predictions with the same
42+
feature then one of them is returned. Which one is undefined
43+
(same as java.util.Arrays.binarySearch).
44+
* If the prediction input is lower or higher than all training features
45+
then prediction with lowest or highest feature is returned respectively.
46+
In case there are multiple predictions with the same feature
47+
then the lowest or highest is returned respectively.
48+
* If the prediction input falls between two training features then prediction is treated
49+
as piecewise linear function and interpolated value is calculated from the
50+
predictions of the two closest features. In case there are multiple values
51+
with the same feature then the same rules as in previous point are used.
4952

5053
### Examples
5154

0 commit comments

Comments
 (0)