Support early stopping of prediction in CLI #565

guolinke · 2017-05-29T16:44:30Z

@cbecker
I push many changes.
I think the type of early_stoping_instance can be auto inferred by the objective function.

Another problem is, should users really need to create the early_stop_instance by themselves ? Passing parameters round_period and threshold_margin to the predict function are not enough ?

Laurae2 · 2017-05-29T16:51:58Z

Just some questions:

pred_early_stop: is it for the prediction early stopping or did the regular early stopping method change?

pred_early_stop_freq: is it good to set to 1?

pred_early_stop_margin: what is the margin?

guolinke · 2017-05-29T16:58:31Z

@Laurae2 this is only for the prediction.
@cbecker can you answer the pred_early_stop_freq and pred_early_stop_margin ?

cbecker · 2017-05-29T20:43:33Z

Hello, I cannot take a look now but I'll do it first thing tomorrow in the morning, thanks for the PR!

cbecker · 2017-05-30T07:56:46Z

@cbecker can you answer the pred_early_stop_freq and pred_early_stop_margin ?

Early stopping at prediction time for classification does the following:

For each sample, every pred_early_stop_freq iterations look at the score of the classifier and compute the margin. For binary classification, it is simply the absolute value of the score. For multiclass, it is the highest score minus the second highest score.
If margin > pred_early_stop_margin then stop iterating and return the current classifier score. This means we're confident enough of the label we are assigning to this sample.

Choosing appropriate values for the parameters:

pred_early_stop_freq: the lowest is 1, but then the margin is computed at every iteration, which can slow down prediction. I typically use ``pred_early_stop_freq = 25` in my experiments, but it depends on the classifier and how it was trained.
pred_early_stop_margin: Lowest is zero, which means that it will stop predicting as soon as possible and the classification is likely to be very poor. pred_early_stop_margin = infinity means that it will never apply early stopping. I typically use pred_early_stop_margin = 1.5. Again, this depends on the classifier: a classifier trained with low shrinkage (e.g. 0.0001) will need a much lower value here than a classifier trained with shrinkage 0.5.

Let me know if there's anything unclear, thanks for the CLI integration :)

guolinke · 2017-05-30T08:00:48Z

@cbecker
I want to delete these two c_api: https://github.com/Microsoft/LightGBM/blob/master/src/c_api.cpp#L1118-L1140 .
and pass three parameters (pred_early_stop, pred_early_stop_freq, pred_early_stop_margin) into predict function in c_api directly.
Do you think that is okay ?

cbecker · 2017-05-30T08:06:54Z

I want to delete these two c_api: https://github.com/Microsoft/LightGBM/blob/master/src/c_api.cpp#L1118-L1140 .
and pass three parameters (pred_early_stop, pred_early_stop_freq, pred_early_stop_margin) into predict function in c_api directly.
Do you think that is okay ?

Yes, it works for me. I am using the pure C++ API whenever I can, and avoid the C api. About the latter, one thing I'm afraid of is that the predict() parameters are growing, and if we need to change something in the early prediction code we need to change all the python and R api, because otherwise it will crash because of missing C function arguments. Right now, if we were to change something about early stopping, we just need to modify LGBM_PredictionEarlyStopInstanceCreate and the changes propagate automatically.

Therefore I think it is safer and cleaner the way it is now, but if you think it's advantageous to have it in a single function that works for me too.

guolinke · 2017-05-30T08:10:40Z

@cbecker
I want to use only one additional parameter named "parameter", with string type. And we parse what we need from it. As a result, We don't need to worry about the paramater growing.

cbecker · 2017-05-30T08:45:53Z

@cbecker
I want to use only one additional parameter named "parameter", with string type. And we parse what we need from it. As a result, We don't need to worry about the paramater growing.

I see, that makes sense. I don't know about the speed penalty though. We'd be doing this for every sample we want to classify, right?

guolinke · 2017-05-30T08:54:05Z

@cbecker
Not, it is one-time init.
But it will create an new instance of EarlyStopInstance (So as Predictor) everytime calling the prediction.

cbecker · 2017-05-30T08:55:11Z

src/c_api.cpp

@@ -175,8 +175,11 @@ class Booster {
    } else {
      is_raw_score = false;
    }
+    auto param = ConfigBase::Str2Map(parameter);


I'm really concerned about this. Do you know how long Str2Map() takes? We'll be doing this for each instance we want to classify. IMO it would be worth benchmarking the after/before to see how this 'hurts' performance when only a few trees are being used (with early stopping disabled)

I just saw your comment about one-time-init, I will have a second look now.

We won't call the c_api one-time for one sample, right ?
The prediction is called one-time for many instances.

cbecker · 2017-05-30T09:13:14Z

Not, it is one-time init.
But it will create an new instance of EarlyStopInstance (So as Predictor) everytime calling the prediction.

I agree this makes sense when many samples are classified in a single call, which I guess is the goal of this function. For a bit I was confused with the PredictRaw() function, thanks for the clarification.

cbecker · 2017-05-30T09:14:53Z

We won't call the c_api one-time for one sample, right ?
The prediction is called one-time for many instances.

yes, it makes sense.

* fix multi-threading. * fix name style. * support in CLI version. * remove warnings. * Not default parameters. * fix if...else... . * fix bug. * fix warning. * refine c_api. * fix R-package. * fix R's warning. * fix tests. * fix pep8 .

guolinke added 3 commits May 29, 2017 23:37

fix multi-threading.

5dca17a

fix name style.

1b87afc

support in CLI version.

af80ab1

msftclas added the cla-not-required label May 29, 2017

guolinke added 2 commits May 30, 2017 00:52

remove warnings.

9e4c837

Not default parameters.

f9db703

guolinke added 3 commits May 30, 2017 01:03

fix if...else... .

4af7119

fix bug.

19a0f67

fix warning.

8dab0de

refine c_api.

01c5637

cbecker reviewed May 30, 2017

View reviewed changes

fix R-package.

f21a960

guolinke added 3 commits May 30, 2017 17:25

fix R's warning.

38afe63

fix tests.

101cae5

fix pep8 .

125c328

guolinke merged commit 6d4c7b0 into master May 30, 2017

guolinke deleted the prediction branch May 31, 2017 03:15

lock bot locked as resolved and limited conversation to collaborators Mar 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support early stopping of prediction in CLI #565

Support early stopping of prediction in CLI #565

guolinke commented May 29, 2017

Laurae2 commented May 29, 2017

guolinke commented May 29, 2017

cbecker commented May 29, 2017

cbecker commented May 30, 2017

guolinke commented May 30, 2017

cbecker commented May 30, 2017

guolinke commented May 30, 2017

cbecker commented May 30, 2017

guolinke commented May 30, 2017

cbecker May 30, 2017

cbecker May 30, 2017

guolinke May 30, 2017

cbecker commented May 30, 2017

cbecker commented May 30, 2017

Support early stopping of prediction in CLI #565

Support early stopping of prediction in CLI #565

Conversation

guolinke commented May 29, 2017

Laurae2 commented May 29, 2017

guolinke commented May 29, 2017

cbecker commented May 29, 2017

cbecker commented May 30, 2017

guolinke commented May 30, 2017

cbecker commented May 30, 2017

guolinke commented May 30, 2017

cbecker commented May 30, 2017

guolinke commented May 30, 2017

cbecker May 30, 2017

Choose a reason for hiding this comment

cbecker May 30, 2017

Choose a reason for hiding this comment

guolinke May 30, 2017

Choose a reason for hiding this comment

cbecker commented May 30, 2017

cbecker commented May 30, 2017