Handling of missings (in train + predict) #238

pat-s · 2019-06-06T09:12:27Z

Discussed several times in mlr:

AFAICS the current behavior in mlr is the one of this PR: mlr-org/mlr#2099

mllg · 2019-06-27T09:47:00Z

Missings are now handled, with the following policy:

If a learner is capable of handling missing values during train(), it should get the missings property.
Learners which cannot handle missing values in the test set should predict NA for these observations.
Predicting NA results in an exception unless you have a fallback learner defined. All rows with NA observations are imputed with the predictions of the fallback learner.

I know that this is not perfect, and that there might be some rare occasions where you need more flexibility. However, I believe that this is a statistically sound approach (unlike na.rm = TRUE stuff during performance assessment). Additionally, you can always impute missing values with a PipeOp.

mllg added Priority: Medium Status: Available Type: Enhancement labels Jun 6, 2019

mllg closed this as completed Jun 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling of missings (in train + predict) #238

Handling of missings (in train + predict) #238

pat-s commented Jun 6, 2019

mllg commented Jun 27, 2019

Handling of missings (in train + predict) #238

Handling of missings (in train + predict) #238

Comments

pat-s commented Jun 6, 2019

mllg commented Jun 27, 2019