Fixing prediction methods to be more standardized #167

jknowles · 2015-09-10T20:55:54Z

This PR addresses #165 and #158 by making sure that predict.caretEnsemble and predict.caretStack provide similar output and have similar inputs. There are also large changes to the unit tests to make sure that the tests now test the correct structure for the output from these two functions.

predict.caretStack is particularly heavily modified here -- it now can produce prediction errors based on the model disagreement about each observation, mimicking the behavior of predict.caretEnsemble. It uses the variable importance of each models' predictions in the stacked model to define the weights. When these predictions are transformed before ensembling, no standard error is returned or calculated, but the variable importance can still be returned as a weights attribute to the outputted vector.

Catchup to upstream

Catchup

…nto robustCaretList

jknowles · 2015-09-10T20:57:19Z

@zachmayer This is a bit of a large PR, so just ping me with a comment when you have a chance to review it. I can answer any questions you have hopefully and am also willing to make whatever changes you think make sense.

jknowles · 2015-09-11T15:36:41Z

I'd also recommend changing the Git Check settings to only pick up git checks on the "NOT_CRAN" version of caretEnsemble because our testing infrastructure is too big to get full coverage within the CRAN test limits.

jknowles · 2015-10-09T15:17:11Z

@zachmayer Just a ping to see about moving this forward. I think it'll solve some of the recently reported issues like #172 .

zachmayer · 2015-10-09T15:30:43Z

This file has a couple untested lines: https://coveralls.io/builds/3539104/source?filename=R%2FcaretEnsemble.R

Could you add one or 2 more tests to hit the red lines? Meanwhile, I've gotta go figure out why our travis builds are failing.

jknowles · 2015-10-09T15:56:05Z

R/caretEnsemble.R

+  args <- lapply(args, eval, parent.frame()) # convert from symbols to objects
+  if(exists("newdata", args)){
+    # tmp <- args$newdata
+    if(anyNA(args$newdata)){


I'm pretty sure this line means that we will never get to the "Missing data found..." keepNA argument below, but I can't be sure because the data could be complete, but for some functions it could still produce an NA prediction. This seems pretty difficult to identify a test case for.

zachmayer · 2015-10-09T18:52:09Z

Fixed the build issue, wooooooo

zachmayer · 2015-10-09T18:54:46Z

Also, instead of merging master, you can also rebase off master and then force push. It's a little bit harder to do, but makes for a slightly nicer commit history.

jknowles · 2015-10-09T19:14:23Z

OK, I think I did the rebase, but I'm not sure. At any rate we're even with master again. I think what needs to happen is I need to properly deprecate keepNA because I don't think it ever comes up anymore. Does that seem OK? Was going to use Hadley's trick here:

http://r-pkgs.had.co.nz/release.html

zachmayer · 2015-10-09T19:15:20Z

👍 from me. I'm re-testing right now, and will merge when all the tests pass

jknowles · 2015-10-09T19:42:19Z

OK one last push to test my deprecation of keepNA.

jknowles · 2015-10-09T20:14:18Z

@zachmayer It builds, but the coverage check is a little wonky. I can't get those deprecated bits to be tested, but I don't want to junk them because legacy code may use them (I'm speaking selfishly here). If we merge this, I'm happy to figure out the merge conflicts it might cause with the other PR.

zachmayer · 2015-10-09T20:26:17Z

I edited the thresholds and re-ran

jknowles · 2015-10-10T20:39:54Z

@zachmayer Looks like we're good to go. Thanks! After this gets merged in, I'll re-base and start tackling the two issues that cropped up related to prediction #172 and #171. This closes #167.

zachmayer · 2015-10-10T20:52:05Z

Awesome! I'm away for the weekend but will merge asap so we can move on!

Fixing prediction methods to be more standardized

jknowles added 9 commits May 27, 2015 09:39

Merge pull request #26 from zachmayer/master

dd89a1f

Catchup to upstream

Merge pull request #27 from zachmayer/master

9649f6e

Catchup

catchup

cec7eb9

catchup

15bc6ea

rebasing

0890e7b

trying to patch up prediction functions

e4f8725

Merge branch 'robustCaretList' of github.com:jknowles/caretEnsemble i…

4ed0592

…nto robustCaretList

unified prediction framework for caretStack caretEns

c43c31f

Merge branch 'master' of github.com:zachmayer/caretEnsemble

c4cd3d6

jknowles added 2 commits September 10, 2015 16:21

Merge branch 'master' of github.com:zachmayer/caretEnsemble

aafa56b

Merge branch 'master' of github.com:jknowles/caretEnsemble

3ef9630

jknowles mentioned this pull request Sep 10, 2015

added foreach to predict.caretEnsemble #153

Closed

jknowles reviewed Oct 9, 2015
View reviewed changes

zachmayer added 4 commits October 9, 2015 14:05

try new travis file, based on caret's

437a09b

add wait to test script so it doesn't time out

d1cfd32

removed a package that doesnt have binaries

d917b88

removed a '

baca3bd

add deprecation checks and deprecate keepNA

0ea4584

zachmayer added a commit that referenced this pull request Oct 12, 2015

Merge pull request #167 from jknowles/master

308fa6e

Fixing prediction methods to be more standardized

zachmayer merged commit 308fa6e into zachmayer:master Oct 12, 2015

This was referenced Oct 13, 2015

Predict bug fix #173

Merged

Inconsistent handling of predict functions when newdata contains an NA #165

Closed

predict.caretEnsemble returns the same type of data structure, no matter the call #158

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing prediction methods to be more standardized #167

Fixing prediction methods to be more standardized #167

jknowles commented Sep 10, 2015

jknowles commented Sep 10, 2015

jknowles commented Sep 11, 2015

jknowles commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles Oct 9, 2015

zachmayer commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles commented Oct 9, 2015

jknowles commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles commented Oct 10, 2015

zachmayer commented Oct 10, 2015

Fixing prediction methods to be more standardized #167

Fixing prediction methods to be more standardized #167

Conversation

jknowles commented Sep 10, 2015

jknowles commented Sep 10, 2015

jknowles commented Sep 11, 2015

jknowles commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles Oct 9, 2015

Choose a reason for hiding this comment

zachmayer commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles commented Oct 9, 2015

jknowles commented Oct 9, 2015

zachmayer commented Oct 9, 2015

jknowles commented Oct 10, 2015

zachmayer commented Oct 10, 2015