Skip to content

Commit 54a0b42

Browse files
codemzsshauheen
authored andcommitted
FastTree: Instantiate feature map for disk transpose and make Generalized Additive Models predictor resilient when feature map is not available. (#122)
* Instantiate feature map for disk transpose and make Generalized Additive Models predictor resilient when feature map is not available.
1 parent 5d03b54 commit 54a0b42

22 files changed

+4560
-17
lines changed
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
maml.exe TrainTest test=%Data% tr=BinaryClassificationGamTrainer dout=%Output% data=%Data% out=%Output% seed=1
2+
Not adding a normalizer.
3+
Making per-feature arrays
4+
Changing data from row-wise to column-wise
5+
Warning: Skipped 16 instances with missing features during training
6+
Processed 683 instances
7+
Binning and forming Feature objects
8+
Starting to train ...
9+
Training calibrator.
10+
TEST POSITIVE RATIO: 0.3448 (241.0/(241.0+458.0))
11+
Confusion table
12+
||======================
13+
PREDICTED || positive | negative | Recall
14+
TRUTH ||======================
15+
positive || 227 | 14 | 0.9419
16+
negative || 13 | 445 | 0.9716
17+
||======================
18+
Precision || 0.9458 | 0.9695 |
19+
OVERALL 0/1 ACCURACY: 0.961373
20+
LOG LOSS/instance: 0.145652
21+
Test-set entropy (prior Log-Loss/instance): 0.929318
22+
LOG-LOSS REDUCTION (RIG): 84.326961
23+
AUC: 0.991198
24+
25+
OVERALL RESULTS
26+
---------------------------------------
27+
AUC: 0.991198 (0.0000)
28+
Accuracy: 0.961373 (0.0000)
29+
Positive precision: 0.945833 (0.0000)
30+
Positive recall: 0.941909 (0.0000)
31+
Negative precision: 0.969499 (0.0000)
32+
Negative recall: 0.971616 (0.0000)
33+
Log-loss: 0.145652 (0.0000)
34+
Log-loss reduction: 84.326961 (0.0000)
35+
F1 Score: 0.943867 (0.0000)
36+
AUPRC: 0.971819 (0.0000)
37+
38+
---------------------------------------
39+
Physical memory usage(MB): %Number%
40+
Virtual memory usage(MB): %Number%
41+
%DateTime% Time elapsed(s): %Number%
42+
43+
--- Progress log ---
44+
[1] 'FastTree data preparation' started.
45+
[1] 'FastTree data preparation' finished in %Time%.
46+
[2] 'FastTree in-memory bins initialization' started.
47+
[2] 'FastTree in-memory bins initialization' finished in %Time%.
48+
[3] 'FastTree feature conversion' started.
49+
[3] 'FastTree feature conversion' finished in %Time%.
50+
[4] 'GAM training' started.
51+
[4] 'GAM training' finished in %Time%.
52+
[5] 'Saving model' started.
53+
[5] 'Saving model' finished in %Time%.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
BinaryClassificationGamTrainer
2+
AUC Accuracy Positive precision Positive recall Negative precision Negative recall Log-loss Log-loss reduction F1 Score AUPRC Learner Name Train Dataset Test Dataset Results File Run Time Physical Memory Virtual Memory Command Line Settings
3+
0.991198 0.961373 0.945833 0.941909 0.969499 0.971616 0.145652 84.32696 0.943867 0.971819 BinaryClassificationGamTrainer %Data% %Data% %Output% 99 0 0 maml.exe TrainTest test=%Data% tr=BinaryClassificationGamTrainer dout=%Output% data=%Data% out=%Output% seed=1
4+

ZBaselines/SingleDebug/BinaryClassificationGamTrainer/BinaryClassificationGamTrainer-TrainTest-breast-cancer.txt

Lines changed: 700 additions & 0 deletions
Large diffs are not rendered by default.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
maml.exe TrainTest test=%Data% tr=BinaryClassificationGamTrainer{dt+} dout=%Output% data=%Data% out=%Output% seed=1
2+
Not adding a normalizer.
3+
Making per-feature arrays
4+
Changing data from row-wise to column-wise on disk
5+
Warning: 16 of 699 examples will be skipped due to missing feature values
6+
Processed 683 instances
7+
Binning and forming Feature objects
8+
Starting to train ...
9+
Training calibrator.
10+
TEST POSITIVE RATIO: 0.3448 (241.0/(241.0+458.0))
11+
Confusion table
12+
||======================
13+
PREDICTED || positive | negative | Recall
14+
TRUTH ||======================
15+
positive || 227 | 14 | 0.9419
16+
negative || 13 | 445 | 0.9716
17+
||======================
18+
Precision || 0.9458 | 0.9695 |
19+
OVERALL 0/1 ACCURACY: 0.961373
20+
LOG LOSS/instance: 0.145652
21+
Test-set entropy (prior Log-Loss/instance): 0.929318
22+
LOG-LOSS REDUCTION (RIG): 84.326961
23+
AUC: 0.991198
24+
25+
OVERALL RESULTS
26+
---------------------------------------
27+
AUC: 0.991198 (0.0000)
28+
Accuracy: 0.961373 (0.0000)
29+
Positive precision: 0.945833 (0.0000)
30+
Positive recall: 0.941909 (0.0000)
31+
Negative precision: 0.969499 (0.0000)
32+
Negative recall: 0.971616 (0.0000)
33+
Log-loss: 0.145652 (0.0000)
34+
Log-loss reduction: 84.326961 (0.0000)
35+
F1 Score: 0.943867 (0.0000)
36+
AUPRC: 0.971819 (0.0000)
37+
38+
---------------------------------------
39+
Physical memory usage(MB): %Number%
40+
Virtual memory usage(MB): %Number%
41+
%DateTime% Time elapsed(s): %Number%
42+
43+
--- Progress log ---
44+
[1] 'FastTree disk-based bins initialization' started.
45+
[1] 'FastTree disk-based bins initialization' finished in %Time%.
46+
[2] 'GAM training' started.
47+
[2] 'GAM training' finished in %Time%.
48+
[3] 'Saving model' started.
49+
[3] 'Saving model' finished in %Time%.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
BinaryClassificationGamTrainer
2+
AUC Accuracy Positive precision Positive recall Negative precision Negative recall Log-loss Log-loss reduction F1 Score AUPRC /dt Learner Name Train Dataset Test Dataset Results File Run Time Physical Memory Virtual Memory Command Line Settings
3+
0.991198 0.961373 0.945833 0.941909 0.969499 0.971616 0.145652 84.32696 0.943867 0.971819 + BinaryClassificationGamTrainer %Data% %Data% %Output% 99 0 0 maml.exe TrainTest test=%Data% tr=BinaryClassificationGamTrainer{dt+} dout=%Output% data=%Data% out=%Output% seed=1 /dt:+
4+

0 commit comments

Comments
 (0)