Skip to content

Enables FastTreeBinaryClassificationCategoricalSplitTest and BinaryClassifierTesterThresholdingTest #255

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 7 commits into from
May 30, 2018
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
maml.exe TrainTest test=%Data% tr=FastTreeBinaryClassification{nl=5 mil=5 lr=0.25 iter=20 mb=255} dout=%Output% loader=Text{sep=, header+ col=Label:14 col=Cat:TX:1,3,5-9,13} data=%Data% out=%Output% seed=1 xf=Cat{col=Cat} xf=Concat{col=Features:Cat}
Not adding a normalizer.
Making per-feature arrays
Changing data from row-wise to column-wise
Processed 32561 instances
Binning and forming Feature objects
Reserved memory for tree learner: 4980 bytes
Starting to train ...
Not training a calibrator because it is not needed.
TEST POSITIVE RATIO: 0.2362 (3846.0/(3846.0+12435.0))
Confusion table
||======================
PREDICTED || positive | negative | Recall
TRUTH ||======================
positive || 1,982 | 1,864 | 0.5153
negative || 895 | 11,540 | 0.9280
||======================
Precision || 0.6889 | 0.8609 |
OVERALL 0/1 ACCURACY: 0.830539
LOG LOSS/instance: 0.537244
Test-set entropy (prior Log-Loss/instance): 0.788708
LOG-LOSS REDUCTION (RIG): 31.883066
AUC: 0.871960

OVERALL RESULTS
---------------------------------------
AUC: 0.871960 (0.0000)
Accuracy: 0.830539 (0.0000)
Positive precision: 0.688912 (0.0000)
Positive recall: 0.515341 (0.0000)
Negative precision: 0.860937 (0.0000)
Negative recall: 0.928026 (0.0000)
Log-loss: 0.537244 (0.0000)
Log-loss reduction: 31.883066 (0.0000)
F1 Score: 0.589618 (0.0000)
AUPRC: 0.670582 (0.0000)

---------------------------------------
Physical memory usage(MB): %Number%
Virtual memory usage(MB): %Number%
%DateTime% Time elapsed(s): %Number%

--- Progress log ---
[1] 'Building term dictionary' started.
[1] (%Time%) 32561 examples Total Terms: 100
[1] 'Building term dictionary' finished in %Time%.
[2] 'FastTree data preparation' started.
[2] 'FastTree data preparation' finished in %Time%.
[3] 'FastTree in-memory bins initialization' started.
[3] 'FastTree in-memory bins initialization' finished in %Time%.
[4] 'FastTree feature conversion' started.
[4] 'FastTree feature conversion' finished in %Time%.
[5] 'FastTree training' started.
[5] 'FastTree training' finished in %Time%.
[6] 'Saving model' started.
[6] 'Saving model' finished in %Time%.
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
FastTreeBinaryClassification
AUC Accuracy Positive precision Positive recall Negative precision Negative recall Log-loss Log-loss reduction F1 Score AUPRC /lr /nl /mil /iter Learner Name Train Dataset Test Dataset Results File Run Time Physical Memory Virtual Memory Command Line Settings
0.87196 0.830539 0.688912 0.515341 0.860937 0.928026 0.537244 31.88307 0.589618 0.670582 0.25 5 5 20 FastTreeBinaryClassification %Data% %Data% %Output% 99 0 0 maml.exe TrainTest test=%Data% tr=FastTreeBinaryClassification{nl=5 mil=5 lr=0.25 iter=20 mb=255} dout=%Output% loader=Text{sep=, header+ col=Label:14 col=Cat:TX:1,3,5-9,13} data=%Data% out=%Output% seed=1 xf=Cat{col=Cat} xf=Concat{col=Features:Cat} /lr:0.25;/nl:5;/mil:5;/iter:20

Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@

Per-feature gain summary for the boosted tree ensemble:
marital-status.Married-civ-spouse 1
occupation.Exec-managerial 0.399117896915284
occupation.Prof-specialty 0.391227805213273
education.Bachelors 0.334466450228397
education.Masters 0.287494503306969
education.Prof-school 0.204600035294214
education.Doctorate 0.187985462587772
occupation.Sales 0.150856624152445
occupation.Other-service 0.147997280910012
relationship.Own-child 0.137529286784067
marital-status.Never-married 0.131366568817433
education.7th-8th 0.122497278808477
workclass.Self-emp-inc 0.119940186278871
education.HS-grad 0.113890497534289
occupation.Tech-support 0.104801559978143
workclass.Self-emp-not-inc 0.0911869726068162
native-country.Mexico 0.081461599208144
workclass.Federal-gov 0.0785782301236086
sex.Male 0.0783006369313957
relationship.Wife 0.0749144815167656
education.11th 0.0746051436850124
occupation.Handlers-cleaners 0.0601209002480329
native-country.United-States 0.0593295291546631
occupation.Farming-fishing 0.0583149634881263
education.10th 0.0544727793656118
education.9th 0.052272828491541
workclass.Local-gov 0.0519635659099365
occupation.? 0.0479530108171804
occupation.Protective-serv 0.0451534478142547
Loading