GitHub - tdhock/max-generalized-auc

Jonathan Hillman and Toby Dylan Hocking, Optimizing ROC Curves with a Sort-Based Surrogate Loss Function for Binary Classification and Changepoint Detection, arXiv:2107.01285

Replication materials

This paper proposed a new surrogate loss function, Area Under Min of FP and FN (AUM). Minimizing AUM was shown to result in maximizing Area Under the ROC Curve (AUC). Reference implementation of Algorithm 1 (AUM and directional derivative computation) in C++ can be found in R package aum. AUM implementation in pytorch can be found in figure-aum-neural-networks-data.py.

Before running some of the replication scripts below, you will need to clone the data repositories: https://github.com/tdhock/feature-learning-benchmark and https://github.com/tdhock/neuroblastoma-data.

Figure 1: Example changepoint data with non-monotonic FP. Left PNG (data), Right PNG (error), R code.
Figure 2: synthetic example with AUC greater than one. Left PNG (AUM), right PNG (AUC), R code.
Figure 3: real data example with AUC greater than one. left PNG (error functions), right PNG (AUM/AUC functions), R code.
Figure 4: optimizing AUM maximizes AUC. Left PNG, R code (iterations); Right PNG, R code (ROC curves).
Figure 5: Test AUM/AUC comparison across multiple seeds and datasetsTop PNG, Bottom PNG, R code
Figure 6: binary classification test AUC. Left PNG (Comparing AUM variants), Right PNG (AUM compared to baselines), Right PNG, R code.
Figure 7: PNG (Test AUC of neural network for image classification), R code.
Figure 8: speed comparison. PNG, R code.

Interactive figures

Interactive version: http://ml.nau.edu/viz/2021-11-12-aum-convexity/

Source: figure-aum-convexity-interactive.R

figure-auc-improved-interactive.R

Minimizing area under min(FP,FN) usually maximizes AUC.

Title, abstract, slides

Title: Efficient line search for optimizing Area Under the ROC Curve in gradient descent

Abstract: Receiver Operating Characteristic (ROC) curves are useful for evaluation in binary classification and changepoint detection, but difficult to use for learning since the Area Under the Curve (AUC) is piecewise constant (gradient zero almost everywhere). Recently the Area Under Min (AUM) of false positive and false negative rates has been proposed as a differentiable surrogate for AUC. In this paper we study the piecewise linear/constant nature of the AUM/AUC, and propose new efficient path-following algorithms for choosing the learning rate which is optimal for each step of gradient descent (line search), when optimizing a linear model. Remarkably, our proposed line search algorithm has the same log-linear asymptotic time complexity as gradient descent with constant step size, but it computes a complete representation of the AUM/AUC as a function of step size. In our empirical study of binary classification problems, we verify that our proposed algorithm is fast and exact; in changepoint detection problems we show that the proposed algorithm is just as accurate as grid search, but faster.

Slides PDF

See also https://github.com/tdhock/two-new-algos-sci-ml

5 Aug 2024

data_Classif_batchtools_best_valid.R makes

7 May 2024

data_Classif.R makes data_Classif.RData

data_Classif_figure.R makes

6 May 2024

figure-aum-convexity-new.R makes

figure-line-search-example-binary.R makes

3 May 2024

figure-more-than-one-new.R makes new figures for AUM line search paper:

figure-line-search-example.R make figure/demo for new paper/slides.

21 Nov 2023

figure-more-than-one.R modified to make heat map version of SM figure, and rate version of AUM:

4 May 2023

JSM Slides “Efficient line search optimization of penalty functions in supervised changepoint detection,” PDF, LaTeX source.

figure-line-search-example.R make figure/demo for new paper/slides.

figure-line-search-interactive.R makes new interactive figure showing line search.

figure-line-search-complexity-compare.R studies asymptotic complexity of different versions of line search.

10 Apr 2023

figure-line-search-complexity.R is a more systematic version of this experiment, computing how many iterations it takes to get to a min aum step size, tdhock/aum#5

result CSV: figure-line-search-complexity.csv and figure below,

figure-line-grid-search-interactive.R compares grid search to line search with linear number of iterations (same as number of input diffs/breakpoints B).

7 Nov 2022

figure-auc-improved.R makes figure below which shows one way to measure irregularity of ROC curves, via difference with AUC of monotonic version.

19 Aug 2022

HOCKING-slides-prescott.tex makes HOCKING-slides-prescott.pdf

with new figure code figure-aum-grad-speed-binary.R which makes

19 July 2022

figure-compare-hinge-loss.R makes

19 May 2022

New image classification experiment figure-aum-neural-networks-data.py adapted from torch AUM code, https://tdhock.github.io/blog/2022/aum-learning/

figure-aum-neural-networks.R makes

2 May 2022

Slides for London tex, pdf.

Additional figures in figure-more-than-one.R

3 Feb 2022

Figure below from aum package accuracy comparison vignette suggests that experiments on sonar data could provide convincing evidence of superior accuracy.

figure-sonar-comparisons-data.R makes figure-sonar-comparisons.csv

figure-sonar-comparisons.R reads that and makes

12 Nov 2021

figure-aum-convexity-interactive.R makes interactive figure

Interactive versions:

2 Feb 2023, bigger text size http://ml.nau.edu/viz/2021-11-12-aum-convexity/
7 Nov 2021, continuity in pred.diff interaction http://bl.ocks.org/tdhock/raw/e3f56fa419a6638f943884a3abe1dc0b
6 Nov 2021, no continuity in pred.diff interaction http://bl.ocks.org/tdhock/raw/de3979318d5255dd6e21ff907e2f3fb4

3 Nov 2021

HOCKING-slides.tex makes HOCKING-slides.pdf for ML lab / Math colloq.

24 June 2021

figure-aum-grad-speed-binary-cpp-data.R makes binary classification timing data, figure-aum-grad-speed-binary-cpp-data.csv

figure-aum-grad-speed-binary-cpp.R makes

figure-aum-grad-speed.R updated to make

16 June 2021

figure-unbalanced-grad-desc.R updated to make new figure (useful for slides probly)

11 June 2021

Updated figure-aum-convexity.R new figures

Updated figure-aum-grad-speed.R new figure

7 June 2021

figure-aum-grad-speed-binary.R makes

figure above shows time differences between sorted (linear) and unsorted (log-linear) predictions.

figure below shows differences between algos (aum comparable to logistic, whether or not predictions are sorted).

31 May 2021

figure-aum-grad-speed-data.R makes figure-aum-grad-speed-data.csv

figure-aum-grad-speed.R reads that and makes

26 May 2021

figure-unbalanced-grad-desc-data.R makes figure-unbalanced-grad-desc-data.rds

figure-unbalanced-grad-desc.R reads that and makes

The figure above shows that the AUM variant which uses total number of errors (count) is more accurate than the AUM variant which uses the normalized error (rate).

The figure above shows that the AUM is at least as accurate as squared.hinge.all.pairs, whereas logistic.weighted is less accurate.

25 May 2021

figure-logistic-weights.R makes

This figure shows that cv.glmnet does fine with 5% positive labels, but stops learning when we get down to 1% positive labels. This suggests that we should try 1% for comparing aum.rate and aum.count.

10 Mar 2021

figure-DNA-Sonar-subtrain-valid-data.R makes

figure-DNA-Sonar-subtrain-valid-data.csv.gz

figure-DNA-Sonar-subtrain-valid.R analyzes those data.

9 Mar 2021

figure-binary-test-auc-data.R makes figure-binary-test-auc-data.rds

figure-binary-test-auc.R makes

3 Jan 2021

figure-test-fold-monotonic.R makes

> meta.dt[, .(data.name, test.fold, features, n.train, mean.breaks)]
                  data.name test.fold features n.train mean.breaks
1:          ATAC_JV_adipose         4       29     341    6.665689
2: H3K27ac-H3K4me3_TDHAM_BP         2       26    1865    4.145845
3:        H3K4me3_XJ_immune         2       28     216    5.902778
4:        H3K4me3_XJ_immune         4       28     216    6.134259
5:               systematic         1      117    3322    1.010235
> (meta.stats <- meta.tall[, .(
+   min=min(value),
+   max=max(value)
+ ), by=variable])
      variable        min         max
1:    features  26.000000  117.000000
2:     n.train 216.000000 3322.000000
3: mean.breaks   1.010235    6.665689

21 Jan 2021

figure-aum-train-both.R makes

figure-aum-train-data.R makes figure-aum-train-data.rds

figure-aum-train.R makes

figure-aum-optimized-data.R makes figure-aum-optimized-data.rds

figure-aum-optimized.R reads those data and makes

This shows N=54 predicted values with min error, then predicted values optimized via aum gradient descent.

TODO do same with linear model, train error/auc.
TODO aum figs?

13 Jan 2021

figure-binary-class.R makes a figure showing what fp/fn curves look like for binary class,

12 Jan 2021

figure-aum-convexity.R makes

8 Jan 2021

figure-fn-not-monotonic.R makes

figure-more-than-one.R makes

2 Sept 2020

figure-linear-model-test-analyze.R makes

25 Aug 2020

Some R scripts for interactive experimentation with grad desc algo for learning linear model that minimizes AUM:

figure-linear-model.R uses penaltyLearning::IntervalRegressionCV for initialization.
figure-linear-model-zero-init.R uses zero vector for init.

R script with OneFold function that computes train/valid/test error, can be parallelized over 198 test folds on the cluster:

figure-linear-model-test.R

Initial results on two data sets (ATAC, CTCF) show that

Train AUM decreases as a function of iterations (each iteration does line search so that is expected).

IntervalRegressionCV init is much more accurate (in terms of test AUM, AUC, errors) than zero init. Best linear model is not as accurate as best predictions, after running gradient descent on just the predicted values (without linear model).

Using early stopping regularization (select number of iterations with min AUM on validation set) does not decrease test AUM using IntervalRegressionCV initialization.

The linear model which is best in terms of test AUM, over all iterations, is not much better than the initial iteration, for these two data sets.

Do we see any improvement on other test folds / data sets?

16 June 2020

figure-compare-hinge-loss-data.R makes figure-compare-hinge-loss-data.csv

figure-compare-hinge-loss.R makes

18 May 2020

figure-neuroblastomaProcessed-combinations.R makes new figure that highlights counter-examples for the proposition (AUC=1 implies AUM=0) and shows that there are no counter-examples for the converse.

2 Oct 2019

auc.improved.R copied from https://github.com/tdhock/feature-learning-benchmark/blob/master/auc.improved.R

19 Aug 2019

figure-curveAlignment.R computes derivative of area under min(fp,fn), updated viz: http://ml.nau.edu/viz/2019-08-19-curveAlignment-aub-deriv/

16 Aug 2019

figure-neuroblastomaProcessed-combinations-interactive.R makes

http://ml.nau.edu/viz/2019-08-16-generalized-roc/

6 June 2019

curveAlignment.R and figure-curveAlignment.R

http://members.cbio.mines-paristech.fr/~thocking/figure-max-auc/

4 June 2019

figure-aub-convexity.R creates figures which show that the aub function is continuous but not convex:

3 June 2019

figure-neuroblastomaProcessed-complex-loon.R has code for an interactive plot using loon.

31 May 2019

figure-neuroblastomaProcessed-combinations.R creates the following figure which plots auc vs aub:

Note that the min AUM=0 has AUC=1, and the points with AUC>1 have AUM>0. Thus minimizing AUM seems like a reasonable criterion.

30 May 2019

figure-neuroblastomaProcessed-complex.R creates http://members.cbio.mines-paristech.fr/~thocking/figure-neuroblastomaProcessed-complex/ which shows 8 labeled neuroblastoma data sequences with two different ROC curves / predictions. Strangely both achieve 0 errors, but the one with predictions in the finite interval has a highly non-monotonic ROC curve, and much smaller area inside the ROC polygon.

figure-neuroblastomaProcessed-combinations.R creates the following figure which shows the auc values for all of the 2^8 unique combinations of predicted values for 8 labeled profiles.

Each labeled profiles has two minima: one in an infinite interval, and one in a finite interval. The panel titles show the difference d from the infinite interval limit to the predicted value, e.g. (-Inf, 1.2) with d=1 results in a predicted value of 0.2. The overall pattern is that d is relevant for AUC, in a range 0.001 to 10, but it has no effect outside that range. Surprisingly there are AUC values greater than zero, which happens when there are cycles. One example is highlighted with a circle in the plot above, and the ROC curves are shown below.

29 May 2019

https://github.com/tdhock/neuroblastoma-data/blob/master/figure-max-auc.R creates http://members.cbio.mines-paristech.fr/~thocking/figure-max-auc/

Name		Name	Last commit message	Last commit date
Latest commit History 254 Commits
figure-aum-convexity-interactive-cropped		figure-aum-convexity-interactive-cropped
figure-aum-convexity-interactive-screenshots		figure-aum-convexity-interactive-screenshots
.gitignore		.gitignore
2021-03-lab-ski-lunch.jpg		2021-03-lab-ski-lunch.jpg
2023-02-02-group-meeting.jpg		2023-02-02-group-meeting.jpg
HOCKING-slides-UofA.tex		HOCKING-slides-UofA.tex
HOCKING-slides-london.pdf		HOCKING-slides-london.pdf
HOCKING-slides-london.tex		HOCKING-slides-london.tex
HOCKING-slides-prescott.pdf		HOCKING-slides-prescott.pdf
HOCKING-slides-prescott.tex		HOCKING-slides-prescott.tex
HOCKING-slides-sherbrooke.pdf		HOCKING-slides-sherbrooke.pdf
HOCKING-slides-sherbrooke.tex		HOCKING-slides-sherbrooke.tex
HOCKING-slides-short.pdf		HOCKING-slides-short.pdf
HOCKING-slides-short.tex		HOCKING-slides-short.tex
HOCKING-slides-toronto.pdf		HOCKING-slides-toronto.pdf
HOCKING-slides-toronto.tex		HOCKING-slides-toronto.tex
HOCKING-slides.pdf		HOCKING-slides.pdf
HOCKING-slides.tex		HOCKING-slides.tex
Makefile		Makefile
README.org		README.org
auc.improved.R		auc.improved.R
auc.improved.rds		auc.improved.rds
cellprofiler.png		cellprofiler.png
curveAlignment.R		curveAlignment.R
curveAlignment.rds		curveAlignment.rds
data_Classif.R		data_Classif.R
data_Classif.RData		data_Classif.RData
data_Classif.py		data_Classif.py
data_Classif_batchtools.R		data_Classif_batchtools.R
data_Classif_batchtools.png		data_Classif_batchtools.png
data_Classif_batchtools_best_valid.R		data_Classif_batchtools_best_valid.R
data_Classif_batchtools_best_valid.csv		data_Classif_batchtools_best_valid.csv
data_Classif_batchtools_best_valid.png		data_Classif_batchtools_best_valid.png
data_Classif_batchtools_best_valid_scatter.png		data_Classif_batchtools_best_valid_scatter.png
data_Classif_figure.R		data_Classif_figure.R
data_Classif_figure_subtrain_validation.png		data_Classif_figure_subtrain_validation.png
data_Classif_figure_subtrain_validation_AUC.png		data_Classif_figure_subtrain_validation_AUC.png
data_Classif_figure_subtrain_validation_all.png		data_Classif_figure_subtrain_validation_all.png
data_Classif_figure_units.png		data_Classif_figure_units.png
data_Classif_figure_units_all.png		data_Classif_figure_units_all.png
data_Classif_line_search.R		data_Classif_line_search.R
data_Classif_line_search_constant.R		data_Classif_line_search_constant.R
data_Classif_reg.R		data_Classif_reg.R
data_Classif_seeds.R		data_Classif_seeds.R
figure-DNA-Sonar-subtrain-valid-data.R		figure-DNA-Sonar-subtrain-valid-data.R
figure-DNA-Sonar-subtrain-valid.R		figure-DNA-Sonar-subtrain-valid.R
figure-Hocking2020-peak-constraints.png		figure-Hocking2020-peak-constraints.png
figure-Hocking2020-peak-label-errors.png		figure-Hocking2020-peak-label-errors.png
figure-Hocking2020-roc.png		figure-Hocking2020-roc.png
figure-ICML13-margin.png		figure-ICML13-margin.png
figure-LOPART-roc.pdf		figure-LOPART-roc.pdf
figure-Maidstone-roc.png		figure-Maidstone-roc.png
figure-aub-convexity-heatmap.png		figure-aub-convexity-heatmap.png
figure-aub-convexity.R		figure-aub-convexity.R
figure-aub-convexity.png		figure-aub-convexity.png
figure-auc-improved-interactive-screenshot-old.png		figure-auc-improved-interactive-screenshot-old.png
figure-auc-improved-interactive-screenshot.png		figure-auc-improved-interactive-screenshot.png
figure-auc-improved-interactive.R		figure-auc-improved-interactive.R
figure-auc-improved.R		figure-auc-improved.R
figure-auc-improved.png		figure-auc-improved.png
figure-aum-convexity-emph.png		figure-aum-convexity-emph.png
figure-aum-convexity-interactive-diff4-1.png		figure-aum-convexity-interactive-diff4-1.png
figure-aum-convexity-interactive-diff4-2.png		figure-aum-convexity-interactive-diff4-2.png
figure-aum-convexity-interactive-diff4-3.png		figure-aum-convexity-interactive-diff4-3.png
figure-aum-convexity-interactive-diff4.png		figure-aum-convexity-interactive-diff4.png
figure-aum-convexity-interactive-screenshots.R		figure-aum-convexity-interactive-screenshots.R
figure-aum-convexity-interactive.R		figure-aum-convexity-interactive.R
figure-aum-convexity-interactive.png		figure-aum-convexity-interactive.png
figure-aum-convexity-new-profiles.png		figure-aum-convexity-new-profiles.png
figure-aum-convexity-new.R		figure-aum-convexity-new.R
figure-aum-convexity-new.png		figure-aum-convexity-new.png
figure-aum-convexity-no-SM.png		figure-aum-convexity-no-SM.png
figure-aum-convexity-profiles.png		figure-aum-convexity-profiles.png
figure-aum-convexity-thresholds.png		figure-aum-convexity-thresholds.png
figure-aum-convexity.R		figure-aum-convexity.R
figure-aum-convexity.png		figure-aum-convexity.png
figure-aum-grad-speed-binary-algos.png		figure-aum-grad-speed-binary-algos.png
figure-aum-grad-speed-binary-cpp-algos.png		figure-aum-grad-speed-binary-cpp-algos.png
figure-aum-grad-speed-binary-cpp-data.R		figure-aum-grad-speed-binary-cpp-data.R
figure-aum-grad-speed-binary-cpp-data.csv		figure-aum-grad-speed-binary-cpp-data.csv
figure-aum-grad-speed-binary-cpp.R		figure-aum-grad-speed-binary-cpp.R
figure-aum-grad-speed-binary-cpp.png		figure-aum-grad-speed-binary-cpp.png
figure-aum-grad-speed-binary.R		figure-aum-grad-speed-binary.R
figure-aum-grad-speed-binary.png		figure-aum-grad-speed-binary.png
figure-aum-grad-speed-both.png		figure-aum-grad-speed-both.png
figure-aum-grad-speed-data.R		figure-aum-grad-speed-data.R
figure-aum-grad-speed-data.csv		figure-aum-grad-speed-data.csv
figure-aum-grad-speed-random.png		figure-aum-grad-speed-random.png
figure-aum-grad-speed.R		figure-aum-grad-speed.R
figure-aum-grad-speed.png		figure-aum-grad-speed.png
figure-aum-margin-loss.R		figure-aum-margin-loss.R
figure-aum-neural-networks-best-valid-auc-curves.png		figure-aum-neural-networks-best-valid-auc-curves.png
figure-aum-neural-networks-data.R		figure-aum-neural-networks-data.R
figure-aum-neural-networks-data.py		figure-aum-neural-networks-data.py
figure-aum-neural-networks-test-auc.png		figure-aum-neural-networks-test-auc.png
figure-aum-neural-networks.R		figure-aum-neural-networks.R
figure-aum-optimized-data.R		figure-aum-optimized-data.R
figure-aum-optimized-data.rds		figure-aum-optimized-data.rds
figure-aum-optimized-iterations-emph.png		figure-aum-optimized-iterations-emph.png
figure-aum-optimized-iterations.png		figure-aum-optimized-iterations.png

tdhock/max-generalized-auc

Folders and files

Latest commit

History

Repository files navigation