Fast AutoAugment

Title	Authors	In One Sentence	Summary	Date	Link	Conference
Fast AutoAugment	Sungbin Lim et al.	Density matching can speed up AutoAugment search time while achieving comparable performance	Summary	25/05/2019	Paper	NeurIPS 2019

Abstract

Fast AutoAugment does not take any repeated training of child models, searching for augmentation policies that maximize the match between distribution of augmentend split and distribution of another, unaugmented split.

Previous DA approaches 1) Hand-designed 2) Generative Models 3) Optimal combinations of predefined transformations

Search space is O operations with probability p and magnitude \lambda
S is the set of subpolicies with subpolicy \tau \in S (that consists of N_t consecutive operations with probability p)
For exploration split the train dataset in K folds, each k fold consists of two subsets D_M and D_A.
We train the model parameter \theta on each D_M in parallel. After training \theta, the algorithm evaluates B bundles of augmentation policies of D_A without training \theta. The top-N policies obtained from each K-fold are appended to the augmentation list T\ast.
We want our data augmentation to bridge the train/validation gap, as such intuitively we can select the best data augmentation as one that that matches the density of D_train and D_valid
How to select the the top policy? -> compare the categorical cross-entropy loss of the Augmented validation images, use this to train a Bayesian method that samples optimal augmentation