This repo consists of collection of papers and repos on the topic of deep learning by noisy labels. All methods listed below are briefly explained in the paper Image Classification with Deep Learning in the Presence of Noisy Labels: A Survey. More information about the topic can also be found on the survey.
In order to test label-noise-robust algorithms with benchmark datasets (mnist,mnist-fashion,cifar10,cifar100) synthetic noise generation is a necessary step. Following work provides a feature-dependent synthetic noise generation algorithm and pre-generated synthetic noisy labels for mentioned datasets.
List of papers that shed light to label noise phenomenon for deep learning:
List of works under label noise beside classification
Sources on web
Clothing1M is a real-world noisy labeled dataset which is widely used for benchmarking. Below is the test accuracies on this dataset. Note that,clothing1M contains spare 50k clean training data, but most of the methods dont use this data for fair comparison. Therefore, here I only listed methods that do not use extra 50k samples. '?' indicates that given work does not mentipon whether they used 50k clean samples or not.
Abbreviations for noise types are:
- NC -> Noisy Channel
- LNC -> Label Noise Cleansing
- DP -> Dataset Pruning
- SC -> Sample Choosing
- SIW -> Sample Importance Weighting
- LQA -> Labeler quality assesment
- RL -> Robust Losses
- ML -> Meta Learning
- MIL -> Multiple Instance Learning
- SSL -> Semi Supervised Learning
- R -> Regularizers
- EM -> Ensemble Methods
- O -> Others
Other abbreviations:
- NC -> Neurocomputing
- Tf -> Tensorflow
- Pt -> PyTorch
Starred (*) repos means code is unoffical!