Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
capsnet		capsnet
checkpoints		checkpoints
data		data
notebooks		notebooks
resnet		resnet
snn		snn
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
mypy.ini		mypy.ini
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Repository files navigation

Few-shot learning experiments

Dumping ground for miscellaneous ML experiments with focus on FSL.

Using conda to manage dependencies. Detailed list of dependencies in environment.yml and requirements.txt.

Divided in modules by method, which are further divided into submodules by dataset.

snn/omniglot/: Convolutional SNN for one-shot learning on Omniglot dataset^[1].
- Heavily based on reimplementations of the paper at https://github.com/kevinzakka/one-shot-siamese and https://github.com/fangpin/siamese-pytorch.
- Using the learning rate finder from PyTorch Lightning.
- AdamW optimizer^[2], with 1cycle learning rate policy^{[3, 4]}.
snn/librispeech/: Siamese capsule network using Thin-ResNet34 for one-shot learning on LibriSpeech dataset.
- Experimenting based on ideas from paper by Hajavi et al. ^[5].
- Thin-ResNet34 implementation copied from https://github.com/clovaai/voxceleb_trainer.
- CapsNet implementation copied from https://github.com/adambielski/CapsNet-pytorch.
- Using the learning rate finder from PyTorch Lightning.
- Optional spectogram frequency and time masking as per SpecAugment^[6].
- AdamW optimizer^[2], with 1cycle learning rate policy^{[3, 4]}.

python -m <model>.<dataset>.train --help

Example: train model snn/omniglot/ using 1 GPU:

python -O -m snn.omniglot.train --gpus 1 --num_workers 4 --batch_size 128 --max_epochs 50

Koch, Gregory, Richard Zemel, and Ruslan Salakhutdinov. "Siamese neural networks for one-shot image recognition." In ICML deep learning workshop, vol. 2. 2015.
Loshchilov, Ilya, and Frank Hutter. "Decoupled weight decay regularization." arXiv preprint arXiv:1711.05101 (2017). https://arxiv.org/abs/1711.05101.
Smith, Leslie N., and Nicholay Topin. "Super-convergence: Very fast training of neural networks using large learning rates." In Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications. Vol. 11006. International Society for Optics and Photonics, 2019. https://arxiv.org/abs/1708.07120.
https://sgugger.github.io/the-1cycle-policy.html
Hajavi, Amirhossein, and Ali Etemad. "Siamese Capsule Network for End-to-End Speaker Recognition In The Wild." arXiv preprint arXiv:2009.13480 (2020). https://arxiv.org/abs/2009.13480.
Park, Daniel S., Yu Zhang, Chung-Cheng Chiu, Youzheng Chen, Bo Li, William Chan, Quoc V. Le, and Yonghui Wu. "Specaugment on large scale datasets." In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6879-6883. IEEE, 2020. https://arxiv.org/abs/1904.08779.