allie/augmentation/audio_augmentation at master · jim-schwoebel/allie

readme.md

Here is a way to quickly augment a folder of audio files:

cd ~ 
cd allie/features/audio_augmentation
python3 augment.py /Users/jimschwoebel/allie/load_dir

augment_tsaug - adds noise and various shifts to audio files, addes 2x more data; see tutorial here.
augment_addnoise - adds noise to an audio file.
augment_noise - removes noise from audio files randomly.
augment_pitch - shifts pitch up and down to correct for gender differences.
augment_randomsplice - randomly splice an audio file to generate more data.
augment_silence - add silence to an audio file to augment a dataset.
augment_time - change time duration for a variety of audio files through making new files.
augment_volume - change volume randomly (helps to mitigate effects of microphohne distance on a model).

setting	description	default setting	all options
augment_data	whether or not to implement data augmentation policies during the model training process via default augmentation scripts.	True	True, False
default_audio_augmenters	the default augmentation strategies used during audio modeling if augment_data == True	["augment_tsaug"]	["augment_tsaug", "augment_addnoise", "augment_noise", "augment_pitch", "augment_randomsplice", "augment_silence", "augment_time", "augment_volume"]