Skip to content

Latest commit

 

History

History

audio_augmentation

Getting started

Here is a way to quickly augment a folder of audio files:

cd ~ 
cd allie/features/audio_augmentation
python3 augment.py /Users/jimschwoebel/allie/load_dir

Implemented

  • augment_tsaug - adds noise and various shifts to audio files, addes 2x more data; see tutorial here.
  • augment_addnoise - adds noise to an audio file.
  • augment_noise - removes noise from audio files randomly.
  • augment_pitch - shifts pitch up and down to correct for gender differences.
  • augment_randomsplice - randomly splice an audio file to generate more data.
  • augment_silence - add silence to an audio file to augment a dataset.
  • augment_time - change time duration for a variety of audio files through making new files.
  • augment_volume - change volume randomly (helps to mitigate effects of microphohne distance on a model).
setting description default setting all options
augment_data whether or not to implement data augmentation policies during the model training process via default augmentation scripts. True True, False
default_audio_augmenters the default augmentation strategies used during audio modeling if augment_data == True ["augment_tsaug"] ["augment_tsaug", "augment_addnoise", "augment_noise", "augment_pitch", "augment_randomsplice", "augment_silence", "augment_time", "augment_volume"]