You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Audio augmentations library for PyTorch for audio in the time-domain, with support for stochastic data augmentations as used often in self-supervised / contrastive learning. One can apply a single stochastic augmentation or create as many stochastically augmented examples from a single interface.
7
+
Audio data augmentations library for PyTorch for audio in the time-domain. The focus of this repository is to:
8
+
- Provide many audio transformations in an easy Python interface.
9
+
- Have a high test coverage.
10
+
- Easily control stochastic (sequential) audio transformations.
11
+
- Make every audio transformation differentiable with PyTorch's `nn.Module`.
12
+
- Optimise audio transformations for CPU and GPU.
7
13
8
-
This package follows the conventions set out by `torchaudio`, with audio defined as a vector of `[channel, time]`. Each individual augmentation can be initialized on its own, or be wrapped around a `RandomApply` interface which will apply the augmentation with probability `p`.
14
+
It supports stochastic transformations as used often in self-supervised, semi-supervised learning methods. One can apply a single stochastic augmentation or create as many stochastically transformed audio examples from a single interface.
15
+
16
+
This package follows the conventions set out by `torchvision` and `torchaudio`, with audio defined as a tensor of `[channel, time]`, or a batched representation `[batch, channel, time]`. Each individual augmentation can be initialized on its own, or be wrapped around a `RandomApply` interface which will apply the augmentation with probability `p`.
0 commit comments