Skip to content

Commit 109aa75

Browse files
authored
Updated README for codecov
1 parent 3817fd8 commit 109aa75

File tree

1 file changed

+12
-4
lines changed

1 file changed

+12
-4
lines changed

README.md

+12-4
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,19 @@
11
# PyTorch Audio Augmentations
22
![CI status](https://github.com/spijkervet/torchaudio-augmentations/actions/workflows/ci.yml/badge.svg)
3+
[![codecov](https://codecov.io/gh/Spijkervet/torchaudio-augmentations/branch/master/graph/badge.svg?token=0DEFJYJH5K)](https://codecov.io/gh/Spijkervet/torchaudio-augmentations)
34
[![Downloads](https://pepy.tech/badge/torchaudio-augmentations)](https://pepy.tech/project/torchaudio-augmentations)
45
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.4748582.svg)](https://zenodo.org/record/4748582#)
56

6-
Audio augmentations library for PyTorch for audio in the time-domain, with support for stochastic data augmentations as used often in self-supervised / contrastive learning. One can apply a single stochastic augmentation or create as many stochastically augmented examples from a single interface.
7+
Audio data augmentations library for PyTorch for audio in the time-domain. The focus of this repository is to:
8+
- Provide many audio transformations in an easy Python interface.
9+
- Have a high test coverage.
10+
- Easily control stochastic (sequential) audio transformations.
11+
- Make every audio transformation differentiable with PyTorch's `nn.Module`.
12+
- Optimise audio transformations for CPU and GPU.
713

8-
This package follows the conventions set out by `torchaudio`, with audio defined as a vector of `[channel, time]`. Each individual augmentation can be initialized on its own, or be wrapped around a `RandomApply` interface which will apply the augmentation with probability `p`.
14+
It supports stochastic transformations as used often in self-supervised, semi-supervised learning methods. One can apply a single stochastic augmentation or create as many stochastically transformed audio examples from a single interface.
15+
16+
This package follows the conventions set out by `torchvision` and `torchaudio`, with audio defined as a tensor of `[channel, time]`, or a batched representation `[batch, channel, time]`. Each individual augmentation can be initialized on its own, or be wrapped around a `RandomApply` interface which will apply the augmentation with probability `p`.
917

1018

1119
## Usage
@@ -19,7 +27,7 @@ num_samples = sr * 5
1927
transforms = [
2028
RandomResizedCrop(n_samples=num_samples),
2129
RandomApply([PolarityInversion()], p=0.8),
22-
RandomApply([Noise(min_snr=0.3, max_snr=0.5)], p=0.3),
30+
RandomApply([Noise(min_snr=0.001, max_snr=0.005)], p=0.3),
2331
RandomApply([Gain()], p=0.2),
2432
HighLowPass(sample_rate=sr), # this augmentation will always be applied in this aumgentation chain!
2533
RandomApply([Delay(sample_rate=sr)], p=0.5),
@@ -35,7 +43,7 @@ We can also define a stochastic augmentation on multiple transformations. The fo
3543
```python
3644
transforms = [
3745
RandomResizedCrop(n_samples=num_samples),
38-
RandomApply([PolarityInversion(), Noise(min_snr=0.3, max_snr=0.5)], p=0.8),
46+
RandomApply([PolarityInversion(), Noise(min_snr=0.001, max_snr=0.005)], p=0.8),
3947
RandomApply([Gain()], p=0.2),
4048
RandomApply([Delay(sample_rate=sr), Reverb(sample_rate=sr)], p=0.5)
4149
]

0 commit comments

Comments
 (0)