Skip to content

Persephone #148

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 195 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
195 commits
Select commit Hold shift + click to select a range
d4a1a65
move np models to np directory
jesus-villalba Mar 1, 2022
887bd3b
In egs, change hyperion -> hyperion.np where needed
jesus-villalba Mar 1, 2022
67a8543
merged with master
jesus-villalba Mar 3, 2022
569c9ba
added more docs and x_lengths support
jesus-villalba Mar 9, 2022
01ada9f
more docs and x_lengths support
jesus-villalba Mar 9, 2022
89c697c
more minor bugs fixed
jesus-villalba Mar 10, 2022
c82231f
***
jesus-villalba Mar 17, 2022
80cfc3c
find bug not initializing augmentation
jesus-villalba Mar 25, 2022
3a0eeff
voxceleb/v1.1 refactorized
jesus-villalba Apr 1, 2022
ed08db3
huffing face x-vectors working on multi-gpu
jesus-villalba Apr 29, 2022
e9cb8a3
added noam and triangular schedulers, added option to change dropout …
jesus-villalba May 9, 2022
5c5ef7c
changed way that training modes work in TorchModels
jesus-villalba May 16, 2022
d7599f9
fixed bugs introduced in TorchModel.train()
jesus-villalba May 16, 2022
8a7a6fd
added training models to xvector and wav2vec2xvector models
jesus-villalba May 20, 2022
90ef40a
wavlm phase 1
jesus-villalba Jun 2, 2022
25bbd0e
documented most np models
jesus-villalba Jun 2, 2022
0ecebc0
wavlm phase3
jesus-villalba Jun 9, 2022
81d4a7f
added default config to voxceleb/v2
jesus-villalba Jun 12, 2022
1d41410
added default config to voxceleb/v2
jesus-villalba Jun 14, 2022
cf433a7
added recipes with wavlmlarge
jesus-villalba Jul 4, 2022
4825d39
added configs for w2v2 xlsr
jesus-villalba Jul 8, 2022
9a3a168
added hard negative mining
jesus-villalba Jul 22, 2022
45b5dcd
refactorized segment samplers
jesus-villalba Jul 28, 2022
67c1bb8
new seq sampler works
jesus-villalba Aug 30, 2022
80a2498
modified sparse scores
jesus-villalba Sep 4, 2022
9f1f73b
software structure for transducer
Oct 3, 2022
b8ffac4
Librispeech recognition recipe
neillu23 Oct 3, 2022
09354a4
discard the change of train_wav2vec2xvector
neillu23 Oct 4, 2022
ac99960
improved utils
jesus-villalba Oct 5, 2022
42daf5c
update data preparation
neillu23 Oct 14, 2022
85a62dc
bpe data preparation and training
neillu23 Oct 16, 2022
7e61cb0
remove redundent file
neillu23 Oct 16, 2022
d8efa4f
added script for tsne plots of lre
jesus-villalba Oct 17, 2022
7a6dca1
add data preparation for recognition
neillu23 Oct 17, 2022
a100839
updated finetune xvector script
jesus-villalba Oct 18, 2022
5663416
added labels to gbe and lsvmc
jesus-villalba Oct 20, 2022
2121d87
fixed class weighted sampler
jesus-villalba Oct 22, 2022
115d00e
Add Mix Musan noise, and update the transducer data preparation
neillu23 Oct 24, 2022
dac0cc5
fix weighted sampler for unpresent classes, and changed bucket creation
jesus-villalba Oct 25, 2022
3c8a946
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
neillu23 Oct 25, 2022
1ff3720
update loading text and batch into the transducer
neillu23 Oct 26, 2022
a7f1e53
initial loading transducer model
neillu23 Oct 26, 2022
30420e8
Data preparation and implemented Wav2vec2Transducer Models
neillu23 Oct 31, 2022
bc85ec7
some bug fixes in subcenter loss and class-weighted sampler
jesus-villalba Nov 4, 2022
450f50d
Wav2vec2 Transducer update data augmentation
neillu23 Nov 5, 2022
5ac5496
gradient clipping and multi-gpu
neillu23 Nov 7, 2022
4be4d87
add resampling option to audio dataset
jesus-villalba Nov 9, 2022
83b7d9c
fix issue in multi-gpus training and update model parameters
neillu23 Nov 10, 2022
ff172b1
update save and load for transducer model
neillu23 Nov 13, 2022
d1a2419
fix nan in plot tsne
jesus-villalba Nov 16, 2022
4182374
fix train w2v
jesus-villalba Nov 21, 2022
2f6547a
Address comments for transducer
neillu23 Nov 22, 2022
b439137
Merge branch 'persephone-asr' into persephone-refactor
jesus-villalba Nov 22, 2022
37b4424
Merge pull request #124 from neillu23/persephone-refactor
jesus-villalba Nov 22, 2022
2cbefda
audio to duration python script
jesus-villalba Nov 30, 2022
898d4e1
upload missing file: download_lm.py
neillu23 Nov 30, 2022
49219e9
Merge branch 'persephone-refactor' of https://github.com/neillu23/hyp…
neillu23 Nov 30, 2022
75008f0
Merge pull request #125 from neillu23/persephone-refactor
jesus-villalba Nov 30, 2022
7f9ee74
added script to install k2
jesus-villalba Nov 30, 2022
fe43c49
Merge branch 'persephone-asr' of https://github.com/hyperion-ml/hyper…
jesus-villalba Nov 30, 2022
72b8b3a
fixed duration extraction in libri recipe
jesus-villalba Dec 5, 2022
9460cc3
fix dataprep
jesus-villalba Dec 5, 2022
118d1d4
fixed bucketing sampler
jesus-villalba Dec 7, 2022
49b829b
transducer training, working in clsp grid, cudnn=False
jesus-villalba Dec 8, 2022
829aa7d
w2v2 transducer with do
jesus-villalba Dec 13, 2022
43f6c4c
more transducer configs
jesus-villalba Dec 15, 2022
63a1216
Merge and add decode for ASR
neillu23 Dec 21, 2022
300d82b
recover mistakenly deleted/changed files
neillu23 Dec 21, 2022
8984b34
fix typo
neillu23 Dec 21, 2022
e14a840
Add fine-tune function for transducer ASR
neillu23 Dec 21, 2022
46760d7
Merge pull request #128 from neillu23/persephone-refactor
jesus-villalba Dec 21, 2022
90c97af
more transducer configs
jesus-villalba Dec 22, 2022
02bb457
fix beam search
jesus-villalba Dec 24, 2022
3bfb7f0
updated binaries to train w2v2 x-vectors
jesus-villalba Dec 26, 2022
2cf461d
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
jesus-villalba Dec 26, 2022
13dd879
update configs for w2v
jesus-villalba Dec 27, 2022
4ba13a7
isort data
jesus-villalba Dec 27, 2022
dd7b3ed
fixed filter_func_args
jesus-villalba Dec 27, 2022
b274895
trainers use dictionaries
jesus-villalba Dec 28, 2022
4f7bc3d
fixed amp
jesus-villalba Dec 28, 2022
5a138e8
update feature, embeddings datasets and samplers, added fwse and cfws…
jesus-villalba Dec 29, 2022
15db427
added scripts to extract xvectors from feats back to bin
jesus-villalba Dec 29, 2022
f9e4a24
isorted everything
jesus-villalba Dec 29, 2022
e26f5b9
new hyperparams for voxceleb
jesus-villalba Jan 26, 2023
5f0ac99
updated qmf script
jesus-villalba Jan 27, 2023
b364798
xxx
jesus-villalba Jan 27, 2023
06adb57
solved conflicts
jesus-villalba Jan 27, 2023
f15905b
new results voxceleb
jesus-villalba Jan 30, 2023
21a4764
uncoment script
jesus-villalba Jan 30, 2023
16b1316
fix bug
jesus-villalba Jan 30, 2023
76ac6f3
audio dataset with dictionary for asr
jesus-villalba Feb 9, 2023
aabbef1
start refactorizing rnn-t
jesus-villalba Feb 10, 2023
9a619a5
got results with ravana resnet100
jesus-villalba Feb 16, 2023
42f1ebd
xxx
jesus-villalba Feb 16, 2023
c2bd3db
ecapa v3 recipe
jesus-villalba Mar 2, 2023
1d78ea3
refactored beam search
jesus-villalba Mar 10, 2023
97050f9
working align length sycn dec
jesus-villalba Mar 10, 2023
db14742
finised refactoring rnn transducer
jesus-villalba Mar 22, 2023
16b1eae
Merge branch 'persephone-asr' into persephone-asr-refactor
jesus-villalba Mar 22, 2023
594e098
Merge pull request #131 from hyperion-ml/persephone-asr-refactor
jesus-villalba Mar 22, 2023
7115dbb
change configuration of residual networks with batchnorm after nonlin…
jesus-villalba Mar 24, 2023
6988485
add results with new ecapa v3
jesus-villalba Mar 27, 2023
b475d37
changed default config
jesus-villalba Mar 27, 2023
5370716
adapted adv.v1 recipe to persephone
jesus-villalba Apr 14, 2023
947047d
deleted adv.v1 and vae.v1 recipes
jesus-villalba Apr 14, 2023
083b210
started adapt adv.v2
jesus-villalba Apr 14, 2023
df273d2
fixed block and local attention with mask in conformers
jesus-villalba May 3, 2023
8f1e8ff
Merge branch 'persephone-asr' of https://github.com/hyperion-ml/hyper…
jesus-villalba May 3, 2023
0036b2b
Merge pull request #132 from hyperion-ml/persephone-asr-refactor
jesus-villalba May 3, 2023
8b81ffa
adv.v2 adapted to persephone branck
jesus-villalba May 3, 2023
92e577b
fixed conflicts
jesus-villalba May 3, 2023
e44eb75
isort everything again
jesus-villalba May 3, 2023
85282ac
Merge pull request #127 from hyperion-ml/persephone-asr
jesus-villalba May 3, 2023
24f7082
Merge pull request #133 from hyperion-ml/persephone-refactor
jesus-villalba May 3, 2023
35391de
new vox2 dataprep
jesus-villalba May 4, 2023
cf861bc
fix new vox2 dataprep durations, scp -> RecordingSet
jesus-villalba May 8, 2023
c408f74
some fixes in sre21
jesus-villalba May 11, 2023
27d579c
sre21 16k recipe finished
jesus-villalba May 20, 2023
2787891
sre21 8k adapted to persephone branck
jesus-villalba May 25, 2023
e26cb7f
Merge pull request #135 from hyperion-ml/persephone-refactor
jesus-villalba May 25, 2023
63a2bd9
added config 2.0 to vox v2
jesus-villalba Jun 30, 2023
c1d193a
finished experiments of models 2.0 in voxceleb/v2
jesus-villalba Jul 19, 2023
089061f
Merge pull request #136 from hyperion-ml/persephone-refactor
jesus-villalba Jul 19, 2023
89efce4
voxceleb v1.2 works up to snorm backend
jesus-villalba Sep 4, 2023
89c6e20
finished vox v1.2 except plda
jesus-villalba Sep 8, 2023
44f085a
introduce entry points
jesus-villalba Sep 10, 2023
6105476
make it work with cuda 11
jesus-villalba Sep 11, 2023
e4a5be1
Merge pull request #137 from hyperion-ml/persephone-entry
jesus-villalba Sep 11, 2023
392cd30
started vox/v2.1 recipe and fix some readmes
jesus-villalba Sep 11, 2023
ed35173
vox/v2.1 recipe done, not tested
jesus-villalba Sep 11, 2023
8760d05
implemented lora in w2v2, not tested
jesus-villalba Sep 12, 2023
a75610e
vox2.1 working and lora
jesus-villalba Sep 13, 2023
755af2c
Merge pull request #138 from hyperion-ml/persephone-refactor
jesus-villalba Sep 13, 2023
c23103e
lora in wavlm and hubert
jesus-villalba Sep 13, 2023
81c540b
fix bug in w2v constructors with lora
jesus-villalba Sep 15, 2023
cb9fa3c
lre22 fixed v1 done
Oct 30, 2023
af6e26e
new recipes voxceleb/v1.2 and v2.1
jesus-villalba Nov 1, 2023
773e30c
Merge pull request #139 from hyperion-ml/persephone-lre22
jesus-villalba Nov 1, 2023
c977186
added feature fuser to hf w2v xvector
jesus-villalba Nov 4, 2023
4c5c4fb
started lre22/open.v2.8k
Nov 6, 2023
f19cbfa
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
Nov 6, 2023
c2f0602
fix bug missing conformer xvector in __init__
jesus-villalba Nov 8, 2023
9fd2141
started to clean lre22/open.v2.8k
Nov 8, 2023
27db6a9
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
Nov 8, 2023
8446f78
fix param groups in hf w2vec2xvec
jesus-villalba Nov 14, 2023
398bf36
xxx
Nov 14, 2023
1d9710b
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
Nov 14, 2023
e49157d
clean up recipe lre22/open.v2.8k
Nov 30, 2023
d38f612
Merge pull request #140 from hyperion-ml/persephone-lre22
jesus-villalba Nov 30, 2023
ae47ce6
dino seems to be working
jesus-villalba Dec 9, 2023
4593c4b
dino seems to be working
jesus-villalba Dec 14, 2023
2492819
added clustering to dino
jesus-villalba Jan 10, 2024
524fe37
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
jesus-villalba Jan 10, 2024
68cc093
added ecapa and lresnet 3.1
jesus-villalba Jan 18, 2024
ecdc319
added some vox confs
Jan 19, 2024
c26d17d
pos enc in resnet xvector
jesus-villalba Jan 19, 2024
f928186
Merge branch 'persephone-refactor' of https://github.com/hyperion-ml/…
Jan 19, 2024
d99189e
saving progress in dino
jesus-villalba Mar 2, 2024
12a1bd9
added ft steps to voxceleb/ssl.v1
jesus-villalba Apr 23, 2024
ab8c285
added results to README in voxceleb/ssl.v1
jesus-villalba Apr 26, 2024
5960ec6
Merge pull request #141 from hyperion-ml/persephone-refactor
jesus-villalba Apr 26, 2024
dcabebe
added cosine loss to dino
jesus-villalba May 3, 2024
6833401
Merge pull request #142 from hyperion-ml/persephone-refactor
jesus-villalba May 3, 2024
2b707d0
xxx
May 3, 2024
0a1d2b2
xxx
May 3, 2024
63c5d0e
Merge pull request #143 from hyperion-ml/persephone-refactor
jesus-villalba May 3, 2024
5a6240e
asr conformer runs
jesus-villalba May 10, 2024
ad0561e
add max_batches arg to samplers
jesus-villalba May 13, 2024
91407ad
fix
jesus-villalba May 13, 2024
6a88ee5
fix
jesus-villalba May 14, 2024
f9f539f
fix
jesus-villalba May 14, 2024
6a733ee
fix
jesus-villalba May 14, 2024
cdb1ebb
fix
jesus-villalba May 14, 2024
49c50c9
fix
jesus-villalba May 14, 2024
e03dc8c
added first tokenizers
jesus-villalba May 15, 2024
5fb459b
rnnt decoder renamed
jesus-villalba May 15, 2024
d4aac1a
Merge pull request #144 from hyperion-ml/persephone-refactor
jesus-villalba May 15, 2024
dae8633
some fixes
jesus-villalba May 24, 2024
b62824a
some fixes
jesus-villalba May 24, 2024
c8c2d3b
Merge pull request #146 from hyperion-ml/persephone-refactor
jesus-villalba May 24, 2024
d7540bc
updated readme
jesus-villalba May 27, 2024
955fb02
fix kmeans call in mixture models
jesus-villalba Jun 10, 2024
501859a
Merge pull request #152 from hyperion-ml/persephone-refactor
jesus-villalba Jun 10, 2024
b8fe5cb
np.bool -> bool
jesus-villalba Jun 10, 2024
38cc512
Merge pull request #153 from hyperion-ml/persephone-refactor
jesus-villalba Jun 10, 2024
ad9de1c
debug gmm
jesus-villalba Jun 11, 2024
dd1fafa
Merge pull request #154 from hyperion-ml/persephone-refactor
jesus-villalba Jun 11, 2024
e27af1b
debug gmm
jesus-villalba Jun 11, 2024
05843b7
Merge pull request #155 from hyperion-ml/persephone-refactor
jesus-villalba Jun 11, 2024
44f7abb
debug gmm
jesus-villalba Jun 11, 2024
dc2cd50
Merge pull request #156 from hyperion-ml/persephone-refactor
jesus-villalba Jun 11, 2024
025a137
debug gmm
jesus-villalba Jun 11, 2024
40d0606
Merge pull request #157 from hyperion-ml/persephone-refactor
jesus-villalba Jun 11, 2024
9303c0d
remove nan assert from glob pool
jesus-villalba Jul 20, 2024
a8eec32
Merge pull request #160 from hyperion-ml/persephone-refactor
jesus-villalba Jul 20, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
36 changes: 16 additions & 20 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,13 +26,24 @@ The full API is described in the documentation page [https://hyperion-ml.readthe
### Prerequisites

We use anaconda or miniconda, though you should be able to make it work in other python distributions
To start, you should create a new enviroment and install PyTorch>=1.9, (older versions are not supported any longer) e.g.:
To start, you should create a new enviroment and install PyTorch:
```
conda create --name ${your_env} python=3.8
conda create --name ${your_env} python=3.11
conda activate ${your_env}
conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=10.2 -c pytorch
# We used PyTorch 2.0.1, other versions may work too
conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.8 -c pytorch -c nvidia
# If using k2 for ASR
wget https://huggingface.co/csukuangfj/k2/resolve/main/ubuntu-cuda/k2-1.24.4.dev20240223+cuda11.8.torch2.0.1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
pip install k2-1.24.4.dev20240223+cuda11.8.torch2.0.1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
```
In next Hyperion versions, we will upgrade to Pytorch>=1.9 and drop compatibility with older PyTorch versions.

For older systems with cuda 10.2 driver:
```
conda create --name ${your_env} python=3.10
conda activate ${your_env}
conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=10.2 -c pytorch
```


### Installing Hyperion

Expand All @@ -41,27 +52,12 @@ In next Hyperion versions, we will upgrade to Pytorch>=1.9 and drop compatibilit
git clone https://github.com/hyperion-ml/hyperion.git
```

- You can choose to install hyperion in the environment
- Then install hyperion in the environment
```bash
cd hyperion
pip install -e .
```

- Or add the hyperion toolkit to the PYTHONPATH envirnoment variable
This option will allow you to share the same environment if you are working with several hyperion branches
at the same time, while installing it requires to have an enviroment per branch.
For this, you need to install the requirements
```bash
cd hyperion
pip install -r requirements.txt
```
Then add these lines to your `~/.bashrc` or to each script that uses hyperion
```bash
HYP_ROOT= #substitute this by your hyperion location
export PYTHONPATH=${HYP_ROOT}:$PYTHONPATH
export PATH=${HYP_ROOT}/bin:$PATH
```

## Recipes

There are recipes for several tasks in the `./egs` directory.
Expand Down
83 changes: 14 additions & 69 deletions apps.txt
Original file line number Diff line number Diff line change
@@ -1,69 +1,14 @@
apply-mvn-select-frames.py
compute-energy-vad.py
compute-mfcc-feats.py
copy-feats.py
eval-cos-1vs1.py
eval-linear-gbe-up.py
eval-linear-gbe.py
eval-linear-svmc.py
eval-logistic-regression.py
eval-plda-1vs1.py
eval-plda-nvs1.py
make-babble-noise-audio-files.py
merge-h5-files.py
pack-audio-files.py
pack-wav-rirs.py
plot-vector-hist.py
plot-vector-tsne.py
preprocess-audio-files.py
rttm-to-bin-vad.py
segments-to-bin-vad.py
torch-adv-finetune-xvec-from-wav.py
torch-adv-finetune-xvec.py
torch-compute-mfcc-feats.py
torch-eval-vae.py
torch-eval-xvec-cosine-scoring-from-adv-test-wav-wavegan.py
torch-eval-xvec-cosine-scoring-from-adv-test-wav.py
torch-eval-xvec-cosine-scoring-from-art-test-wav.py
torch-eval-xvec-cosine-scoring-from-test-wav.py
torch-eval-xvec-cosine-scoring-from-transfer-adv-test-wav.py
torch-eval-xvec-cosine-scoring-from-transfer-art-test-wav.py
torch-eval-xvec-logits-from-wav.py
torch-extract-xvectors-from-wav-with-rttm.py
torch-extract-xvectors-from-wav.py
torch-extract-xvectors-slidwin-from-wav.py
torch-extract-xvectors-slidwin.py
torch-extract-xvectors-vae-preproc.py
torch-extract-xvectors.py
torch-finetune-xvec-dfr-from-wav.py
torch-finetune-xvec-dfr.py
torch-finetune-xvec-from-wav.py
torch-finetune-xvec.py
torch-generate-adv-attacks-xvector-classif.py
torch-generate-adv-attacks-xvector-verif.py
torch-train-dvae.py
torch-train-efficientnet-xvec-from-wav.py
torch-train-efficientnet-xvec.py
torch-train-resnet-xvec-from-wav.py
torch-train-resnet-xvec.py
torch-train-spinenet-xvec-from-wav.py
torch-train-tdnn-xvec-from-wav.py
torch-train-tdnn-xvec.py
torch-train-transformer-xvec-v1-from-wav.py
torch-train-transformer-xvec-v1.py
torch-train-vae.py
torch-train-vq-dvae.py
torch-train-vq-vae.py
torch-train-xvec-from-wav.py
train-cw-up.py
train-cw.py
train-gaussianizer.py
train-lda.py
train-linear-gbe-up.py
train-linear-gbe.py
train-linear-svmc.py
train-logistic-regression.py
train-mvn.py
train-nda.py
train-pca.py
train-plda.py
compute_energy_vad.py
extract_wav2vec2xvectors.py
extract_xvectors_from_wav.py
finetune_wav2vec2xvector.py
finetune_xvector_dfr_from_feats.py
finetune_xvector_dfr_from_wav.py
finetune_xvector_from_feats.py
finetune_xvector_from_wav.py
make_babble_noise_audio_files.py
pack_wav_rirs.py
preprocess_audio_files.py
train_wav2vec2xvector.py
train_xvector_from_feats.py
train_xvector_from_wav.py
2 changes: 1 addition & 1 deletion egs/chime5_spkdet/v1/local/score_dcf.py
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@
from hyperion.hyp_defs import float_cpu, config_logger
from hyperion.utils.trial_scores import TrialScores
from hyperion.utils.trial_key import TrialKey
from hyperion.metrics import fast_eval_dcf_eer as fast_eval
from hyperion.np.metrics import fast_eval_dcf_eer as fast_eval


def score_dcf(key_file, score_file, output_path):
Expand Down
2 changes: 1 addition & 1 deletion egs/chime5_spkdet/v1/steps_be/eval-be-diar-v2.py
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
from hyperion.utils import TrialScores
from hyperion.helpers import MultiTestTrialDataReaderV2 as TDR
from hyperion.helpers import PLDAFactory as F
from hyperion.transforms import TransformList
from hyperion.np.transforms import TransformList


def combine_diar_scores(ndx, orig_seg, subseg_scores):
Expand Down
2 changes: 1 addition & 1 deletion egs/chime5_spkdet/v1/steps_be/eval-be-v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
from hyperion.utils.trial_scores import TrialScores
from hyperion.helpers import TrialDataReader as TDR
from hyperion.helpers import PLDAFactory as F
from hyperion.transforms import TransformList
from hyperion.np.transforms import TransformList


def eval_plda(
Expand Down
4 changes: 2 additions & 2 deletions egs/chime5_spkdet/v1/steps_be/eval-calibration-v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -18,8 +18,8 @@
from hyperion.utils.trial_scores import TrialScores
from hyperion.utils.trial_key import TrialKey
from hyperion.utils.trial_ndx import TrialNdx
from hyperion.metrics import compute_act_dcf, compute_min_dcf
from hyperion.classifiers import BinaryLogisticRegression as LR
from hyperion.np.metrics import compute_act_dcf, compute_min_dcf
from hyperion.np.classifiers import BinaryLogisticRegression as LR


def eval_calibration(in_score_file, ndx_file, model_file, out_score_file):
Expand Down
2 changes: 1 addition & 1 deletion egs/chime5_spkdet/v1/steps_be/train-be-v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
from hyperion.hyp_defs import float_cpu, config_logger
from hyperion.helpers import VectorClassReader as VCR
from hyperion.helpers import VectorReader as VR
from hyperion.transforms import TransformList, LDA, LNorm
from hyperion.np.transforms import TransformList, LDA, LNorm
from hyperion.helpers import PLDAFactory as F


Expand Down
4 changes: 2 additions & 2 deletions egs/chime5_spkdet/v1/steps_be/train-calibration-v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -17,8 +17,8 @@
from hyperion.hyp_defs import float_cpu, config_logger
from hyperion.utils.trial_scores import TrialScores
from hyperion.utils.trial_key import TrialKey
from hyperion.metrics import compute_act_dcf, compute_min_dcf
from hyperion.classifiers import BinaryLogisticRegression as LR
from hyperion.np.metrics import compute_act_dcf, compute_min_dcf
from hyperion.np.classifiers import BinaryLogisticRegression as LR


def train_calibration(score_file, key_file, model_file, prior, verbose):
Expand Down
12 changes: 6 additions & 6 deletions egs/dihard2019/v1/steps_diar/eval-ahc-v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,13 +27,13 @@
from hyperion.io import RandomAccessDataReaderFactory as DRF
from hyperion.io import VADReaderFactory as VRF
from hyperion.helpers import PLDAFactory as F
from hyperion.transforms import TransformList, PCA, LNorm
from hyperion.clustering import AHC
from hyperion.pdfs import GMMTiedDiagCov as GMM
from hyperion.diarization import DiarAHCPLDA as Diar
from hyperion.np.transforms import TransformList, PCA, LNorm
from hyperion.np.clustering import AHC
from hyperion.np.pdfs import GMMTiedDiagCov as GMM
from hyperion.np.diarization import DiarAHCPLDA as Diar

# from hyperion.pdfs import GMMDiagCov as GMM2
# from hyperion.pdfs import GMM as GMM3
# from hyperion.np.pdfs import GMMDiagCov as GMM2
# from hyperion.np.pdfs import GMM as GMM3


def make_timestamps(n, win_start, win_length, win_shift, win_shrink):
Expand Down
2 changes: 1 addition & 1 deletion egs/dihard2019/v1/steps_diar/train-plda-v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@
from hyperion.utils import Utt2Info

# from hyperion.helpers import VectorClassReader as VCR
from hyperion.transforms import TransformList, LDA, LNorm, PCA
from hyperion.np.transforms import TransformList, LDA, LNorm, PCA
from hyperion.helpers import PLDAFactory as F
from hyperion.io import RandomAccessDataReaderFactory as DRF

Expand Down
28 changes: 28 additions & 0 deletions egs/librispeech/v0/cmd.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
# you can change cmd.sh depending on what type of queue you are using.
# If you have no queueing system and want to run on a local machine, you
# can change all instances 'queue.pl' to run.pl (but be careful and run
# commands one by one: most recipes will exhaust the memory on your
# machine). queue.pl works with GridEngine (qsub). slurm.pl works
# with slurm. Different queues are configured differently, with different
# queue names and different ways of specifying things like memory;
# to account for these differences you can create and edit the file
# conf/queue.conf to match your queue's configuration. Search for
# conf/queue.conf in http://kaldi-asr.org/doc/queue.html for more information,
# or search for the string 'default_config' in utils/queue.pl or utils/slurm.pl.

if [ "$(hostname -d)" == "cm.gemini" ];then
export train_cmd="queue.pl --config conf/coe_gpu_long.conf --mem 4G"
export cuda_cmd="queue.pl --config conf/coe_gpu_long.conf --mem 20G"
#export cuda_cmd="queue.pl --config conf/coe_gpu_v100.conf --mem 20G"
export cuda_cmd="queue.pl --config conf/coe_gpu_rtx.conf --mem 40G"
export cuda_eval_cmd="queue.pl --config conf/coe_gpu_short.conf --mem 4G"
#export cuda_eval_cmd="queue.pl --config conf/coe_gpu_rtx.conf --mem 10G"
#export cuda_eval_cmd="queue.pl --config conf/coe_gpu_long.conf --mem 4G"
else
export train_cmd="queue.pl --config conf/clsp.conf --mem 4G "
export cuda_cmd="queue.pl --config conf/clsp.conf --mem 20G"
export cuda_eval_cmd="$train_cmd"
fi



11 changes: 11 additions & 0 deletions egs/librispeech/v0/conf/clsp.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@

# Default configuration
command qsub -v PATH -cwd -S /bin/bash -j y -l arch=*64* -V
option mem=* -l mem_free=$0,ram_free=$0
option mem=0 # Do not add anything to qsub_opts
option num_threads=* -pe smp $0
option num_threads=1 # Do not add anything to qsub_opts
option max_jobs_run=* -tc $0
default gpu=0
option gpu=0 -l 'hostname=b[1]*|c0[123456789]*|c1[1345679]*|c2[12357]*'
option gpu=* -l 'hostname=c0[123456789]*|c1[1345679]*|c2[12357]*,gpu=$0'
4 changes: 4 additions & 0 deletions egs/librispeech/v0/conf/infer.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
beam_width: 5
decoding_method: time_sync_beam_search
#decoding_method: greedy
#decoding_method: align_length_sync_beam_search
34 changes: 34 additions & 0 deletions egs/librispeech/v0/conf/reverb_noise20dB_aug.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
reverb_aug:
reverb_prob: 0.45
max_reverb_context: 0.5
rir_types:
smallroom:
weight: 1
rir_path: scp:data/rirs_smallroom/rirs.scp
rir_norm: max
mediumroom:
weight: 1
rir_path: scp:data/rirs_mediumroom/rirs.scp
rir_norm: max
realroom:
weight: 1
rir_path: scp:data/rirs_real/rirs.scp
rir_norm: max
noise_aug:
noise_prob: 0.7
noise_types:
noise:
weight: 1
noise_path: data/musan_noise_proc_audio/wav.scp
min_snr: 10
max_snr: 20
music:
weight: 1
noise_path: data/musan_music_proc_audio/wav.scp
min_snr: 10
max_snr: 20
babble:
weight: 1
noise_path: data/musan_speech_babble/wav.scp
min_snr: 10
max_snr: 20
35 changes: 35 additions & 0 deletions egs/librispeech/v0/conf/reverb_noise_aug.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,35 @@
reverb_aug:
reverb_prob: 0.45
max_reverb_context: 0.5
rir_types:
smallroom:
weight: 1
rir_path: scp:data/rirs_smallroom/rirs.scp
rir_norm: max
mediumroom:
weight: 1
rir_path: scp:data/rirs_mediumroom/rirs.scp
rir_norm: max
realroom:
weight: 1
rir_path: scp:data/rirs_real/rirs.scp
rir_norm: max
noise_aug:
noise_prob: 0.7
noise_types:
noise:
weight: 1
noise_path: data/musan_noise_proc_audio/wav.scp
min_snr: 0
max_snr: 18
music:
weight: 1
noise_path: data/musan_music_proc_audio/wav.scp
min_snr: 3
max_snr: 18
babble:
weight: 1
noise_path: data/musan_speech_babble/wav.scp
min_snr: 3
max_snr: 18

Loading