Skip to content

Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow

Notifications You must be signed in to change notification settings

taltechnlp/est-asr-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

kaldi-offline-transcriber-nextflow

Nextflow based speech processing

Usage

First build the container: docker build . -t nextflow:latest

Start a container (name is "nextflow") and put it into background (-d). Also, mount a local directory ~/tmp/speechfiles as the container directory /opt/speechfiles.

mkdir -p ~/tmp/speechfiles
docker run --name nextflow -v ~/tmp/speechfiles:/opt/speechfiles --rm -d -t nextflow:latest

To transcribe: docker exec -it nextflow /opt/kaldi-offline-transcriber/nextflow run /opt/kaldi-offline-transcriber/transcribe.nf

About

Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published