Skip to content

Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow

Notifications You must be signed in to change notification settings

taltechnlp/est-asr-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

kaldi-offline-transcriber-nextflow

Nextflow based speech processing

Usage

First build the container:

docker build . -t nextflow:latest

Start a container (name is "nextflow") and put it into background (-d). Also, mount a local directory ~/tmp/speechfiles as the container directory /opt/speechfiles.

mkdir -p ~/tmp/speechfiles
docker run --name nextflow -v ~/tmp/speechfiles:/opt/speechfiles --rm -d -t nextflow:latest

Install Nextflow locally:

wget -qO- https://get.nextflow.io | bash

To transcribe:

nextflow run transcribe.nf -with-docker nextflow -with-report report.html -with-trace -with-dag flowchart.png -with-weblog 'http://localhost:7700/process/'

nextflow run transcribe.nf -with-docker nextflow -with-weblog 'http://localhost:7700/process/'

About

Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published