[src] CudaDecoder endpointing #4146

hugovbraun · 2020-06-30T21:22:55Z

Built on top of #4101. Probably better to look at the diff once #4101 has been merged.

Implements endpointing directly into the cuda decoder.

Uses the rules as defined in online2/online-endpoint.h. From a user point of view, setting the high level parameters on endpointing and passing a vector in the DecodeBatch should be enough:

  void DecodeBatch(const std::vector<CorrelationID> &corr_ids,
                   const std::vector<SubVector<BaseFloat>> &wave_samples,
                   const std::vector<bool> &is_first_chunk,
                   const std::vector<bool> &is_last_chunk,
                   std::vector<std::string *> *partial_hypotheses = NULL,
                   std::vector<bool> *end_points = NULL)

Parameters (from online2/online-endpoint.h):

--endpoint.rule1.max-relative-cost : This endpointing rule requires relative-cost of final-states to be <= this value (describes how good the probability of final-states is). (float, default = inf)
--endpoint.rule1.min-trailing-silence : This endpointing rule requires duration of trailing silence(in seconds) to be >= this value. (float, default = 5)
--endpoint.rule1.min-utterance-length : This endpointing rule requires utterance-length (in seconds) to be >= this value. (float, default = 0)
--endpoint.rule1.must-contain-nonsilence : If true, for this endpointing rule to apply there mustbe nonsilence in the best-path traceback. (bool, default = false)

Easiest way to test is to pass --print-endpoints=true to the binary src/cudadecoderbin/batched-wav-nnet3-cuda-online

Internally, it provides all the necessary metrics (relative cost, number of silence phones on current best path, total length). Those rules can be modified through the command line parameters.

kkm000 · 2020-07-01T00:34:35Z

Thanks! So, you'd say that both #4101 and this one are ready to merge?

kkm000

Two of my comments to the other PR no longer apply. I'll mark them there.

src/cudadecoder/batched-threaded-nnet3-cuda-online-pipeline.cc

src/cudadecoder/cuda-decoder.h

hugovbraun · 2020-07-14T01:18:09Z

@kkm000 The known issue was fixed and we don't have any known bugs now

kkm000 · 2020-07-15T08:11:27Z

Thanks much! Merging.

auzxb · 2020-07-16T12:55:48Z

A bug happened in this commit

cuda-decoder.h:851:3: error: ‘atomic_int32_t’ in namespace ‘std’ does not name a type std::atomic_int32_t n_partial_traceback_threads_todo_;

kkm000 · 2020-07-17T12:52:05Z

@auzxb, this is fixed in #4173 today. Please open a new issue when reporting a big, we won't notice otherwise.

kkm000 requested changes Jul 1, 2020

View reviewed changes

src/cudadecoder/batched-threaded-nnet3-cuda-online-pipeline.cc Outdated Show resolved Hide resolved

src/cudadecoder/cuda-decoder.h Outdated Show resolved Hide resolved

src/cudadecoder/cuda-decoder.h Outdated Show resolved Hide resolved

kkm000 mentioned this pull request Jul 1, 2020

[src] Partial hypothesis for cuda decoder #4101

Closed

hugovbraun added 4 commits July 6, 2020 16:08

Partial hypotheses

cca068f

PR comments

16bb731

Endpointing

8036756

PR comments

0399cb1

hugovbraun force-pushed the endpointing branch from 2854b87 to 0399cb1 Compare July 6, 2020 23:09

Neg non-em partial traceback bug fix

c0db642

kkm000 approved these changes Jul 15, 2020

View reviewed changes

kkm000 merged commit 083c64d into kaldi-asr:master Jul 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[src] CudaDecoder endpointing #4146

[src] CudaDecoder endpointing #4146

Uh oh!

hugovbraun commented Jun 30, 2020

Uh oh!

kkm000 commented Jul 1, 2020

Uh oh!

kkm000 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hugovbraun commented Jul 14, 2020

Uh oh!

kkm000 commented Jul 15, 2020

Uh oh!

auzxb commented Jul 16, 2020

Uh oh!

kkm000 commented Jul 17, 2020

Uh oh!

Uh oh!

[src] CudaDecoder endpointing #4146

[src] CudaDecoder endpointing #4146

Uh oh!

Conversation

hugovbraun commented Jun 30, 2020

Uh oh!

kkm000 commented Jul 1, 2020

Uh oh!

kkm000 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hugovbraun commented Jul 14, 2020

Uh oh!

kkm000 commented Jul 15, 2020

Uh oh!

auzxb commented Jul 16, 2020

Uh oh!

kkm000 commented Jul 17, 2020

Uh oh!

Uh oh!