merging utterances based on the start time and end time to form a paragraph. #3953
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
It addresses two issues raised in the CHiME group.
In the S01.json file of the eval set, array U02 was the ref in the starting and then in the middle but in the code, all utterances of array U02 were merged together in the starting. It caused incorrect scoring for the eval set. With this fix the result changed from 85.42 to 78.08 for the eval set.
Results after running the setup with the pre-trained model
Dev: %WER 84.33 [ 49653 / 58881, 1529 ins, 35813 del, 12311 sub ]
Eval: %WER 85.42 [ 47093 / 55132, 1583 ins, 32671 del, 12839 sub ]
Results after fixing scoring and running the setup with the pre-trained model
Dev: %WER 84.33 [ 49653 / 58881, 1529 ins, 35813 del, 12311 sub ]
Eval: %WER 78.08 [ 43046 / 55132, 957 ins, 32045 del, 10044 sub ]
@sw005320