Skip to content

Kaldi MFCC #228

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 14 commits into from
Aug 16, 2019
Merged

Kaldi MFCC #228

merged 14 commits into from
Aug 16, 2019

Conversation

jamarshon
Copy link
Contributor

@jamarshon jamarshon commented Aug 12, 2019

We want to write compute-mfcc-feats for compliance.kaldi.mfcc in torchaudio

@jamarshon jamarshon changed the title [WIP] Kalidi MFCC [WIP] Kaldi MFCC Aug 12, 2019
@jamarshon
Copy link
Contributor Author

jamarshon commented Aug 15, 2019

current result on 70 files. those should be max error not mse
image
max relative error for fbank was 0.081825569 so seems suspicious here

@jamarshon
Copy link
Contributor Author

jamarshon commented Aug 15, 2019

Found a bug. Now the code has the following statistic for the 70 files:

Max absolute error: 0.00029754638671875
Max relative error: 5.5

The 5.5 max relative error has a max absolute error of 5.245208740234375e-06. For file

mfcc-2.0603-1.7529-0.6875-0.6250-7900-false-5086-4-0.74-false-false-true-false-true-false-4-59.0671-7431-7014-1.0000-hamming.ark

which is acceptable. Now the graph looks more like expected 😄
image

@jamarshon jamarshon changed the title [WIP] Kaldi MFCC Kaldi MFCC Aug 15, 2019
Copy link
Contributor

@vincentqb vincentqb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm glad to see that there are over 2^6 test files to test all true/false combinations :)

Thanks for providing error graphs. This looks good to me!

@vincentqb
Copy link
Contributor

I see a commit about the README, but I don't see changes in README files. Did you mean to add one?

@jamarshon
Copy link
Contributor Author

I added a small change to the README to say mfcc along with the related kaldi features we currently have
https://github.com/pytorch/audio/pull/228/files#diff-04c6e90faac2675aa89e2176d2eec7d8R25

@vincentqb
Copy link
Contributor

vincentqb commented Aug 16, 2019

I added a small change to the README to say mfcc along with the related kaldi features we currently have
https://github.com/pytorch/audio/pull/228/files#diff-04c6e90faac2675aa89e2176d2eec7d8R25

Right, I see what that meant now. Thanks!

@vincentqb
Copy link
Contributor

vincentqb commented Aug 16, 2019

I'm glad to see that there are over 2^6 test files to test all true/false combinations :)

As discussed with @cpuhrsch:

  • They don't cover all cases since they are picked randomly.
  • We also should get in the habit of building more thorough edge cases (e.g. empty tensors? all zeros?).
  • Long list of parameters can be passed easily using locals.

@jamarshon
Copy link
Contributor Author

new statistic over 145 files (same overall shape just more points on the left region) + added test for empty tensors
image

@cpuhrsch cpuhrsch merged commit a450cf8 into pytorch:master Aug 16, 2019
@jamarshon jamarshon deleted the mfcc branch August 17, 2019 22:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants