Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
Former-commit-id: b9a5c0c
  • Loading branch information
zzw922cn authored Dec 14, 2016
1 parent 6b837c6 commit 307e992
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -51,16 +51,17 @@ In folder data/mfcc, each file is a feature matrix with size timeLength*39 of on

If you want to set your own data preprocessing, you can edit [calcmfcc.py](https://github.com/zzw922cn/Automatic-Speech-Recognition/blob/master/src/feature/calcmfcc.py) or [preprocess.py](https://github.com/zzw922cn/Automatic-Speech-Recognition/blob/master/src/feature/preprocess.py).

Since the original TIMIT dataset contains 61 phonemes, we use 61 phonemes for training and evaluation, but when scoring, we mappd the 61 phonemes into 39 phonemes for better performance. The mapping details are as follows:
Since the original TIMIT dataset contains 61 phonemes, we use 61 phonemes for training and evaluation, but when scoring, we mappd the 61 phonemes into 39 phonemes for better performance. We do this mapping according to the paper [Speaker-independent phone recognition using hidden Markov models](http://repository.cmu.edu/cgi/viewcontent.cgi?article=2768&context=compsci). The mapping details are as follows:

| original phoneme(s) | mapped into phoneme |
| :------------------ | :-------------------: |
| ux | uw |
| axr | er |
| em | m |
| nx,n | en |
| nx, n | en |
| eng | ng |
| hv | hh |
| cl,bcl,dcl,gcl,epi,h#,kcl,pau,pcl,tcl,vcl | sil |
| cl, bcl, dcl, gcl, epi, h#, kcl, pau, pcl, tcl, vcl | sil |
| l | el |
| zh | sh |
| aa | ao |
Expand Down

0 comments on commit 307e992

Please sign in to comment.