dMel: Discretized Log Mel-Filterbanks

This software project accompanies the research paper, dMel: Speech Tokenization Made Simple by Bai, He and Likhomanenko, Tatiana and Zhang, Ruixiang and Gu, Zijin and Aldeneh, Zakaria and Jaitly, Navdeep on speech tokenization for speech generation and speech recognition.

Repository contains the dmel pytorch-based package which performs discretization of the log mel-filterbanks for the given audio to prepare speech representations for decoder model training which will be generative model of speech.

Installation

from pypi

pip install dmel

from source

pip install .

Example of usage

We have a snipped of code to run feature extraction for both dMel and Mel and plotting their representations. To run example:

pip install torchaudio matplotlib dmel
python run_example.py

The example will generate example_mel.png and example_dmel.png

License

Repository is under LICENSE.

Citation

@article{bai2024dmel,
  title={dMel: Speech Tokenization Made Simple},
  author={Bai, He and Likhomanenko, Tatiana and Zhang, Ruixiang and Gu, Zijin and Aldeneh, Zakaria and Jaitly, Navdeep},
  journal={arXiv preprint arXiv:2407.15835},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
dmel		dmel
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
example.wav		example.wav
example_dmel.png		example_dmel.png
example_mel.png		example_mel.png
pyproject.toml		pyproject.toml
run_example.py		run_example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dMel: Discretized Log Mel-Filterbanks

Installation

Example of usage

License

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

apple/dmel

Folders and files

Latest commit

History

Repository files navigation

dMel: Discretized Log Mel-Filterbanks

Installation

Example of usage

License

Citation

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages