Skip to content

This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀

License

Notifications You must be signed in to change notification settings

cuichenrui2000/barry_speech_tools

Repository files navigation

深度学习语音工具包

本项目为个人在深度学习语音领域研究的一些工具汇总,并会分享一些高质量的语音领域学习资料。本项目仅用于个人资料和代码的备份,欢迎大家前来学习讨论交流🎉🎉🎉

🔄 最新更新

  • [2024_08_01] 进行一些整理,将所有信息汇总完毕,等待进一步代码整理。

  • [2024_07_19] 整理完成:“qwen_using” 文件夹,提供了针对 Qwen-Audio 框架的一些尝试和思考。

  • [2024_07_19] 整理完成:“wenet_using” 文件夹,添加了 wenet 框架多机多卡训练脚本和 debug 配置。

  • [2024_05_11] 整理完成:“faster_whisper_using” 文件夹,介绍了 faster_whisper 的使用细节和 debug 进展。

  • [2024_04_17] 整理完成:“wenet_using” 文件夹,介绍了语音识别框架 wenet 的一些知识和用法。

  • [2024_04_12] 整理完成:“语音入门资料汇总.md” 和 “工具踩坑记录汇总.md”,介绍了语音入门的一些资料和容易踩坑的一些问题。

About

This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published