PaddleSpeech r1.4.0
S2T
- Add wav2vec2-zh finetune pipeline. #3012 #2916 by @zxcd
- Fix some bugs in Whisper. #2900 #2825 by @zxcd
- Add code-switch asr tal_cs recipe. #2816 #2796 by @zxcd
T2S
- Add dygraph to static、PaddleInference、Paddle2ONNX and ONNXRuntime Infer for Cantonese TTS. #2990 by @JiehangXie
- Add Cantonese test examples. #2937 by @JiehangXie
- Add VITS inference pipeline. #3002 #2972 #2883 by @yt605155624
- Rearrange encoder_infer param's order. #2983 by @443127316
- Add male speaker and Chinese-English mix ONNXRuntime infer in CLI. #2945 by @lym0302
- Add Cantonese TTS example. #2950 #2927 #2924 #2907 #2899 by @WongLaw
- Fix PWGAN TIPC. #2882 by @yt605155624
- Add a case in not_erhua. #2863 by @QuanZ9
- Fix data prepare for PaddleSlim PTQ of TTS. #2862 by @yt605155624
- Avoid using variable "attn_loss" before assignment. #2860 by @hopingZ
- add soft link for shell in example, Add skip_copy_wave in norm stage of GANVocoders to save disk. #2851 by @yt605155624
- Optimize the training of VITS. #2843 #2809 #2791 #2770 by @WongLaw
- Add StarGANv2-VC model scripts and synthsize scripts. #2842 by @yt605155624
- Add diffusion module for training diffsinger. #2868 #2832 by @HighCWu
- Fix some Text Frontend bugs. #2831 by @yt605155624
- For mixed Chinese and English speech synthesis, add SSML support for Chinese. #2830 by @jindongyi011039
- Add mkldnn and trt config for TTS Inference. #2748 by @yt605155624
- Fix dygraph to static for tacotron2. #2426 by @yt605155624
Server
Engine
- Add wfst decoder. #2886 by @SmileGoat
- Add batch recognizer decode. #2866 by @SmileGoat
- Add nnet prob cache && make 2 thread decode work. #2769 by @SmileGoat
- Engine directory refactor. #2746 by @SmileGoat
- Fix openfst download error. #2742 by @SmileGoat
Audio
- Replace kaldi fbank with kaldi-native-fbank in paddleaudio. #2799 by @SmileGoat
- Fix load paddleaudio fail. #2815 by @SmileGoat
- Update paddleaudio readme. #2801 by @SmileGoat
Demos
- Add TTS ARM Linux C++ Demo. #2991 by @SwimmingTiger
- Add Cantonese TTS in CLI. #2977 by @WongLaw
- Add ONNXRuntime infer for Cantonese TTS in CLI. #2990 by @JiehangXie
Docs
- Add u2pp_wenetspeech_static_quant to released_model.md. #2973 @zxcd
- Remove redundant dependencies and Fix some bugs in setup.py. #2970 #2871 #2867 #2853 #2771 #2767 #2764 by @yt605155624
Others
- Remove fluid API in ASR. #2944 #2859 #2852 by @zxcd
- Add python simple adadelta optimizer. #2925 by @zxcd
- Add encoding=utf-8 for text. #2896 by @zxcd #2865 by @yt605155624
- Fix Tensor.numpy()[0] to float(Tensor) to adapt 0D. #2884 by @zhouwei25
- Fix libsndfile.so not found in ubuntu18-cpu/Dockerfile. #2763 by @linkec
- Fix AttributeError "module 'distutils' has no attribute 'ccompiler'" in setup.py in ctc_decoders. #2745 by @GreatV
New Contributors
- @GreatV made their first contribution in #2745
- @linkec made their first contribution in #2763
- @cxumol made their first contribution in #2828
- @jindongyi011039 made their first contribution in #2830
- @QuanZ9 made their first contribution in #2863
- @hopingZ made their first contribution in #2860
- @zhouwei25 made their first contribution in #2884
- @EscaticZheng made their first contribution in #2915
- @chinobing made their first contribution in #2922
- @lance6716 made their first contribution in #2924
- @443127316 made their first contribution in #2983
Full Changelog: r1.3.0...r1.4.0