PaddlePaddle/PaddleSpeech r1.3.0
PaddlePaddle/PaddleSpeech
Captured source
source ↗published Dec 14, 2022seen 5dcaptured 8hhttp 200method plain
PaddleSpeech r1.3.0
Repository: PaddlePaddle/PaddleSpeech
Tag: r1.3.0
Published: 2022-12-14T06:38:57Z
Prerelease: no
Release notes:
HighLIght
S2T
- Support U2/U2++ Conformer dy2static, and U2/U2++ C++ High Performance Streaming ASR Deployment. @zh794390558
- Add Wav2vec2ASR-en, wav2vec2.0 fine-tuning for ASR on LibriSpeech. @Zth9730
- Add Whisper CLI and Demos, support multi language recognition and translation. @zxcd
- Add Wav2vec2 CLI and Demos, support ASR and Feature Extraction. @Zth9730
- Add whisper. #2640 #2704 by @zxcd
- Fix gpu training hang. #2478 by @Zth9730
- Support u2++ based cli and server. #2489 #2510 by @Zth9730
- Add wav2vec2-en. #2518 #2527 #2637 by @Zth9730
- Add wav2vec2-zh cli. #2697 by @Zth9730
T2S
- Add seek for BytesIO. https://github.com/PaddlePaddle/PaddleSpeech/pull/2484 by @ZapBird
- Add mix finetune. https://github.com/PaddlePaddle/PaddleSpeech/pull/2525 https://github.com/PaddlePaddle/PaddleSpeech/pull/2647 by @lym0302
- Add streaming TTS fastdeploy serving. https://github.com/PaddlePaddle/PaddleSpeech/pull/2528 by @HexToString
- Add SSML for Chinese Text Frontend. https://github.com/PaddlePaddle/PaddleSpeech/pull/2531 by @david-95
- Add end-to-end Prosody Prediction pipeline (including using prosody labels in Acoustic Model). https://github.com/PaddlePaddle/PaddleSpeech/pull/2548 https://github.com/PaddlePaddle/PaddleSpeech/pull/2615 https://github.com/PaddlePaddle/PaddleSpeech/pull/2693 by @WongLaw
- Add Adversarial Loss for Chinese English mixed TTS. https://github.com/PaddlePaddle/PaddleSpeech/pull/2588 by @lym0302
- Fix frontend bugs. https://github.com/PaddlePaddle/PaddleSpeech/pull/2539 https://github.com/PaddlePaddle/PaddleSpeech/pull/2606 by @yt605155624
- Add TN for English unit. https://github.com/PaddlePaddle/PaddleSpeech/pull/2629 by @WongLaw
- Add male voice for TTS. https://github.com/PaddlePaddle/PaddleSpeech/pull/2660 by @lym0302
- Add double byte char for zh normalization. https://github.com/PaddlePaddle/PaddleSpeech/pull/2661 by @david-95
- Add TTS Paddle-Lite x86 inference. https://github.com/PaddlePaddle/PaddleSpeech/pull/2636 https://github.com/PaddlePaddle/PaddleSpeech/pull/2667 by @yt605155624
- Add greek char and fix #2571. https://github.com/PaddlePaddle/PaddleSpeech/pull/2683 by @david-95
- Add Slim for TTS. https://github.com/PaddlePaddle/PaddleSpeech/pull/2729 by @yt605155624
Audio
- Move paddlespeech/audio to paddleaudio. https://github.com/PaddlePaddle/PaddleSpeech/pull/2706 by @SmileGoat
Demo
- Add TTSAndroid demo. https://github.com/PaddlePaddle/PaddleSpeech/pull/2703 by @yt605155624
New Contributors
- @ZapBird made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2484
- @HexToString made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2528
- @dahu1 made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2554
- @kFoodie made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2664
- @zxcd made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2640
- @michael-skynorth made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2666
- @heyudage made their first contribution in https://github.com/PaddlePaddle/PaddleSpeech/pull/2688
Full Changelog: https://github.com/PaddlePaddle/PaddleSpeech/compare/r1.2.0...r1.3.0