PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等
#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
翻译 - 文本到语音的深度学习
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
#大语言模型#vits2 backbone with multilingual-bert
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other lan...
翻译 - :stuck_out_tongue_closed_eyes:TensorflowTTS:Tensorflow 2的实时最新语音合成
#计算机科学#HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
A high-quality speech analysis, manipulation and synthesis system
General Speech Restoration
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
#计算机科学#DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
PyTorch Implementation of FastDiff (IJCAI'22)
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
#计算机科学#A fast, high-quality neural vocoder.
#计算机科学#Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)