#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
翻译 - 文本到语音的深度学习
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other lan...
翻译 - :stuck_out_tongue_closed_eyes:TensorflowTTS:Tensorflow 2的实时最新语音合成
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
MelGAN implementation with Multi-Band and Full Band supports...
Ultrafast GAN based Vocoder for Text to Speech
#计算机科学#zero-shot realtime TTS system, fully offline, free and open source
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
MelGAN Multi GPU Implementation.
MelGAN with catalyst framework
#计算机科学#SE-MelGAN - Speaker Agnostic Rapid Speech Enhancement