Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
多个SVC/TTS的C++推理库
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
#计算机科学#Persian/Farsi text to speech(TTS) training using coqui tts
#计算机科学#TTS models for Arabic (Tacotron2, FastPitch)
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Ultrafast GAN based Vocoder for Text to Speech
#计算机科学#zero-shot realtime TTS system, fully offline, free and open source
#计算机科学#SA-toolkit: Speaker speech anonymization toolkit in python
TTS for Arabic (FastPitch, MixerTTS) in the ONNX format
RADTTS + HiFiGAN vocoder
🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https:/...
Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases