#计算机科学#HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
OpenMusic: SOTA Text-to-music (TTM) Generation
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
翻译 - PortaSpeech 的 PyTorch 实现:便携且高质量的生成文本到语音
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
#计算机科学#A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...
#计算机科学#Vietnamese Text to Speech library
#计算机科学#A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
#计算机科学#TTS models for Arabic (Tacotron2, FastPitch)
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
#计算机科学#PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to ...
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
A neural speech codec based on discrete WavLM representations
Train HiFi-GAN on TPU
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
#计算机科学#Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"
This is the experimental description of MnTTS2.
#计算机科学#TTS (FastPitch) for German (Thorsten voice / emotional)
Python package for NSF and NSF-HiFi-GAN (unofficial)