#

multi-speaker

https://static.github-zh.com/github_avatars/r9y9?size=40

#计算机科学#PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python 1.97 k
https://static.github-zh.com/github_avatars/ranchlai?size=40

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Python 468
https://static.github-zh.com/github_avatars/keonlee9420?size=40

#计算机科学#A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...

Python 324
https://static.github-zh.com/github_avatars/aishoot?size=40

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Jupyter Notebook 308
https://static.github-zh.com/github_avatars/DrewThomasson?size=40

VoxNovel: generate audiobooks giving each character a different voice actor.

Python 204
https://static.github-zh.com/github_avatars/keonlee9420?size=40

#计算机科学#A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...

Python 146
https://static.github-zh.com/github_avatars/keonlee9420?size=40

#计算机科学#PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to ...

Python 48
https://static.github-zh.com/github_avatars/anton-jeran?size=40

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Python 44
https://static.github-zh.com/github_avatars/hwRG?size=40

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

Python 8
https://static.github-zh.com/github_avatars/nikitashvarts?size=40
Python 4
https://static.github-zh.com/github_avatars/ZoraizQ?size=40

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

Shell 3
Website
Wikipedia