集合主题趋势排行榜

multi-speaker

netease-youdao / EmotiVoice

#计算机科学#EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

PyTorch speech speech-synthesis tts multi-speaker text-to-speech 深度学习 prompt emotivoice 人工智能 Python emotion style

Python 8.34 k

1 年前

r9y9 / deepvoice3_pytorch

#计算机科学#PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

tts speech-synthesis end-to-end speech-processing 机器学习 PyTorch Python multi-speaker

Python 1.98 k

2 年前

ranchlai / mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

tts tacotron PyTorch fastspeech2 multi-speaker

Python 480

3 年前

keonlee9420 / Comprehensive-Transformer-TTS

#计算机科学#A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...

text-to-speech unsupervised non-autoregressive multi-speaker tts PyTorch fastspeech transformer neural-tts fastspeech2 hifi-gan sota speech-synthesis 深度学习

Python 326

3 年前

DrewThomasson / VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

audiobooks epub generative-ai multi-speaker torch tts 声音克隆 Linux macOS Windows m4b

Python 315

4 个月前

aishoot / LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

speech-separation audio-separation multi-speaker speech-enhancement

Jupyter Notebook 309

4 年前

keonlee9420 / Comprehensive-E2E-TTS

#计算机科学#A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...

深度学习 fastspeech2 hifi-gan jets multi-speaker neural-tts non-autoregressive PyTorch sota speech-synthesis text-to-speech tts unsupervised end-to-end

Python 146

3 年前

Totoketchup / Adaptive-MultiSpeaker-Separation

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

深度学习 audio-separation speech-separation adaptive-learning Tensorflow source-separation multi-speaker

Jupyter Notebook 51

7 年前

anton-jeran / MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

codec multi-speaker overlap spatial-audio speech-enhancement speech-separation

Python 50

7 个月前

keonlee9420 / Comprehensive-Tacotron2

#计算机科学#PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to ...

text-to-speech tts tacotron tacotron2 PyTorch speech-synthesis autoregressive multi-speaker robustness efficiency neural-tts hifi-gan 深度学习

Python 48

2 年前

hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

fastspeech2 korean multi-speaker PyTorch transfer-learning tts

Python 8

3 年前

TheSeraphim / scribe-forge-ai

#自然语言处理#🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Mark...