GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

multi-speaker

Website
Wikipedia
https://static.github-zh.com/github_avatars/netease-youdao?size=40
netease-youdao / EmotiVoice

#计算机科学#EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

PyTorchspeechspeech-synthesisttsmulti-speakertext-to-speech深度学习promptemotivoice人工智能Pythonemotionstyle
Python 8.07 k
1 年前
https://static.github-zh.com/github_avatars/r9y9?size=40
r9y9 / deepvoice3_pytorch

#计算机科学#PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

ttsspeech-synthesisend-to-endspeech-processing机器学习PyTorchPythonmulti-speaker
Python 1.98 k
2 年前
https://static.github-zh.com/github_avatars/ranchlai?size=40
ranchlai / mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

ttstacotronPyTorchfastspeech2multi-speaker
Python 477
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Comprehensive-Transformer-TTS

#计算机科学#A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...

text-to-speechunsupervisednon-autoregressivemulti-speakerttsPyTorchfastspeechtransformerneural-ttsfastspeech2hifi-gansotaspeech-synthesis深度学习
Python 326
3 年前
https://static.github-zh.com/github_avatars/aishoot?size=40
aishoot / LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

speech-separationaudio-separationmulti-speakerspeech-enhancement
Jupyter Notebook 310
3 年前
https://static.github-zh.com/github_avatars/DrewThomasson?size=40
DrewThomasson / VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

audiobooksepubgenerative-aimulti-speakertorchtts声音克隆LinuxmacOSWindowsm4b
Python 280
24 天前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Comprehensive-E2E-TTS

#计算机科学#A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...

深度学习fastspeech2hifi-ganjetsmulti-speakerneural-ttsnon-autoregressivePyTorchsotaspeech-synthesistext-to-speechttsunsupervisedend-to-end
Python 146
3 年前
https://static.github-zh.com/github_avatars/Totoketchup?size=40
Totoketchup / Adaptive-MultiSpeaker-Separation

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

深度学习audio-separationspeech-separationadaptive-learningTensorflowsource-separationmulti-speaker
Jupyter Notebook 51
7 年前
https://static.github-zh.com/github_avatars/anton-jeran?size=40
anton-jeran / MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

codecmulti-speakeroverlapspatial-audiospeech-enhancementspeech-separation
Python 49
4 个月前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Comprehensive-Tacotron2

#计算机科学#PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to ...

text-to-speechttstacotrontacotron2PyTorchspeech-synthesisautoregressivemulti-speakerrobustnessefficiencyneural-ttshifi-gan深度学习
Python 48
2 年前
https://static.github-zh.com/github_avatars/hwRG?size=40
hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

fastspeech2koreanmulti-speakerPyTorchtransfer-learningtts
Python 8
3 年前
https://static.github-zh.com/github_avatars/nikitashvarts?size=40
nikitashvarts / CocktailPartySpeakerRecognition

#计算机科学#An Algorithm for Speaker Recognition in a Multi-Speaker Environment

speaker-recognitionmulti-speaker深度学习lstm
Python 4
5 年前
https://static.github-zh.com/github_avatars/ZoraizQ?size=40
ZoraizQ / urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

speech-recognitionurdumulti-speaker
Shell 4
4 年前
https://static.github-zh.com/github_avatars/TheSeraphim?size=40
TheSeraphim / scribe-forge-ai

#自然语言处理#🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Mark...

audio-analysisaudio-processingdiarizationFFmpeghuggingface机器学习multi-speaker自然语言处理openai-whisperPythonspeaker-diarizationspeech-recognitionspeech-to-textWhisper
Python 1
16 天前
https://static.github-zh.com/github_avatars/parisimaa?size=40
parisimaa / multi_speaker

#计算机科学#

audio-processingmulti-speaker深度学习
MATLAB 0
3 年前