vocoder · GitHub Topics

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis 声音克隆 voice-synthesis voice-conversion

Python 39.27 k

8 个月前

PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库，用于语音和音频中的各种关键任务的开发，典型的应用包括：语音识别、语音翻译、语音合成等

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition 声音克隆 vocoder voice-recognition self-supervised-learning Whisper

Python 11.77 k

5 天前

mozilla / TTS

#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

翻译 - 文本到语音的深度学习

深度学习 text-to-speech Python PyTorch tacotron tts speaker-encoder dataset-analysis tacotron2 tensorflow2 vocoder melgan glow-tts speech

Jupyter Notebook 9.78 k

1 年前

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generation audio-synthesis audioldm music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e voice-conversion audit fastspeech2 vits emilia maskgct vocoder

Python 8.93 k

10 小时前

fishaudio / Bert-VITS2

#大语言模型#vits2 backbone with multilingual-bert

bert bert-vits2 tts vits vits2 bert-vits 大语言模型 friendly interactive shell vocoder agent

Python 8.37 k

5 天前

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other lan...

翻译 - ：stuck_out_tongue_closed_eyes：TensorflowTTS：Tensorflow 2的实时最新语音合成

speech-synthesis text-to-speech tensorflow2 melgan fastspeech real-time tts vocoder multi-speaker-tts fastspeech2 tacotron2 tflite

Python 3.91 k

9 个月前

jik876 / hifi-gan

#计算机科学#HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

speech-synthesis Generative Adversarial Network text-to-speech tts 深度学习 hifi-gan PyTorch vocoder

Python 2.1 k

9 个月前

kan-bayashi / ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

speech-synthesis neural-vocoder text-to-speech PyTorch wavenet realtime tts melgan vocoder hifigan

Jupyter Notebook 1.6 k

1 年前

mmorise / World

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

C++ 1.22 k

2 个月前

haoheliu / voicefixer

General Speech Restoration

speech-processing speech-synthesis speech-enhancement speech-analysis speech tts denoise super-resolution vocoder

Python 1.12 k

2 个月前

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

vocoder

Python 913

8 个月前

lmnt-com / diffwave

#计算机科学#DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

机器学习 text-to-speech 神经网络 Bukkit PyTorch speech-synthesis vocoder speech pretrained-models tts 深度学习

Python 821

1 年前

Rongjiehuang / FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Python 410

10 个月前

ivanvovk / WaveGrad

Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.

vocoder text-to-speech tts speech speech-synthesis probabilistic-models diffusion-models

Jupyter Notebook 402

4 年前

rishikksh20 / VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

vocoder Generative Adversarial Network melgan speech-synthesis text-to-speech speech-processing

Python 319

9 个月前

szechyjs / mbelib

P25 Phase 1 and ProVoice vocoder

C vocoder

C++ 285

4 年前

lmnt-com / wavegrad

#计算机科学#A fast, high-quality neural vocoder.

机器学习神经网络 speech-synthesis text-to-speech Bukkit PyTorch vocoder speech pretrained-models tts 深度学习

Python 281

2 年前

maum-ai / univnet

#计算机科学#Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

text-to-speech vocoder Generative Adversarial Network 深度学习 PyTorch tts speech-synthesis

Python 272

4 年前

rishikksh20 / iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

vocoder tts speech-synthesis

Python 245

2 年前

sh123 / codec2_talkie

Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)

kiss amateur-radio ham-radio vocoder bluetooth radio lora digital fm walkie-talkie aprs opus

Java 242

8 小时前