This is the GitHub page for publicly available emotional speech data.
Sharif Emotional Speech Database
Urdu Language Speech Emotional Corpus
💻 :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈
Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
Emotional Speech Conversion using Style Transfer and MUNIT
A large-scale validated database for Persian speech emotion detection.
微软VALL-E X 零样本语音合成模型的开源实现
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
无需情感标注的情感可控语音合成模型,基于VITS
speech enhancement\speech seperation\sound source localization
Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems
Code for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090
You can find the speech algorithms you want here
whisper 是一个通用语音识别模型
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
A small speech recognizer
翻译 - PocketSphinx是一种轻量级的语音识别引擎,尽管在台式机上同样出色,但专为手持设备和移动设备进行了优化
A PyTorch-based Speech Toolkit
翻译 - 基于Pytorch的语音工具包
Android speech recognition and text to speech made easy