微软VALL-E X 零样本语音合成模型的开源实现
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)
A modification on the Sharif Emotional Speech Database
EmoTa is an open-access Tamil Speech Emotion Recognition dataset with 936 utterances from 22 native speakers, covering five emotions (anger, happiness, sadness, fear, and neutrality). It supports emot...
#计算机科学#TTS (FastPitch) for German (Thorsten voice / emotional)
Applying deep learning to translate animation and re-generate audio.
EMOLIPS: TWO-LEVEL APPROACH FOR LIP-READING EMOTIONAL SPEECH
#大语言模型#A GUI program for chat with chatbot such as chatgpt.
This is a project dedicated to the classification of emotional speech and was created in class with Prof. Dr. Burkhardt at Technische Universität Berlin.
This is a project dedicated to the classification of emotional speech and was created in class with Prof. Dr. Burkhardt at Technische Universität Berlin.