#计算机科学#基于 so-vits-svc4.0(V1)的一个分支,支持实时推理和图形化推理界面,且兼容其模型。
Self-Supervised Speech Pre-training and Representation Learning Toolkit
翻译 - 自我监督的语音预训练和表征学习工具包。
#计算机科学#Phoneme segmentation using pre-trained speech models
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trai...
Cover Song Powered by SoftVC VITS
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
#计算机科学#Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.
code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio Representation Model
unsupervised spoken utterances scoring
Universal Pooling Method for Speaker Verification Utilizing Pre-trained Multi-layer Features, 2025 preprint
In this code, we have used common and well-known datasets such as the Toronto dataset available on Kaggle to create a sentiment analysis model from human voice. This model is designed based on the Ber...
A library to help your context being persisted in your react native apps
Speech Keyword detection using Wav2Vec Model
Advanced Speech Emotion Recognition, based on ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets and 14 languages (Emotions: Disgust, Neutral, Kind, Anger, Surpr...