speech-representation · GitHub Topics

Self-Supervised Speech Pre-training and Representation Learning Toolkit

翻译 - 自我监督的语音预训练和表征学习工具包。

speech-representation mockingjay representation-learning apc tera self-supervised-learning speech-pretraining vq-apc wav2vec hubert wavlm

Python 2.37 k

1 个月前

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

acoustic codec gpt4o semantic speech-representation text-to-speech dac

Python 1.1 k

1 个月前

ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

pytorch-implementation speech-representation

Python 792

4 个月前

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

gpt-4o speech speech-representation streaming

287

5 个月前

mechanicalsea / lighthubert

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

neural-architecture-search PyTorch self-supervised-learning speech-representation

Python 70

3 年前

andi611 / Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

speech speech-representation mockingjay representation-learning feature-extraction sentiment-classification speaker-recognition PyTorch pytorch-implementation apc

Python 54

2 年前

vectominist / MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

asr ctc speech-recognition speech-representation hubert minimal PyTorch fairseq

Jupyter Notebook 50

2 年前

bshall / dusted

DUSTED: Spoken-Term Discovery using Discrete Speech Units

speech-representation

Jupyter Notebook 17

6 个月前

seorim0 / SE-using-SRL-Model

#计算机科学#Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings

深度学习深度神经网络 noise-reduction PyTorch self-supervised-learning speech-enhancement speech-representation Python

Python 14

2 个月前

jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch

Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining

speech-recognition semi-supervised-learning speech-representation

Python 12

4 年前