#

wav2vec

s3prl/s3prl
https://static.github-zh.com/github_avatars/s3prl?size=40

Self-Supervised Speech Pre-training and Representation Learning Toolkit

翻译自我监督的语音预训练和表征学习工具包。

Python 2.35 k
2 天前
https://static.github-zh.com/github_avatars/oliverguhr?size=40

A live speech recognition using Facebooks wav2vec 2.0 model.

Python 341
1 年前
https://static.github-zh.com/github_avatars/arxyzan?size=40

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python 176
2 年前
https://static.github-zh.com/github_avatars/shangeth?size=40

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

Python 65
4 年前
https://static.github-zh.com/github_avatars/robinhad?size=40

Training scripts for Speech-To-Text models for Ukrainian language

Jupyter Notebook 35
2 年前
https://static.github-zh.com/github_avatars/lucasgris?size=40
Jupyter Notebook 33
3 年前
https://static.github-zh.com/github_avatars/bhattbhavesh91?size=40

Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer

Jupyter Notebook 30
4 年前
https://static.github-zh.com/github_avatars/daanzu?size=40

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Python 24
4 年前
https://static.github-zh.com/github_avatars/notAI-tech?size=40
Python 12
4 年前
https://static.github-zh.com/github_avatars/phanxuanphucnd?size=40

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

Python 4
4 年前
https://static.github-zh.com/github_avatars/manhph2211?size=40

Building a speaker identification & verification pipeline for Vietnamese voices 😪

Jupyter Notebook 3
3 年前
https://static.github-zh.com/github_avatars/NabinAdhikari674?size=40

A repo to make installation and training of a wav2vec model easier

Python 2
4 年前
https://static.github-zh.com/github_avatars/oswaldoludwig?size=40

This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.

Shell 2
1 年前
https://static.github-zh.com/github_avatars/MarwaAbdelAal?size=40

#自然语言处理#ASR model generates transcription from audio waves, then correct the word spelling

Python 1
3 年前
loading...
Website
Wikipedia