#

rnnt

https://static.github-zh.com/github_avatars/modelscope?size=40

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 9.78 k
2 天前
https://static.github-zh.com/github_avatars/upskyy?size=40

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Python 103
3 年前
https://static.github-zh.com/github_avatars/iamjanvijay?size=40
Cuda 68
4 年前
https://static.github-zh.com/github_avatars/iamjanvijay?size=40

An implementation of RNN-Transducer loss in TF-2.0.

Python 45
2 年前
https://static.github-zh.com/github_avatars/manhph2211?size=40

I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...

Python 17
3 年前
https://static.github-zh.com/github_avatars/tuanio?size=40
Python 15
3 年前
https://static.github-zh.com/github_avatars/George0828Zhang?size=40

Pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction" https://arxiv.org/abs/1609.08194

Python 2
3 年前
https://static.github-zh.com/github_avatars/Andersonjesusvital?size=40

Deep learning-based subtitle generation model that processes audio datasets to generate accurate text transcriptions. Includes audio feature extraction, encoder-decoder architecture, training pipeline...

0
23 天前
Website
Wikipedia