#计算机科学#pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are ...
翻译 - pytorch-kaldi是一个用于开发最先进的DNN / RNN混合语音识别系统的项目。 DNN部分由pytorch管理,而特征提取,标签计算和解码则通过kaldi工具箱执行。
#计算机科学#SincNet is a neural architecture for efficiently processing raw audio samples.
#计算机科学#The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly en...
#计算机科学#Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Python implementation of pre-processing for End-to-End speech recognition
Speech recognition on the TIMIT (or any other) dataset
#计算机科学#This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation sup...
Pytorch based phoneme recognition (TIMIT phoneme classification)
#计算机科学#THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
#计算机科学#A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirec...
Attention-based end-to-end ASR on TIMIT in PyTorch
#计算机科学#Extract mfcc vectors and phones from TIMIT dataset
Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.
Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.
Python/numpy/pandas convenience wrapper for the TIMIT database.