Extensions to YAML syntax for better python interaction
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
Target speaker automatic speech recognition (TS-ASR)
Incremental learning for automatic speech recognition (ASR)
#自然语言处理#Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
#计算机科学#Processing EEG data using Speechbrain-MOABB and model tuning to get best results
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
#计算机科学#A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
Speech transcription and speech diarization
#计算机科学#Speech Emotion Recognition SE&R 2022
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.
Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.
A short test to determine the distribution of similarity scores for different SpeechBrain speaker identification models.
Research on speech processing, speaker identification and audio diarization