A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
翻译 - NeMo:用于对话式AI的工具包
A PyTorch-based Speech Toolkit
翻译 - 基于Pytorch的语音工具包
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
翻译 - 这是用于无界交错状态递归神经网络(UIS-RNN)算法的库,与论文《完全监督的说话人歧义》相对应。