speech-translation · GitHub Topics

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

翻译 - NeMo：用于对话式AI的工具包

machine-translation speaker-recognition asr tts generative-ai multimodal 深度学习 neural-networks speaker-diariazation speech-translation speech-synthesis large-language-models

Python 13.62 k

2 小时前

PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库，用于语音和音频中的各种关键任务的开发，典型的应用包括：语音识别、语音翻译、语音合成等

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition 声音克隆 vocoder voice-recognition self-supervised-learning Whisper

Python 11.77 k

5 天前

espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

翻译 - 端到端语音处理工具包

深度学习 end-to-end chainer PyTorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization text-to-speech

Python 8.99 k

5 小时前

huggingface / speech-to-speech

#计算机科学#Speech To Speech: an effort for an open-sourced and modular GPT4-o

人工智能 assistant language-model 机器学习 Python speech speech-synthesis speech-to-text speech-translation

Python 3.96 k

12 天前

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

speech-pretraining speech-recognition speech-synthesis speech-translation

Python 1.33 k

1 年前

ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

speech speech-recognition speech-synthesis speech-to-text speech-translation translation all-in-one machine-translation streaming-audio text-to-speech asr tts voice text-to-audio non-autoregressive speech-enhancement audio-processing speech-processing

Python 1.06 k

8 个月前

zhangshaolei1998 / Awesome-Simultaneous-Translation

#自然语言处理#Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

machine-translation 自然语言处理 speech-translation Awesome Lists Bukkit streaming

577

10 个月前

Dadangdut33 / Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Python speech-translation tkinter-python translate Whisper

Python 567

1 年前

double22a / speech_dataset

#计算机科学#The dataset of Speech Recognition

asr speech-recognition 深度学习 dataset audio 深度神经网络 wav speech-to-text speech tts speech-synthesis voice-conversion speech-translation speech-enhancement speech-separation text-to-speech automatic-speech-recognition

412

4 个月前

echogarden-project / echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice i...

language-identification speech speech-alignment speech-recognition speech-synthesis speech-to-text speech-translation text-to-speech language-detection source-separation 命令行界面 Node.js

TypeScript 351

13 天前

kahne / SpeechTransProgress

#自然语言处理#Tracking the progress in end-to-end speech translation

natural-language-generation speech-translation 人工智能 machine-translation 自然语言处理 speech-processing

260

1 年前

MooreThreads / MooER

#大语言模型#MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not lim...

ChatGPT gpt-4o large-language-models speech-recognition speech-to-text speech-translation

Python 202

3 个月前