OpenAI Whisper语音识别模型,C++移植版本。
#计算机科学# DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行
🧠 Leon is your open-source personal assistant.
翻译 - 🧠Leon是您的开源个人助理。
#计算机科学# Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
A PyTorch-based Speech Toolkit
翻译 - 基于Pytorch的语音工具包
#安卓# Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
翻译 - Silero模型:经过预先训练的STT模型和基准测试非常简单
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
#大语言模型# Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
#计算机科学# Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
#计算机科学# 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
翻译 - TSTT-用于语音转文本的深度学习工具包,在研发和生产中经过了实战测试
Botium Speech Processing
翻译 - t语音处理
#计算机科学# Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
Whisper.net. Speech to text made simple using Whisper Models
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.