faster-whisper · GitHub Topics

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp 声音克隆 podcasts audiobook voice-conversion karaoke whisperx

Python 3.59 k

14 天前

chenyme / Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

faster-whisper gpt-4 speech-recognition video-translation Whisper gpt-4o

Python 2.33 k

6 天前

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

faster-whisper openai transcribe vad Whisper whisperx asr

Python 2.28 k

4 个月前

Purfview / whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

openai speech-to-text Whisper asr speech-recognition subtitles ctranslate2 faster-whisper whisperx uvr diarization speaker-diarization

1.92 k

1 天前

speaches-ai / speaches

Docker Docker Compose faster-whisper openai-api openai-whisper-translation Whisper whisper-ai openai-whisper transcription

Python 1.68 k

1 天前

umlx5h / LLPlayer

#大语言模型#The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!

asr Whisper yt-dlp C#language-learning media-player OCR player Video video-player WPF flyleaf faster-whisper 大语言模型 ollama

C# 882

1 天前

substratusai / kubeai

#大语言模型#AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Kubernetes 大语言模型 openai-api autoscaler ollama vllm ollama-operator vllm-operator 人工智能 Whisper faster-whisper

Go 881

2 天前

savbell / whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

openai Whisper dictation speech-recognition speech-to-text faster-whisper openai-api openai-whisper

Python 757

8 个月前

zh-plus / openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。

faster-whisper lyrics openai-api speech-to-text transcribe Whisper Python

Python 529

3 个月前

reriiasu / speech-to-text

Real-time transcription using faster-whisper

faster-whisper speech-recognition Whisper voice-recognition openai speech-to-text

HTML 458

9 个月前

Evil0ctal / Fast-Powerful-Whisper-AI-Services-API

#网络爬虫#⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API，使用本地运行的Whisper模型进行推理，并支持多GPU并发，针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫，可实现来自多个社交平台的无缝媒体处理，为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

FastAPI openai-whisper speech-to-text whisper-ai asr speech-recognition douyin-api faster-whisper tiktok-api video-analysis 爬虫

Python 356

1 个月前

ycyy / faster-whisper-webui

a gradio webui for faster whisper

人工智能 asr faster-whisper

Python 258

2 年前

NavodPeiris / speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

人工智能 automatic-speech-recognition faster-whisper speaker-diarization speaker-recognition speaker-verification transcription whisper-ai

Python 203

4 天前