Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
faster_whisper GUI with PySide6
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A simple GUI to use Whisper.
#计算机科学#Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
Transcription from mp3 files to html with or without embedded player
a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model
deploy whsiper on aws
#计算机科学#Transcribe Like a Pro, Without Paying a Penny!
A sleek, web-based audio player featuring synchronized subtitle display, speaker diarization support, and keyboard controls in a modern, responsive interface
#自然语言处理#This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
#大语言模型#AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing.
Generate fully aligned subtitles for any Video or Audio file on your local system for free using the amazing capabilities of WhisperX.
Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of Speech Disfluencies.
A tool for automatically adding subtitles to short social media videos
#大语言模型#VideoWise is a video transcription and AI-powered analysis tool that helps users easily upload, transcribe, and interact with video content. Using WhisperX for high-quality transcriptions and Ollama f...