#计算机科学#A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2E, F5-TTS, CosyVoice), with Whisper audio processing, RVC voice changer, YouTube download,...
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
#大语言模型#Instant, controllable, local pre-trained AI models in Rust
an editor for spoken-word audio with automatic transcription
#Awesome#🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加...
#计算机科学#A python package to build AI-powered real-time audio applications
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
#大语言模型#Generate subtitles, summaries, and chapters from videos in seconds
#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary
#IOS#The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
#计算机科学#Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
OBS plugin for local speech recognition and captioning using AI
#面试#Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
#IOS#🎤 The easiest way to transcribe audio in Swift