transcription · GitHub Topics

BasedHardware / omi

AI wearables. Put it on, speak, transcribe, automatically

人工智能 App Flutter friend 移动 necklace omi Python summary transcription wearable bci C Next personas smartglasses

C 4.55 k

11 小时前

spotify / basic-pitch

#计算机科学#A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

lightweight 机器学习 MIDI music pitch-detection polyphonic transcription audio Python TypeScript

Python 3.83 k

3 个月前

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp 声音克隆 podcasts audiobook voice-conversion karaoke whisperx

Python 3.59 k

14 天前

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

人工智能 Go subtitles sveltekit transcription Whisper ui Web app speech-recognition speech-to-text stt Web

Svelte 2.17 k

3 个月前

floneum / floneum

#大语言模型#Instant, controllable, local pre-trained AI models in Rust

人工智能大语言模型 Rust llamacpp kalosm candle llama mistral floneum-v3 dioxus transcription Whisper

Rust 1.83 k

1 天前

bugbakery / audapolis

an editor for spoken-word audio with automatic transcription

transcription video-editing audio-editing speech-to-text

TypeScript 1.73 k

2 年前

speaches-ai / speaches

Docker Docker Compose faster-whisper openai-api openai-whisper-translation Whisper whisper-ai openai-whisper transcription

Python 1.68 k

1 天前

sindresorhus / awesome-whisper

#Awesome#🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

人工智能 Awesome Lists gpt openai speech-to-text transcription

1.62 k

1 年前

hardhackerlabs / book

「硬地骇客 - 两个月 $12000 ARR 实践之路」是由硬地骇客团队编著，本书是关于 Podwise 产品历程的忠实记录：内容包含灵感 - 构建 - 发布 - 增长 - 复盘五个章节。如果你觉得一个人读不够过瘾，欢迎加入「硬地骇客」官方知识星球与专家们一起讨论！Podwise 的故事才刚刚开始，我们也将在星球持续分享我们的认知，成功可能无法复制，但失败一定可以借鉴。现在就点击下方链接加...

人工智能 book mindmap podcast summarizer transcription

MDX 1.27 k

15 天前

azuwis / pianotrans

Simple GUI for ByteDance's Piano Transcription with Pedals

piano transcription 人工智能

Nix 1.24 k

13 天前

juanmc2005 / diart

#计算机科学#A python package to build AI-powered real-time audio applications

speaker-diarization streaming-audio real-time 深度学习 transcription voice-activity-detection

Python 1.24 k

2 个月前

YaoFANGUK / video-subtitle-generator

视频音频生成字幕，生成srt文件。无需申请第三方API，本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

Whisper audio2text generation srt subtitle transcription

Python 983

1 年前

transcriptionstream / transcriptionstream

#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary

自动化 diarization 大语言模型 speaker-diarization speech-recognition transcription Whisper ollama mistral-7b whisperx

Python 834

7 个月前

aschmelyun / subvert

#大语言模型#Generate subtitles, summaries, and chapters from videos in seconds

ChatGPT openai transcription translation video-editing Whisper

PHP 830

2 个月前

Saik0s / Whisperboard

#IOS#The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

openai iOS speech-recognition speech-to-text SwiftUI transcription tca Whisper whisper-cpp

Swift 821

7 个月前

mayeaux / generate-subtitles

#计算机科学#Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

Express 机器学习 Node.js transcription translation Whisper gpu yt-dlp

JavaScript 787

2 年前