diarization · GitHub Topics

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

openai speech-to-text Whisper asr speech-recognition subtitles ctranslate2 faster-whisper whisperx uvr diarization speaker-diarization

1.92 k

1 天前

R3gm / SoniTranslate

Synchronized Translation for Videos. Video dubbing

audio-processing diarization translation translate-audio translate-video video-dubbing asr automatic-dubbing document-translator dubbing speech-to-text stt text-to-speech tts

Python 1.1 k

2 个月前

transcriptionstream / transcriptionstream

#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary

自动化 diarization 大语言模型 speaker-diarization speech-recognition transcription Whisper ollama mistral-7b whisperx

Python 834

7 个月前

microsoft / UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

PyTorch speech-recognition speech-processing speech diarization speech-separation speaker-verification

Python 455

1 年前

revdotcom / reverb

Open source inference code for Rev's model

speech-recognition speech-to-text asr canary Docker Whisper Open Source speechrecognition diarization huggingface speaker-diarization 深度学习神经网络

Python 398

2 天前

gong-io / gecko

Gecko - A Tool for Effective Annotation of Human Conversations

transcription diarization voice-detection

JavaScript 280

2 年前

SuyashMore / MevonAI-Speech-Emotion-Recognition

#计算机科学#Identify the emotion of multiple speakers in an Audio Segment

人工智能 convolutional-neural-networks 机器学习 speech-processing emotion-recognition 深度学习 diarization keras-tensorflow colab-notebook uis-rnn

C 168

2 年前

thewh1teagle / sherpa-rs

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

audio diarization embeddings Rust sherpa speech-recognition

Rust 150

21 天前

cvqluu / simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

speech-to-text transcription diarization asr colab-notebook speaker-diarization

Python 145

1 年前

desh2608 / dover-lap

Python package for combining diarization system outputs.

diarization

Python 86

2 年前

bunyaminergen / Callytics

#大语言模型#Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

diarization llama3 大语言模型 openai Open Source speech-processing speech-recognition speech-to-text voice-activity-detection voice-recognition denoising sentiment-analysis summary topic-modeling transcription

Python 61

2 个月前

wq2012 / SimpleDER

#计算机科学#A lightweight library to compute Diarization Error Rate (DER).

speaker-diarization 监控 speech-processing speech-recognition diarization 机器学习

Python 59

2 年前

thewh1teagle / pyannote-rs

pyannote audio diarization in rust

asr diarization onnxruntime Rust speech-recognition Whisper

Rust 50

4 个月前

cvqluu / nn-similarity-diarization

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

PyTorch diarization 神经网络 speech similarity kaldi lstm speaker-recognition speaker-diarization

Python 44

4 年前

Picovoice / falcon

#计算机科学#On-device speaker diarization powered by deep learning

speaker-diarization 深度学习 diarization on-device speaker-recognition

Python 43

1 个月前

JSchmie / ScrAIbe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

diarization speech-to-text transcription

Python 42

3 个月前

desh2608 / spyder

Simple Python package for fast DER computation

diarization

C++ 32

2 年前

exemplaryai / ai-engine

#自然语言处理#Easy to use Multi-Provider ASR/Speech To Text and NLP engine

asr 自然语言处理 natural-language-understanding speech-recognition speech-to-text 低代码 automatic-speech-recognition diarization neural-networks stt Open Source speaker-recognition conversational-ai 深度学习 speech language-models

2 年前

chimechallenge / chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

diarization speech-processing speech-recognition speech-separation automatic-speech-recognition speech-enhancement

Python 21

2 个月前

shahruk10 / kaldi-tflite

Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.

Tensorflow kaldi speech tflite diarization

Python 20

3 年前