”speech“ 的搜索结果

ASRT_SpeechRecognition

@nl8590687

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Tensorflow cnn ctc Python Keras

Python8.23 k

12 天前🇨🇳

Google Bing GitHub

DeepSpeech存档

Mozilla@mozilla

#计算机科学#DeepSpeech 是一款开源嵌入式（离线、设备上）语音识别引擎，最低可以在树莓派上运行

深度学习机器学习 neural-networks Tensorflow speech-recognition

C++26.6 k

3 个月前

speech

@awni

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python763

2 年前

fish-speech

@fishaudio

SOTA Open Source TTS

llama transformer tts valle vits

Python22.96 k

7 天前

speechbrain

@speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement

Python10.45 k

1 个月前

speech_recognition

Anthony Zhang@Uberi

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python audio speech-recognition speech-to-text

Python8.86 k

4 天前

PaddleSpeech

@PaddlePaddle • 百度

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库，用于语音和音频中的各种关键任务的开发，典型的应用包括：语音识别、语音翻译、语音合成等

transformer conformer speech-translation streaming-asr speech-alignment

Python12.23 k

5 天前

speech-to-speech

Hugging Face@huggingface

#计算机科学#Speech To Speech: an effort for an open-sourced and modular GPT4-o

人工智能 assistant language-model 机器学习 Python

Python4.18 k

5 个月前

whisper

OpenAI@openai

whisper 是一个通用语音识别模型

Python88.32 k

10 天前

awesome-speech-recognition-speech-synthesis-papers

ponyzhang@zzw922cn

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

automatic-speech-recognition papers 路线图 rnn cnn

3.07 k

2 年前

mlx-audio

@Blaizzy

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

apple-silicon audio-processing MLX multimodal speech-recognition

Python2.68 k

2 天前

VoiceCraft

@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.38 k

6 个月前

awesome-speech-enhancement

@WenzheLiu-Speech

speech enhancement\speech seperation\sound source localization

1.17 k

2 年前

csm

@SesameAILabs

A Conversational Speech Generation Model

Python13.88 k

4 个月前

NeMo

@NVIDIA-NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation speaker-recognition asr tts generative-ai

Python15.72 k

5 小时前

Orpheus-TTS

@canopyai

#大语言模型#Towards Human-Sounding Speech

大语言模型 tts realtime

Python5.56 k

4 个月前

ChatTTS

@2noise

#大语言模型#ChatTTS是专门为对话场景设计的文本转语音模型，例如LLM助手对话任务。它支持英文和中文两种语言

agent text-to-speech chat ChatGPT chattts

Python37.83 k

2 个月前

pocketsphinx

cmusphinx@cmusphinx

A small speech recognizer

speech-recognition Python C

C4.19 k

10 天前

speech-aligner

@open-speech

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

speech kaldi C++

C++409

5 年前