微软VALL-E X 零样本语音合成模型的开源实现
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Instant voice cloning by MIT and MyShell.
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
Open-vocabulary Semantic Segmentation
Zero-Shot Learning with GCN (CVPR 2018)
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
☺️ One Shot Voice Cloning base on Unet-TTS
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
Robust fine-tuning of zero-shot models
Zero-shot Image-to-Image Translation [SIGGRAPH 2023]
"Zero-Shot" Super-Resolution using Deep Internal Learning
Official code for "Decoupling Zero-Shot Semantic Segmentation"
Official implementations for paper: Anydoor: zero-shot object-level image customization
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Code base for ZJL zero shot learning competition.
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
Zero-shot Composed Image Retrieval with Textual Inversion
[SIGGRAPH Asia 2022] Text2Light: Zero-Shot Text-Driven HDR Panorama Generation