#大语言模型#Janus-Series: Unified Multimodal Understanding and Generation Models
Voice Conversion With Just Nearest Neighbors
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
The official github repo for MixEval-X, the first any-to-any, real-world benchmark.
Voice Conversions for Japanesse