Slightly improved official version for finetune xtts
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
A Gradio UI for XTTSv2 and RVC.
#大语言模型#🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoro
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
#大语言模型#Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation), and Piper / xVASynth / XTTS (text-to-speech).