#计算机科学#Gradio是一个开源的Python库,用于构建演示机器学习或数据科学,以及web应用程序。使用Gradio,您可以基于您的机器学习模型或数据科学工作流快速创建一个漂亮的用户界面,让用户可以”尝试“拖放他们自己的图像、粘贴文本、录制他们自己的声音,并通过浏览器与您的演示程序进行交互。
#计算机科学#stable diffusion webui colab
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Easy Docker setup for Stable Diffusion with user-friendly UI
#大语言模型#Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2E, F5-TTS, CosyVoice), with Whisper audio processing, RVC voice changer, YouTube download,...
#大语言模型#InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
#大语言模型#[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
#大语言模型#Build multimodal language agents for fast prototype and production
#大语言模型#A colab gradio web UI for running Large Language Models
#计算机科学#TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
#计算机科学#A sketch extractor for anime/illustration.
翻译 - 动漫/插画的素描提取器。
A Web UI for easy subtitle using whisper model.
Fast stable diffusion on CPU
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
#大语言模型#Ship RAG based LLM web apps in seconds.