gradio · GitHub Topics

深度学习 diffusion image-generation image2image img2img text2image txt2img 人工智能 ai-art gradio PyTorch stable-diffusion torch upscaling Web unstable

Python 151.04 k

1 个月前

gradio-app / gradio

#计算机科学#Gradio是一个开源的Python库，用于构建演示机器学习或数据科学，以及web应用程序。使用Gradio，您可以基于您的机器学习模型或数据科学工作流快速创建一个漂亮的用户界面，让用户可以”尝试“拖放他们自己的图像、粘贴文本、录制他们自己的声音，并通过浏览器与您的演示程序进行交互。

机器学习 models ui ui-components interface Python 数据科学数据可视化深度学习数据分析 gradio gradio-interface python-notebook deploy Hacktoberfest

Python 37.42 k

1 天前

camenduru / stable-diffusion-webui-colab

#计算机科学#stable diffusion webui colab

stable-diffusion stable-diffusion-web-ui stable-diffusion-webui colab colab-notebook colaboratory dreambooth lora 人工智能 gradio 深度学习 image-generation img2img PyTorch txt2img ai-art t2v text2video

Jupyter Notebook 15.82 k

6 个月前

Zeyi-Lin / HivisionIDPhotos

#人脸识别#⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Demo gradio idphoto 机器学习 matting 工具 cnn Docker face-recognition FastAPI mtcnn unet

Python 15.49 k

9 天前

DrewThomasson / ebook2audiobook

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

audiobooks Docker epub Linux macOS tts Windows xtts 声音克隆 gradio 中文 english multilingual colab-notebook kaggle

Python 9.47 k

6 天前

AbdBarho / stable-diffusion-webui-docker

Easy Docker setup for Stable Diffusion with user-friendly UI

Docker stable-diffusion gradio docker-compse PyTorch

Shell 7.11 k

8 个月前

modelscope / FunClip

#大语言模型#Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition video-clip video-subtitles subtitles-generator speech-to-text gradio gradio-python-llm 大语言模型

Python 4.4 k

1 个月前

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp 声音克隆 podcasts audiobook voice-conversion karaoke whisperx

Python 3.59 k

14 天前

ant-research / MagicQuill

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

aigc image-editing mllm gradio

Python 3.27 k

1 个月前

OpenGVLab / InternGPT

#大语言模型#InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...

ChatGPT foundation-model gpt gpt-4 gradio husky image-captioning langchain 大语言模型 multimodal vqa llama vicuna video-generation sam segment-anything click draggan

Python 3.22 k

8 个月前

OpenGVLab / Ask-Anything

#大语言模型#[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

captioning-videos ChatGPT gradio langchain video-question-answering video-understanding stablelm chat Video big-model foundation-models large-language-models

Python 3.21 k

3 个月前

om-ai-lab / OmAgent

#大语言模型#Build multimodal language agents for fast prototype and production

large-language-models multimodal-agent vision-and-language agent workflow 聊天机器人 gpt4 大语言模型 multimodal rag vlm gpt gradio llama llava openai Python gemini

Python 2.47 k

25 天前

camenduru / text-generation-webui-colab

#大语言模型#A colab gradio web UI for running Large Language Models

colab colab-notebook colaboratory gradio llama 大语言模型 alpaca koala lama llamas vicuna

Jupyter Notebook 2.1 k

1 年前

rsxdalv / tts-generation-webui

#计算机科学#TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)