GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2025 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

集合主题趋势排行榜

#

blip

Website
Wikipedia

gokayfem / awesome-vlm-architectures

#Awesome#Famous Vision Language Models and Their Architectures

clip llava vlm multimodal blip cogvlm internlm vision-language-model Awesome Lists

Markdown 770

2 个月前

SkalskiP / awesome-foundation-and-multimodal-models

#自然语言处理#👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

blip clip foundational-models grounding-dino llava multimodal segment-anything 机器视觉自然语言处理 open-vocabulary-detection open-vocabulary-segmentation image-captioning

Python 611

1 年前

jina-ai / agentchain

#大语言模型#Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

人工智能大语言模型机器学习 multimodal nlproc langchain stable-diffusion blip Whisper

Python 604

2 年前

mertyg / vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

multimodal PyTorch vision-language blip clip compositionality

Python 275

2 年前

outpoot / bliptext

The wiki where you edit a word every 30sec, with 2.1M Wikipedia articles ported to a custom markdown format. Real-time text editing, beautiful UI & more. Vandalize articles today!

blip Svelte sveltekit Wiki wikipedia

Svelte 134

12 天前

MikeWangWZHL / VidIL

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

blip clip gpt-3 vision-language

Python 114

3 年前

nick8592 / text-guided-image-colorization

This repository provides an interactive image colorization tool that leverages Stable Diffusion (SDXL) and BLIP for user-controlled color generation. With a retrained model using the ControlNet approa...

controlnet gradio image-colorization stable-diffusion blip

Python 87

5 个月前

microsoft / Data-Discovery-Toolkit

#计算机科学#A data discovery and manipulation toolset for unstructured data

blip Keras 机器学习 openai powerbi evaluation-metrics search

Jupyter Notebook 54

2 年前

cobanov / image-captioning

Image captioning using python and BLIP

image-captioning blip image-text-retrieval vision-language

Python 47

2 年前

takenet / blip-sdk-js

The Javascript SDK for BLiP

blip 聊天机器人

JavaScript 33

1 年前

BUAADreamer / SPN4CIR

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

blip blip2 clip data-generation image-retrieval llama llava multimodal-learning transformer cross-modal-retrieval

Python 30

6 个月前

YonLiud / Emergency-Caller-FiveM

FiveM Script to allow civilians to dial 911, giving out their location, name, and reason they called, adding a blip to the map too

fivem blip Hacktoberfest Lua

Lua 19

3 年前

entrpn / serving-model-cards

#计算机科学#Collection of OSS models that are containerized into a serving container

机器学习 vertex-ai blip clip dreambooth Google 云 ml-training stable-diffusion vertexai esrgan hugging-face huggingface huggingface-diffusers huggingface-transformers t5 dataflow

Python 16

2 年前

zer0int / CLIP-Interrogator-LongCLIP-hallucinwords

CLIP Interrogator, fully in HuggingFace Transformers 🤗, with LongCLIP & CLIP's own words and / or *your* own words!

blip blip2 clip

Python 16

3 个月前

CodeWizardsDev / wizard-blips

Free Advanced Fivem Blip System, Highly Customizable

blip fivem Lua Script

Lua 15

2 年前

eren23 / sam-clip-diffusion

SAM + CLIP + DIFFUSION for image to edit objects in images using plain text

clip sam blip diffusion huggingface huggingface-transformers image-editing inpainting segment-anything stable-diffusion transformer object-segmentation semantic-segmentation

Python 15

2 年前

ghostofpokemon / oCaption

oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning

blip gpt openai openai-api sdxl vision

Python 11

1 年前

securade / sentinel

#计算机科学#Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.

人工智能 blip cctv 机器视觉 generative-ai 机器学习 rtsp-stream surveillance video-analytics visual-question-answering vlm

Python 11

7 天前

neechbear / blip

Bash Library for Indolent Programmers

Bash Shell sh blip

Shell 10

3 年前

gyuilLim / Youtube-scene-search-with-text

Finding scenes that you want by text automatically

Jupyter Notebook 9

3 个月前

loading...