GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”blip2“ 的搜索结果

Video-BLIP2-Preprocessor
@ExponentialML

A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it

Python138
1 年前

相关主题

blip2ChatGPTllamacross-modal-pretrainingvision-language-pretrainingvisual-language-learninggpt-4vocabulary

Google   Bing   GitHub

Image2Paragraph
@showlab

#大语言模型#[Image 2 Text Para] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

ChatGPTtoolboxgpt4
Python813
2 年前
sd-webui-blip2
@Tps-F

WebUI extension for using Blip2

Python98
1 年前
Video-LLaMA
@DAMO-NLP-SG

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

large-language-modelsvideo-language-pretrainingvision-language-pretrainingblip2llama
Python3.03 k
1 年前
chat-with-nerf
@sled-group

#大语言模型#[ICRA 2024] Chat with NeRF enables users to interact with a NeRF model by typing in natural language.

blip2ChatGPTgpt-4nerf
Python313
1 年前
image_text_retrieval_BLIP_BLIP2
@enrico310786

Experiments with LAVIS library to perform image2text and text2image retrieval with BLIP and BLIP2 models

Jupyter Notebook14
2 年前
Blip2
@wjm202

Python6
2 年前
BLIVA
@mlpc-ucsd

#大语言模型#(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

blip2聊天机器人instruction-tuningllama大语言模型
Python261
1 年前
stable-diffusion-webui-blip2-captioner
@p1atdev

BLIP2 captioning tool as an extension of AUTOMATIC's WebUI

automatic1111stable-diffusionstable-diffusion-webuistable-diffusion-webui-plugin
Python60
2 年前
NeuroClips
@gongzix

Official code base for NeuroClips

fmriblip2
MATLAB91
1 个月前
VLog
@showlab

#大语言模型#[CVPR 2025] Video Narration as Vocabulary & Video as Long Document

ChatGPTlangchain大语言模型Whisper
Python572
4 个月前