”vision-language-model“ 的搜索结果

tiny vision language model

Jupyter Notebook5.88 k

7 天前

image-captioning language-model vision-language-pretraining chatgpt large-language-models transformer moe vision-language-model vision-and-language vision-language-transformer

sglang

@sgl-project

#大语言模型#SGLang is a fast serving framework for large language models and vision language models.

CUDA inference llama llava llm

Python6.27 k

1 小时前

VLM_survey

@jingyi0000

Collection of AWESOME vision-language models for vision tasks

2.55 k

3 天前

MoE-LLaVA

@PKU-YuanGroup

Mixture-of-Experts for Large Vision-Language Models

large-vision-language-model mixture-of-experts moe multi-modal

Python2 k

6 个月前

Awesome-Prompting-on-Vision-Language-Model

@JindongGu

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

399

1 个月前

CoOp

@KaiyangZhou

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python1.79 k

6 个月前

MGM

@dvlab-research

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

generation large-language-models vision-language-model

Python3.22 k

7 个月前

vision-language-models-are-bows

@mertyg

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Python259

1 年前

Vary

@Ucas-HaoranWei

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python1.82 k

2 个月前

awesome-vision-language-pretraining-papers

@yuewang-cuhk

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1.14 k

2 年前

BriVL

@BAAI-WuDao

Bridging Vision and Language Model

Python276

2 年前

GeoChat

@mbzuai-oryx

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python455

1 天前

prismer

NVIDIA Research Projects@NVlabs

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

image-captioning language-model multi-modal-learning multi-task-learning vision-language-model

Python1.3 k

10 个月前

VisionLLM

@OpenGVLab

VisionLLM Series

Python937

1 个月前

Vary-toy

@Ucas-HaoranWei

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python558

6 个月前

VALOR

@TXH-mercury

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Python262

6 个月前

vilbert-multi-task

@facebookresearch • Meta

Multi Task Vision and Language

Jupyter Notebook799

3 年前

awesome-llm-and-aigc

@codingonion

🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.

510

4 个月前

InternLM-XComposer

@InternLM

#大语言模型#InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

ChatGPT visual-language-learning multi-modality foundation gpt-4

Python2.54 k

2 个月前

LAMA

@facebookresearch • Meta

LAnguage Model Analysis

Python1.36 k

5 个月前

LAVIS

Salesforce@salesforce

#计算机科学#LAVIS - A One-stop Library for Language-Vision Intelligence

深度学习 deep-learning-library image-captioning salesforce vision-and-language

Jupyter Notebook9.99 k

11 天前

GLM

THUDM@THUDM

GLM (General Language Model)

Python3.2 k

1 年前

Qwen-VL

@QwenLM • 阿里巴巴

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

large-language-models vision-language-model

Python5.12 k

4 个月前

DeepSeek-VL

@deepseek-ai

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-model vision-language-pretraining foundation-models

Python2.1 k

7 个月前

Awesome_Prompting_Papers_in_Computer_Vision

@ttengwang

A curated list of prompt-based paper in computer vision and vision-language learning.

898

1 年前

tirg

谷歌公司@google

deep learning, image retrieval, vision and language

Python300

4 年前

”vision-language-model“ 的搜索结果

编程语音