vision-and-language“ 的搜索结果

#大语言模型#SGLang is a fast serving framework for large language models and vision language models.

Python17.96 k
13 分钟前

#Awesome#A curated list of awesome vision and language resources (still under construction... stay tuned!)

547
10 个月前
Jupyter Notebook818
4 年前

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook5.48 k
1 年前

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python1.49 k
1 年前

Pretrain Vision and Large Language Models in Python, Published by Packt

Jupyter Notebook88
2 年前

#Awesome#Famous Vision Language Models and Their Architectures

Markdown1.01 k
7 个月前

deep learning, image retrieval, vision and language

Python305
4 年前

Strong and Open Vision Language Assistant for Mobile Devices

Python1.27 k
1 年前

Bridging Vision and Language Model

Python276
2 年前

tiny vision language model

Python8.42 k
5 天前
2.92 k
4 个月前

A curated list of prompt-based paper in computer vision and vision-language learning.

922
2 年前

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

540
1 年前

ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field

Python168
8 个月前

#计算机科学#Vision-and-Language Navigation in Continuous Environments using Habitat

Python546
8 个月前
1.16 k
3 年前

Ideas and thoughts about the fascinating Vision-and-Language Navigation

254
2 年前
Jupyter Notebook10.9 k
10 个月前

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python3.96 k
1 年前

Vision-Language Pre-training for Image Captioning and Question Answering

Python424
4 年前
loading...