GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”vision-and-language“ 的搜索结果

sglang
@sgl-project

#大语言模型#SGLang is a fast serving framework for large language models and vision language models.

CUDAinferencellamallava大语言模型
Python15.93 k
1 小时前

相关主题

vision-and-languagevision-language-model深度学习multimodal-deep-learningfoundation-modelsclipAwesome Listsvlmllavaimage-captioning

Google   Bing   GitHub

awesome-vision-and-language
@sangminwoo

#Awesome#A curated list of awesome vision and language resources (still under construction... stay tuned!)

Awesome Listsvision-and-languagemultimodal-learning
540
8 个月前
Meta Research
vilbert-multi-task存档
@facebookresearch • Meta

Multi Task Vision and Language

Jupyter Notebook813
3 年前
Salesforce
BLIP
Salesforce@salesforce

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

vision-languagevision-and-language-pre-trainingimage-text-retrievalimage-captioningvisual-question-answering
Jupyter Notebook5.38 k
1 年前
ViLT
@dandelin

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

vision-and-language
Python1.48 k
1 年前
awesome-Vision-and-Language-Pre-training
@phellonchen

Recent Advances in Vision and Language Pre-training (VLP)

vision-and-language-pre-trainingvision-and-languagepretrainingmultimodal-deep-learning
292
2 年前
Packt
Pretrain-Vision-and-Large-Language-Models-in-Python
Packt@PacktPublishing

Pretrain Vision and Large Language Models in Python, Published by Packt

Jupyter Notebook88
2 年前
谷歌公司
tirg
谷歌公司@google

deep learning, image retrieval, vision and language

Python305
4 年前
MobileVLM
@Meituan-AutoML

Strong and Open Vision Language Assistant for Mobile Devices

Python1.24 k
1 年前
awesome-vlm-architectures
@gokayfem

#Awesome#Famous Vision Language Models and Their Architectures

clipllavavlm
Markdown921
5 个月前
BriVL
@BAAI-WuDao

Bridging Vision and Language Model

Python276
2 年前
moondream
@vikhyat

tiny vision language model

Python8.19 k
21 天前
VLM_survey
@jingyi0000

#计算机科学#Collection of AWESOME vision-language models for vision tasks

机器视觉深度学习knowledge-distillationsurveytransfer-learning
2.83 k
2 个月前
Awesome_Prompting_Papers_in_Computer_Vision
@ttengwang

A curated list of prompt-based paper in computer vision and vision-language learning.

prompt-learningadapterfew-shot-learningprompt-tuningzero-shot-learning
921
2 年前
awesome-vision-language-navigation
@eric-ai-lab

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

vision-and-languagenavigationembodied-agent
514
1 年前
vilmedic
@jbdel

ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field

Python168
6 个月前
awesome-vision-language-pretraining-papers
@yuewang-cuhk

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

vision-and-languagepretrainingmultimodal-deep-learningbert
1.15 k
3 年前
VLN-CE
@jacobkrantz

#计算机科学#Vision-and-Language Navigation in Continuous Environments using Habitat

人工智能机器视觉Robotics深度学习research
Python459
6 个月前
Thinking-VLN
@YicongHong

Ideas and thoughts about the fascinating Vision-and-Language Navigation

234
2 年前
VLP
@LuoweiZhou

Vision-Language Pre-training for Image Captioning and Question Answering

Python419
3 年前
Awesome-Foundation-Models
@uncbiag

A curated list of foundation models for vision and language tasks

foundation-modelsvision-transformerlarge-language-modelstransformer-modelsmultimodal-models
1.05 k
18 天前
Salesforce
LAVIS
Salesforce@salesforce

#计算机科学#LAVIS - A One-stop Library for Language-Vision Intelligence

深度学习deep-learning-libraryimage-captioningsalesforcevision-and-language
Jupyter Notebook10.73 k
8 个月前
DeepSeek-VL
@deepseek-ai

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-modelvision-language-pretrainingfoundation-models
Python3.91 k
1 年前
loading...