GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2023 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

”vision-language-pretraining“ 的搜索结果

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1.14 k

2 年前

Google Bing GitHub

image-captioning vision-language-pretraining mmlu chatgpt large-language-models multimodal-datasets vision-language-model deep-learning foundation-models vision-language-transformer

@QwenLM • 阿里巴巴

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

large-language-models vision-language-model

Python5.12 k

4 个月前

awesome-vision-and-language-pretraining

A curated list of vision-and-language pre-training (VLP). :-)

56

2 年前

Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

403

5 个月前

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Python262

6 个月前

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-model vision-language-pretraining foundation-models

Python2.11 k

7 个月前

awesome-vision-language-pretraining

Awesome Vision-Language Pretraining Papers

29

3 个月前

[TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs"

Python114

3 年前

Awesome-VLP-and-Efficient-Transformer

Vision-Language Pretraining & Efficient Transformer Papers.

14

3 年前

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]

Python128

2 个月前

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook529

2 个月前

Contrastive Language-Audio Pretraining

Python1.43 k

8 天前

Zihang Dai@zihangdai

XLNet: Generalized Autoregressive Pretraining for Language Understanding

翻译 - XLNet：用于语言理解的广义自回归预训练

Python6.18 k

2 年前

[NeurIPS2022] Egocentric Video-Language Pretraining

Python229

7 个月前

@facebookresearch • Meta

PyTorch original implementation of Cross-lingual Language Model Pretraining.

翻译 - PyTorch最初执行跨语言模型预训练。

Python2.89 k

2 年前

#自然语言处理#A large-scale 7B pretraining language model developed by BaiChuan-Inc.

artificial-intelligence ceval large-language-models 自然语言处理 mmlu

Python5.68 k

4 个月前

tiny vision language model

Jupyter Notebook5.88 k

7 天前

Collection of AWESOME vision-language models for vision tasks

2.55 k

3 天前

#计算机科学#CLIP（Contrastive Language-Image Pretraining），根据图像预测最相关的文本片段

深度学习机器学习

Jupyter Notebook26.21 k

4 个月前

Microsoft@microsoft

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python715

8 个月前

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python729

8 个月前

pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

Python177

10 个月前

vilbert-multi-task

@facebookresearch • Meta

Multi Task Vision and Language

Jupyter Notebook799

3 年前

Bridging Vision and Language Model

Python276

2 年前

Salesforce@salesforce

#计算机科学#LAVIS - A One-stop Library for Language-Vision Intelligence

深度学习 deep-learning-library image-captioning salesforce vision-and-language

Jupyter Notebook9.99 k

11 天前

Hugging Face@huggingface

Pipeline for pulling and processing online language model pretraining data from the web

Python174

1 年前

loading...

编程语音

Python
C++
Jupyter Notebook
C
Go
JavaScript
Java
Rust
TypeScript
C#