GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2023 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

”vision-and-language-pre-training“ 的搜索结果

Salesforce@salesforce

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook4.85 k

4 个月前

Google Bing GitHub

image-captioning beit swin-transformer tden mobilenet deep-learning video-captioning vision-and-language clip convnext

awesome-Vision-and-Language-Pre-training

Recent Advances in Vision and Language Pre-training (VLP)

289

1 年前

Vision-Language Pre-training for Image Captioning and Question Answering

Python413

3 年前

Google Research Datasets@google-research-datasets

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

369

2 年前

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense r...

翻译 - X-modaler 是用于跨模态分析的多功能高性能代码库。

image-captioning video-captioning vision-and-language pretraining cross-modal-retrieval

Python1.03 k

2 年前

awesome-vision-and-language-pretraining

A curated list of vision-and-language pre-training (VLP). :-)

56

2 年前

@facebookresearch • Meta

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Python132

2 年前

[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.

Python116

2 年前

@PathologyFoundation

Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to extra...

Python281

1 年前

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

Jupyter Notebook60

1 年前

Salesforce@salesforce

Code for ALBEF: a new vision-language pre-training method

Python1.5 k

2 年前

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Python260

2 个月前

Microsoft@microsoft

Grounded Language-Image Pre-training

Python2.24 k

10 个月前

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Python90

2 年前

Salesforce@salesforce

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Python183

2 年前

bert_language_understanding

brightmart@brightmart

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Python959

6 年前

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Python206

2 年前

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Python2.16 k

6 年前

Microsoft@microsoft

MASS: Masked Sequence to Sequence Pre-training for Language Generation

翻译 - MASS：用于语言生成的蒙版序列到序列预训练

Python1.12 k

2 年前

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

Python107

2 年前

Microsoft@microsoft

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python1.22 k

7 个月前

OpenMMLab@open-mmlab

#计算机科学#OpenMMLab Pre-training Toolbox and Benchmark

翻译 - OpenMMLab图像分类工具箱和基准

image-classification resnet mobilenet PyTorch 深度学习

Python3.48 k

1 个月前

chinese_clip_in_tensorflow

@real-brilliant

CLIP (Contrastive Language-Image Pre-Training) in tensorflow

Python11

2 年前

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

Python187

2 年前

@facebookresearch • Meta

Code release for SLIP Self-supervision meets Language-Image Pre-training

翻译 - SLIP 自监督的代码发布满足语言-图像预训练

Python743

2 年前

Self-Supervised-Vision

Implementations of some self-supervised methods for pre-training vision models

Python18

2 年前

Microsoft@microsoft

MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf

Python255

3 年前

loading...

编程语音

Python
JavaScript
C++
Java
C
C#
TypeScript
Jupyter Notebook
Go
HTML