”cross-modal-pretraining“ 的搜索结果

[ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?

Python100

5 个月前

crossmodal-retrieval blip2 cross-modal-pretraining vision-language-pretraining python large-language-models classification multimodal-learning llama paddlepaddle

Video-LLaMA

@DAMO-NLP-SG

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

large-language-models video-language-pretraining vision-language-pretraining blip2 llama

Python2.82 k

6 个月前

Entity-Graph-Enhanced-Cross-Modal-Pretraining-for-Instance-level-Product-Retrieval

@Xiaodongsuper

Python15

2 年前

Croc

@deepglint

Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension

Python18

1 个月前

Cross-Modal-Pretraining-with-BERT

@luomingshuang

4 年前

VLPCook

@mshukor

Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval

Jupyter Notebook11

2 年前

RLIP

@JacobYuan7

[NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Graph Generation.

Python72

6 个月前

cross-modality-pretraining

@walln

Graduate research exploring transformer behavior

Python1

3 年前

ECG-CMR

@Yukui-1999

Official code repository for the paper " Large-scale cross-modality pretrained model enhances cardiovascular state estimation and cardiomyopathy detection from electrocardiograms: An AI system develop...

Python3

9 天前

counting-probe

@Heidelberg-NLP

Counting dataset for Vision & Language models. Introduced in the paper "Seeing Past Words: Testing the Cross-Modal Capabilities of Pretrained V&L Models". https://arxiv.org/abs/2012.12352

3 年前

Awesome_Matching_Pretraining_Transfering

@Paranioar

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

403

5 个月前

Cross-Modal-Center-Loss

@LongLong-Jing

Cross-Modal Center Loss for 3D Cross-Modal Retrieval (CVPR2021)

Python29

4 年前

XLM

@facebookresearch • Meta

PyTorch original implementation of Cross-lingual Language Model Pretraining.

翻译 - PyTorch最初执行跨语言模型预训练。

Python2.89 k

2 年前

ACMR-demo

@SRTP-cross-modal-retrieval

basic modal for cross-modal-retrieval

Python9

6 年前

pytorch_fnet

@AllenCellModeling

Three dimensional cross-modal image inference

Python152

4 年前

SCALE_code

@Xiaodongsuper

M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022

Python32

2 年前

VideoX

Microsoft@microsoft

VideoX: a collection of video cross-modal models

Python986

6 个月前

cross_modal_adaptation

@linzhiqiu

Cross-modal few-shot adaptation with CLIP

Python320

9 个月前

deep-cross-modal-hashing

@WangGodder

Deep learning cross modal hashing in PyTorch

Python102

3 年前

VisualVoice

@facebookresearch • Meta

Audio-Visual Speech Separation with Cross-Modal Consistency

Python223

1 年前

xmuda

@valeoai

Cross-Modal Unsupervised Domain Adaptationfor 3D Semantic Segmentation

Python194

2 年前

DCMH

@WendellGul

PyTorch implementation for paper "Deep Cross-Modal Hashing"

Python108

3 年前

DCMH-CVPR2017

@jiangqy

source code for paper "Deep Cross-Modal Hashing"

MATLAB99

4 年前

Cross-Modal-Projection-Learning

@YingZhangDUT

TensorFlow Implementation of Deep Cross-Modal Projection Learning

Python93

5 年前

StacMR

@AndresPMD

Scene Text Aware Cross Modal Retrieval (StacMR)

Python23

3 年前

cross-modal-hasing-playground

@yolo2233

Python implementation of cross-modal hashing algorithms

Python22

2 年前

OMML

@njustkmg

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

multimodal multimodal-learning Python paddlepaddle PyTorch

Python564

2 年前🇨🇳