”cross-modal-retrieval“ 的搜索结果

#自然语言处理#本项目为CLIP模型的中文版本，使用大规模中文数据进行训练（~2亿图文对），旨在帮助用户快速实现中文领域的图文特征&相似度计算、跨模态检索、零样本图片分类等任务

chinese 机器视觉 multi-modal-learning 自然语言处理 PyTorch

Python4.59 k

4 个月前🇨🇳

multi-modal-learning pytorch transformers cross-modal-retrieval tden classification vision-language multi-modal image-text-retrieval paddlepaddle

Cross-Modal-Retrieval

@zhongzhh8

Cross-Modal Retrieval, triplet loss, Pytorch, Resnet18, Bert, Deep Hashing

Python103

5 年前

Awesome-Cross-Modal-Video-Moment-Retrieval

@yawenzeng

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

240

1 年前

Cross-Modal-Retrieval

@BMC-SDNU

Cross-Modal-Real-valuded-Retrieval

Python75

1 年前

CrossModalRetrieval

@jingliao132

Pytorch implementation of 'See, Hear, and Read: Deep Aligned Representations'

Python33

6 年前

crossModalRetrieval

@congjianluo

A demo of a cross-modal retrieval system

JavaScript26

5 年前

OMML

@njustkmg

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

multimodal multimodal-learning Python paddlepaddle PyTorch

Python564

2 年前🇨🇳

SSAH

@lelan-li

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval(CVPR2018)

Python164

6 年前

xmodaler

@YehLi

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense r...

翻译 - X-modaler 是用于跨模态分析的多功能高性能代码库。

image-captioning video-captioning vision-and-language pretraining cross-modal-retrieval

Python1.03 k

2 年前