#自然语言处理#本项目为CLIP模型的中文版本,使用大规模中文数据进行训练(~2亿图文对),旨在帮助用户快速实现中文领域的图文特征&相似度计算、跨模态检索、零样本图片分类等任务
Cross-Modal Retrieval, triplet loss, Pytorch, Resnet18, Bert, Deep Hashing
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
Cross-Modal-Real-valuded-Retrieval
Pytorch implementation of 'See, Hear, and Read: Deep Aligned Representations'
A demo of a cross-modal retrieval system
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense r...
翻译 - X-modaler 是用于跨模态分析的多功能高性能代码库。
媒体计算实践作业:图像——文本跨模态搜索
basic modal for cross-modal-retrieval
Cross-Modal Center Loss for 3D Cross-Modal Retrieval (CVPR2021)
Scene Text Aware Cross Modal Retrieval (StacMR)
The baselines of cross-modal hashing retrieval.
Deep Supervised Cross-modal Retrieval (CVPR 2019, PyTorch Code)
Label Embedding Online Hashing for Cross-Modal Retrieval
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019
Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
Deep Hashing Algorithm for Cross-modal Images and Text Retrieval
PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise interaction