#自然语言处理#Reading list for research topics in multimodal machine learning
#计算机科学#An open-source framework for training large multimodal models.
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
A curated list of Multimodal Related Research.
翻译 - 精选的多模式相关研究清单。
#计算机科学#[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
#人脸识别#ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
A Comparative Framework for Multimodal Recommender Systems
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
#计算机科学#Papers, code and datasets about deep learning and multi-modal learning for video analysis
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
A collection of resources on applications of multi-modal learning in medical imaging.
#自然语言处理#Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
#Awesome#A curated list of awesome vision and language resources (still under construction... stay tuned!)
#自然语言处理#[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
#自然语言处理#Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
翻译 - Pytorch的官方实施“ OmniNet:用于多模式多任务学习的统一体系结构”作者:Subhojeet Pramanik,Priyanka Agrawal,Aman Hussain
#自然语言处理#Multi-modality pre-training
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
#计算机科学#Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!