This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information...
Multimodal Sarcasm Detection Dataset
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
DataComp: In search of the next generation of multimodal datasets
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
A curated list of AWESOME papers, datasets and tutorials within Multimodal Knowledge Graph.
Multimodal datasets.
MINT-1T: A one trillion token multimodal interleaved dataset.
Preprocessed Datasets for our Multimodal NER paper
✨✨Latest Advances on Multimodal Large Language Models
List of datasets, papers, and codes related to multimodal/multisource/multisensor remote sensing classification
Multimodal-GPT
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
#大语言模型#AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Multimodal Unsupervised Image-to-Image Translation
#计算机科学#Jina 是一个基于深度学习的搜索框架,支持各种类型如图片,视频,长文本,PDF等。
#自然语言处理#🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
翻译 - 🤗 PyTorch,TensorFlow,NumPy和Pandas中用于自然语言处理以及其他功能的快速,高效,开放式数据集和评估指标
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
[ACL'19] [PyTorch] Multimodal Transformer
Toward Multimodal Image-to-Image Translation
#计算机科学#Meta-Transformer for Unified Multimodal Learning
#计算机科学#Represent, send, store and search multimodal data
翻译 - 非结构化数据的数据结构
Emu Series: Generative Multimodal Models from BAAI