Multimodal contrastive pretraining for astronomical data
COntrastive Multimodal Pretraining for AutonomouS Systems
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
[ICLR 2024 Spotlight] This is the official code for the paper "SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training"
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Emu Series: Generative Multimodal Models from BAAI
The most impactful papers related to contrastive pretraining for multimodal models!
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
Source codes of the paper "Hierarchical Pretraining on Multimodal Electronic Health Records".
#计算机科学#Large-scale pretraining for dialogue
翻译 - 对话的大规模预培训
Multimodal-GPT
XLNet: Generalized Autoregressive Pretraining for Language Understanding
翻译 - XLNet:用于语言理解的广义自回归预训练
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
PyTorch original implementation of Cross-lingual Language Model Pretraining.
翻译 - PyTorch最初执行跨语言模型预训练。
#大语言模型#AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Multimodal Unsupervised Image-to-Image Translation
#自然语言处理#A large-scale 7B pretraining language model developed by BaiChuan-Inc.
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
#计算机科学#Jina 是一个基于深度学习的搜索框架,支持各种类型如图片,视频,长文本,PDF等。
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
[ACL'19] [PyTorch] Multimodal Transformer
Multimodal Sarcasm Detection Dataset
Toward Multimodal Image-to-Image Translation
#计算机科学#Meta-Transformer for Unified Multimodal Learning
#计算机科学#Represent, send, store and search multimodal data
翻译 - 非结构化数据的数据结构