#计算机科学#An open-source framework for training large multimodal models.
A curated list of Multimodal Related Research.
翻译 - 精选的多模式相关研究清单。
#计算机科学#[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
翻译 - Pytorch的官方实施“ OmniNet:用于多模式多任务学习的统一体系结构”作者:Subhojeet Pramanik,Priyanka Agrawal,Aman Hussain