#计算机科学#A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
翻译 - 来自Facebook AI Research(FAIR)的视觉和语言多模式研究的模块化框架
#大语言模型#Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
OmniFusion — a multimodal model to communicate using text and images