#安卓# Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
#大语言模型# InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
#大语言模型# A family of lightweight multimodal models.