✨✨Latest Advances on Multimodal Large Language Models
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
#大语言模型#InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Mixture-of-Experts for Large Vision-Language Models
🔥 🔥 🔥 [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies
#大语言模型#[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
[NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
#自然语言处理#Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
This is the offical repository of LLAVIDAL
Easy-to-use large vision language model pipeline for quantitative analysis