#大语言模型#Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding