✨✨Latest Advances on Multimodal Large Language Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
#大语言模型#Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
翻译 - NeMo:用于对话式AI的工具包
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
SEED-Story: Multimodal Long Story Generation with Large Language Model
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
《Evaluating Large Language Models Trained on Code》论文代码
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
#计算机科学#An open-source framework for training large multimodal models.
A guidance language for controlling large language models.
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"
#大语言模型#Adding guardrails to large language models.
Practical course about Large Language Models.
Test generation using large language models
Notebooks for Large Language Models (LLMs) Specialization
#大语言模型#A unified evaluation framework for large language models
Finetuning large language models for GDScript generation.