编程语音

Python
Jupyter Notebook
C++

”mllm“ 的搜索结果

mllm

@UbiquitousLearning

Fast Multimodal LLM on Mobile Devices

large-language-models llama multimodal

C++740

10 天前

Google Bing GitHub

LMOps

Microsoft@microsoft

#自然语言处理#General technology for enabling AI capabilities w/ LLMs and MLLMs

自然语言处理 agi gpt 大语言模型 lm

Python3.88 k

2 个月前

MiniCPM-o

@OpenBMB

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

minicpm minicpm-v multi-modal

Python18.94 k

10 天前

Awesome-MLLM-Hallucination

@showlab

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

598

3 个月前

MagicQuill

@ant-research

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

aigc image-editing mllm gradio

Python3.04 k

14 天前

Sa2VA

@magic-research

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

机器视觉 mllm

Python961

16 天前

Mulberry

@HJYao00

Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python902

11 小时前

Awesome-MLLM-LLM-Colab

@Czi24

Happy experimenting with MLLM and LLM models!

colab-notebook large-language-models multimodal-large-language-models

Jupyter Notebook94

5 个月前

CAD-MLLM

@CAD-MLLM

CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM

120

5 天前

Awesome-RL-based-Reasoning-MLLMs

@Sun-Haoyuan23

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

343

16 小时前

MLLM-Tool

@MLLM-Tool

#大语言模型#MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

gpt4 大语言模型 lmm

Python109

10 个月前

MLLMArxivTalk

@gyunggyung

#计算机科学#[Google Meet] MLLM Arxiv Casual Talk

机器学习 text-to-video

2 年前

VITA

@VITA-MLLM

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

large-multimodal-models multimodal-large-language-models

Python2.15 k

1 个月前

cambrian

@cambrian-mllm

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

聊天机器人 clip 机器视觉 dino instruction-tuning

Python1.87 k

4 个月前

LLaVA-UHD

THUNLP@thunlp

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Python369

2 个月前

Ovis

@AIDC-AI

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

聊天机器人 llama3 multimodal multimodal-large-language-models multimodality

Python784

15 天前

Awesome-MLLM-Safety

@isXinLiu

Accepted by IJCAI-24 Survey Track

Python195

7 个月前

OS-Agent-Survey

@OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".

survey agent llms GUI

219

23 天前

self-llm

Datawhale@datawhalechina

#大语言模型#《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

大语言模型 chatglm internlm2 llama3 lora

Jupyter Notebook13.64 k

13 天前

Table-LLaVA

@SpursGoZmy

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabula...

Python188

5 个月前

MPP-LLaVA

@Coobiw

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...

multimodal-large-language-models deepspeed pipeline-parallelism mllm

Jupyter Notebook423

3 天前

Awesome_Multimodel_LLM

@Atomic-man007

#自然语言处理#Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context l...

ChatGPT dataset gpt 大语言模型 mllm

300

1 个月前

编程语音

”mllm“ 的搜索结果

相关主题