LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
#自然语言处理#Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context l...
This project demonstrates the potential of the Mediapipe library for multimodal machine learning applications, specifically in the context of hand gesture recognition within a Unity3D simulation.
✨✨Latest Advances on Multimodal Large Language Models
This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training data, instruction fine-tuning data, and In-Context learning dat...
[CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"
Code and dataset for EMNLP23 Findings paper "In-context Learning for Few-shot Multimodal Named Entity Recognition"
Real-Time Multimodal Emotion Classification System in E-Learning Context
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
#自然语言处理#Reading list for research topics in multimodal machine learning
#大语言模型#Research Trends in LLM-guided Multimodal Learning.
Paper List for In-context Learning 🌷
#大语言模型#Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates. - Professor Yu Liu
A technical report on convolution arithmetic in the context of deep learning
#计算机科学#Meta-Transformer for Unified Multimodal Learning
#Awesome#A Survey on multimodal learning research.
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
#大语言模型#Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.
#计算机科学#Course Material for the machine learning in financial context bootcamp
Code for Continual Learning of Context-dependent Processing in Neural Networks
#自然语言处理#[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Teaching and learning deep learning in the context of digital image processing
A multimodal face liveness detection module that can be used in the context of face anti-spoofing
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
#计算机科学#a unified environment for supervised learning and reinforcement learning in the context of quantitative trading
Multimodal deep learning for Alzheimer's disease dementia assessment