”video-understanding“ 的搜索结果

Awesome-LLMs-for-Video-Understanding

@yunlong10

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1.94 k

23 天前

Google Bing GitHub

mmaction2

OpenMMLab@open-mmlab

#计算机科学#OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

翻译 - OpenMMLab的下一代操作理解工具箱和基准

action-recognition temporal-action-localization PyTorch video-understanding tsn

Python4.45 k

6 个月前

SlowFast

@facebookresearch • Meta

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

翻译 - PySlowFast：FAIR的视频理解代码库，用于复制最新的视频模型。

Python6.79 k

3 个月前

video-understanding-dataset

@yoosan

#数据仓库#A collection of recent video understanding datasets, under construction!

video-understanding 数据集机器视觉 action-recognition

461

7 年前

ECO-efficient-video-understanding

@mzolfaghari

Code and models of paper " ECO: Efficient Convolutional Network for Online Video Understanding", ECCV 2018

Jupyter Notebook438

6 年前

pytorchvideo

@facebookresearch • Meta

A deep learning library for video understanding research.

翻译 - 用于视频理解研究的深度学习库。

Python3.39 k

25 天前

temporal-shift-module

MIT HAN Lab@mit-han-lab

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

翻译 - [ICCV 2019] TSM：高效视频理解的时移模块。

acceleration low-latency temporal-modeling video-understanding efficient-model

Python2.09 k

7 个月前

MiniGPT4-video

@Vision-CAIR

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

video-question-answering video-understanding

Python587

2 个月前

Video-LLaMA

@DAMO-NLP-SG

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

large-language-models video-language-pretraining vision-language-pretraining blip2 llama

Python2.92 k

8 个月前

InternVideo

@OpenGVLab

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

foundation-models video-understanding vision-transformer action-recognition

Python1.67 k

5 天前

video-long-term-feature-banks存档

@facebookresearch • Meta

Long-Term Feature Banks for Detailed Video Understanding

Python370

3 年前

Youtube-8M存档

百度@baidu

PaddlePaddle models for Youtube-8M Video Understanding Challenge

Python114

7 年前

awesome_video_person_reid

@AsuradaYuci

papers collection and understanding for video person re-identification

papers

312

9 个月前

VideoGPT-plus

@mbzuai-oryx

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

聊天机器人 clip gpt4 gpt4o

Python255

6 个月前

VideoLLaMA2

@DAMO-NLP-SG

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python1.07 k

1 个月前

Awesome-Video-Diffusion

@showlab

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Awesome Lists diffusion-models video-editing video-generation video-understanding

4 k

2 天前

I3D_Finetune

@USTC-Video-Understanding

#计算机科学#TensorFlow code for finetuning I3D model on UCF101.

action-recognition video-understanding cnn 深度学习 i3d

Python144

7 年前

awesome-video-understanding

@sujiongming

video-understanding:Video Classification, Action Recognition, Video Datasets

141

7 年前

youtube8mchallenge

@miha-skalic

1st place solution to Kaggle's 2018 YouTube-8M Video Understanding Challenge

Python197

2 年前

MovieChat

@rese1f

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

机器视觉 multimodal-large-language-models llama large-language-models

Python575

20 天前

TAdaConv

@alibaba-mmai-research

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

action-recognition PyTorch video-understanding video-classification

Python229

1 年前

Shot2Story

字节跳动@bytedance

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

benchmark dataset large-language-models video-language-pretraining

Python117

19 天前

编程语音

”video-understanding“ 的搜索结果

相关主题