#大语言模型#[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
#计算机科学#OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
翻译 - OpenMMLab的下一代操作理解工具箱和基准
#计算机科学#Video classification tools using 3D ResNet
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[ICLR2022] official implementation of UniFormer
翻译 - UniFormer 的正式实施
#计算机科学#Papers, code and datasets about deep learning and multi-modal learning for video analysis
#计算机科学#Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
Video Classification using 2 stream CNN
To classify video into various classes using keras library with tensorflow as back-end.
#计算机科学#deep learning sex position classifier
[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)
Simplest and fastest image and text annotation tool.
Tutorial about 3D convolutional network
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
Explore Action Recognition
Exploration of different solutions to action recognition in video, using neural networks implemented in PyTorch.
#计算机科学#Implementation Code of the paper Optical Flow Guided Feature, CVPR 2018
Classify UCF101 videos using one frame at a time with a CNN(InceptionV3)