#大语言模型#[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
#计算机科学#OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
翻译 - OpenMMLab的下一代操作理解工具箱和基准
#计算机科学#Video classification tools using 3D ResNet
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[ICLR2022] official implementation of UniFormer
翻译 - UniFormer 的正式实施
#计算机科学#Papers, code and datasets about deep learning and multi-modal learning for video analysis
#计算机科学#Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
Video Classification using 2 stream CNN
To classify video into various classes using keras library with tensorflow as back-end.
[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)
#计算机科学#deep learning sex position classifier
Simplest and fastest image and text annotation tool.
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
Tutorial about 3D convolutional network
Explore Action Recognition
#计算机科学#Implementation Code of the paper Optical Flow Guided Feature, CVPR 2018
Classify UCF101 videos using one frame at a time with a CNN(InceptionV3)
Exploration of different solutions to action recognition in video, using neural networks implemented in PyTorch.