An open-source toolbox for action understanding based on PyTorch
翻译 - 一个基于PyTorch的用于理解动作的开源工具箱
基于模块化的设计,提供丰富的视频算法实现、产业级的视频算法优化与应用,包括安防、体育、互联网、媒体等行业的动作定位与识别、行为分析、智能封面、视频标注、视频打标签等,涵盖动作识别与视频分类、动作定位、动作检测、多模态文本视频检索等技术。
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Awesome papers & datasets specifically focused on long-term videos.
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
temporal action detection: benchmark results, features download etc.
[TIP 2022] End-to-end Temporal Action Detection with Transformer
A curated publication list on weakly-supervised temporal action localization
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
[CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection
[CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark
A single stage temporal action detection toolbox based on PyTorch
[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points
End to End Streaming Video Temporal Segmentation
[ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "
[AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation
[ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "
【AAAI 2022】Temporal Action Proposal Generation with Background Constraint
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models