OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
翻译 - OpenMMLab视频感知工具箱。它通过统一的框架支持单对象跟踪(SOT),多对象跟踪(MOT),视频对象检测(VID)。
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
翻译 - [CVPR2021 ORAL]与变压器的端到端视频实例分段
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
翻译 - SeqFormer:令人沮丧的简单视频实例分割模型