OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
翻译 - OpenMMLab视频感知工具箱。它通过统一的框架支持单对象跟踪(SOT),多对象跟踪(MOT),视频对象检测(VID)。
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
翻译 - [CVPR2021 ORAL]与变压器的端到端视频实例分段
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))
[ICCV 2021] Instances as Queries
Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation, NeurIPS 2021 Spotlight
翻译 - 用于多对象跟踪和分割的原型交叉注意力网络,NeurIPS 2021 聚焦
Mask-Free Video Instance Segmentation [CVPR 2023]
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
翻译 - SeqFormer:令人沮丧的简单视频实例分割模型
[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking an...
SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation (ECCV2020)
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral
Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency (AAAI 2021)
Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)
DVIS: Decoupled Video Instance Segmentation Framework
DeVIS: Making Deformable Transformers Work for Video Instance Segmentation
Code release for "STMask: Spatial Feature Calibration and Temporal Fusion for Effective One-stage Video Instance Segmentation"(CVPR2021)
Awesome video instance segmentation papers