Abstract:Image segmentation for video analysis plays an essential role in different research fields such as smart city, healthcare, computer vision and geoscience, and remote sensing applications. In this regard, a significant effort has been devoted recently to developing novel segmentation strategies; one of the latest outstanding achievements is panoptic segmentation. The latter has resulted from the fusion of semantic and instance segmentation. Explicitly, panoptic segmentation is currently under study to help gain a more nuanced knowledge of the image scenes for video surveillance, crowd counting, self-autonomous driving, medical image analysis, and a deeper understanding of the scenes in general. To that end, we present in this paper the first comprehensive review of existing panoptic segmentation methods to the best of the authors' knowledge. Accordingly, a well-defined taxonomy of existing panoptic techniques is performed based on the nature of the adopted algorithms, application scenarios, and primary objectives. Moreover, the use of panoptic segmentation for annotating new datasets by pseudo-labeling is discussed. Moving on, ablation studies are carried out to understand the panoptic methods from different perspectives. Moreover, evaluation metrics suitable for panoptic segmentation are discussed, and a comparison of the performance of existing solutions is provided to inform the state-of-the-art and identify their limitations and strengths. Lastly, the current challenges the subject technology faces and the future trends attracting considerable interest in the near future are elaborated, which can be a starting point for the upcoming research studies. The papers provided with code are available at: <a class="link-external link-https" href="https://github.com/elharroussomar/Awesome-Panoptic-Segmentation" rel="external noopener nofollow">this https URL</a>

Large-scale Video Panoptic Segmentation in the Wild: A Benchmark

Video Object Segmentation in Panoptic Wild Scenes

Aerial-PASS: Panoramic Annular Scene Segmentation in Drone Videos

YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark

ViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic Segmentation

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Panoptic Segmentation: A Review

Occluded Video Instance Segmentation: A Benchmark

Towards Open-Vocabulary Video Instance Segmentation

3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation

3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

Temporal Pixel-Level Semantic Understanding Through the VSPW Dataset

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation

Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation

A Large Scale Urban Surveillance Video Dataset for Multiple-Object Tracking and Behavior Analysis

STEP: Segmenting and Tracking Every Pixel

PANDA: A Gigapixel-level Human-centric Video Dataset

Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning

Towards Open-Vocabulary Video Semantic Segmentation