Amodal Layout Completion in Complex Outdoor Scenes.

Jingyu Wu,Zejian Li,Shengyuan Zhang,Lingyun Sun
DOI: https://doi.org/10.1007/978-3-031-20497-5_3
2022-01-01
Abstract:A layout is a group of bounding boxes with labels annotating objects in complex scenes. However, manually labelled layouts often annotate only visible parts of objects (modal layout) instead of the whole body including both visible and invisible parts (amodal layout). Modal layouts are caused by occlusion in scenes, while amodal layouts contain more accurate information of objects’ relative positions and sizes. In this paper, we investigate the influence of modal layout on the layout-to-image generation. Specifically, to recover an amodal layout from a modal layout and improve the generation quality, we propose Amodal Layout Completion Network (ALCN) regressing amodal bounding boxes from potential occluded boxes. Following a divide-and-conquer strategy, we divide the modal layout of a scene into occlusion groups of bounding boxes, which are processed by ALCN individually. Furthermore, we propose four challenging IoU variants to measure completion performances for different completion conditions. Experiment results show the ALCN achieves state-of-the-art layout completion performances in most cases and improves the layout-to-image generation performance.
What problem does this paper attempt to address?