DQ-HorizonNet: Enhancing Door Detection Accuracy in Panoramic Images Via Dynamic Quantization

Cing-Jia Lin,Jheng-Wei Su,Kai-Wen Hsiao,Ting-Yu Yen,Chih-Yuan Yao,Hung-Kuo Chu
DOI: https://doi.org/10.1109/cvprw63382.2024.00135
2024-01-01
Computer Vision and Pattern Recognition
Abstract:This paper introduces DQ-HorizonNet, a novel learning-based methodology that incorporates vertical features to enhance doors detection in indoor panoramic images. Building upon HorizonNet, which excels in estimating 3D indoor layouts from panoramic images using 1D vectors to identify boundaries, we identify a key limitation: HorizonNet’s dense, column-wise prediction output is ill-suited for object detection tasks due to the need for complex post-processing to separate true positives from numerous false-positive predictions. DQ-HorizonNet innovatively addresses this issue through dynamic quantization, which clusters column-wise outputs and assigns learning targets dynamically, improving accuracy via a U-axis distance cost matrix that evaluates the discrepancy between predictions and actual data. Our model, tested on the extensive Zillow indoor dataset (ZInD), significantly outperforms existing methods, including the original HorizonNet and the transformer-based DETR network, showcasing its superior ability to accurately detect doors in panoramic indoor imagery.The code can be found on https://github.com/Lontoone/DQ-HorizonNet/.
What problem does this paper attempt to address?