CSCPR: Cross-Source-Context Indoor RGB-D Place Recognition

Jing Liang,Zhuo Deng,Zheming Zhou,Min Sun,Omid Ghasemalizadeh,Cheng-Hao Kuo,Arnie Sen,Dinesh Manocha
2024-07-25
Abstract:We present a new algorithm, Cross-Source-Context Place Recognition (CSCPR), for RGB-D indoor place recognition that integrates global retrieval and reranking into a single end-to-end model. Unlike prior approaches that primarily focus on the RGB domain, CSCPR is designed to handle the RGB-D data. We extend the Context-of-Clusters (CoCs) for handling noisy colorized point clouds and introduce two novel modules for reranking: the Self-Context Cluster (SCC) and Cross Source Context Cluster (CSCC), which enhance feature representation and match query-database pairs based on local features, respectively. We also present two new datasets, ScanNetIPR and ARKitIPR. Our experiments demonstrate that CSCPR significantly outperforms state-of-the-art models on these datasets by at least 36.5% in Recall@1 at ScanNet-PR dataset and 44% in new datasets. Code and datasets will be released.
Computer Vision and Pattern Recognition,Robotics
What problem does this paper attempt to address?
The paper mainly targets the issue of RGB-D indoor place recognition and proposes a new algorithm named Cross-Source-Context Place Recognition (CSCPR). The primary goal of this algorithm is to handle the task of indoor place recognition under RGB-D data, particularly integrating these capabilities during the global retrieval and reranking stages. Specifically, the paper attempts to address the following problems: 1. **Insufficient research on RGB-D indoor place recognition**: Currently, most methods focus primarily on RGB images, while the depth information in RGB-D data is very important for place recognition in indoor environments, but relatively less studied. 2. **Existing methods rely solely on global retrieval**: Most current methods only use global retrieval for place recognition, neglecting the important step of reranking, which can improve recognition accuracy by evaluating local features. 3. **Lack of effective RGB-D indoor place recognition datasets**: Although there are some datasets for object classification, segmentation, and other tasks, high-quality datasets specifically for training RGB-D indoor place recognition are still scarce. To address these issues, the paper presents the following contributions: - **New end-to-end architecture**: The CSCPR algorithm integrates global retrieval with reranking, enhancing the representational power of RGB-D features through a method called Context-of-Clusters (CoCs). - **Innovative reranking method**: In addition to global retrieval, it introduces two modules, Self-Context Cluster (SCC) and Cross Source Context Cluster (CSCC), to improve matching accuracy during the reranking process. - **Construction of new datasets**: To compensate for the deficiencies of existing datasets, the paper proposes two new large-scale RGB-D indoor place recognition datasets: ScanNetIPR and ARKitIPR. These datasets improve the quality of training data by selecting positive and negative sample pairs through point cloud overlap. Experimental results show that the CSCPR algorithm significantly outperforms existing methods on multiple datasets, especially in the Recall@1 metric, where it has improved by at least 36.5% compared to other state-of-the-art RGB-D indoor place recognition methods. Additionally, the paper conducts detailed ablation studies to verify the effectiveness of each component.