Fusion of optical and liDAR images for urban objects recognition
W. Liao,F. Van Coillie,H. Zhang,S. Gautama,W. Philips
DOI: https://doi.org/10.3990/2.410
2016-01-01
Abstract:Nowadays, advanced sensor technology and image processing algorithms allow us to measure different aspects of the objects on the Earth’s surface, from spectral characteristics in optical images, height information in LiDAR data, to spatial information generated by image processing technologies like commercial software eCognition®. However, automatic recognition of objects in remote sensed scenes remains challenging. It is clear that single technology might not be sufficient to obtain reliable classification results (Debes, 2014). Multisensor data, once combined, can contribute to a more comprehensive interpretation of objects on the ground. For example, spectral reflections from optical image cannot recognize objects under shadows, while they can often be easily detected by LiDAR data. On the other hand, LiDAR data alone may fail to discriminate between objects that are quite similar in height.Stacking multi-source data together is a widely applied data fusion technique for classification. These methods first apply feature extraction on each individual data source, after which all feature sources are concatenated into one stacked vector for classification. While such methods are appealing due to their simplicity, they do not always perform better than using a single data source. This is because the value of different components in the stacked feature vector can be significantly unbalanced. As a consequence, theinformation contained by different data sources is not equally represented or measured. Furthermore, the increase in the dimensionality of the stacked features, combined with the limited number of labelled samples, may together lead to the problem of the “curse of dimensionality” (Liao, 2015).Therefore, we present a local graph fusion method to fuse true orthophoto and LiDAR image for urban object recognition. First, object-based spatial and height information are generated on true orthophoto and LiDAR image, respectively. Second, we build a local fusion graph within a sliding window where only the data points with similar spatial and height characteristics are connected. Finally, we solve the problem of multisensor data fusion by projecting multisensor data into a subspace, on which the advantages ofdifferent data sources are well exploited. Experimental results on fusion of true orthophoto and LiDAR image from 'ISPRS Test Project on Urban Classification and 3D Building Reconstruction' demonstrate the potential of the proposed method. Compared to the methods using only single data source or stacking them together, our approach has significant improvements in overall classification accuracy. Both the method’s details and the results of a comprehensive test will be presented at GEOBIA 2016.