Automated Rock Detection From Mars Rover Image via Y-Shaped Dual-Task Network With Depth-Aware Spatial Attention Mechanism

Chaohua Ma,Yuan Li,Junying Lv,Zhouxuan Xiao,Wuming Zhang,Linshan Mo
DOI: https://doi.org/10.1109/tgrs.2024.3371684
IF: 8.2
2024-03-15
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Extracting rocks from Mars rover images using convolutional neural networks (CNNs) is a semantic segmentation problem gaining increasing attention in planetary science and artificial intelligence. However, this task still faces the challenges of inaccurate extraction of rock boundaries and small rocks. An important reason is that the textural features of the Mars rover images are not discriminative enough between the target and the background. To obtain 3-D information to enhance the rock extraction without introducing additional data, we designed a dual-task branch network with a Y-shaped encoder–decoder structure. In our network, the primary semantic segmentation task branch is used to decode textural features into semantic features and output rock extraction results; the auxiliary task depth estimation branch decodes textural features into depth features in 3-D and transmits them to the primary branch through the spatial attention module to enhance the identification ability of semantic features for boundaries and small targets. In addition, another reason that impedes accurate Mars rock extraction is the lack of high-quality annotated training datasets. Therefore, we created a dataset containing 6325 image pairs with corresponding annotations and depth information, SimMars6K. Ablation and comparison experiments based on this dataset and two actual datasets show that our method achieves 83.1% IoU and -score on the simulated dataset and outperforms other methods on actual datasets with 2% improvement in recall. Transfer learning experiment shows that the pretraining on our simulated data can bring up to an additional 5% gain on mean intersection over union (mIoU) and 6% gain on mean pixel accuracy (mPA) for the general model.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?