Abstract:The ability to rapidly and accurately delineate open-pit granite mining areas is pivotal for effective production planning and environmental impact assessment. Over the years, advancements in remote sensing techniques, including the utilization of satellite imagery, LiDAR technology and unmanned aerial vehicles, have revolutionized the way mining areas are monitored and managed. Simultaneously, in the context of the open-pit mining area extraction task, deep learning-based automatic recognition is gradually replacing manual visual interpretation. Leveraging the potential of unmanned aerial vehicles (UAVs) for real-time, low-risk remote sensing, this study employs UAV-derived orthophotos for mining area extraction. Central to the proposed approach is the novel Gather–Injection–Perception (GIP) module, designed to overcome the information loss typically associated with conventional feature pyramid modules during feature fusion. The GIP module effectively enriches semantic features, addressing a crucial information limitation in existing methodologies. Furthermore, the network introduces the Boundary Perception (BP) module, uniquely tailored to tackle the challenges of blurred boundaries and imprecise localization in mining areas. This module capitalizes on attention mechanisms to accentuate critical high-frequency boundary details in the feature map and synergistically utilizes both high- and low-dimensional feature map data for deep supervised learning. The suggested method demonstrates its superiority in a series of comparative experiments on a specially assembled dataset of research area images. The results are compelling, with the proposed approach achieving 90.67% precision, 92.00% recall, 91.33% F1-score, and 84.04% IoU. These figures not only underscore the effectiveness of suggested model in enhancing the extraction of open-pit granite mining areas but also provides a new idea for the subsequent application of UAV data in the mining scene.

Do Keypoints Contain Crucial Information? Mining Keypoint Information to Enhance Cross-View Geo-Localization

Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization

P2-Net - Joint Description and Detection of Local Features for Pixel and Point Matching.

Each Part Matters: Local Patterns Facilitate Cross-View Geo-Localization

Multilevel Attention Siamese Network for Keypoint Detection in Optical and SAR Images

SK-Net: Deep Learning on Point Cloud via End-to-end Discovery of Spatial Keypoints

Unleash the Potential of Image Branch for Cross-modal 3D Object Detection

Learning to Make Keypoints Sub-Pixel Accurate

Attention-based neural network with Generalized Mean Pooling for cross-view geo-localization between UAV and satellite

DKNAS: A Practical Deep Keypoint Extraction Framework Based on Neural Architecture Search

KeypointDETR: an End-to-End 3D Keypoint Detector

A Satellite-Drone Image Cross-View Geolocalization Method Based on Multi-Scale Information and Dual-Channel Attention Mechanism

RSKDD-Net: Random Sample-based Keypoint Detector and Descriptor

A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention

A Cross-View Geo-Localization Algorithm Using UAV Image and Satellite Image

Multi-scale Dense Object Detection in Remote Sensing Imagery Based on Keypoints

X-Pose: Detecting Any Keypoints

Uniting Keypoints: Local Visual Information Fusion for Large-Scale Image Search

Rethinking of learning-based 3D keypoints detection for large-scale point clouds registration

Deep Corner

Open-Pit Granite Mining Area Extraction Using UAV Aerial Images and the Novel GIPNet