Abstract:The field of multi-source remote sensing observation is becoming increasingly dynamic through the integration of various remote sensing data sources. However, existing deep learning methods face challenges in differentiating between internal and external relationships and capturing fine spatial features. These models often struggle to effectively capture comprehensive information across remote sensing data bands, and they have inherent differences in the size, structure, and physical properties of different remote sensing datasets. To address these challenges, this paper proposes a novel geometric-algebra-based spectral–spatial hierarchical fusion network (GASSF-Net), which uses geometric algebra for the first time to process multi-source remote sensing images, enabling a more holistic approach to handling these images by simultaneously leveraging the real and imaginary components of geometric algebra to express structural information. This method captures the internal and external relationships between remote sensing image features and spatial information, effectively fusing the features of different remote sensing data to improve classification accuracy. GASSF-Net uses geometric algebra (GA) to represent pixels from different bands as multivectors, thus capturing the intrinsic relationships between spectral bands while preserving spatial information. The network begins by deeply mining the spectral–spatial features of a hyperspectral image (HSI) using pairwise covariance operators. These features are then extracted through two branches: a geometric-algebra-based branch and a real-valued network branch. Additionally, the geometric-algebra-based network extracts spatial information from light detection and ranging (LiDAR) to complement the elevation data lacking in the HSI. Finally, a genetic-algorithm-based cross-fusion module is introduced to fuse the HSI and LiDAR data for improved classification. Experiments conducted on three well-known datasets, Trento, MUUFL, and Houston, demonstrate that GASSF-Net significantly outperforms traditional methods in terms of classification accuracy and model efficiency.

Using Line Segments to Train Multi-Stream Stacked Autoencoders for Image Classification

Deep Dual-Stream Network with Scale Context Selection Attention Module for Semantic Segmentation

Integrating Pixels and Segments: A Deep-Learning Method Inspired by the Informational Diversity of the Visual Pathways

Representation Learning Via a Semi-Supervised Stacked Distance Autoencoder for Image Classification

Emmcnn: An Etps-Based Multi-Scale And Multi-Feature Method Using Cnn For High Spatial Resolution Image Land-Cover Classification

DSNet:Multi-resolution Dense Encoder and Stack Decoder Network for Aerial Image Segmentation

Semantic Segmentation Based On Stacked Discriminative Autoencoders And Context-Constrained Weakly Supervised Learning

Learning Geometric Invariance Features and Discrimination Representation for Image Classification via Spatial Transform Network and XGBoost Modeling

Semisupervised Stacked Autoencoder with Cotraining for Hyperspectral Image Classification.

TSGCNet: Discriminative Geometric Feature Learning with Two-Stream GraphConvolutional Network for 3D Dental Model Segmentation

A deep learning based framework for remote sensing image ground object segmentation

Two-Stream Graph Convolutional Network for Intra-oral Scanner Image Segmentation

Deep Transport Network for Unsupervised Video Object Segmentation

Dual-Path Geometry-Aware Network for Semantic Segmentation of High-Resolution Aerial Images

GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling

Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation

GASSF-Net: Geometric Algebra Based Spectral-Spatial Hierarchical Fusion Network for Hyperspectral and LiDAR Image Classification

Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion

Multi-scale and Discriminative Part Detectors Based Features for Multi-label Image Classification.

Video object segmentation via couple streams and feature memory