Graph Enhancement and Transformer Aggregation Network for RGB-Thermal Crowd Counting
Yi Pan,Wujie Zhou,Meixin Fang,Fangfang Qiang
DOI: https://doi.org/10.1109/lgrs.2024.3362820
IF: 5.343
2024-02-16
IEEE Geoscience and Remote Sensing Letters
Abstract:Crowd counting has received significant attention in recent years due to its practical applications. In order to address the specific characteristics of RGB and thermal images, we have developed the graph enhancement and transformer aggregation network (GETANet) for generating representative density maps. Our approach incorporates several innovative modules to enhance accuracy. First, we introduced a position-adaptive module (PAM) that effectively counts individuals' positions and integrates features extracted from the main framework. Furthermore, we leveraged the advantages of graph convolutional networks (GCNs), which integrate spatial information and exploit relationships between nodes. Specifically, we designed a dual GCN module that further improves the model's performance by considering the spatial context and relationships among individuals in the crowd. To capture global image information and improve overall performance, we integrated a vision transformer into our model architecture. The vision transformer effectively captures global dependencies and enhances the model's ability to understand complex crowd scenes. Additionally, we designed a transformer information aggregation module (TIAM) that integrates information from multiple levels, resulting in a highly precise prediction map. Through comprehensive experiments on benchmark datasets, such as RGBT-CC and DroneRGBT, our GETANet demonstrated its effectiveness in RGB-thermal crowd counting tasks. Moreover, GETANet showcased remarkable generalization results on the ShanghaiTech-RGBD dataset. Our code has been made publicly available on GitHub at https://github.com/panyi95/GETANet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics