Few-Shot Object Counting With Dynamic Similarity-Aware in Latent Space

Jinghui He,Bo Liu,Fan Cao,Jian Xu,Yanshan Xiao
DOI: https://doi.org/10.1109/tgrs.2024.3350383
IF: 8.2
2024-02-02
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Few-shot object counting (FSOC) estimates object quantities in query images using a few of support information. Unlike traditional counting methods, FSOC prioritizes more discriminative and generalized similarity measures between query and support data. This facilitates counting objects from new categories without extensive dataset creation or costly retraining. However, existing approaches often rely on fixed similarity rules, leading to spatial information loss. Limited training data can yield sparse similarity feature distribution, hampering the model's learning and its ability to handle objects with large intraclass differences. In this study, we introduce a novel FSOC network named DSALVANet that comprises the dynamic similarity-aware module (DSAM) and the latent variable augmentation module (LVAM). DSAM establishes adaptive metric rules for support features to find similar regions in the metric space for accurate object counting. LVAM utilizes prior similarity knowledge from DSAM to model the latent distribution of the density map, improving the decoder's robustness by sampling diverse latent variables during training. Extensive experiments on the FSOC benchmark and remote-sensing datasets demonstrate our method's effectiveness and state-of-the-art performance. The code and model are available at DSALVANet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?