Context-based modeling for accurate logo detection in complex environments

Zhixiang Jia,Sujuan Hou,Peng Li
DOI: https://doi.org/10.1016/j.jvcir.2024.104061
IF: 2.887
2024-02-01
Journal of Visual Communication and Image Representation
Abstract:Logo detection involves the tasks of locating and classifying logo objects in images and videos, and has been widely applied in the real world. However, most existing approaches rely on general object detection strategies that do not fully utilize the unique characteristics of logos. This can lead to sub-optimal performance in complex environments, especially when logos are small or have varying sizes and shapes. We observe that logos belonging to the same category often share similar context information, such as background dependency of the logo. This motivates us to incorporate contextual information to improve logo detection. Our proposed method, Context-based Modeling Enhancement Network (CME-Net), aims to enhance the distinctive region feature of logos using contextual information. We achieve this by modeling both the logo and its background region to extract their contextual information. This contextual information serves as a guide for enhancing the saliency of the distinctive regions within the logo image. To further improve the accuracy of detection, we have implemented a scale feature balance strategy. This strategy solves the problem of losing scale information caused by enhancement, ensuring that all scales are appropriately considered. Additionally, noise generated during the enhancement process is also effectively suppressed. By effectively leveraging contextual information, our method successfully tackles the challenge of accurately locating logo objects. Our extensive experiments on four public benchmark datasets demonstrate that CME-Net improves the accuracy of logo detection in complex environments.
computer science, information systems, software engineering
What problem does this paper attempt to address?