Abstract:Specular highlight removal plays a pivotal role in multimedia applications, as it enhances the quality and interpretability of images and videos, ultimately improving the performance of downstream tasks such as content-based retrieval, object recognition, and scene understanding. Despite significant advances in deep learning-based methods, current state-of-the-art approaches often rely on additional priors or supervision, limiting their practicality and generalization capability. In this paper, we propose the Dual-Hybrid Attention Network for Specular Highlight Removal (DHAN-SHR), an end-to-end network that introduces novel hybrid attention mechanisms to effectively capture and process information across different scales and domains without relying on additional priors or supervision. DHAN-SHR consists of two key components: the Adaptive Local Hybrid-Domain Dual Attention Transformer (L-HD-DAT) and the Adaptive Global Dual Attention Transformer (G-DAT). The L-HD-DAT captures local inter-channel and inter-pixel dependencies while incorporating spectral domain features, enabling the network to effectively model the complex interactions between specular highlights and the underlying surface properties. The G-DAT models global inter-channel relationships and long-distance pixel dependencies, allowing the network to propagate contextual information across the entire image and generate more coherent and consistent highlight-free results. To evaluate the performance of DHAN-SHR and facilitate future research in this area, we compile a large-scale benchmark dataset comprising a diverse range of images with varying levels of specular highlights. Through extensive experiments, we demonstrate that DHAN-SHR outperforms 18 state-of-the-art methods both quantitatively and qualitatively, setting a new standard for specular highlight removal in multimedia applications.

Facial Highlight Removal with Cross-Context Attention and Texture Enhancement

SIHRNet: a fully convolutional network for single image highlight removal with a real-world dataset

Towards Mask-robust Face Recognition.

Highlight detection and removal based on chromaticity

Highlight Removal of Multi-View Facial Images

Highlight Removal Network Based on an Improved Dichromatic Reflection Model.

From Coarse to Fine: A Two-Stage Network for Dense Haze removal

Highlight mask‐guided adaptive residual network for single image highlight detection and removal

Specular Highlight Removal In Facial Images

Dual-Hybrid Attention Network for Specular Highlight Removal

A Multi-Task Network for Joint Specular Highlight Detection and Removal

Specular highlight removal by federated generative adversarial network with attention mechanism

In-the-Wild Facial Highlight Removal via Generative Adversarial Networks.

Frequency‐Aware Facial Image Shadow Removal through Skin Color and Texture Learning

Towards High-Quality Specular Highlight Removal by Leveraging Large-Scale Synthetic Data

M2-Net: Multi-stages Specular Highlight Detection and Removal in Multi-scenes

Text-Aware Single Image Specular Highlight Removal

Specular highlight removal of light field image combining dichromatic reflection with exemplar patch filling

Facial mask attention network for identity-aware face super-resolution

HR-CycleGAN: Face highlight reduction based on improved cycle-consistent adversarial networks

Specular Highlight Removal for Real‐world Images