Efficient Joint Rectification of Photometric and Geometric Distortions in Document Images.

Hao Tang,Junyuan Guo,Teng Wang,Yanwei Yu,Chao Wang
DOI: https://doi.org/10.1109/ICASSP48485.2024.10447446
2024-01-01
Abstract:Document images captured with cameras often exhibit photometric and geometric distortions. Here, we propose a novel learning-based approach for efficient joint rectification of document images. Inspired by the strong correlation between visual shadows and physical deformations, we design a shared encoder architecture to fully leverage structured document features. A cross-attention module is introduced to facilitate information exchange between deformation and coordinate domains. Our method effectively addresses both geometric and photometric distortions in an end-to-end manner, making it highly valuable for applications involving camera-captured document images.
What problem does this paper attempt to address?