Abstract:Single image rectification of document deformation is a challenging task. Although some recent deep learning-based methods have attempted to solve this problem, they cannot achieve satisfactory results when dealing with document images with complex deformations. In this article, we propose a new efficient framework for document flattening. Our main insight is that most layout primitives in a document have rectangular outline shapes, making unwarping local layout primitives essentially homogeneous with unwarping the entire document. The former task is clearly more straightforward to solve than the latter due to the more consistent texture and relatively smooth deformation. On this basis, we propose a layout-aware deep model working in a divide-and-conquer manner. First, we employ a transformer-based segmentation module to obtain the layout information of the input document. Then a new regression module is applied to predict the global and local UV maps. Finally, we design an effective merging algorithm to correct the global prediction with local details. Both quantitative and qualitative experimental results demonstrate that our framework achieves favorable performance against state-of-the-art methods. In addition, the current publicly available document flattening datasets have limited 3D paper shapes without layout annotation and also lack a general geometric correction metric. Therefore, we build a new large-scale synthetic dataset by utilizing a fully automatic rendering method to generate deformed documents with diverse shapes and exact layout segmentation labels. We also propose a new geometric correction metric based on our paired document UV maps. Code and dataset will be released at https://github.com/BunnySoCrazy/LA-DocFlatten .

Exploiting Vector Fields For Geometric Rectification Of Distorted Document Images

Printer Forensics Based on Page Document's Geometric Distortion

Document Forgery Detection Using Distortion Mutation of Geometric Parameters in Characters

Printer Identification Using Page Geometric Distortion On Text Lines

Restoring Camera-Captured Distorted Document Images

Unified Optical Distortion Correction Method for Imaging Systems Using A Concise Geometrical Transformation Model

Restoring Warped Document Image Through Segmentation and Full Page Interpolation

Blurring and lens distortion free scanning for large area painting and calligraphy

Geometric Representation Learning for Document Image Rectification

Arbitrary Warped Document Image Restoration Based on Segmentation and Thin-Plate Splines.

A Cylindrical Surface Model to Rectify the Bound Document Image.

Efficient Joint Rectification of Photometric and Geometric Distortions in Document Images.

Effective Document Image Rectification via a Deep Learning Framework

A Document Rectification Approach Dealing with Both Perspective Distortion and Warping Based on Text Flow Curve Fitting

A method to restore Chinese warped document images based on binding characters and building curved lines

A Scheme for Automatic Text Rectification in Real Scene Images.

DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction

A Page Content Independent Book Dewarping Method To Handle 2d Images Captured By A Digital Camera

Layout-aware Single-image Document Flattening

Multiple Geometry Transform Estimation from Single Camera-Captured Text Image

DocScanner: Robust Document Image Rectification with Progressive Learning