Boundary Attention: Learning curves, corners, junctions and grouping

Mia Gaia Polansky,Charles Herrmann,Junhwa Hur,Deqing Sun,Dor Verbin,Todd Zickler
2024-09-17
Abstract:We present a lightweight network that infers grouping and boundaries, including curves, corners and junctions. It operates in a bottom-up fashion, analogous to classical methods for sub-pixel edge localization and edge-linking, but with a higher-dimensional representation of local boundary structure, and notions of local scale and spatial consistency that are learned instead of designed. Our network uses a mechanism that we call boundary attention: a geometry-aware local attention operation that, when applied densely and repeatedly, progressively refines a pixel-resolution field of variables that specify the boundary structure in every overlapping patch within an image. Unlike many edge detectors that produce rasterized binary edge maps, our model provides a rich, unrasterized representation of the geometric structure in every local region. We find that its intentional geometric bias allows it to be trained on simple synthetic shapes and then generalize to extracting boundaries from noisy low-light photographs.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to address the problem of robustly inferring precise, non-rasterized contours (boundaries) from discrete images in computer vision. Specifically, the authors focus on how to detect geometric structures such as curves, corners, junctions, and local groupings from images in the presence of noise, ensuring these structures are non-rasterized, highly accurate, and rich in geometric information. ### Main Contributions 1. **Introduction of a bottom-up feedforward network**: This network can decompose an image into a local geometric primitive field containing curves, corners, junctions, and local groupings. 2. **Proposal of a new local geometric primitive parameterization method**: Along with a new self-attention mechanism called Boundary Attention. 3. **Training the model with simple synthetic data**: Then generalizing it to real-world low-light photographs. ### Method Overview - **Boundary Attention Mechanism**: This is a geometry-aware local attention operation that, through dense and repetitive application, progressively refines the local boundaries within each overlapping patch of the image. - **Non-rasterized Representation**: The model's output is a dense non-rasterized geometric primitive field that can be used to generate various global maps, such as boundary distance maps, binarized boundary maps, boundary-aware color smoothing maps, and spatial affinity maps. - **Adaptive Window Function**: The size of each patch can be adaptively adjusted based on the local geometric structure and noise conditions of the input image. ### Experimental Results - **Performance on Synthetic Data**: After training on simple synthetic data, the model generalizes well to real-world photographs, including those with low light and high noise conditions. - **Comparison with Existing Methods**: In multiple benchmark tests, the model provides more details in low-noise conditions and shows greater robustness in high-noise conditions. ### Conclusion By combining the advantages of geometric modeling and deep learning, this paper proposes an efficient and robust method for extracting precise non-rasterized boundaries from discrete images. This method excels in handling complex geometric structures and noise, showing great potential for a wide range of applications.