YOLO-table: disclosure document table detection with involution

Daqian Zhang,Ruibin Mao,Runting Guo,Yang Jiang,Jing Zhu
DOI: https://doi.org/10.1007/s10032-022-00400-z
2022-05-04
International Journal on Document Analysis and Recognition
Abstract:As financial document automation becomes more general, table detection is receiving more and more attention as an important part of document automation. Disclosure documents contain both bordered and borderless tables of varying lengths, and there is currently no model that performs well on these types of documents. To solve this problem, we propose a table detection model based on YOLO-table. We introduce involution into the backbone of the network to improve the network's ability to learn table spatial layout features and design a simple Feature Pyramid Network to improve model effectiveness. In addition, this paper proposes a table-based augment method. We experiment on a disclosure document dataset, and the results show that the F1-measure of the YOLO-table reaches 97.3%. Compared with YOLOv3, our method improves the accuracy by 2.8% and the speed by 1.25 times. It also evaluates the ICDAR2013 and ICDAR2019 Table Competition datasets and achieves state-of-the-art performance.
computer science, artificial intelligence
What problem does this paper attempt to address?