TableDet: An end-to-end deep learning approach for table detection and table image classification in data sheet images

Johan Fernandes,Murat Simsek,Burak Kantarci,Shahzad Khan
DOI: https://doi.org/10.1016/j.neucom.2021.10.023
IF: 6
2022-01-01
Neurocomputing
Abstract:Global supply chains are kept viable through the information shared through billions of electronic documents, many of which extensively use tables to display critical information. Making effective supply chain decisions requires the extraction of data from these tables which is hindered by the variations in layouts and styles of tables. In this paper, we propose Table Det: a deep learning based methodology to solve table detection and table image classification in data sheet images in a single inference as the first stage of the table text extraction pipeline. TableDet utilizes Cascade R-CNN with Complete IOU (CIOU) loss and a deformable convolution backbone as its underlying architecture to capture the variations in scales and orientations of tables. It also detects text and figures to enhance its table detection performance. We demonstrate the effectiveness of training TableDet with a dual-step transfer learning process and fine-tuning it with Table Aware Cutout (TAC) augmentation strategy. We achieved the highest F1 score for table detection against state-of-the-art solutions on ICDAR 2013 (complete set), ICDAR 2017 (test set) and ICDAR 2019 (test set) with 100%, 99.3% and 95.1% respectively. For the table image classification task we attained 100% recall and above 85% precision on three test sets. This classification capability ensures that all images with tables would be promoted to the next step in the table text extraction pipeline, with a small number of images without tables making it through.
computer science, artificial intelligence
What problem does this paper attempt to address?