Automatic Table Boundary Detection and Performance Evaluation in Fixed-Layout Documents

FANG Jing,GAO Liangcai,QIU Ruiheng,TANG Zhi
DOI: https://doi.org/10.13209/j.0479-8023.2013.008
2013-01-01
Abstract:The authors propose a novel and effective table boundary detection method via visual separators and geometric content layout information,which is effective for both Chinese and English documents.Additionally,due to the lack of automatic evaluation system for table boundaries detection,the authors also provide a publicly available large-scale dataset,composed of same amount of Chinese and English pages make ground-truth and propose mobile reading oriented performance measurements.Evaluation and comparison with two other open source table boundary detection projects demonstrates effectiveness of the proposed method and practicality of the evaluation suit.
What problem does this paper attempt to address?