GraphLED: A graph-based approach to process and visualise linked engineering documents

Vanessa Telles da Silva,Lucas de Angelo Martins Ribeiro,Willian Borges de Lemos,Sílvia Silva da Costa Botelho,Nelson Lopes Duarte Filho,Marcelo Rita Pias
DOI: https://doi.org/10.48550/arXiv.2302.08905
2023-02-16
Abstract:The architecture, engineering and construction (AEC) sector extensively uses documents supporting product and process development. As part of this, organisations should handle big data of hundreds, or even thousands, of technical documents strongly linked together, including CAD design of industrial plants, equipment purchase orders, quality certificates, and part material analysis. However, analysing such records is daunting for users because it gets complicated to sift through hundreds of documents to establish valuable relationships. This paper addresses how knowledge extracted from linked engineering documents contributes to industrial digitalisation under IT/OT convergence. The proposed GraphLED is a system tasked with data processing, graph-based modelling, and colourful visualisation of related documents. The graph-based approach ensures an improved understanding of linked information because the graph structure offers a promising tool to model the underlying data properties of engineering documents. Preliminary system validation indicates quality improvements are possible in the OCR-based data (85.9% of ambiguous text data removed). This work has the potential to benefit the industry by improving the reliability and resilience of industrial production systems through automated summaries of large quantities of documents and their linkage.
Digital Libraries
What problem does this paper attempt to address?