Construction risk identification using a multi-sentence context-aware method

Nan Gao,Ali Touran,Qi Wang,Nicholas Beauchamp
DOI: https://doi.org/10.1016/j.autcon.2024.105466
IF: 10.3
2024-05-16
Automation in Construction
Abstract:Knowledge of risk events with potentially negative consequences from previous projects is essential for risk identification in early stages of new infrastructure projects. However, historical risk events are usually scattered in various sources and reports, rendering collecting such risk information time-consuming and expensive. To expand the current risk data sources and facilitate risk events' extraction, the paper presents a synthetic approach that utilizes Natural Language Processing (NLP) techniques to automatically identify and extract risk-related sentences from news articles. A supervised Multi-sentence Context-aware Risk Identification (MCRI) model is devised to exploit both sentence-level and multi-sentence level context to boost the sentence classification performance. The MCRI model outperformed several baseline models with a risk-class F1-score of 87.1% and an accuracy of 86.7%. This paper provides a baseline for future studies aimed at automating the extraction of project-level risk information within the construction domain.
construction & building technology,engineering, civil
What problem does this paper attempt to address?