A data-driven framework for conceptual cost estimation of infrastructure projects using XGBoost and Bayesian optimization
Jiashu ZhangJingfeng YuanAmin MahmoudiWenying JiQiushi Fanga Department of Construction and Real Estate,School of Civil Engineering,Southeast University,Nanjing,Chinab Department of Civil,Environmental and Infrastructure Engineering,George Mason University,Fairfax,VA,USAc Department of Electrical and Electronic Engineering,The University of Manchester,Manchester,UKJiashu Zhang is a PhD candidate in the Department of Construction and Real Estate,School of Civil Engineering,Southeast University. He also has a multidisciplinary background in electrical and electronic engineering. His main research interests include project management,machine learning,and low-carbon city management.Jingfeng Yuan is a professor in the Department of Construction and Real Estate,School of Civil Engineering,Southeast University. Dr. Yuan received his PhD in Southeast University. His main research direction is project management,intelligent construction,and big data analytics.Amin Mahmoudi is a researcher at the Southeast University,Nanjing,China. He was selected as the World's Top 2% Scientists by Stanford University. He published nearly 55 papers in top-tier journals,including Information Sciences,Resources Policy,Expert Systems with Applications,Business Strategy and the Environment,IEEE Transactions on Engineering Management,etc. He developed a method titled Ordinal Priority Approach (OPA) for intelligent decision-making,which was used widely by scholars in various fields of study. He also developed web-based software titled "OPA Solver" that can be employed for decision-making in academia and industry. He is an editor in SN Operations Research Forum,The Journal of Grey System,Modern Supply Chain Research & Applications,Journal of Project Management,and Management Science Letters. He also published two books in the field of project management in 2013 and 2016. His main research interest is project management,supply chain management,decision science,and data analysis.Wenying Ji is an assistant professor in the Department of Civil,Environmental & Infrastructure Engineering,George Mason University. Dr. Ji received his PhD in Construction Engineering and Management from the University of Alberta. Dr. Ji is an interdisciplinary scholar focused on the integration of advanced data analytics and complex system modeling to enhance the overall performance of infrastructure systems. His e-mail address is hi Fang obtained his master degree in the Department of Electrical and Electronic Engineering,The University of Manchester. His main research direction is machine learning and digital signal processing.
DOI: https://doi.org/10.1080/13467581.2023.2294871
IF: 0.904
2024-01-05
Journal of Asian Architecture and Building Engineering
Abstract:Cost estimation is a key component of project plans, yet it is challenging to provide reliable and efficient estimations using conventional methods in the conceptual phase of infrastructure projects. This study proposes a framework that integrates feature selection, extreme gradient boosting (XGBoost), Bayesian optimization (BO), and SHapley Additive exPlanations (SHAP) to provide conceptual cost estimations and explain the results for early decision-making. Correlation analysis and forward search are combined to select the key features. XGBoost is developed as the estimator and enhanced by BO in accuracy and efficiency. Model explanations were presented using SHAP. The framework is demonstrated through a case study of electric substations containing 605 samples. The results show that the proposed framework can provide satisfactory performance on conceptual cost estimations, where BO-XGBoost outperforms the benchmark models (with R2 ~0.9567, adjusted R2 ~0.9549, RMSE ~ 0.8690, and MAE ~ 0.4875). SHAP reveals how the features contribute to the cost based on both global and local explanations. The framework provides a guideline for more accurate, efficient, and explainable cost estimations in the conceptual phase of infrastructure projects. It can support the government and project planners in early decision-making, including reliable project budget and plan alternatives selection.
construction & building technology