Machine Learning: New Ideas and Tools in Environmental Science and Engineering

Shifa Zhong,Kai Zhang,Majid Bagheri,Joel G. Burken,April Gu,Baikun Li,Xingmao Ma,Babetta L. Marrone,Zhiyong Jason Ren,Joshua Schrier,Wei Shi,Haoyue Tan,Tianbao Wang,Xu Wang,Bryan M. Wong,Xusheng Xiao,Xiong Yu,Jun-Jie Zhu,Huichun Zhang
DOI: https://doi.org/10.1021/acs.est.1c01339
2021-08-17
Abstract:The rapid increase in both the quantity and complexity of data that are being generated daily in the field of environmental science and engineering (ESE) demands accompanied advancement in data analytics. Advanced data analysis approaches, such as machine learning (ML), have become indispensable tools for revealing hidden patterns or deducing correlations for which conventional analytical methods face limitations or challenges. However, ML concepts and practices have not been widely utilized by researchers in ESE. This feature explores the potential of ML to revolutionize data analysis and modeling in the ESE field, and covers the essential knowledge needed for such applications. First, we use five examples to illustrate how ML addresses complex ESE problems. We then summarize four major types of applications of ML in ESE: making predictions; extracting feature importance; detecting anomalies; and discovering new materials or chemicals. Next, we introduce the essential knowledge required and current shortcomings in ML applications in ESE, with a focus on three important but often overlooked components when applying ML: correct model development, proper model interpretation, and sound applicability analysis. Finally, we discuss challenges and future opportunities in the application of ML tools in ESE to highlight the potential of ML in this field.This article has not yet been cited by other publications.
environmental sciences,engineering, environmental
What problem does this paper attempt to address?
This paper discusses the application and potential of Machine Learning (ML) in the field of Environmental Science and Engineering (ESE). With the development of environmental monitoring technology, the volume and complexity of data have increased dramatically, requiring more advanced data analysis methods. Machine Learning, with its powerful pattern recognition capabilities, can handle complex data patterns, and thus has gradually gained attention in the field of ESE. The paper first demonstrates how Machine Learning solves complex environmental science problems through five examples, and then summarizes four main applications of Machine Learning in ESE: prediction, feature importance extraction, anomaly detection, and discovery of new substances or chemicals. The paper then discusses the key knowledge that needs to be considered when applying Machine Learning, such as model development, proper interpretation, and applicability analysis, and points out the current limitations, such as the "black box" nature of models and the lack of interpretability and applicability analysis. In addition, the paper also highlights that despite some successes of Machine Learning in ESE, there are still challenges, such as researchers lacking proper knowledge in using Machine Learning and neglecting model interpretation and applicability analysis. The paper calls for ESE researchers to not only adopt more advanced Machine Learning methods actively but also to participate in the improvement and development of these methods. To sum up, this paper aims to enhance the understanding of Machine Learning among ESE researchers, highlight its potential in solving environmental problems, and raise awareness of issues to consider and future research directions when applying Machine Learning.