Managing Data Lineage of O&G Machine Learning Models: The Sweet Spot for Shale Use Case

Raphael Thiago,Renan Souza,L. Azevedo,E. Soares,Rodrigo Santos,Wallas Santos,Max De Bayser,M. Cardoso,M. Moreno,Renato Cerqueira
DOI: https://doi.org/10.48550/arXiv.2003.04915
2020-03-11
Abstract:Machine Learning (ML) has increased its role, becoming essential in several industries. However, questions around training data lineage, such as "where has the dataset used to train this model come from?"; the introduction of several new data protection legislation; and, the need for data governance requirements, have hindered the adoption of ML models in the real world. In this paper, we discuss how data lineage can be leveraged to benefit the ML lifecycle to build ML models to discover sweet-spots for shale oil and gas production, a major application in the Oil and Gas O&G Industry.
Databases,Computers and Society,Distributed, Parallel, and Cluster Computing,Machine Learning
What problem does this paper attempt to address?