Machine learning with model selection to predict TOC from mineralogical constituents: case study in the Sichuan Basin

C. M. Saporetti,D. L. Fonseca,L. C. Oliveira,E. Pereira,L. Goliatt
DOI: https://doi.org/10.1007/s13762-022-04081-3
2022-04-04
International Journal of Environmental Science and Technology
Abstract:The total organic carbon content from rock samples is the fundamental quantitative and qualitative indicator of the existing organic matter in a reservoir. Generally, it is calculated manually through the analysis of rock samples of origin. However, this procedure demands time and resources since it depends on samples obtained from several intervals of wells in source rocks. Consequently, efforts on research have been conducted to assist this task. Machine learning approaches arise as an alternative to producing estimates for total organic carbon grounded on data well logs and stratigraphic analysis. Given this context, the present paper proposes using machine learning techniques to automate total organic carbon estimation. In order to provide flexibility to the model, a grid search procedure was combined with cross-validation to perform the model selection. This computational approach allows finding models that produced the best generalization capacity. Three methods were applied: Support Vector Machines, Extreme Learning Machine, and Ridge Regression. The proposed methodology was validated on core samples of the shale gas field YuDongNan area, Sichuan Basin. The Support Vector Machine method outperformed the other methods in several metrics analyzed, producing accurate predictions, showing that the approach present in this paper can be used as a surrogate model to assist geologists and petrologists in estimating total organic carbon values.
environmental sciences
What problem does this paper attempt to address?