Prediction of igneous lithology and lithofacies based on ensemble learning with data optimization

Ruiyi Han,Zhuwen Wang,Zhitao Zhang,Xinru Wang,Yitong Cui,Yuhuang Guo
DOI: https://doi.org/10.1190/geo2022-0782.1
IF: 3.264
2024-02-18
Geophysics
Abstract:Igneous rocks are widely developed in various Mesozoic and Cenozoic continental and marine basins. Igneous reservoirs are the key reservoirs for current oil and gas development. Accurate prediction of lithology and lithofacies is a prerequisite for the effective exploration of igneous reservoirs. Igneous lithology and lithofacies are complex and correlated. The existing single-label igneous rock identification methods only consider the prediction of individual properties, and less consideration is given to the correlation of reservoir properties. Therefore, lithology and lithofacies prediction based on conventional logging data is regarded as a typical class-imbalanced multilabel classification problem when considering both attribute correlation in algorithms and evaluation metrics. To solve this problem, an ensemble method of data optimization combined with multigrained cascade forest (CF) is used in this study to develop a new multilabel lithology and lithofacies prediction model based on data from nine conventional logs in the eastern depressional reservoirs of the Liaohe Basin, and satisfactory results are obtained. The imbalance problem of the conventional logging data sets is first solved by using K-means and synthetic minority oversampling technique methods; then, the model is trained by scenario transformation and stripping with multigrained CF; and next, a multilabel classification evaluation index with multiple perspectives is introduced. The differences between the model and typical intelligent algorithms such as CF, adaptive boosting, random forest, and support vector machine are compared in simulated wells, and the new model is found to have obvious advantages. The model is finally applied to an actual well, and the accurate prediction results illustrate that the new model designed and trained for the class imbalance multilabel classification problem in this paper has application value in the lithology and lithofacies multilabel prediction of igneous rocks and also provides a theoretical basis for more complex multilabel reservoir evaluation using machine learning in the future.
geochemistry & geophysics
What problem does this paper attempt to address?