Lithofacies identification of shale formation based on mineral content regression using LightGBM algorithm: A case study in the Luzhou block, South Sichuan Basin, China

Yi Liu,Runhua Zhu,Shuo Zhai,Na Li,Chengyong Li
DOI: https://doi.org/10.1002/ese3.1579
IF: 4.0349
2023-10-01
Energy Science & Engineering
Abstract:This study is based on logging data and core testing data, and uses machine‐learning algorithms to construct a regression model for the relative content of shale minerals. The regression prediction model and mineral ternary diagrams are used to classify and identify lithofacies. Lithofacies form the basis for evaluating shale gas fields and play an important role in gas reservoir enrichment. The accurate identification of shale lithofacies is key for exploration and development. Based on well‐logged data, the accuracy of mineral content prediction using machine‐learning regression models is not ideal. Therefore, feature derivation was introduced to enhance the correlation between minerals and lithofacies and improve the data expression ability. Four machine‐learning models for mineral regression were established based on feature‐derived data sets: LightGBM, XGBoost, artificial neural network, and support vector machine. By calculating the evaluation metrics of each model, we found that LightGBM had the best prediction performance. To compare and confirm the accuracy of the model in identifying lithofacies, this study established a new method, MT‐LightGBM, which combines the LightGBM mineral content regression model with mineral ternary diagrams to identify lithofacies. By using the MT‐LightGBM model and LightGBM classification models to identify the target lithofacies, it was found that the accuracy of lithofacies identification of MT‐LightGBM reached 94%. This accuracy is high and is of great significance for understanding and evaluating underground shale reservoirs.
energy & fuels
What problem does this paper attempt to address?