Abstract:The strong heterogeneity characteristics of deep-buried clastic low-permeability reservoirs may lead to great risks in hydrocarbon exploration and development, which makes the accurate identification of reservoir lithofacies crucial for improving the obtained exploration results. Due to the very limited core data acquired from deep drilling, lithofacies logging identification has become the most important method for comprehensively obtaining the rock information of deep-buried reservoirs and is a fundamental task for carrying out reservoir characterization and geological modeling. In this study, a machine learning method is introduced to lithofacies logging identification, to explore an accurate lithofacies identification method for deep fluvial-delta sandstone reservoirs with frequent lithofacies changes. Here Sangonghe Formation in the Central Junggar Basin of China is taken as an example. The K-means-based synthetic minority oversampling technique (K-means SMOTE) is employed to solve the problem regarding the imbalanced lithofacies data categories used to calibrate logging data, and a probabilistic calibration method is introduced to correct the likelihood function. To address the situation in which traditional machine learning methods ignore the geological deposition process, we introduce a depositional prior for controlling the vertical spreading process based on a Markov chain and propose an improved Bayesian inversion process for training on the log data to identify lithofacies. The results of a series of experiments show that, compared with the traditional machine learning method, the new method improves the recognition accuracy by 20%, and the predicted petrographic vertical distribution results are consistent with geological constraints. In addition, SMOTE and probabilistic calibration can effectively handle data imbalance problems so that different categories can be adequately learned. Also the introduction of geological prior has a positive impact on the overall distribution, which significantly improves the accuracy and recall rate of the method. According to this comprehensive analysis, the proposed method greatly enhanced the identification of the lithofacies distributions in the Sangonghe Formation. Therefore, this method can provide a tool for logging lithofacies interpretation of deep and strongly heterogeneous clastic reservoirs in fluvial-delta and other depositional environments.

A Tri-Training method for lithofacies identification under scarce labeled logging data

An automatic identification method of imbalanced lithology based on Deep Forest and K-means SMOTE

A novel hybrid method of lithology identification based on k-means++ algorithm and fuzzy decision tree

Semi-supervised Learning for Lithology Identification Using Laplacian Support Vector Machine

Sequential data-driven cross-domain lithology identification under logging data distribution discrepancy

Lithology identification of logging data based on improved neighborhood rough set and AdaBoost

Lithology Identification by Adaptive Feature Aggregation under Scarce Labels

Lithofacies logging identification for strongly heterogeneous deep-buried reservoirs based on improved Bayesian inversion: The Lower Jurassic sandstone, Central Junggar Basin, China

Borehole lithology modelling with scarce labels by deep transductive learning

A real-time intelligent lithology identification method based on a dynamic felling strategy weighted random forest algorithm

Prediction of igneous lithology and lithofacies based on ensemble learning with data optimization

A gradient boosting decision tree algorithm combining synthetic minority oversampling technique for lithology identification

Evaluation of Active Learning Algorithms for Formation Lithology Identification

Lithology identification from well-log curves via neural networks with additional geologic constraint

Subsurface Lithofacies Identification with Meta Learning

Bayesian discriminant analysis of lithofacies integrate the Fisher transformation and the kernel function estimation

Integrating deep learning and logging data analytics for lithofacies classification and 3D modeling of tight sandstone reservoirs

Integrated Carbonate Lithofacies Modeling Based on the Deep Learning and Seismic Inversion and Its Application

Classification with noisy labels through tree-based models and semi-supervised learning: A case study of lithology identification

Application and Comparison of Machine Learning Methods for Mud Shale Petrographic Identification

Unsupervised Domain Adaptation Using Maximum Mean Discrepancy Optimization for Lithology Identification