Abstract:Transformer-based neural network architectures achieve state-of-the-art results in different domains, from natural language processing (NLP) to computer vision (CV). The key idea of Transformers, the attention mechanism, has already led to significant breakthroughs in many areas. The attention has found their implementation for time series data as well. However, due to the quadratic complexity of the attention calculation regarding input sequence length, the application of Transformers is limited by high resource demands. Moreover, their modifications for industrial time series need to be robust to missing or noised values, which complicates the expansion of the horizon of their application. To cope with these issues, we introduce the class of efficient Transformers named Regularized Transformers (Reguformers). We implement the regularization technique inspired by the dropout ideas to improve robustness and reduce computational expenses. The focus in our experiments is on oil&gas data, namely, well logs, a prominent example of multivariate time series. The goal is to solve the problems of similarity and representation learning for them. To evaluate our models for such problems, we work with an industry-scale open dataset consisting of well logs of more than 20 wells. The experiments show that all variations of Reguformers outperform the previously developed RNNs, classical Transformer model, and robust modifications of it like Informer and Performer in terms of well-intervals' classification and the quality of the obtained well-intervals' representations. Moreover, the sustainability to missing and incorrect data in our models exceeds that of others by a significant margin. The best result that the Reguformer achieves on well-interval similarity task is the mean PR~AUC score equal to 0.983, which is comparable to the classical Transformer and outperforms the previous models.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: How to overcome the high computational complexity, large resource requirements, and sensitivity to missing or noisy data of the traditional Transformer model when processing time - series of oil well data. Specifically, the author proposes a new Transformer variant - Reguformer, aiming to improve the robustness and efficiency of the model by introducing regularization techniques and applying it to the similarity learning and representation learning tasks of well - intervals. ### Problem Background Oil well drilling is a time - consuming and expensive process and plays a crucial role in oil exploration and preventing drilling accidents. The similarity of oil wells and their intervals can help decision - makers judge the rationality of drilling, for example, by comparing oil wells with known characteristics to reconstruct the characteristics of new oil wells. In addition, through appropriate similarity learning methods, low - dimensional representations of oil well intervals can be obtained to estimate their lithological and physical characteristics. ### Existing Challenges 1. **High Computational Complexity**: Traditional Transformer models require a large amount of computational time and memory resources when processing long sequences. 2. **Sensitivity to Low - Quality Data**: When the input data has missing or noisy values, the Transformer model is prone to errors. 3. **Special Requirements of Industrial Time - Series**: Industrial time - series data usually contains a large amount of noise and missing values, which pose higher requirements for the robustness of the model. ### Solutions To solve the above problems, the author proposes the following innovations: 1. **Propose a New Transformer Variant (Reguformer)**: - Based on the idea of dropout, introduce regularization techniques to reduce computational overhead and improve the robustness of the model. - Reguformer reduces computational complexity through the sparse attention mechanism, only focusing on the key parts of the sequence. 2. **Self - Supervised Learning Method**: - Does not require expert - annotated data. The model can be trained through the self - supervised learning method to obtain high - quality representations of oil well intervals. - Use Siamese and Triplet loss functions for contrastive learning to evaluate the similarity between different intervals. 3. **Experimental Verification**: - Experiments were carried out on a log dataset containing more than 20 oil wells. The results show that all variants of Reguformer perform better than previous RNNs, the classic Transformer and its improved versions (such as Informer and Performer) in classification tasks and representation learning tasks. - Especially in dealing with missing and incorrect data, Reguformer shows significant advantages. ### Experimental Results - **Best Results**: In the oil well interval similarity task, Reguformer achieved an average PR AUC score of 0.983, which is comparable to the classic Transformer (PR AUC of 0.984) and far exceeds LSTM (0.951), Informer (0.974) and Performer (0.97). - **Robustness**: The performance of Reguformer in dealing with missing and incorrect data is significantly better than other models. Through these innovations, the author has successfully improved the efficiency and accuracy of oil well data processing, providing new tools and methods for data analysis in the petroleum industry.

Robust representations of oil wells' intervals via sparse attention mechanism

Development of Deep Transformer-Based Models for Long-Term Prediction of Transient Production of Oil Wells

Porosity and Permeability Prediction Using a Transformer and Periodic Long Short-Term Network

Rough Transformers for Continuous and Efficient Time-Series Modelling

A Transformer and LSTM-Based Approach for Blind Well Lithology Prediction

Transformer-Based Models Aid Prediction of Transient Production of Oil Wells

Long-Term, Multi-Variate Production Forecasting Using Non-Stationary Transformer

Attention as Robust Representation for Time Series Forecasting

Non-stationary Transformers: Exploring the Stationarity in Time Series Forecasting

Resformer: Combine quadratic linear transformation with efficient sparse Transformer for long-term series forecasting

Robformer: A robust decomposition transformer for long-term time series forecasting

Adaptive Spatio-Temporal Neural Networks Based on Machine Learning for Missing Well Log Prediction

Base on temporal convolution and spatial convolution transformer for fluid prediction through well logging data

Missing Well Log Reconstruction Using A Sequence Self-attention Deep Learning Framework

ContiFormer: Continuous-Time Transformer for Irregular Time Series Modeling

Probing the limit of hydrologic predictability with the Transformer network

Porosity prediction through well logging data: A combined approach of convolutional neural network and transformer model (CNN-transformer)

Time Series Representation Models

WaveRoRA: Wavelet Rotary Route Attention for Multivariate Time Series Forecasting

Non-stationary Transformers: Rethinking the Stationarity in Time Series Forecasting

A Transformer-based Framework For Multi-variate Time Series: A Remaining Useful Life Prediction Use Case