Transfer Learning Soft Sensor Modeling Based on Two-dimensional Domain-adaption Stochastic Configuration Network

Xiaogang Deng,Jing Zhang,Lumeng Huang,Yue Zhao,Ping Wang
DOI: https://doi.org/10.1109/jsen.2024.3487840
IF: 4.3
2024-01-01
IEEE Sensors Journal
Abstract:Stochastic configuration networks (SCNs) are widely used in the field of soft sensor modeling due to their advantages of good generalization performance and automatic model structure determination. The classical SCN-based soft sensors are usually effective when industrial processes only involve a single operation mode. However, in practical applications, operation mode variations are often seen because of many factors, including market demands, raw material changes, and ambient temperatures etc. In the historical modes, abundant labeled samples are collected. However, in the new operation mode, the labeled samples are very scarce and cannot sufficiently support the effective training of soft sensor models. Therefore, how to make full use of the historical modes to assist the soft sensor modeling of the new mode is a meaningful and challenging problem. To handle this problem, this paper proposes a transfer learning soft sensor modeling method based on two-dimensional domain-adaption SCN (TD-DASCN). In this method, a domain adaption SCN modeling framework is designed for transfer learning soft sensor development by fusing the abundant labeled samples from historical modes (Source domain) and a few labeled samples from new modes (Target domain). For reducing data distribution difference between source and target domains, feature alignment procedure is performed by geodesic flow kernel method. For the sake of avoiding the possible negative transfer phenomenon, the source domain loss function is constrained according to the degree of contribution of the source domain samples in the transfer. Lastly, the effectiveness of the proposed method is verified by two industrial cases. Compared with the basic DASCN soft sensor method, the proposed method can reduce the average prediction RMSE value by 30.0% and 9.1% in the two tested cases, respectively.
What problem does this paper attempt to address?