Compression and regularized optimization of modules stacked residual deep fuzzy system with application to time series prediction

Yunxia Liu,Xiao Lu,Wei Peng,Chengdong Li,Haixia Wang
DOI: https://doi.org/10.1016/j.ins.2022.06.088
IF: 8.1
2022-08-01
Information Sciences
Abstract:The double-input-rule-modules stacked deep fuzzy method (DIRM-DFM) has attracted much attention because of its interpretability and prediction accuracy. However, when confronted with high-dimensional data and large numbers of fuzzy rules, the original DIRM-DFM has three limitations. First, the identity mapping that appears in the depth layer causes degradation of the prediction performance. Second, redundant fuzzy rules increase the complexity of the structure. Third, the overfitting phenomenon in the parameter-learning process imposes restrictions on the generalization ability of the system. Therefore, achieving structural simplification and high-accuracy prediction remains challenging. In this paper, a compression and regularized optimization scheme for the modules stacked residual deep fuzzy system (CDIRM-RDFS) is presented to address the above-mentioned limitations. To overcome the first limitation, a residual approximation mechanism was developed to approximate the actual output layer by layer. To obtain more compact fuzzy modules, singular value decomposition was conducted to eliminate the unimportant fuzzy rules, thus surmounting the second limitation. Finally, to prevent overfitting, regularization terms combining the l1 norm with the l2 norm were added to the loss function to penalize the parameters in the learning process and thereby solve the third problem. The performance of the proposed CDIRM-RDFS was validated using various types of datasets, including seasonal and non-seasonal as well as cyclical and non-cyclical time series. Compared with some existing popular shallow and deep models, the experimental results indicate that the proposed methodology can simplify the system architecture while improving the accuracy, effectiveness, and robustness of the prediction model.
computer science, information systems
What problem does this paper attempt to address?