Feature-Level Attention-Guided Multitask CNN for Fault Diagnosis and Working Conditions Identification of Rolling Bearing

Huan Wang,Zhiliang Liu,Dandan Peng,Mei Yang,Yong Qin
DOI: https://doi.org/10.1109/tnnls.2021.3060494
IF: 14.255
2021-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:Accurate and real-time fault diagnosis (FD) and working conditions identification (WCI) are the key to ensuring the safe operation of mechanical systems. We observe that there is a close correlation between the fault condition and the working condition in the vibration signal. Most of the intelligent FD methods only learn some features from the vibration signals and then use them to identify fault categories. They ignore the impact of working conditions on the bearing system, and such a single-task learning method cannot learn the complementary information contained in multiple related tasks. Therefore, this article is devoted to mining richer and complementary globally shared features from vibration signals to complete the FD and WCI of rolling bearings at the same time. To this end, we propose a novel multitask attention convolutional neural network (MTA-CNN) that can automatically give feature-level attention to specific tasks. The MTA-CNN consists of a global feature shared network (GFS-network) for learning globally shared features and <span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="2.066ex" height="2.176ex" style="vertical-align: -0.338ex;" viewBox="0 -791.3 889.5 936.9" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-4B" x="0" y="0"></use></g></svg></span> task-specific networks with feature-level attention module (FLA-module). This architecture allows the FLA-module to automatically learn the features of specific tasks from globally shared features, thereby sharing information among different tasks. We evaluated our method on the wheelset bearing data set and motor bearing data set. The results show that our method has a better performance than the state-of-the-art deep learning methods and strongly prove that our multitask learning mechanism can improve the results of each task.<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"><path stroke-width="1" id="MJMATHI-4B" d="M285 628Q285 635 228 637Q205 637 198 638T191 647Q191 649 193 661Q199 681 203 682Q205 683 214 683H219Q260 681 355 681Q389 681 418 681T463 682T483 682Q500 682 500 674Q500 669 497 660Q496 658 496 654T495 648T493 644T490 641T486 639T479 638T470 637T456 637Q416 636 405 634T387 623L306 305Q307 305 490 449T678 597Q692 611 692 620Q692 635 667 637Q651 637 651 648Q651 650 654 662T659 677Q662 682 676 682Q680 682 711 681T791 680Q814 680 839 681T869 682Q889 682 889 672Q889 650 881 642Q878 637 862 637Q787 632 726 586Q710 576 656 534T556 455L509 418L518 396Q527 374 546 329T581 244Q656 67 661 61Q663 59 666 57Q680 47 717 46H738Q744 38 744 37T741 19Q737 6 731 0H720Q680 3 625 3Q503 3 488 0H478Q472 6 472 9T474 27Q478 40 480 43T491 46H494Q544 46 544 71Q544 75 517 141T485 216L427 354L359 301L291 248L268 155Q245 63 245 58Q245 51 253 49T303 46H334Q340 37 340 35Q340 19 333 5Q328 0 317 0Q314 0 280 1T180 2Q118 2 85 2T49 1Q31 1 31 11Q31 13 34 25Q38 41 42 43T65 46Q92 46 125 49Q139 52 144 61Q147 65 216 339T285 628Z"></path></defs></svg>
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, hardware & architecture
What problem does this paper attempt to address?