• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2020, Vol. 56 ›› Issue (9): 102-117.doi: 10.3901/JME.2020.09.102

Previous Articles     Next Articles

An Empirical Analysis about the Generalization Performance of Data-driven Fault Diagnosis Methods

ZHENG Huailiang, WANG Rixin, YANG Yuantao, YIN Jiancheng, XU Minqiang   

  1. Deep Space Exploration Research Center, Harbin Institute of Technology, Harbin 150001
  • Received:2019-04-08 Revised:2019-11-21 Online:2020-05-05 Published:2020-05-29

Abstract: In recent years, data-driven fault diagnosis methods have been widely researched. A prerequisite of ensuring those methods' effectiveness is that the data for training diagnosis models and data to be tested should be collected from the same machine and the working environment. However, it is very difficult to satisfy this prerequisite in the actual diagnosis problem, and only the historical fault data collected from other same-type machines or different operating conditions are available for training the diagnosis model. The validity of conventional data-driven fault diagnosis methods for the actual diagnosis scenarios between datasets with potential discrepancy has rarely been discussed yet. The possible factors that dominate the generalization performance of fault diagnosis methods are first analyzed theoretically, and then multiple cross-dataset diagnosis tasks are organized. Based on them the empirical analysis about the generalization performance of several data-driven fault diagnosis methods is conducted. It is found that the distribution discrepancy between datasets is a major factor to influence the generalization performance. Meanwhile, the further fundamental reason for generalization performance declines is also explained from the perspective of signal characteristics under both model difference and operating condition difference. The discussion is conducive to inspiring the studies of data-driven diagnosis methods that can handle actual diagnosis scenarios.

Key words: fault diagnosis, data-driven, generalization performance, empirical analysis

CLC Number: