• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2022, Vol. 58 ›› Issue (23): 196-207.doi: 10.3901/JME.2022.23.196

Previous Articles     Next Articles

Clustering Method of High-dimensional Time Series SCADA Data from Wind Turbines Based on Operational Laws and TICC Algorithm

XIAO Zhao1, DENG Jiewen1, LIU Xiaoming2, DUAN Shuyong2, XU Shouliang3   

  1. 1. School of Mechanical Engineering, Hunan University of Science and Technology, Xiangtan 411201;
    2. State Key Laboratory of Reliability and Intelligence of Electrical Equipment, Hebei University of Technology, Tianjin 300401;
    3. Huadian Zhengzhou Mechanical Design Institute Co., Ltd., Zhengzhou 450046
  • Received:2022-03-07 Revised:2022-10-10 Online:2022-12-05 Published:2023-02-08

Abstract: Aiming at the working condition identification of high-dimensional SCADA time series data from wind turbines, a new time series clustering method is proposed by combining the operational laws of wind turbines and the TICC algorithm. A small set of key parameters such as wind velocity, rotational speed and pitch angle are selected from the high-dimensional SCADA data as the cluster datasets. The repeated patterns in temporal data of the key parameters and the theoretical operational conditions of wind turbines are analyzed. Some historical data of key parameters are used to obtain the optimal feature parameters by the TICC algorithm. The optimal feature parameters are then taken as the input of the TICC algorithm which clusters new time series data of the key parameters. Finally, the SCADA time series data of non-key parameters are segmented and clustered based on the clustering results of the key parameters. The SCADA data of 2.5 MW doubly-fed induction generator wind turbine are used to validate the present method. Performance comparison of TICC algorithm, FCM algorithm, GMM algorithm and K-Means algorithm with respect to SCADA data cluster are analyzed. The validation example and comparative study show that the proposed method takes full advantage of the theoretical knowledge and the TICC algorithm. It can efficiently segment and cluster high-dimensional time series SCADA data while the clustering results are consistent with the theoretical results.

Key words: wind turbine, SCADA data, TICC algorithm, time series clustering

CLC Number: