• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2019, Vol. 55 ›› Issue (17): 133-144.doi: 10.3901/JME.2019.17.133

Previous Articles     Next Articles

Two-stage Unsupervised Feature Selection Method Oriented to Manufacturing Procedural Data

ZHANG Jie1,2, SHENG Xia1, ZHANG Peng1, QIN Wei1, ZHAO Xinming1   

  1. 1. Institute of Intelligent Manufacturing and Information Engineering, Shanghai Jiao Tong University, Shanghai 200240;
    2. College of Mechanical Engineering, Donghua University, Shanghai 201620
  • Received:2018-09-19 Revised:2019-02-03 Online:2019-09-05 Published:2020-01-07

Abstract: In a modernized manufacturing workshop, myriads of data are incessantly produced and a large part of those are stored in the industrial big data platform of the modern manufacturing enterprise in the form of structuralized unlabeled raw data. Those manufacturing data are of great latent exploitative value, because of their characteristics of high-noise and high-redundancy, however, direct analysis and utilization of them are impossible. Aiming at reducing the redundancy of manufacturing procedural data and excavating their local structure, a two-stage unsupervised feature selection method is proposed. In the first stage of the method, subset of the original feature set generated by genetic algorithm(GA) is utilized as the input features of radius basis function neural network(RBFNN), to reconstruct the unabridged original feature set. The ratio of dimensionality reduction and reconstructional accuracy are calculated jointly as the fitness function of GA, which is optimized by iteration to learn a low-dimensional representation of high-dimensional features, removing redundant and noisy features of the origin feature set. In the second stage, Laplacian score(LS) is employed to evaluate the locality preserving power of the remaining features, unearthing features which are prone to improving the performance of classification. By comparing with other unsupervised feature selection method, the method proposed here is proven more effective in reducing the redundancy of manufacturing data and simultaneously enhancing the performance of classification.

Key words: unsupervised feature selection, genetic algorithm, radius basis function neural network, Laplacian score, manufacturing procedural data

CLC Number: