• CN:11-2187/TH
  • ISSN:0577-6686

机械工程学报 ›› 2023, Vol. 59 ›› Issue (12): 17-27.doi: 10.3901/JME.2023.12.017

• 特邀专栏:制造大数据分析与决策 • 上一篇    下一篇

扫码分享

一种数据隐私保护下多方无损线性模型学习方法

华丰1,2, 王亚森1,2, 金骏阳3, 袁烨1,2,3,4   

  1. 1. 华中科技大学机械科学与工程学院 武汉 430074;
    2. 华中科技大学数字制造装备与技术国家重点实验室 武汉 430074;
    3. 华中科技大学无锡研究院 无锡 214000;
    4. 华中科技大学人工智能与自动化学院 武汉 430074
  • 收稿日期:2022-09-08 修回日期:2023-01-31 出版日期:2023-06-20 发布日期:2023-08-15
  • 通讯作者: 袁烨(通信作者),男,1986年出生,博士,教授,博士研究生导师。主要研究方向为数据驱动建模理论及其在工业应用。E-mail:yye@hust.edu.cn
  • 作者简介:华丰,男,1995年出生,博士研究生。主要研究方向为数据驱动机理建模在工业应用。E-mail:feng_hua@hust.edu.cn;王亚森,男,1995年出生,博士研究生。主要研究方向为机器学习,系统辨识。E-mail:arthinw@hust.edu.can;金骏阳,男,1991年出生,博士。主要研究方向为机器学习,网络拓扑结构推断。E-mail:jj415@alumni.cam.ac.uk
  • 基金资助:
    国家自然科学基金资助项目(92167201,62203182)。

A Multi-Lossless Linear Model Learning Method with Data Privacy-Preserving

HUA Feng1,2, WANG Yasen1,2, JIN Junyang3, YUAN Ye1,2,3,4   

  1. 1. School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074;
    2. State Key Laboratory of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074;
    3. HUST-Wuxi Research Institute, Wuxi 214000;
    4. School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074
  • Received:2022-09-08 Revised:2023-01-31 Online:2023-06-20 Published:2023-08-15

摘要: 随着工业大数据技术的发展,制造企业通过收集和分析生产数据,获取在预测、诊断等方面的优化方法。然而制造企业受限于建模和算力等技术瓶颈,难以高效地实现数据分析,在联合其他参与方共同协作时即需要承担信息泄露的风险,又难以保证模型的性能无损。针对这些场景问题,提出基于数据隐私保护的多方无损线性模型学习方法。首先搭建多方协作计算框架,设计了数据单向加密算法来保护数据出场的隐私安全,各协作方分别基于加密数据进行线性模型的训练。随后,研究分析线性模型与数据集的关联特性,提出线性模型无损聚合算法。最后,在典型工业场景数据集上进行方法验证,试验结果表明提出的框架可以获得性能无损的全局模型,并实现数据持有方的隐私安全保护。

关键词: 工业大数据, 隐私保护机器学习, 联邦学习, 数据分析

Abstract: With the development of industrial big data technologies, manufacturing enterprises collect and analyse production data to obtain optimization methods of forecasting and diagnosis. However, manufacturing companies are constrained by technical bottlenecks such as modelling and computing power, which make it difficult to analyse data efficiently. Manufacturing enterprises need to bear the risk of information leakage, and it is also difficult to guarantee that model performance is lossless, when cooperating with other participants. For these scenario problems, a multi-lossless linear model learning method based on data privacy preserving is proposed. Firstly, a multi-party collaborative computing framework is built and the one-way data encryption algorithm is designed to protect the privacy of the data. Each collaborator trains the linear model separately based on the encrypted data. Secondly, the study analyses the association properties of the linear model with the dataset, proposing a lossless aggregation algorithm for linear models. Finally, the method is validated on the typical industrial scenario dataset. The experimental results show that the proposed framework can obtain global models with lossless performance and achieve privacy security for the data holders.

Key words: industrial big data, privacy-preserving machine learning, federated learning, data analysis

中图分类号: