基于深度强化学习与有限元仿真集成的拉深成形控制

doi:10.3901/JME.2020.20.047

摘要/Abstract

摘要： 金属板材拉深过程中的压边力是决定成品质量的关键参数,传统压边力控制方法往往需要对高度非线性的拉深过程进行建模,导致其控制结果与实际存在较大偏差。提出一种基于深度强化学习与有限元仿真集成的金属板材拉深过程控制模型,利用深度神经网络强大的预测能力来提取拉深加工过程中的状态信息并进行可靠预测,结合强化学习的决策能力来进行压边力控制策略的学习优化,避免了精确系统动力学模型的拟合以及先验知识的获取。同时,针对板材拉深加工中常见的拉裂质量缺陷与起皱质量缺陷,建立拉深成形性能评价函数,为深度强化学习提供回报信号来指导学习过程,并利用有限元仿真构成深度强化学习的环境模型。试验表明,深度强化学习模型能够有效地进行压边力控制策略优化,有效提高产品质量。所提出的压边力控制模型利用无模型的深度强化学习,能避免拉深过程的系统模型拟合,可提高压边力控制策略的控制效果,同时结合循环神经网络能解决板材拉深加工过程中的部分可观察性问题。

关键词: 板材拉深成形, 质量控制, 深度强化学习, 有限元仿真, 优化控制

Abstract: The blank holder force in the deep drawing process is the key to determine the quality of finished products. The traditional blank holder force control methods often need to model the highly nonlinear deep drawing process, resulting in a large deviation of the control result from the real situation. The proposed control model based on integration of deep reinforcement learning and finite element method, uses the prediction ability of deep neural network to extract the state information of deep drawing process and predict system state, uses the decision ability of reinforcement learning to optimize control policy, and avoids the fitting of highly nonlinear process dynamic and the acquirement of prior knowledge. Besides, according to the common quality defects in deep drawing process, which are crack and wrinkle, the evaluation function of deep drawing process is established to provide the reward signal to guide the deep reinforcement learning, and the environment model of deep reinforcement learning is constructed by using finite element simulation. Experiments show that the deep reinforcement learning model can effectively optimize the blank holder force control policy and improve the product quality. The proposed blank holder force control model uses model-free deep reinforcement learning to avoid the fitting of deep drawing process, and improves the control effect of blank holder force control policy. The usage of recurrent neural network solves the problem of partial observability in the deep drawing process.

Key words: deep drawing, quality control, deep reinforcement learning, finite element analysis, optimal control

中图分类号:

TG156

郭鹏, 张新艳, 余建波. 基于深度强化学习与有限元仿真集成的拉深成形控制[J]. 机械工程学报, 2020, 56(20): 47-58.

GUO Peng, ZHANG Xinyan, YU Jianbo. Control of Deep Drawing Process Based on Integration of Deep Reinforcement Learning and Finite Element Method[J]. Journal of Mechanical Engineering, 2020, 56(20): 47-58.

参考文献

[1] 陈一哲,刘伟,苑世剑.薄板液压成形起皱预测及控制研究进展[J].机械工程学报,2016,52(4):20-28. CHEN Yizhe,LIU Wei,YUAN Shijian. Research development on wrinkling prediction and suppression for sheet hydroforming of thin-walled deep drawing parts[J]. Journal of Mechanical Engineering,2016,52(4):20-28.
[2] 林晓娟,于宽,潘悦飞,等.变压边力拉深工艺的研究[J].锻压装备与制造技术,2009,44(1):59-62. LIN Xiaojuan,YU Kuan,PAN Yuefei,et al. Application of response surface method in optimization of multi-stage deep drawing parameters[J]. China Mental Forming Equipment and Manufacturing Technology,2009,44(1):59-62.
[3] YOUNIS A D. The effect of using non-uniform blank holder force in deep drawing process on the thickness distribution along the cup[J]. Al Rafdain Engineering Journal,2013,21(2):25-31.
[4] 马瑞,赵军,屈晓阳.盒形件智能化拉深变压边力控制规律及其预测[J].机械工程学报,2010,46(8):32-36. MA Rui,ZHAO Jun,QU Xiaoyang. Control law of variable BHF and its prediction in itelligent deep drawing for rectangular box[J]. Journal of Mechanical Engineering,2010,46(8):32-36.
[5] 谢延敏,于沪平,陈军,等.基于灰色系统理论的方盒件拉深稳健设计[J].机械工程学报,2007,43(3):54-59. XIE Yanmin,YU Huping,CHEN Jun,et al. Rectangular box deep drawing robust design based on gray system theory[J]. Journal of Mechanical Engineering,2007,43(3):54-59.
[6] 刘罡.基于回弹控制的提高轿车冲压件成形精度方法研究[D].上海:上海交通大学,2001. LIU Gang. Study on the method to improve forming accuracy of autobody panels based on springback control[D]. Shanghai:Shanghai Jiao Tong University,2001.
[7] CAO Jun,BOYCE M C. A predictive tool for delaying wrinkling and tearing failures in sheet metal forming[J]. Journal of Engineering Materials and Technology,1997,119(4):354-365.
[8] KRISHNAN N,CAO Jun. Estimation of optimal blank holder force trajectories in segmented binders using an ARMA model[J]. Journal of Manufacturing Science and Engineering,2003,125(4):763-770.
[9] SHENG Ziqiang,JIRATHEARANAT S,ALTAN T. Adaptive FEM simulation for prediction of variable blank holder force in conical cup drawing[J]. International Journal of Machine Tools and Manufacture,2003,44(5):487-494.
[10] 孙成智,陈关龙,林忠钦,等.基于数值模拟的变压边力优化设计[J].上海交通大学学报,2004,38(7):1086-1090. SUN Chengzhi,CHEN Guanlong,LIN Zhongqin,et al. The optimization of variable blank-holder forces based on numerical simulation[J]. Journal of Shanghai Jiao Tong University,2004,38(7):1086-1090.
[11] HILLMANN M,KUBLI W. Optimization of sheet metal forming processes using simulation programs[C]//Proceedings of Numisheet'99,France:Besancon,1999:287-292.
[12] SCOOT M A,CARDEW H M,HODGSON P,et al. Novel criteria and tools for the FE optimization of sheet metal forming process[C]//Proceedings of Numisheet'99,France:Besancon,1999:305-310.
[13] SENN M,LINK N,POLLAK J,et al. Reducing the computational effort of optimal process controllers for continuous state spaces by using incremental learning and post-decision state formulations[J]. Journal of Process Control,2014,24(3):133-143.
[14] 黄玉萍,阮锋,蔡志兴.应用神经网络优化压边力[J].模具工业,2008,34(7):9-12. HUANG Yuping,RUAN Feng,CAI Zhixing. Using neural network to optimize blank holder force[J]. Die and Mould Industry,2008,34(7):9-12.
[15] LI Qihan,LI Mengzhe,TIAN Ye. Optimization technology of sheet metal deep drawing with variable blank holder force[C]//Proceedings of 2010 International Conference on Computer,Mechatronics,Control and Electronic Engineering,IEEE,2010,2:495-497.
[16] QIAN Zhiping,MA Rui,ZHAN Jun,et al. Intelligent control technology for deep drawing of sheet metal[J]. Journal of Central South University of Technology,2010,15(S2):273-277.
[17] MANABE K,YANG M,YOSHIHARA S. Artificial intelligence identification of process parameters and adaptive control system for deep-drawing process[J]. Journal of Materials Processing Technology,1998,80(1):421-426.
[18] 汪锐,罗亚军,何丹农,等.基于模糊神经网络的压边力优化控制专家系统[J].上海交通大学学报,2001,35(3):411-415. WANG Rui,LUO Yajun,HE Dannong,et al. Expert system based on fuzzy neural network for the optimal control of blank holder force[J]. Journal of Shanghai Jiao Tong University,2001,35(3):411-415.
[19] MANABE K,KOYAMA H,YOSHIHARA S,et al. Development of a combination punch speed and blank-holder fuzzy control system for the deep-drawing process[J]. Journal of Materials Processing Technology,2002,125:440-445.
[20] YAGAMI T,MANABE K. FE analysis on deformation mechanism of strain-rate-sensitive materials in cylindrical deep-drawing with combination punch speed and blank holder control[J]. Journal of Solid Mechanics and Materials Engineering,2007,1(12):1385-1396.
[21] DORNHEIM J,LINK N,GUMBSCH P. Model-free adaptive optimal control of sequential manufacturing processes using reinforcement learning[EB/OL].(2018-09-18)[2019-08-05]. https://arxiv.org/abs/1809.06646.
[22] SUTTON R S,BARTO A G. Reinforcement learning:An introduction[M]. Cambridge:MIT Press,2018.
[23] HOCHREITER S. The vanishing gradient problem during learning recurrent neural nets and problem solutions[J]. International Journal of Uncertainty,Fuzziness And Knowledge-Based,Systems,1998,6(2):107-116.
[24] HEESS N,HUNT J J,LILLICRAP T P,et al. Memory-based control with recurrent neural networks[EB/OL].(2015-12-14)[2019-08-05]. https://arxiv.org/abs/1512.04455.
[25] SILVER D,LEVER G,HEESS N,et al. Deterministic policy gradient algorithms[C]//Proceedings of the International Conference on Machine Learning. Beijing:ACM,2014:387.
[26] MNIH V,KAVUKCUOGLU K,SILVER D,et al. Human-level control through deep reinforcement learning[J]. Nature,2015,518(7540):529.
[27] 范泽.冲压速度对板料成形性能的影响研究[D].合肥:合肥工业大学,2015. FAN Ze. Study of the effect of punch speed on sheet metal forming[D]. Hefei:Hefei University of Technology,2015.
[28] 毛狄评.方盒件拉深成形问题的有限元分析[D].哈尔滨:哈尔滨工业大学,2010. MAO Diping. Investigation of the square cup deep drawing by finite element method[D]. Harbin:Harbin Institute of Technology,2010.
[29] LILLICRAP T P,HUNT J J,PRITZEL A,et al. Continuous control with deep reinforcement learning[J]. Computer Science,2015,8(6):A187.