Real-time Scheduling Simulation Optimization Method for Smart Production Lines Based on Digital Twin and Reinforcement Learning

doi:10.3901/JME.260224

Abstract

Abstract: Production scheduling remains a perpetual research hotspot in industry, serving as a critical metronome for the efficient operation of production lines. With the continuous evolution of intelligent manufacturing, smart production scheduling has emerged as a cutting-edge frontier. Multi-source stochastic disturbances, such as production task variations, the coupling of manufacturing resources, and others, pose a significant challenge in balancing scheduling efficiency and accuracy during dynamic production. To address this challenge, a real-time scheduling simulation optimization method based on digital twin (DT) and reinforcement learning (RL) is proposed. DT technology is used to construct high-fidelity models of production lines, establishing a hierarchical and high-fidelity virtual production simulation environment. An improved Q-Learning algorithm is developed to establish a scheduling optimization agent, incorporating triple state space reconstruction, a multi-dimensional reward function, and a dual exploration strategy to mitigate the curse of dimensionality and the robustness limitations inherent in traditional algorithms. Furthermore, a hierarchical execution control architecture is established based on perception-decision-execution loop throughout the production simulation process to achieve deep fusion between the DT and the intelligent simulation agent. A case study focusing on aerospace product final assembly line is provided to demonstrate the effectiveness of the proposed method. The result shows that the execution distances yielded by five other classical scheduling rules are 6.38% to 16.50% higher than those of the proposed method, signifying a substantial improvement in manufacturing resource collaborative efficiency.

Key words: smart production lines, production scheduling, digital twin, reinforcement learning, simulation optimization

CLC Number:

TG156

YANG Zehao, DONG Wei, HUANG Sihan, YIN Yanchao, DONG Liyang, ZHENG Zujie. Real-time Scheduling Simulation Optimization Method for Smart Production Lines Based on Digital Twin and Reinforcement Learning[J]. Journal of Mechanical Engineering, 2026, 62(5): 12-25.

Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks

URL: http://www.cjmenet.com.cn/EN/10.3901/JME.260224

http://www.cjmenet.com.cn/EN/Y2026/V62/I5/12

References

[1] HUANG S，WANG B，LI X，et al. Industry 5.0 and society 5.0—Comparison，complementation and co-evolution[J]. Journal of manufacturing systems，2022，64:424-428.
[2] HUANG S，WANG G，LEI D，et al. Toward digital validation for rapid product development based on digital twin:A framework[J]. The International Journal of Advanced Manufacturing Technology，2022，119(3):2509-2523.
[3] 王跃飞，王超，许于涛，等. 边-云协同下智能制造单元作业的数字孪生任务调度方法[J]. 机械工程学报，2024，60(6):137-152. WANG Yuefei，WANG Chao，XU Yutao，et al. Digital twin task scheduling method for intelligent manufacturing cell under edge-cloud collaboration[J]. Journal of Mechanical Engineering，2024，60(6):137-152.
[4] WANG D，WU H，ZHENG W，et al. A mixed-integer linear programming model for addressing efficient flexible flow shop scheduling problem with automatic guided vehicles consideration[J]. Applied Sciences，2025，15(6):3133-3141.
[5] LI W，HAN D，GAO L，et al. Integrated production and transportation scheduling method in hybrid flow shop[J]. Chinese Journal of Mechanical Engineering，2022，35(1):12.
[6] 孙媛，黄铭，刘合朋，等. 基于精益物流的光纤陀螺装配车间布局与生产调度集成优化[J]. 工业工程，2025，28(4):24-33. SUN Yuan，HUANG Ming，LIU Hepeng，et al. Integrated optimization of layout and scheduling in fiber optic gyroscope assembly workshops based on lean logistics[J]. Industrial Engineering Journal，2025，28(4):24-33.
[7] ZAN X，WU Z，GUO C，et al. A Pareto-based genetic algorithm for multi-objective scheduling of automated manufacturing systems[J]. Advances in Mechanical Engineering，2020，12(1):1687814019885294.
[8] LIU Z，ZHA J，YAN J，et al. An improved genetic algorithm with an overlapping strategy for solving a combination of order batching and flexible job shop scheduling problem[J]. Engineering Applications of Artificial Intelligence，2024，127:107321.
[9] 孙树栋，周新民，常昇博. 含私有信息的多代理作业车间协商调度算法[J]. 机械工程学报，2022，58(9):210-217. SUN Shudong，ZHOU Xinmin，CHANG Shengbo. Negotiation scheduling algorithm for multi-agent job shop with private information[J]. Journal of Mechanical Engineering，2022，58(9):210-217.
[10] ZHENG X，ZHANG C，AN Y，et al. A metaheuristic framework with experience reuse for dynamic multi-objective big data optimization[J]. Applied Sciences，2024，14(11):4878.
[11] 郭具涛，戴铮，张洁，等. 面向生产扰动的航天制造车间鲁棒性调度方法研究[J]. 工业工程，2025，28(2):69-77. GUO Jutao，DAI Zheng，ZHANG Jie，et al. Research on robust scheduling method for aerospace manufacturing workshop facing production disturbances[J]. Industrial Engineering Journal，2025，28(2):69-77.
[12] 黄思翰，彭志诚，朱启章，等. 面向工业元宇宙的人本智造系统数字孪生建模与分布式虚拟协作方法[J]. 机械工程学报，2025，61(15):385-398. HUANG Sihan，PENG Zhicheng，ZHU Qizhang，et al. Digital twin modeling and distributed virtual collaboration method for human-centric intelligent manufacturing system oriented to industrial metaverse[J]. Journal of Mechanical Engineering，2025，61(15):385-398.
[13] 林国义，郭慧妍，冷杰武，等. 数字孪生在工业工程领域应用的热点和趋势分析[J]. 工业工程，2024，27(6):13-25. LIN Guoyi，GUO Huiyan，LENG Jiewu，et al. Hotspots and trends analysis of digital twin application in industrial engineering[J]. Industrial Engineering Journal，2024，27(6):13-25.
[14] FENG X，WAN J. Digital twins for discrete manufacturing lines:A review[J]. Big Data and Cognitive Computing，2024，8(5):45.
[15] TAO，F，XIAO，B，QI，Q，et al. Digital twin modeling[J]. Journal of Manufacturing Systems，2022，64，372-389.
[16] 刘达新，王科，刘振宇，等. 基于数据融合与知识推理的机器人装配单元数字孪生建模方法研究[J]. 机械工程学报，2024，60(5):36-50. LIU Daxin，WANG Ke，LIU Zhenyu，et al. Research on digital twin modeling method of robotic assembly unit based on data fusion and knowledge reasoning[J]. Journal of Mechanical Engineering，2024，60(5):36-50.
[17] 顾文斌，李育鑫，刘斯麒，等. 数据驱动的智慧车间实时调度方法研究[J]. 机械工程学报，2023，59(12):47-61. GU Wenbin，LI Yuxin，LIU Silin，et al. Research on data-driven real-time scheduling method of smart workshop [J]. Journal of Mechanical Engineering，2023，59(12):47-61.
[18] 杨赓，周慧颖，王柏村. 数字孪生驱动的智能人机协作:理论、技术与应用[J]. 机械工程学报，2022，58(18):279-291. YANG Geng，ZHOU Huiying，WANG Bocun. Digital twin driven intelligent human-machine collaboration:Theory，technology and application[J]. Journal of Mechanical Engineering，2022，58(18):279-291.
[19] SANTOS R，PIQUEIRO H，DIAS R，et al. Transitioning trends into action:A simulation-based digital twin architecture for enhanced strategic and operational decision-making[J]. Computers & Industrial Engineering，2024，198:110616.
[20] XIAO B，ZHONG J，BAO X，et al. Digital twin-driven prognostics and health management for industrial assets[J]. Scientific Reports，2024，14(1):13443.
[21] KATYARA S，SHARMA S，DAMACHARLA P，et al. Benchmarking Sim2Real gap:High-fidelity digital twinning of agile manufacturing[J]. arXiv preprint arXiv:2409.10784，2024.
[22] LEE H，YANG H. Digital twin simulation and optimization of manufacturing process flows[C]//International Manufacturing Science and Engineering Conference. American Society of Mechanical Engineers，2023，87240:V002T07A013.
[23] KOBER C，ADOMAT V，AHANPANJEH M，et al. Digital twin fidelity requirements model for manufacturing[C]//Proceedings of the Conference on Production Systems and Logistics:CPSL 2022. Hannover:Publish-Ing.，2022:595-611.
[24] 沈倩，管在林，张正敏，等. 面向卷烟生产调度的集成产能过滤算法与仿真技术的优化框架[J]. 计算机集成制造系统，2022，28(5):1462-1471. SHEN Qian，GUAN Zailin，ZHANG Zhengmin，et al. Optimization framework integrating capacity filtering algorithm and simulation technology for cigarette production scheduling[J]. Computer Integrated Manufacturing Systems，2022，28(5):1462-1471.
[25] ZHANG M，WANG C，LI X，et al. Optimization of machine configuration and scheduling in the hybrid flow shop using a linear programming-driven evolutionary approach[J]. Robotics and Computer-Integrated Manufacturing，2025，95:103029.
[26] CAO X，YAO M，ZHANG Y，et al. Digital twin modeling and simulation optimization of transmission front and middle case assembly line[J]. Computer Modeling in Engineering & Sciences (CMES)，2024，139(3):3233-3253.
[27] DERLINI D，ANNISA S，LUBIS Z. Optimizing production scheduling in smart manufacturing systems using hybrid simulation-based multi-objective optimization[C]//Proceeding of International Conference on Science and Technology UISU. 2025:105-108.
[28] GRZNÁR P，PAPÁNEK L，MARČAN M，et al. Enhancing production efficiency through digital twin simulation scheduling[J]. Applied Sciences，2025，15(7):3637.
[29] 黄铭，黄思翰，陈建鹏，等. 基于多目标深度强化学习的不确定扰动下岛式装配线动态调度优化[J/OL]. 机械工程学报，1-14[2025-12-25]. https://link.cnki.net/urlid/11.2187.TH.20251223.1148.050. HUANG Ming，HUANG Sihan，CHEN Jianpeng，et al. Dynamic scheduling optimization of island assembly line under uncertain disturbance based on multi-objective deep reinforcement learning[J/OL]. Journal of Mechanical Engineering:1-14[2025-12-25].https://link.cnki.net/urlid/11.2187.TH.20251223.1148.050.
[30] LIU R，PIPLANI R，TORO C. Deep reinforcement learning for dynamic scheduling of a flexible job shop[J]. International Journal of Production Research，2022，60(13):4049-4069.
[31] 张中伟，李艺，高增恩，等. 基于深度强化学习的柔性作业车间节能调度研究[J]. 工业工程，2024，27(1):78-85，103. ZHANG Zhongwei，LI Yi，GAO Zengen，et al. Research on energy-saving scheduling of flexible job shop based on deep reinforcement learning[J]. Industrial Engineering Journal，2024，27(1):78-85，103.
[32] ZHANG M，LU Y，HU Y，et al. Dynamic scheduling method for job-shop manufacturing systems by deep reinforcement learning with proximal policy optimization[J]. sustainability，2022，14(9):5177.
[33] CHANG J，YU D，HU Y，et al. Deep reinforcement learning for dynamic flexible job shop scheduling with random job arrival[J]. Processes，2022，10(4):760.
[34] LIU R，PIPLANI R，TORO C. A deep multi-agent reinforcement learning approach to solve dynamic job shop scheduling problem[J]. Computers & Operations Research，2023，159:106294.
[35] LI H，ZHANG H，HE Z，et al. Solving integrated process planning and scheduling problem via graph neural network based deep reinforcement learning[J]. arXiv preprint arXiv:2409.00968，2024.
[36] DEVIDZE R，KAMALARUBAN P，SINGLA A. Exploration-guided reward shaping for reinforcement learning under sparse rewards[J]. Advances in Neural Information Processing Systems，2022，35:5829-5842.
[37] MOURTZIS D. Simulation in the design and operation of manufacturing systems:state of the art and new trends[J]. International Journal of Production Research，2020，58(7):1927-1949.
[38] QI Q，TAO F，HU T，et al. Enabling technologies and tools for digital twin[J]. Journal of Manufacturing Systems，2021，58:3-21.