• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2023, Vol. 59 ›› Issue (12): 78-88.doi: 10.3901/JME.2023.12.078

Previous Articles     Next Articles

Interactive Operation Agent Scheduling Method for Job Shop Based on Deep Reinforcement Learning

CHEN Ruiqi1, LI Wenxin1,2, WANG Chuanyang1, YANG Hongbing1   

  1. 1. School of Mechanical and Electric Engineering, Soochow University, Suzhou 215000;
    2. School of Management, Shanghai University, Shanghai 200230
  • Received:2022-07-02 Revised:2023-02-06 Online:2023-06-20 Published:2023-08-15

Abstract: Job shop scheduling problem(JSSP) is difficult to obtain high-quality solution quickly due to NP hard attribute, and rescheduling occurs frequently due to the random disturbances of production scenarios. Based on deep reinforcement learning, a novel interactive operation agent(IOA) scheduling model framework is proposed. Through analysis of the constraint relationship between process route and processing equipment among operations, the processing processes in job shop are constructed as operation agents. The interaction mechanism between operation agents is designed, and each agent can interact with each other and update its own feature vector according to their relationship. Further, a deep neural network is constructed based on the operation characteristics and the earliest processing time to fit the action value function. As a result, the scheduling model can generate the scheduling strategy according to the system state and the characteristics of each operation agent. Double DQN algorithm is used to train IOA scheduling model, and the introduction of empirical playback mechanism effectively breaks the correlation between sequence training samples. The trained model can quickly generate high-quality scheduling scheme, and effectively execute rescheduling production strategy in case of machine failure. Experimental results show that the proposed IOA scheduling method is superior to greedy algorithm and heuristic scheduling rules, and has good robustness and generalization ability.

Key words: Job shop scheduling, deep reinforcement learning, operation agents, machine failure, double DQN

CLC Number: