• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2023, Vol. 59 ›› Issue (8): 224-234.doi: 10.3901/JME.2023.08.224

Previous Articles     Next Articles

TD3 Algorithm Improving and Lane-merging Strategy Learning for Autonomous Vehicles

ZHANG Zhi-yong1,2, HUANG Da-yang2, HUANG Cai-xia1,3, HU Lin2, DU Rong-hua2   

  1. 1. Hunan Province Key Laboratory of Intelligent Manufacturing Technology for High-performance Mechanical Equipment, Changsha University of Science and Technology, Changsha 410114;
    2. College of Automobile and Mechanical Engineering,Changsha University of Science and Technology, Changsha 410114;
    3. Hunan Provincial Key Laboratory of Automotive Power and Transmission System,Hunan Institute of Technology, Xiangtan 411104
  • Received:2022-02-07 Revised:2022-10-25 Online:2023-04-20 Published:2023-06-16

Abstract: To enhance the comprehensive performance of automotive lane-merging, the Q-value estimation method of twin delayed deep deterministic policy gradient(TD3) algorithm and the reward function are improved. The automotive lane-merging model is formalized as the Markov decision process, and the influences of Q-value underestimated by TD3 algorithm on lane-merging strategy are analyzed. A Q-value estimation method based on weighted average of sample variance is proposed to enhance the Q-value estimation accuracy, when two Q-value estimation samples are obtained by performing Monte Carlo dropout on the dual target critic network. With giving priority to the completion of the lane-merging, a more perfect reward function is designed considering the safety,comfort and traffic efficiency. Based on the improved TD3 algorithm and the reward function, a lane-merging strategy of autonomous vehicles is learned and verified with BARK simulator. The results show that the improved TD3 algorithm significantly enhances the accuracy of Q-value estimation. Combined with the established reward function, the safety and ride comfort of lane-merging are improved while ensuring traffic efficiency.

Key words: autonomous vehicle, reinforcement learning, lane-merging strategy, Q-value estimation

CLC Number: