可变环境下仿人机器人智能姿态控制

doi:10.3901/JME.2020.03.064

机械工程学报 ›› 2020, Vol. 56 ›› Issue (3): 64-72.doi: 10.3901/JME.2020.03.064

扫码分享

可变环境下仿人机器人智能姿态控制

施群, 吕雷, 谢家骏

上海大学机电工程与自动化学院上海 200444

收稿日期:2019-02-15 修回日期:2019-08-10 出版日期:2020-02-05 发布日期:2020-04-09
通讯作者: 吕雷(通信作者),男,1994年出生,硕士研究生。主要研究方向为工业机器人运动控制算法,类人机器人运动控制。E-mail:940509@shu.edu.cn
作者简介:施群,男,1972年出生,博士,副教授。主要研究方向为机器人运动控制算法,机器人操作系统,数控机床操作系统,智能控制。E-mail:shiqun@staff.shu.edu.cn;谢家骏,男,1992年出生,硕士研究生。主要研究方向为机器人控制系统。E-mail:724217648@qq.com

Intelligent Posture Control of Humanoid Robot in Variable Environment

SHI Qun, Lü Lei, XIE Jiajun

School of Mechanical and Electrical Engineering and Automation, Shanghai University, Shanghai 200444

Received:2019-02-15 Revised:2019-08-10 Online:2020-02-05 Published:2020-04-09

摘要/Abstract

摘要： 为了解决仿人机器人运动控制精度和运动稳定性差等问题，提出智能运动姿态控制算法。将连续动作和连续状态空间的深度强化学习应用于姿态控制，建立机器人运动智能姿态控制器。并针对物理样机训练样本少、效率低等问题，提出使用机器人辨识模型对姿态控制器进行离线的预训练，作为真实物理环境下继续学习提升的先验知识，提高了后期训练效率。将优化后的机器人姿态控制器用于机器人的运动控制中，分别和加入PID控制器、MPC控制器、以及PID+MPC控制器的机器人运动相比，在环境过渡步行试验中机器人上身俯仰姿态轨迹跟踪残差标准差分别减少60.97%，46.36%，23.98%，在平地障碍物步行试验中机器人上身俯仰姿态轨迹跟踪残差标准差分别减少60.38%，26.38%，9.52%。

关键词: 双足步行, 深度强化学习, 运动控制

Abstract: To solve the problems of motion instability of humanoid robots in variable uncertain, unstructured terrain and the low accuracy motion control, intelligent posture motion control algorithm is proposed. The deep reinforcement learning based continuous motion and continuous state space is applied to posture control, and the humanoid robot motion intelligent posture controller is established. Aiming at the problems of less sample and low efficiency of physical prototype training, the identification robot model is present to perform offline pre-training of the posture controller as a prior knowledge for continuous learning and in the real physical environment, improve the training efficiency in the later stage. The optimized robot posture controller is applied to the motion control of the robot. Compared with the robot motion with PID controller, MPC controller and PID+MPC controller, the standard deviation of the upper body pitch posture trajectory tracking error of the robot is reduced by 60.97%, 46.36%, 23.98% in the environmental transitional walking test, respectively. In the walking test of ground obstacles, the standard deviations of the trajectory tracking errors of the robot's upper body pitching posture are reduced by 60.38%, 26.38% and 9.52%, respectively.

Key words: bipedal walking, deep reinforcement learning, motion control

中图分类号:

TG156

施群, 吕雷, 谢家骏. 可变环境下仿人机器人智能姿态控制[J]. 机械工程学报, 2020, 56(3): 64-72.

SHI Qun, Lü Lei, XIE Jiajun. Intelligent Posture Control of Humanoid Robot in Variable Environment[J]. Journal of Mechanical Engineering, 2020, 56(3): 64-72.

参考文献

[1] REHER J,COUSINEAU E A,HEREID A,et al. Realizing dynamic and efficient bipedal locomotion on the humanoid robot DURUS[C]//IEEE International Conference on Robotics and Automation,IEEE,2016:1794-1801.
[2] 丁长涛. 随机不确定扰动下平面双足机器人动力学响应特性研究[D]. 杭州:浙江大学,2016. DING Changtao. Study on dynamic response characteristics of planar biped robot under randomly uncertain disturbance[D]. Hangzhou:Zhejiang University,2016.
[3] ZHOU Shihua,SONG Guiqiu,REN Zhaohui,et al. Nonlinear dynamic analysis of coupled gear-rotor-bearing system with the effect of internal and external excitations[J]. Chinese Journal of Mechanical Engineering,2016,30(2):281-292.
[4] 陈奇石. 强化学习在仿人机器人行走稳定控制上的研究及实现[D]. 广州:华南理工大学,2016. CHEN Qishi. Research and implementation of reinforcement learning on walking stability control of humanoid robots[D]. Guangzhou:South China University of Technology,2016.
[5] 王文玺,肖世德,孟祥印,等. 基于Agent的递阶强化学习模型与体系结构[J]. 机械工程学报,2010,46(2):76-82. WANG Wenxi,XIAO Shide,MENG Xiangyin,et al. Agent-based hierarchical reinforcement learning model and architecture[J]. Journal of Mechanical Engineering,2010,46(2):76-82.
[6] SHAO Shiyu,SUN Wenjun,YAN Ruqiang,et al. A deep learning approach for fault diagnosis of induction motors in manufacturing[J]. Chinese Journal of Mechanical Engineering,2017,30(6):1347-1356.
[7] 侯文擎,叶鸣,李巍华. 基于改进堆叠降噪自编码的滚动轴承故障分类[J]. 机械工程学报,2018,54(7):87-96. HOU Wenqing,YE Ming,LI Weihua. Fault classification of rolling bearings based on improved stack noise reduction self-coding[J]. Journal of Mechanical Engineering,2018,54(7):87-96.
[8] HWANG K S,LIN J L,LI J S. Biped balance control by reinforcement learning[J]. Journal of Information Science and Engineering,2016,32(4):1041-1060.
[9] SILVA I J,PERICO D H,HOMEM T P D,et al. Using reinforcement learning to improve the stability of a humanoid robot:Walking on sloped terrain[C]//Robotics Symposium (LARS) and 2015 3rd Brazilian Symposium on Robotics (LARS-SBR),2015 12th Latin American. IEEE,2015:210-215.
[10] WU W,GAO L. Posture self-stabilizer of a biped robot based on training platform and reinforcement learning[J]. Robotics and Autonomous Systems,2017,98:42-55.
[11] WANG S,BRAAKSMA J,BABUSKA R,et al. Reinforcement learning control for biped robot walking on uneven surfaces[C]//International Joint Conference on Neural Networks. July 16-21,2006, Sheraton Vancouver Wall Center Hotel,Vancouver,BC,Canada. IEEE,2006:4173-4178.
[12] 刘道远. 基于Q学习的欠驱动双足机器人行走控制研究[D]. 杭州:浙江大学,2013. LIU Daoyuan. Research on walking control of under-actuated biped robot based on Q-learning[D]. Hangzhou:Zhejiang University,2013.
[13] XIE Z. Feedback control for Cassie with deep reinforcement learning[C]//2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Oct 1-5,2018,Madrid Spain. IEEE,2018:1241-1246.
[14] UYANIK I,ANKARALI M M,COWAN N J,et al. Identification of a vertical hopping robot model via harmonic transfer functions[J]. Transactions of the Institute of Measurement and Control,2016,38(5):501-511.
[15] ZHANG Junyuan,JIN Yang,XIE Lisha,et al. Establishment and validation for the theoretical model of the vehicle airbag[J]. Chinese Journal of Mechanical Engineering,2015,29(3):27-34.
[16] DIAS J E A,PEREIRA G A S,PALHARES R M. Longitudinal model identification and velocity control of an autonomous car[J]. IEEE Transactions on Intelligent Transportation Systems,2015,16(2):776-786.
[17] QIN S J,LIN W,LJUNG L. A novel subspace identification approach with enforced causal models[J]. Automatica,2005,41(12):2043-2053.
[18] STIMAC G,BRAUT S. Comparative analysis of PSO algorithms for PID controller tuning[J]. Chinese Journal of Mechanical Engineering,2014,28(5):928-936.
[19] VAZQUEZ S,RODRIGUEZ J,RIVERA M,et al. Model predictive control for power converters and drives:Advances and trends[J]. IEEE Transactions on Industrial Electronics,2017,64(2):935-947.
[20] PINSKER J E,LEE J B,DASSAU E,et al. Response to comment on Pinsker et al. Randomized crossover comparison of personalized MPC and PID control algorithms for the artificial pancreas[J]. Diabetes Care,2016,39(1):1135-1142.

可变环境下仿人机器人智能姿态控制

Intelligent Posture Control of Humanoid Robot in Variable Environment

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	王无印, 黄子钊, 庄子龙, 方怀瑾, 秦威. 基于深度强化学习的自动化码头堆场场桥调度方法[J]. 机械工程学报, 2024, 60(6): 44-57.
[2]	张立元, 杨锦波, 李澳, 杨庆凯, 徐光魁. 张拉整体球形机器人构型设计与控制研究进展[J]. 机械工程学报, 2024, 60(5): 1-18.
[3]	郭景华, 李文昌, 王班, 王靖瑶. 基于深度强化学习的网联混合动力汽车队列控制[J]. 机械工程学报, 2024, 60(2): 262-271.
[4]	陈正, 吕立彤, 王飞, 姚斌, 李冬明, 刘红光, 张国良. 基于容腔压力规划的非模式切换负载口独立电液系统运动控制[J]. 机械工程学报, 2024, 60(2): 302-312.
[5]	袁小庆, 吴涛, 原勋, 王文东. 基于GSO-RF意图识别算法的全身助力外骨骼控制方法研究[J]. 机械工程学报, 2024, 60(17): 91-101.
[6]	张奇祥, 王金湘, 张伊晗, 张荣林, 靳立强, 殷国栋. 智能电动汽车线控制动关键技术与研究进展[J]. 机械工程学报, 2024, 60(10): 339-365.
[7]	黄维维, 张鑫泉, 朱利民. 基于重复控制的快速刀具伺服系统前馈补偿方法[J]. 机械工程学报, 2023, 59(21): 43-51.
[8]	高镇海, 于桐, 孙天骏, 唐明弘, 高菲, 赵睿. 面向纵向自动驾驶的仿人驱动控制网络模型[J]. 机械工程学报, 2023, 59(18): 251-262.
[9]	钟沛成, 骆德渊, 庞明君. 基于深度强化学习的四足机器人跟随策略研究及系统实现[J]. 机械工程学报, 2023, 59(13): 79-88.
[10]	陈睿奇, 黎雯馨, 王传洋, 杨宏兵. 基于深度强化学习的工序交互式智能体Job shop调度方法[J]. 机械工程学报, 2023, 59(12): 78-88.
[11]	唐小林, 陈佳信, 高博麟, 杨凯, 胡晓松, 李克强. 基于云控系统高精度地图驱动的深度强化学习型混合动力汽车集成控制[J]. 机械工程学报, 2022, 58(24): 163-177.
[12]	唐鑫, 欧阳权, 黄俍卉, 王志胜, 马瑞. 基于深度强化学习的锂电池快速充电控制策略[J]. 机械工程学报, 2022, 58(22): 69-78.
[13]	陶永, 刘海涛, 王田苗, 韩栋明, 赵罡. 我国服务机器人技术研究进展与产业化发展趋势[J]. 机械工程学报, 2022, 58(18): 56-74.
[14]	王辉, 徐佳文, 严如强. 基于多尺度注意力深度强化学习网络的行星齿轮箱智能诊断方法[J]. 机械工程学报, 2022, 58(11): 133-142.
[15]	吕立彤, 陈正, 姚斌. 并联式泵阀协调电液系统：对比分析与运动控制[J]. 机械工程学报, 2022, 58(10): 136-151.