• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2025, Vol. 61 ›› Issue (21): 204-212.doi: 10.3901/JME.2025.21.204

Previous Articles    

Gait Switching Method for Humanoid Robot Integrating Vision-language Model and Proximal Policy Optimization Algorithm

DU Guofeng1, SHAO Shibo1, LI Shanglin1, LIN Chengran2, CAO Zhengcai2   

  1. 1. College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029;
    2. State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150006
  • Received:2025-02-28 Revised:2025-06-23 Published:2025-12-27

Abstract: Gait switching is the core of humanoid robots’ ability to achieve seamless locomotion across multiple terrains. Existing methods predominantly rely on proprioception and lack the ability to perceive external environmental features. To address this, a gait switching method is proposed by integrating the semantic mapping capabilities of vision-language models (VLMs) with the adaptive learning characteristics of the proximal policy optimization (PPO) algorithm. First, human-like gait sequences are generated through motion retargeting using a linear mapping. Then, a reward-shaped PPO algorithm trains gait primitives to construct a multi-terrain gait library. Next, a gait scheduler based on a VLM is designed to dynamically match suitable gait primitives. After that, polynomial functions are constructed via Lagrange interpolation to constrain joint trajectories, enabling smooth and adaptive gait transitions. Finally, experiments on autonomous gait switching in representative scenarios validate the effectiveness of the proposed method.

Key words: humanoid robot, gait switching, vision-language model, reinforcement learning

CLC Number: