• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2022, Vol. 58 ›› Issue (24): 289-299.doi: 10.3901/JME.2022.24.289

Previous Articles     Next Articles

Research on 3D Object Detection Based on Laser Point Cloud and Image Fusion

LIU Yonggang1,2, YU Fengning1, ZHANG Xinjie2, CHEN Zheng3, QIN Datong1   

  1. 1. State Key Laboratory of Mechanical Transmissions, College of Mechanical and Vehicle Engineering, Chongqing University, Chongqing 400044;
    2. State Key Laboratory of Automotive Simulation and Control, Jinlin University, Changchun 130025;
    3. Faculty of Transportation Engineering in Kunming University of Science and Technology, Kunming 650500
  • Received:2022-01-19 Revised:2022-09-26 Online:2022-12-20 Published:2023-04-03

Abstract: At present, 3D object detection based on the fusion of lidar and camera has received extensive attention. However, most fusion algorithms are difficult to accurately detect small target objects such as pedestrians and cyclists. Therefore, a feature fusion network based on the self-attention mechanism is proposed, which fully considers the local feature information to achieve accurate 3D object detection. Firstly, to reduce the spatial search range of the point cloud, the Faster-RCNN is improved to form a candidate box. Then, the frustum point cloud was extracted according to the projection relationship between the lidar and the camera. Secondly, a Self-Attention PointNet based on the self-attention mechanism is proposed to segment the original point cloud data within the scope of the frustum. Finally, while using the PointNet and T-Net to predict the 3D bounding box parameters, the regularization term is considered in the loss function to achieve higher convergence accuracy. The KITTI dataset is used for verification and testing. The results show that this method is obviously superior to F-PointNet and the detection accuracy of cars, pedestrians, and cyclists has been greatly improved, and it has higher accuracy than mainstream 3D object detection networks.

Key words: lidar, 3D object detection, point cloud fusion, attention mechanism, deep learning

CLC Number: