基于图像和点云实例匹配的智能车目标检测和跟踪

doi:10.3901/JME.2024.22.302

机械工程学报 ›› 2024, Vol. 60 ›› Issue (22): 302-310.doi: 10.3901/JME.2024.22.302

扫码分享

基于图像和点云实例匹配的智能车目标检测和跟踪

李尚杰, 殷国栋, 耿可可, 刘帅鹏

东南大学机械工程学院南京 211189

收稿日期:2024-01-21 修回日期:2024-07-06 出版日期:2024-11-20 发布日期:2025-01-02
作者简介:李尚杰,男,1998年出生。主要研究方向为智能车环境感知、目标检测与跟踪、多传感器融合。E-mail:shangjie-li@seu.edu.cn;殷国栋(通信作者),男,1976年出生,博士,教授,博士研究生导师。主要研究方向为智能网联汽车、无人驾驶与智能辅助驾驶系统、车路协同、新能源汽车控制系统、车辆动力学及其控制等。E-mail:ygd@seu.edu.cn
基金资助:
长三角科技创新体联合攻关(2023CSJGG1600)、国家自然科学基金(52272414)和国家重点研发计划(2023YFD2000303)资助项目。

Object Detection and Tracking Based on Image and Point Clouds Instance Matching for Intelligent Vehicles

LI Shangjie, YIN Guodong, GENG Keke, LIU Shuaipeng

School of Mechanical Engineering, Southeast University, Nanjing 211189

Received:2024-01-21 Revised:2024-07-06 Online:2024-11-20 Published:2025-01-02
About author:10.3901/JME.2024.22.302

摘要/Abstract

摘要： 针对智能车的环境感知任务，为了结合相机图像中丰富的语义信息与激光雷达点云中准确的位置信息，提出一种基于图像和点云实例匹配的融合检测方法。通过实例分割网络预测图像中目标的实例掩膜，通过透视投影变换将点云投影至图像平面，根据每个目标的实例掩膜提取属于该目标的点云，然后利用聚类算法去除点云中的噪声，并利用凸包轮廓逼近算法拟合点云的三维轮廓，实现对目标的融合检测。在所提出的融合检测方法的基础上，设计跟踪门实现多目标的数据关联与管理，基于卡尔曼滤波对目标进行跟踪并估计各目标的运动状态。试验结果表明，该方法能够有效地对图像数据和点云数据进行信息融合，从而准确快速地对目标的位置、尺寸、方向角进行拟合并对目标的速度进行估计，且在不同试验场景中表现出鲁棒性。

关键词: 智能车, 目标检测, 目标跟踪, 传感器融合, 实例分割, 透视投影

Abstract: In the environment perception task of intelligent vehicles, in order to combine the rich semantic information in the camera image with the accurate spatial information in the lidar point clouds, a fusion detection method based on image and point clouds instance matching is proposed. To achieve the fusion detection, the instance masks of the targets in the image are predicted by the instance segmentation network, the point clouds are projected to the image plane through perspective projection transformation, the point clouds belonging to the target are extracted according to the instance mask of each target, and then the clustering algorithm is used to remove the noise, and the convex hull approximating algorithm is used to fit the 3D bounding box of the target. Based on the fusion detection method, a gate is designed to realize multi-target data association and management, and the Kalman filter is used to track the target and estimate the motion state of the target. The experimental results show that the method can effectively fuse the information from image data and point clouds data, accurately and quickly fit the position, size, and direction of the target and estimate the speed of the target, and show robustness in different experimental scenarios.

Key words: intelligent vehicle, object detection, object tracking, sensor fusion, instance segmentation, perspective projection

中图分类号:

TG156

李尚杰, 殷国栋, 耿可可, 刘帅鹏. 基于图像和点云实例匹配的智能车目标检测和跟踪[J]. 机械工程学报, 2024, 60(22): 302-310.

LI Shangjie, YIN Guodong, GENG Keke, LIU Shuaipeng. Object Detection and Tracking Based on Image and Point Clouds Instance Matching for Intelligent Vehicles[J]. Journal of Mechanical Engineering, 2024, 60(22): 302-310.

参考文献

[1] PENG P，GENG K，YIN G，et al. Adaptive multi-modal fusion instance segmentation for CAEVs in complex conditions：Dataset，framework and verifications[J]. Chinese Journal of Mechanical Engineering，2021，34(1)：1-11.
[2] LIU L，OUYANG W，WANG X，et al. Deep learning for generic object detection：A survey[J]. International Journal of Computer Vision，2020，128(2)：261-318.
[3] REN S，HE K，GIRSHICK R，et al. Faster R-CNN：Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence，2017，39(6)：1137-1149.
[4] REDMON J，DIVVALA S，GIRSHICK R，et al. You only look once：Unified，real-time object detection[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition，2016：779-788.
[5] LIU W，ANGUELOV D，ERHAN D，et al. SSD：Single shot multibox detector[C]// European Conference on Computer Vision，2016：21-37.
[6] HE K，GKIOXARI G，DOLLÁR P，et al. Mask R-CNN[C]// Proceedings of the IEEE International Conference on Computer Vision，2017：2961-2969.
[7] LI Y，QI H，DAI J，et al. Fully convolutional instance- aware semantic segmentation[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition，2017：2359-2367.
[8] BOLYA D，ZHOU C，XIAO F，et al. YOLACT：Real- time instance segmentation[C]// Proceedings of the IEEE International Conference on Computer Vision，2019：9157-9166.
[9] ARNOLD E，AL-JARRAH O Y，DIANATI M，et al. A survey on 3d object detection methods for autonomous driving applications[J]. IEEE Transactions on Intelligent Transportation Systems，2019，20(10)：3782-3795.
[10] FENG D，HAASE-SCHÜTZ C，ROSENBAUM L，et al. Deep multi-modal object detection and semantic segmentation for autonomous driving：Datasets，methods，and challenges[J]. IEEE Transactions on Intelligent Transportation Systems，2020，22(3)：1341-1360.
[11] STONE L D，STREIT R L，CORWIN T L，et al. Bayesian multiple target tracking[M]. Boston：Artech House，2013.
[12] 彭湃，耿可可，殷国栋，等. 基于传感器融合里程计的相机与激光雷达自动重标定方法[J]. 机械工程学报，2021，57(20)：206-214. PENG Pai，GENG Keke，YIN Guodong，et al. Automatic recalibration of camera and LiDAR using sensor fusion odometry[J]. Journal of Mechanical Engineering，2021，57(20)：206-214.
[13] FERGUSON D，DARMS M，URMSON C，et al. Detection，prediction，and avoidance of dynamic obstacles in urban environments[C]// IEEE Intelligent Vehicles Symposium，2008：1149-1154.
[14] BRADSKI G，KAEHLER A. Learning OpenCV：Computer vision with the OpenCV library[M]. Sebastopol：O’Reilly Media，2008.

[1]	关海杰, 王博洋, 龚建伟, 陈慧岩. 面向异构车辆的统一运动规划方法[J]. 机械工程学报, 2024, 60(18): 288-298.
[2]	张彧, 檀祖冰, 曹东璞, 陈龙. 基于视觉和惯性测量单元的里程计关键技术研究综述[J]. 机械工程学报, 2024, 60(10): 3-21.
[3]	林晨, 何智成, 黄怡菲, 林智桂, 付广, 黄晋. 多级参数融合网络的驾驶场景目标检测方法研究[J]. 机械工程学报, 2024, 60(10): 64-75.
[4]	褚端峰, 彭赛骞, 胡海洋, 皮大伟. 预见性驾驶风险场模型[J]. 机械工程学报, 2024, 60(10): 160-170.
[5]	陈晓明, 李柏, 范丽丽, 王涯舟, 张坦探, 张友民, 曹东璞. 基于半空间约束理论的自动泊车高性能轨迹优化方法[J]. 机械工程学报, 2024, 60(10): 273-288.
[6]	赵林峰, 丰肖, 方婷, 王宁, 陈无畏, 王慧然. 基于前车轨迹预测的智能车辆高速主动避撞方法[J]. 机械工程学报, 2024, 60(10): 289-301.
[7]	毛杨坤, 段现银, 林昕, 傅盈西, 朱锟鹏. 基于目标检测的选区激光熔融成形过程熔池与飞溅监测[J]. 机械工程学报, 2023, 59(9): 335-348.
[8]	彭湃, 耿可可, 王子威, 柳智超, 殷国栋. 智能汽车环境感知方法综述[J]. 机械工程学报, 2023, 59(20): 281-303.
[9]	贾寒冰, 刘鹏, 张雷, 王震坡. 基于规则与机器学习融合的换道决策建模方法研究[J]. 机械工程学报, 2022, 58(4): 212-221.
[10]	梁军, 韩冬冬, 盘朝奉, 陈龙, 陈逢强, 杜万兵. 基于移动机器人的智能车库关键技术综述[J]. 机械工程学报, 2022, 58(3): 1-20.
[11]	刘永刚, 于丰宁, 章新杰, 陈峥, 秦大同. 基于激光点云与图像融合的3D目标检测研究[J]. 机械工程学报, 2022, 58(24): 289-299.
[12]	赵子婧, 刘宏哲, 曹东璞. 基于Libra R-CNN改进的交通标志检测算法[J]. 机械工程学报, 2021, 57(22): 255-265.
[13]	李乃鹏, 蔡潇, 雷亚国, 徐鹏程, 王文廷, 王彪. 一种融合多传感器数据的数模联动机械剩余寿命预测方法[J]. 机械工程学报, 2021, 57(20): 29-37,46.
[14]	彭湃, 耿可可, 殷国栋, 庄伟超, 刘帅鹏, 徐利伟. 基于传感器融合里程计的相机与激光雷达自动重标定方法[J]. 机械工程学报, 2021, 57(20): 206-214.
[15]	王博洋, 龚建伟, 张瑞增, 陈慧岩. 基于真实驾驶数据的运动基元提取与再生成[J]. 机械工程学报, 2020, 56(16): 155-165.

基于图像和点云实例匹配的智能车目标检测和跟踪

Object Detection and Tracking Based on Image and Point Clouds Instance Matching for Intelligent Vehicles

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价