• CN:11-2187/TH
  • ISSN:0577-6686

机械工程学报 ›› 2024, Vol. 60 ›› Issue (10): 64-75.doi: 10.3901/JME.2024.10.064

• 智能感知与行为预测 • 上一篇    下一篇

扫码分享

多级参数融合网络的驾驶场景目标检测方法研究

林晨1, 何智成1,2, 黄怡菲3, 林智桂2, 付广2, 黄晋4   

  1. 1. 湖南大学整车先进设计制造技术全国重点实验室 长沙 410082;
    2. 上汽通用五菱汽车股份有限公司技术中心 柳州 545007;
    3. 澳门大学机电工程系 澳门 999078;
    4. 清华大学车辆与运载学院 北京 100084
  • 收稿日期:2023-12-05 修回日期:2024-02-05 出版日期:2024-05-20 发布日期:2024-07-24
  • 作者简介:林晨,男,1995年出生,博士研究生。主要研究方向为智能汽车环境感知技术。
    E-mail:linchen132@hnu.edu.cn
    何智成(通信作者),男,1983年出生,博士,教授,博士研究生导师。主要研究方向为智能汽车与智能控制,先进结构与智能设计。
    E-mail:hezhicheng815@163.com
  • 基金资助:
    国家自然科学基金联合(U20A20285)、广西科技重大专项(2021AA04004)和柳州市科技重大专项(Y2021AA0101A033)资助项目。

Research on Detection Method for Driving Scenarios Based on Multi-stage Parameter Fusion Network

LIN Chen1, HE Zhicheng1,2, HUANG Yifei3, LIN Zhigui2, FU Guang2, HUANG Jin4   

  1. 1. State Key Laboratory of Advanced Design and Manufacturing Technology for Vehicle, Hunan University, Changsha 410082;
    2. SGMW Automobile Co., Ltd., Liuzhou 545007;
    3. Department of Electromechanical Engineering, University of Macau, Macao 999078;
    4. School of Vehicle and Mobility, Tsinghua University, Beijing 100084
  • Received:2023-12-05 Revised:2024-02-05 Online:2024-05-20 Published:2024-07-24

摘要: 基于深度学习的目标检测方法在智能车载控制器应用时很难同时满足检测精度与速度的要求。因此,提出一种多级参数融合的驾驶场景目标检测方法,实现检测速度和精度的同步提升。首先,设计出一种多级分支结构用于构建模型,同时,为提高模型的推理速度,引入一种多级参数融合的方法,即将多级结构层等效为单一的卷积-批标准化层,在保证模型泛化能力不变的条件下,大幅度减小模型的参数量。其次,为增加模型的检测精度,提出一种SSIoU(Soft scaled intersection of union)边界框损失计算方法以及一种联合半锚框的标签关联算法,提高模型对驾驶场景的适应能力。最后,开展基于DAIR-V2X-V数据集的试验验证,结果表明,所提出的多级参数融合模型,相比于目前先进的YOLO(You only look once)算法,检测精度(Mean average precision, mAP)提高了9.89%,推理速度(Frames per second, FPS)提高了51.89%。

关键词: 智能汽车, 目标检测, 参数融合, SSIoU, YOLO算法

Abstract: It is difficult to meet the requirements of both accuracy and speed when applied to intelligent vehicle controllers for object detection based on deep learning methods. Therefore, a multi-stage parameter fusion object detection method for driving scenarios has been proposed, achieving an improvement for detection speed and accuracy simultaneously. Firstly, a multi-stage branching structure is designed to build the model, at the same time, to improve the speed of model inference, the multi-stage branching structure is equivalent to a single convolution-batch normalization layer by introducing a parameter fusion method, whose parameters are reduced greatly with unchanged generalization. Secondly, to improve detection accuracy, a bounding box loss function of SSIoU(Soft scaled intersection of union) and a united semi-anchor free labeling assignment are put forward, enhancing model adaptability to driving scenarios. Finally, the experiments are conducted on the DAIR-V2X-V dataset, the results show that the approach proposed achieves 9.89% and 51.89% improvements in mAP(mean average precision) and FPS(Frames per second) compared to the state-of-the-art YOLO (You only look once) algorithm.

Key words: intelligent vehicle, object detection, parameter fusion, SSIoU, YOLO algorithm

中图分类号: