Driving Behavior Recognition Based on Human Feature Point Extraction and Multidimensional Time Series Classification

doi:10.3901/JME.2025.15.233

Abstract

Abstract: Driver behavior recognition is crucial for both improving driving safety and developing intelligent transportation. Due to the large differences in different driving environments and driver characteristics, it is difficult for driving behavior recognition models based on end-to-end deep learning to maintain high generalization performance under different datasets. To address the above problems, a driver behavior recognition method based on human feature point extraction and multi-dimensional time series classification is proposed. YOLOv8 and distillation for whole-body pose estimators（DWPose） are used to track the driver region and extract the driver human feature point matrix. The feature point matrix is normalized, smoothed and dimensionally transformed. Multidimensional time series classification models based on Transformer, Informer, temporal convolutional neural network（TCN） and attention mechanism-long and short-term memory networks（LSTM-Attention） are established, respectively. The results show that the Informer model has the highest recognition accuracy and the TCN model has the highest operational efficiency. When trained with Driver-100-Day, Informer’s test accuracies on the Driver-100-Day, Driver-100-Night, and State Farm Driver 2 datasets are 90.82%, 88.77%, and 80.67%, respectively, which is higher than that of CNN-Transformer by 24.56%, 72.02% and 67.57%. The proposed method shows a major improvement in generalization compared to the model based on single frame image input and is able to arrive at higher recognition efficiency and accuracy.

Key words: intelligent driving, driving behavior recognition, human feature points, time series classification, generalization performance

CLC Number:

U471

LI Zhao, ZHAO Xia, ZHAO Jikang, FU Rui, WANG Chang. Driving Behavior Recognition Based on Human Feature Point Extraction and Multidimensional Time Series Classification[J]. Journal of Mechanical Engineering, 2025, 61(15): 233-246.

References

[1] KOAY H V,CHUAH J H,CHOW C O,et al. Detecting and recognizing driver distraction through various data modality using machine learning:A review, recent advances, simplified framework and open challenges (2014-2021)[J]. Engineering Applications of Artificial Intelligence,2022,115:105309.
[2] QU Y,HU H,LIU J,et al. Driver state monitoring technology for conditionally automated vehicles:Review and future prospects[J]. IEEE Transactions on Instrumentation and Measurement,2023,72:1-20.
[3] 张波,王文军,魏民国,等.基于机器视觉的驾驶人使用手持电话行为检测[J].吉林大学学报,2015,45(5):1688-1695.ZHANG Bo,WANG Wenjun, WEI Minguo, et al.Detection handheld phone use by driver based on machine vision[J]. Journal of Jilin University, 2015, 45(5):1688-1695.
[4] 程文冬,付锐,马勇,等.驾驶人在手机通话行为中的认知分心图像识别研究[J].中国公路学报,2021,34(5):168-181.CHENG Wendong,FU Rui,MA Yong,et al. Research on driver's cognitive distraction in mobile phone call behavior based on image recognition[J]. China Journal of Highway and Transport,2021,34(5):168-181.
[5] LI W,HUANG J,XIE G,et al. A survey on vision-based driver distraction analysis[J]. Journal of Systems Architecture,2021,121:102319.
[6] WANG J,CHAI W,VENKATACHALAPATHY A,et al.A survey on driver behavior analysis from in-vehicle cameras[J]. IEEE Transactions on Intelligent Transportation Systems,2022,23(8):10186-10209.
[7] XING Y, LÜC, WANG H, et al. Driver activity recognition for intelligent vehicles:A deep learning approach[J]. IEEE Transactions on Vehicular Technology,2019,68(6):5379-5390.
[8] 贺宜,鲁曼可,高嵩,等.基于Mobile ViT-CA模型的营运车辆驾驶人分心行为检测[J].中国公路学报,2024,37(1):194-204.HE Yi,LU Manke,GAO Song,et al,Distracted behavior detection of commercial vehicle drivers based on the mobilevit-ca model[J]. China Journal of Highway and Transport,2024,37(1):194-204.
[9] 柳长源,虎浩媛,毕晓君.双线性融合网络的驾驶员分心行为识别[J].北京邮电大学学报,2022,45(2):79-84.LIU Changyuan, HU Haoyuan, BI Xiaojun. Driver distraction recognition using bilinear fusion networks[J].Journal of Beijing University of Posts and Telecommunications,2022,45(2):79-84.
[10] LI B,CHEN J,HUANG Z,et al. A new unsupervised deep learning algorithm for fine-grained detection of driver distraction[J]. IEEE Transactions on Intelligent Transportation Systems,2022,23(10):19272-19284.
[11] 张斌,付俊怡,夏金祥.基于类间距优化的分心驾驶行为识别模型训练方法[J].汽车工程,2022,44(2):225-232.ZHANG Bin,FU Junyi,XIA Jinxiang. A metric space optimized method for driver distraction recognition model training[J]. Automotive Engineering, 2022, 44(2):225-232.
[12] LU M,HU Y,LU X. A pose-aware dynamic weighting model using feature integration for driver action recognition[J]. Engineering Applications of Artificial Intelligence,2022,113:104918.
[13] LIU D, YAMASAKI T, WANG Y, et al. Toward extremely lightweight distracted driver recognition with distillation-based neural architecture search and knowledge transfer[J]. IEEE Transactions on Intelligent Transportation Systems,2023,24(1):764-777.
[14] 曹立波,杨洒,艾昌硕,等.基于深度学习的分心驾驶行为检测方法[J].汽车技术,2023(6):49-54.CAO Libo,YANG Sa,AI Changshuo,et al. Distracted driving behavior detection based on deep learning[J].Automobile Technology,2023(6):49-54.
[15] JEGHAM I,ALOUANI I,BEN KHALIFA A,et al. Deep learning-based hard spatial attention for driver in-vehicle action monitoring[J]. Expert Systems with Applications,2023,219:119629.
[16] 尹智帅,钟恕,聂琳真,等.基于人体姿态估计的分心驾驶行为检测[J].中国公路学报,2022,35(6):312-323.YIN Zhishuai, ZHONG Shu, NIE Linzhen, et al.Distracted driving behavior detection based on human pose estimation[J]. China Journal of Highway and Transport,2022,35(6):312-323.
[17] RAMIS S,BUADES J M,PERALES F J,et al. A novel approach to cross dataset studies in facial expression recognition[J]. Multimedia Tools and Applications,2022,81(27):39507-39544.
[18] LIU Z,LI Y,YAO L,et al. Side-aware meta-learning for cross-dataset listener diagnosis with subjective tinnitus[J].IEEE Transactions on Neural Systems and Rehabilitation Engineering,2022,30:2352-2361.
[19] PRAJOD P,ANDRÉE J A E P. On the generalizability of ECG-based stress detection models[C]//202221st IEEE International Conference on Machine Learning and Applications (ICMLA). New York:IEEE,2022:549-554.
[20] WANG J,LI W,LI F,et al. 100-Driver:A large-scale,diverse dataset for distracted driver classification[J]. IEEE Transactions on Intelligent Transportation Systems,2023,24(7):7061-7072.
[21] ALOTAIBI M, ALOTAIBI B. Distracted driver classification using deep learning[J]. Signal,Image and Video Processing,2020,14(3):617-624.
[22] STACCHIO L,ANGELI A,LISANTI G,et al. Analyzing cultural relationships visual cues through deep learning models in a cross-dataset setting[J]. Neural Computing and Applications,2024:36(20):11727-11742.
[23] GARRUCHO L,KUSHIBAR K,JOUIDE S,et al.Domain generalization in deep learning based mass detection in mammography:A large-scale multi-center study[J]. Artificial Intelligence in Medicine,2022,132:102386.
[24] MOU L,CHANG J,ZHOU C,et al. Multimodal driver distraction detection using dual-channel network of CNN and transformer[J]. Expert Systems with Applications,2023,234:121066.
[25] BAHETI B, TALBAR S, GAJRE S. Towards computationally efficient and realtime distracted driver detection with mobilevgg network[J]. IEEE Transactions on Intelligent Vehicles,2020,5(4):565-574.