Explainable Boosting Machine: A Contemporary Glass-Box Model to Analyze Work Zone-Related Road Traffic Crashes

Author:

Alahmadi Raed1ORCID,Almujibah Hamad2ORCID,Alotaibi Saleh3ORCID,Elshekh Ali. E. A.2,Alsharif Mohammad4ORCID,Bakri Mudthir5ORCID

Affiliation:

1. Department of Civil Engineering, College of Engineering, Al-Baha University, Al-Baha P.O. Box 1988, Saudi Arabia

2. Department of Civil Engineering, College of Engineering, Taif University, Taif P.O. Box 11099, Saudi Arabia

3. Department of Civil and Environmental Engineering, Faculty of Engineering—Rabigh Branch, King Abdulaziz University, Jeddah P.O. Box 21589, Saudi Arabia

4. Department Architecture, College of Engineering, Al-Baha University, Al-Baha P.O. Box 1988, Saudi Arabia

5. Department of Civil Engineering, College of Engineering, Qassim University, Unaizah P.O. Box 56452, Saudi Arabia

Abstract

Examining the factors contributing to work zone crashes and implementing measures to reduce their occurrence can significantly improve road safety. In this research, we utilized the explainable boosting machine (EBM), a modern glass-box machine learning (ML) model, to categorize and predict work zone-related crashes and to interpret the various contributing factors. The issue of data imbalance was also addressed by utilizing work zone crash data from the state of New Jersey, comprising data collected over the course of two years (2017 and 2018) and applying data augmentation strategies such synthetic minority over-sampling technique (SMOTE), borderline-SMOTE, and SVM-SMOTE. The EBM model was trained using augmented data and Bayesian optimization for hyperparameter tuning. The performance of the EBM model was evaluated and compared to black-box ML models such as combined kernel and tree boosting (KTBoost, python 3.7.1 and KTboost package version 0.2.2), light gradient boosting machine (LightGBM version 3.2.1), and extreme gradient boosting (XGBoost version 1.7.6). The EBM model, using borderline-SMOTE-treated data, demonstrated greater efficacy with respect to precision (81.37%), recall (82.53%), geometric mean (75.39%), and Matthews correlation coefficient (0.43). The EBM model also allows for an in-depth evaluation of single and pairwise factor interactions in predicting work zone-related crash severity. It examines both global and local perspectives, and assists in assessing the influence of various factors.

Publisher

MDPI AG

Subject

Public Health, Environmental and Occupational Health,Safety Research,Safety, Risk, Reliability and Quality

Reference37 articles.

1. (2023, September 15). FHWA Work Zone Facts and Statistics, Available online: https://ops.fhwa.dot.gov/wz/resources/facts_stats.htm#ftn2.

2. Meta-analysis of the effect of road work zones on crash occurrence;Theofilatos;Accid. Anal. Prev.,2017

3. Driver responses to differing urban work zone configurations;Morgan;Accid. Anal. Prev.,2010

4. In-depth analysis of drivers’ merging behavior and rear-end crash risks in work zone merging areas;Weng;Accid. Anal. Prev.,2015

5. A Tobit model for analyzing speed limit compliance in work zones;Debnath;Saf. Sci.,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3