Abstract
The identification of causative factors and implementation of measures to mitigate work zone crashes can significantly improve overall road safety. This study introduces a Self-Paced Ensemble (SPE) framework, which is utilized in conjunction with the Shapley additive explanations (SHAP) interpretation system, to predict and interpret the severity of work-zone-related crashes. The proposed methodology is an ensemble learning approach that aims to mitigate the issue of imbalanced classification in datasets of significant magnitude. The proposed solution provides an intuitive way to tackle issues related to imbalanced classes, demonstrating remarkable computational efficacy, praiseworthy accuracy, and extensive adaptability to various machine learning models. The study employed work zone crash data from the state of New Jersey spanning a period of two years (2017 and 2018) to train and evaluate the model. The study compared the prediction outcomes of the SPE model with various tree-based machine learning models, such as Light Gradient Boosting Machine, adaptive boosting, and classification and regression tree, along with binary logistic regression. The performance of the SPE model was superior to that of tree-based machine learning models and binary logistic regression. According to the SHAP interpretation, the variables that exhibited the highest degree of influence were crash type, road system, and road median type. According to the model, on highways with barrier-type medians, it is expected that crashes that happen in the same direction and those that happen at a right angle will be the most severe crashes. Additionally, this study found that severe injuries were more likely to result from work zone crashes that happened at night on state highways with localized street lighting.
Original language | English (US) |
---|---|
Article number | 9076 |
Journal | Sustainability (Switzerland) |
Volume | 15 |
Issue number | 11 |
DOIs | |
State | Published - Jun 2023 |
All Science Journal Classification (ASJC) codes
- Computer Science (miscellaneous)
- Geography, Planning and Development
- Renewable Energy, Sustainability and the Environment
- Environmental Science (miscellaneous)
- Energy Engineering and Power Technology
- Hardware and Architecture
- Computer Networks and Communications
- Management, Monitoring, Policy and Law
Keywords
- Shapley additive explanations
- machine learning
- self-paced ensemble
- work zones crashes