1. Introduction
Passenger vehicles (PVs), as motor vehicles designed to transport people and their belongings, have significantly enhanced the convenience of human mobility [
1,
2,
3]. PVs are primarily categorized into two major groups: passenger cars and light trucks [
4]. Among various vehicle types, PVs not only constitute the majority of the vehicle fleet but also account for the highest proportion of traffic accidents [
5]. According to the 2024 analysis conducted by the National Highway Traffic Safety Administration on the 2022 Crash Report Sampling System data [
6], more than 64% of police-reported crashes involved two-vehicle collisions, with PV-PV crashes accounting for the highest proportion at 73.4%. Drivers are the most severely affected individuals in traffic accidents, representing 67.7% of all road traffic casualties in 2021. Therefore, understanding the factors influencing collision occurrences and injury severity is crucial for enhancing traffic safety.
PV-PV collisions constitute a complex dynamic process, as illustrated in
Figure 1, encompassing dual-vehicle and driver characteristics, environmental conditions, causal factors, and pre-collision and collision dynamics [
7]. accident severity prediction and feature analysis involve four key steps [
8]: data acquisition, preprocessing, feature selection, and model selection. First, obtaining a reliable and comprehensive accident dataset is foundational [
9]. Second, preprocessing steps like outlier removal, missing value imputation, encoding, and balancing ensure data suitability for analysis [
10]. Third, feature selection addresses redundancy and enhances prediction performance and interpretability [
11]. Finally, selecting an appropriate model tailored to the dataset and research objectives is critical for effective analysis.
Recent studies have focused on identifying and quantifying factors influencing accident injury severity, primarily through statistical methods. For instance, Dongdong Song et al. (2023) utilized a random parameters bivariate probability model to analyze UK truck-car collision data (2017–2019), identifying key factors affecting driver injury severity and proposing safety improvements [
12]. Similarly, Hongren Gong et al. (2022) applied a Bayesian multivariate random parameters logit model to U.S. GES and CRSS two-vehicle collision data (2010–2018), uncovering significant associations between factors and injury risk, along with heterogeneity and temporal instability in effects [
13]. In another study, Joon-Ki Kim et al. (2013) analyzed California single-vehicle crash data (2003–2004) using a mixed logit model, highlighting the influence of age and gender on injury severity and the presence of heterogeneity [
14]. Donald Mathew Cerwick et al. (2014) compared mixed logit and latent class models using multi-vehicle heavy truck crash data from Iowa (2007–2012), finding that while the latent class model fit slightly better, the mixed logit model yielded probability predictions closer to observed values, with similar marginal effects for most variables [
15].
Statistical methods have been widely used due to their strong interpretability, aiding in understanding the mechanisms behind accident heterogeneity and severity correlations. However, their dependence on linear assumptions and independence constraints severely limits their ability to model the complex, nonlinear interactions among accident-related factors. [
16]. Traffic accidents, as complex systems, involve multiple collinear and higher-order interactions among influencing factors [
17,
18], such as vehicle dynamics and environmental conditions. The results obtained from traditional methods heavily depend on the validity of their assumptions. If these assumptions deviate from reality—such as neglecting nonlinear interactions—the reliability and generalizability of the analysis are significantly compromised [
19,
20,
21]. In contrast, machine learning (ML) methods excel in nonlinear modeling and can effectively capture intricate relationships among accident features without relying on strict statistical assumptions. They can extract latent patterns from high-dimensional data, substantially improving the accuracy of accident severity prediction [
22,
23].
In recent years, ML techniques have emerged as a promising alternative to traditional data analysis methods for injury severity analysis, offering the ability to extract valuable insights from large-scale, complex, and heterogeneous datasets. Ensemble machine learning (EML) and deep learning (DL) models are widely used in traffic accident severity prediction, but their applications differ. DL models are mainly used for macroscopic accident analysis, utilizing spatial-temporal data from images, videos, or sequences to predict accident occurrence and severity, aiding traffic management in intervention and emergency response. Yu et al. (2021) developed a Deep Spatio-Temporal Graph Convolutional Network using heterogeneous traffic data in Beijing to improve accident risk prediction, the model enhances prediction accuracy, enabling early warnings of potential hazards and supporting safer route selection [
24].Fares Alhaek et al. (2024) proposed a CNN-BiLSTM-based DL method for traffic accident severity prediction. Experiments on UK city data showed it outperformed baselines [
25]. However, due to the structured nature of accident severity datasets, Tabular is the most basic form of traffic accident datasets. EML models has demonstrated its superiority in handling datasets with tabular data format. EML’s strength lies in its ability to combine multiple base models, enabling it to capture complex patterns and relationships within the data more effectively than single models. In the field of accident severity prediction, this advantage is particularly evident. For example, various ensemble machine learning models have outperformed other machine learning models. Models like Random Forest [
26], XGBoost [
27,
28], LightGBM [
29], CatBoost [
30] and AdaBoost [
31] are more suitable for microscopic accident severity prediction. These models offer better interpretability, which means it is easier to understand how different factors contribute to accident severity.
EML algorithms improve predictive performance by integrating multiple decision trees. Although a single decision tree offers inherent interpretability, the depth and number of nodes in ensemble models significantly increase in high-dimensional accident feature spaces, leading to excessive model complexity and a notable reduction in interpretability [
32]. Despite their superior predictive performance, the "black-box" nature of ensemble models hinders the interpretation of accident causation mechanisms. Explainable ML techniques, SHAP [
33], provide a viable solution to this issue. SHAP, based on Shapley values from game theory, quantifies feature contributions and reveals interaction effects. Its additive feature attribution property aligns well with tree-based ensemble models, supporting both global and local interpretability [
34]. Widely integrated with ensemble machine learning, SHAP has become a preferred tool for analyzing the relationship between accident severity and contributing factors, offering a new paradigm for passenger vehicle collision analysis. It not only ensures the accuracy of the prediction model but also enables in - depth interpretation of the model and feature analysis.
Current research on accident severity prediction faces several challenges. First, natural accident datasets suffer from severe class imbalance [
6,
12,
13], where fatal accident cases are significantly underrepresented compared to non-injury cases. This imbalance poses challenges for traditional resampling methods in effectively balancing severity levels [
35]. Moreover, accident datasets often contain a mix of categorical and numerical features, which limits the applicability of Euclidean distance-based oversampling techniques such as Synthetic Minority Over-sampling Technique (SMOTE) [
36]. Thus, addressing data imbalance at the data collection stage is crucial. Second, existing studies primarily rely on empirical heuristics for feature selection [
12], lacking rigorous scientific justification. Consequently, the representativeness and rationality of selected features remain inadequately explained.
To address the issue of class imbalance in accident severity datasets, this study first integrates the CRSS and FARS datasets from the NHTSA. This integration reduces the severity imbalance ratio from 175.1 in CRSS and 10.2 in FARS to 5.1 in the combined dataset. Furthermore, a combination of oversampling and undersampling techniques is employed to achieve a balanced distribution of severity categories. For feature selection, traditional methods often rely on Pearson correlation coefficients to eliminate highly correlated and irrelevant variables. However, this approach is not suitable for accident datasets dominated by categorical features, as categorical values represent discrete labels rather than numerical relationships. o enhance the rationality of feature extraction, this study adopts the mRMR algorithm based on mutual information [
37]. This algorithm has been widely applied in Disease Diagnosis [
38,
39]. Compared with other feature extraction methods, it demonstrates the best performance while requiring the fewest number of features [
40]. Subsequently, this study applies ensemble machine learning algorithms—including CatBoost, AdaBoost, XGBoost, Random Tree, LightGBM, and decision trees—to accident severity prediction. By comparing the predictive performance of different models, the best-performing model is selected for SHAP-based interpretability analysis, facilitating insights into feature importance and improving the explainability of accident severity predictions.
The contributions of this paper can be summarized as follows:
(1) Integrated two distinct accident datasets to address imbalance issues and applied the mRMR algorithm for key feature extraction, reducing redundancy and improving model accuracy across all severity levels.
(2) Proposed a novel micro-level analytical approach for PV-PV accidents, incorporating dual-vehicle factors to simultaneously enhance prediction accuracy and model interpretability, advancing beyond traditional macro-level methods.
(3) Performed a systematic evaluation of state-of-the-art EML models for PV-PV accident severity prediction and integrated the optimal ensemble model with SHAP interpretation, enabling comprehensive feature analysis and addressing a critical research gap in traffic safety analytics.
The remainder of this paper is organized as follows.
Section 2 describes the methodologies, encompassing data preprocessing techniques, feature extraction methods, predictive modeling approaches, and explainable AI algorithms.
Section 3 presents the dataset’s statistical characteristics and feature analysis.
Section 4 details the experimental results, including model performance evaluation and critical factor analysis.
Section 5 discusses the research implications and study limitations. Finally,
Section 6 concludes the paper and proposes future research directions.
3. Data Description
This study uses crash data from the CRSS and FARS databases maintained by the NHTSA from 2016 to 2022. The CRSS provides a nationally representative sample of police-reported crashes, including those resulting in property damage, injuries, or fatalities, while the FARS records fatal crashes where at least one fatality occurs within 30 days of the accident. The two databases collectively include 594,773 crash cases, covering single-vehicle, two-vehicle, and multi-vehicle collisions, with PV-PV collisions being the most frequent, accounting for 225,461 cases. A detailed breakdown of the raw data is presented in
Table 1. More information on these databases can be found at
NHTSA CRSS and
NHTSA FARS.
3.1. Descriptive Statistics of Accident Data
To ensure data completeness and reliability, accident cases missing critical feature information were removed. Since this study focuses on predicting the injury severity of the subject vehicle’s driver, certain attributes of the other party in the collision were considered less relevant. For instance, the seat belt usage of the other driver is unlikely to significantly impact the subject driver’s injury severity. Consequently, some cases were retained despite incomplete information on the other driver, resulting in instances where only one driver in a collision was analyzed.
After preprocessing, the final dataset comprises 116,998 accident cases and 182,296 drivers, ensuring that each retained case contains sufficient information for injury severity prediction. A detailed summary of the dataset after preprocessing is presented in
Table 2,
Table 3 and
Table 4.
After applying the mRMR algorithm, the feature set was reduced from 65 to 33, achieving a balance between relevance and redundancy. At this selection point, feature redundancy is minimized, and further increasing the number of features does not yield noticeable improvements in predictive accuracy. The retained features encompass key accident-related factors, ensuring both robust model performance and meaningful interpretability in accident analysis. The correlation among the selected features, assessed using Cramér’s V [
50], is visualized in 6. This refinement enhances model performance by ensuring the inclusion of only the most relevant and non-redundant features, which are categorized into three groups:
1)
Environmental Features (7 features): These features are related to weather conditions and accident location.
Table 2 presents all 7 environmental features.
2)
Collision and Pre-Collision Features (8 features): These features describe the driver’s evasive actions before impact and the collision dynamics.
Table 3 lists six of these features, while the initial impact areas of both vehicles (IMP_ARE1 and IMP_ARE2) are shown in
Figure 4. For better visualization, the definition of the initial impact area of a vehicle is illustrated in
Figure 3(a).
3)
Participant Features (18 features): These features include driver and vehicle-related characteristics for both parties involved in the collision.
Table 4 presents 14 of these features, the age distributions of the two drivers (DRV_AGE1 and DRV_AGE2) are illustrated in
Figure 5, and the vehicle brand statistics of the involved vehicles are shown in
Figure 3 (b).
This selection process ensures that the retained features contribute meaningfully to accident severity prediction while minimizing redundancy and noise in the dataset.
3.2. Feature Statistical Description
Building on the statistical analysis of environmental factors, accidents most frequently occur during daytime, under clear weather, on straight and level roads, and on two-way undivided roadways outside intersections—scenarios where vehicles spend most of their travel time. While crash frequency is high due to greater exposure, severe injuries are less common, as good visibility, well-maintained pavement, and predictable traffic flow enhance safety. However, two-way undivided roads stand out as an exception, exhibiting a relatively higher fatality risk despite their prevalence, likely due to the increased likelihood of high-speed head-on collisions.
Shifting focus to collision dynamics, pre-collision features play a crucial role in accident severity. Braking upon detecting danger is associated with a lower risk of fatal injuries (8%), whereas vehicle instability before impact—such as skidding laterally—correlates with a substantially higher severity risk (70.9%), likely due to factors such as slippery roads, excessive speed, and driver errors. Accident type (ACC_TYPE) also influences injury outcomes: rear-end collisions, though the most frequent, are linked to a relatively lower fatal injury risk (4.4%), while head-on collisions present the highest risk due to greater collision energy. Impact location further affects injury severity, with side impacts at the 9, 10, and 3 o’clock positions exhibiting significantly higher fatal injury risks (43.5%, 38.1%, and 35.3%, respectively), likely due to limited energy absorption and intrusion into the driver’s space.
It is important to note that since this study integrates two datasets with different class distributions—one covering all accident types and the other focusing on high-severity cases—the reported percentages indicate relative risk levels rather than absolute probabilities. Being struck by the front end (12 o’clock position) of the opposing vehicle poses the highest risk (28%). Similarly, higher pre-crash speeds (TRV_SPD1, TRV_SPD2) are strongly correlated with increased injury severity, while rollovers and loss-of-control crashes—often involving multiple impacts with objects such as the ground, trees, or guardrails—further elevate the risk of severe injuries.
An analysis of driver and vehicle characteristics identifies additional factors influencing injury severity. Male drivers exhibit a higher risk of fatal injury than female drivers, with Cramér’s V [
50] analysis revealing significant associations between gender and vehicle attributes, including VEH_AGE1, VEH_BDY1, VEH_MKE1, and TRV_SPD. Furthermore, impairment due to alcohol consumption or drug influence is a critical factor associated with increased injury severity.
Finally, protective measures and vehicle attributes play a key role in reducing injury severity. Seat belt use and airbag deployment significantly lower injury severity, with non-deployed airbags typically seen in low-energy collisions that result in minor injuries. Heavier vehicles generally offer better protection due to lower collision accelerations, with vehicle weight also linked to structural attributes like height, width, and length that affect crash safety. Additionally, collisions involving minivans or cargo vans tend to result in less severe injuries, possibly due to factors such as professional drivers and lower urban speeds.
4. Experimental Results
To predict accident severity, five EML models and a Decision Tree model were employed, treating the prediction as a multi-class classification task. Model performance was evaluated using multiple metrics, as shown in
Figure 6. Since accuracy alone does not provide a complete assessment, particularly in imbalanced classification problems, precision, recall, and F1-score were also used to ensure a more comprehensive evaluation.
Despite merging the CRSS and FARS databases, class imbalance persisted, with severity levels distributed as follows: Level 0 (45.3%), Level 1 (26.9%), Level 2 (8.8%), and Level 3 (19%). To mitigate this, random oversampling and undersampling techniques were applied, using the average class size as a benchmark. Classes exceeding the benchmark were downsampled, while those below it were upsampled. For synthetic data generation, categorical features remained unchanged, while numerical features were modified by adding random noise within ±10% of the original values to enhance data variability while preserving statistical integrity.
Bayesian optimization was employed to determine the optimal hyperparameters for each model, offering advantages such as reduced computational cost and efficient exploration of the hyperparameter space. Model performance was evaluated both before and after data balancing, with hyperparameters tuned separately for each scenario. The dataset was split into 60% training, 20% validation, and 20% testing sets.
Following data balancing, all models demonstrated significant performance improvements. As shown in
Table 5, prior to balancing, XGBoost achieved the highest accuracy (79.52%) and precision (71.8%), while CatBoost excelled in recall (68.6%) and F1-score (68.69%). After balancing, XGBoost outperformed all models across all four metrics (Accuracy = 84.9%, Precision = 84.85%, Recall = 84.9%, F1-score = 84.87%), followed by Random Forest (Accuracy = 83.74%, Precision = 83.63%, Recall = 83.77%, F1-score = 83.66%). However, Decision Tree exhibited a slight accuracy decrease of 0.73% after balancing, despite notable improvements in precision, recall, and F1-score. This suggests that the model initially favored predicting the majority class in the imbalanced dataset. The redistribution of class instances reduced this bias, enhancing its ability to classify minority classes correctly, while causing a slight accuracy drop.
4.1. Accident Feature Analysis
Based on the model performance evaluation, XGBoost demonstrated superior performance across all metrics and was therefore selected as the target model for interpretability analysis in this study. The SHAP method, based on Shapley values (as detailed in
Section 2), was employed to systematically interpret its decision-making mechanisms.
Global SHAP analysis quantitatively assessed the contributions of individual features to predictions across the four accident severity classes.
Figure 7(a-d) illustrates the SHAP value distributions for different severity categories. The horizontal axis represents the range of SHAP values, indicating feature influence intensity (positive values support the prediction of the corresponding severity class, while negative values indicate opposing effects). The vertical axis lists feature variables, with scatter point colors representing feature magnitudes (for numerical variables) or encoded categorical labels (for nominal variables)
Analytical results indicate that TRV_SPD1 and TRV_SPD2 make the greatest contribution to Level 0 predictions, suggesting that the travel speeds of both collision parties play a crucial role in determining accident severity by influencing kinetic energy transfer. Lower velocities (cool-toned colors) increase the likelihood of Level 0 predictions, whereas higher velocities (warm-toned colors) lower the probability of being classified as Level 0.
Further investigation reveals that most features exhibit nonlinear relationships between their values and SHAP contributions. Most features exhibit a bimodal distribution, where lower-value ranges correspond to predominantly negative SHAP values, transitioning to positive values at higher ranges. Additionally, certain features display polarity reversal across different severity classes. This phenomenon suggests that the model captures context-dependent variations in feature contributions. Notably, vehicle speed parameters are strongly correlated with accident severity, supporting the validity of the model’s decision-making process and its consistency with fundamental collision dynamics principles.
4.1.1. Feature Importance Analysis
Feature importance analysis is essential for understanding model decision-making and guiding accident analysis. While SHAP value distributions show how individual feature values impact predictions, they do not quantify the overall significance of features. To address this, the mean absolute SHAP value (Mean(|SHAP|)) is used to measure the average contribution of each feature across all predictions. The feature importance ranking results are shown in
Figure 8, with the feature importance for another eight types of accidents presented in
Figure A1. TRV_SPD1, TRV_SPD2, ACC_TYPE, and IMP_ARE2 are identified as the most influential features in the model. However, globally important features do not always align with those most critical for specific severity levels. For instance, the primary predictors for Level 3 severity are TRV_SPD2, DRV_AGE1, and SAF_ARBG.
Moreover, to assess the impact of feature importance on model performance, features were progressively removed in ascending order of importance, followed by model retraining and evaluation. The relationship between feature count and model accuracy, illustrated in
Figure 9, demonstrates that retaining high-importance features results in significant performance improvements, while low-importance features contribute less substantially.
Although global rankings identify key predictors, they may overlook features with strong localized effects. For example, DRV_ALC1 ranks 22nd globally but exhibits widely dispersed SHAP values, indicating that despite its low frequency, it has a substantial impact on accident severity. This underscores the necessity of considering both overall and scenario-specific importance for a more comprehensive assessment of accident severity determinants.
4.1.2. Feature Dependency Analysis
In this study, feature dependency analysis is performed using the SHAP dependence plot, which captures the marginal effect of individual features on the target variable while illustrating the distribution and variability of SHAP values. Each point on the plot represents the predicted target value for a specific observation in the dataset. The analysis focuses on Level3 severity, representing fatal accidents—the most critical crash severity category. Thus, feature dependency analysis primarily examines feature behavior at this level to provide deeper insights into their contributions to fatal crash outcomes. Specifically, four key features—TRAV_SPD1, IMP_ARE1, VEH_MKE1, and VEH_BDY1—are analyzed. Additionally, the system autonomously identifies the feature with the strongest interaction within the dataset, as shown in
Figure 10.
Figure 10(a) presents the interaction between TRV_SPD1 and DRV_SEX1. As TRV_SPD1 increases, the SHAP value shifts from negative to positive along the y-axis, with a transition threshold between 40 mph and 55 mph, indicating a critical speed range for predicting fatal accidents. The scatter point colors represent driver gender, revealing nearly identical SHAP distributions for male and female drivers. Although DRV_SEX1 exhibits a globally bimodal impact, its narrow SHAP distribution suggests a negligible overall influence compared to speed, whose broader distribution signifies a more substantial effect.
Figure 10(b) illustrates the interaction between IMP_ARE1 and IMP_ARE2, underscoring the strong association between impact location and fatal crash outcomes. Collisions at the 9 o’clock and 10 o’clock positions contribute most significantly to fatal injury predictions, aligning with real-world crash dynamics. These zones, located near the driver, absorb less impact energy, leading to severe structural deformation and reduced survival space. The effect is exacerbated when these areas are struck by another vehicle’s 11 o’clock or 12 o’clock positions, where impact forces are typically highest.
Figure 10(c) examines the influence of VEH_MKE1 and VEH_WGT1, demonstrating brand-specific variations in vehicle safety performance. The x-axis represents encoded vehicle brand labels (refer to
Figure 4), allowing for a direct comparison of brand influence on accident severity. Scatter point colors denote vehicle curb weight, highlighting how certain manufacturers specialize in distinct vehicle categories. For example, Hummer (15) focuses on off-road vehicles, while RAM (10) primarily produces light trucks—both featuring high curb weights and reinforced body structures. Conversely, Volkswagen (13) and Tesla primarily manufacture passenger cars with lower curb weights but high safety ratings, aligning with their real-world safety reputations.
Figure 10(d) explores the relationship between VEH_BDY1 and VEH_WGT1, emphasizing the significant variation in crash survivability across vehicle body types. Light trucks exhibit lower susceptibility to fatal crashes compared to passenger vehicles (PVs), which are more prone to severe injuries. Scatter point colors indicate that light trucks generally have higher curb weights than PVs. Among body types, pickup trucks demonstrate the highest safety performance, whereas convertibles rank lowest, underscoring the critical role of vehicle structure and weight in crash outcomes.
4.1.3. Bivariate SHAP Contribution Analysis
Given that SHAP values exhibit additivity, the sum of individual feature SHAP values equals their cumulative contribution to the model’s prediction. This property allows for the evaluation of the combined impact of two or more features on the target variable by aggregating their SHAP values. In this study, Bivariate SHAP Contribution Analysis investigates the joint influence of feature pairs on model predictions by analyzing the sum of their SHAP values. This approach uncovers feature interactions and their collective impact on accident severity classification. By visualizing the aggregated SHAP contributions through a scatter heatmap, patterns emerge that show how specific feature combinations lead to higher or lower predicted severity levels, providing deeper insights into interdependent factors driving model decisions.
Figure 11 (a-d) displays the joint contribution of TRV_SPD1 & TRV_SPD2 to injury severity levels ("Level 0" to "Level 3"). The color of each scatter point reflects the magnitude of the SHAP value, illustrating how increasing vehicle speeds lead to higher predicted severity. Similarly,
Figure 11 (e-h) depicts the combined effect of DRV_AGE1 & VEH_AGE1, showing that drivers over 80 years old are more likely to experience Level 3 injuries, with the age threshold decreasing as vehicle age increases, reflecting improvements in vehicle safety technologies.
Figure 11 (i-l) demonstrates the joint influence of VEH_WGT1 & VEH_WGT2, revealing that heavier vehicles typically sustain lower injury severity compared to lighter vehicles in collisions.
This analysis effectively reveals the joint influence of feature pairs on accident severity outcomes. The clustering of colors within the heatmaps highlights distinct support, rejection, and transition regions. Red regions indicate the support domain, where predictions are more likely to align with the corresponding category, while deep blue regions represent the rejection domain, where predictions are less likely to belong to the specified category. Light green and light yellow regions correspond to transition zones, where predictions exhibit intermediate tendencies and neither strongly support nor reject the predicted outcomes. The clarity of these clusters underscores the contribution of these feature interactions in enhancing model prediction accuracy.
4.1.4. Instance-Level SHAP Interpretation
Instance-Level SHAP Interpretation decomposes the SHAP values for a single prediction instance, revealing the specific contribution of each feature to the final prediction and explaining why the model made that particular decision.
Figure 12 presents the SHAP value waterfall plot for a randomly selected accident case from the dataset, where the actual injury severity is classified as "Level 3." The x-axis represents the SHAP values, while the y-axis lists the features in descending order of importance. The feature labeled ’Others’ represents the cumulative SHAP values of lower-contributing features, ensuring a more concise visualization.
The SHAP value waterfall plot provides an intuitive and transparent representation of the model’s decision-making process, illustrating how individual features contribute to the final prediction. The baseline value, , representing the expected model output across the dataset, is computed as 0.360, 0.568, 0.555, and 0.465 for injury severity levels 0 to 3, respectively. By aggregating the SHAP values of each feature relative to this baseline, the cumulative SHAP value for each severity level is obtained. The final predicted probabilities for each severity category are then derived using the Softmax function.
This instance-level interpretation enhances the explainability of the model by highlighting the key factors that influence the classification outcome. Such insights are particularly valuable for understanding the underlying risk factors in individual accident cases and can aid in data-driven decision-making for traffic safety analysis and policy development.
6. Conclusions
This study employed interpretable machine learning algorithms to predict driver injury severity and analyze accident characteristics in PV-PV collisions. The proposed prediction model achieved high accuracy, with superior precision, recall, and F1-score across all severity levels compared to existing studies. Notably, the model demonstrated significantly improved accuracy in predicting severe injury and fatality levels, which are the most critical categories in accident prediction and analysis. This improvement stems from the integration of the CRSS and FARS datasets, which mitigated class imbalance in high-severity cases, and the application of the mRMR algorithm, which reduced redundancy and refined feature selection.
By leveraging SHAP-based interpretable machine learning, this study provides a high-performance predictive model for injury analysis and accident causation studies in PV-PV collisions. Focusing on micro-level accident cases, the study identified the most influential features associated with accident occurrence and progression, ensuring a more precise and reliable prediction framework.
Global and local SHAP analyses revealed the contribution and directional impact of various accident-related features on injury severity prediction. Through multiple visualization techniques, the study examined both individual feature effects and combined feature interactions from different perspectives. These insights enhance the understanding of key accident factors, supporting more effective injury prevention strategies, risk assessment, and safety interventions. The proposed approach offers a robust framework for future research on accident severity prediction and interpretable machine learning applications in traffic safety analysis.
Figure 1.
Schematic diagram of a traffic accident scene. The accident occurred as a sedan was overtaking a cargo van on the left. A pedestrian suddenly entered the lane, and due to the obstruction from the roadside greenery, the sedan driver did not notice the pedestrian in time. After noticing the pedestrian, the driver immediately braked and swerved left, causing a rear-end collision with the cargo van.
Figure 1.
Schematic diagram of a traffic accident scene. The accident occurred as a sedan was overtaking a cargo van on the left. A pedestrian suddenly entered the lane, and due to the obstruction from the roadside greenery, the sedan driver did not notice the pedestrian in time. After noticing the pedestrian, the driver immediately braked and swerved left, causing a rear-end collision with the cargo van.
Figure 2.
Technical framework diagram of this study.
Figure 2.
Technical framework diagram of this study.
Figure 4.
Stacked chart of collision characteristics and severity. (a) Annual distribution. (b) Initial impact area of this vehicle (IMP_ARE1). (c) Driver age of this vehicle (DRV_AGE1). (d) Days of the week. (e) Initial impact area of the other vehicle (IMP_ARE2). (f) Driver age of the other vehicle (DRV_AGE2).
Figure 4.
Stacked chart of collision characteristics and severity. (a) Annual distribution. (b) Initial impact area of this vehicle (IMP_ARE1). (c) Driver age of this vehicle (DRV_AGE1). (d) Days of the week. (e) Initial impact area of the other vehicle (IMP_ARE2). (f) Driver age of the other vehicle (DRV_AGE2).
Figure 5.
Cramér’s V correlation matrix of selected accident features.
Figure 5.
Cramér’s V correlation matrix of selected accident features.
Figure 6.
Performance Metrics for Multiclass Classification Models.
Figure 6.
Performance Metrics for Multiclass Classification Models.
Figure 7.
Global SHAP value distribution of accident features.
Figure 7.
Global SHAP value distribution of accident features.
Figure 8.
Global feature importance ranking.
Figure 8.
Global feature importance ranking.
Figure 9.
Relationship between feature numbers and model performance.
Figure 9.
Relationship between feature numbers and model performance.
Figure 10.
Global SHAP value distribution of accident features.
Figure 10.
Global SHAP value distribution of accident features.
Figure 11.
Scatter plot of combined SHAP impact on accident severity. (a–d) TRV_SPD1 and TRV_SPD2, (e–h) DRV_AGE1 and VEH_AGE1, and (i–l) VEH_WGT1 and VEH_WGT2, with all plots showing combined SHAP values for injury severity levels (Level 0–Level 3).
Figure 11.
Scatter plot of combined SHAP impact on accident severity. (a–d) TRV_SPD1 and TRV_SPD2, (e–h) DRV_AGE1 and VEH_AGE1, and (i–l) VEH_WGT1 and VEH_WGT2, with all plots showing combined SHAP values for injury severity levels (Level 0–Level 3).
Figure 12.
Local SHAP waterfall plot for a single multi-class prediction case (True label: ’Level 3’).
Figure 12.
Local SHAP waterfall plot for a single multi-class prediction case (True label: ’Level 3’).
Table 1.
Accident data statistics from the CRSS and FARS databases (2016–2022).
Table 1.
Accident data statistics from the CRSS and FARS databases (2016–2022).
| Database |
Participants |
Case Number |
|
Driver numbers for different accident vehicles |
|
Total
|
PV-PV
|
Total
|
PV
|
Motorcycle
|
Truck
|
Bus
|
Others
|
| CRSS |
1 |
121288 |
- |
|
121288 |
107169 |
8585 |
3768 |
418 |
1348 |
| 2 |
204916 |
175910 |
|
409832 |
379053 |
9341 |
14218 |
1590 |
5630 |
| >2 |
26938 |
- |
|
87755 |
83664 |
883 |
2226 |
122 |
860 |
| FARS |
1 |
134982 |
- |
|
134982 |
111851 |
14360 |
5462 |
370 |
2939 |
| 2 |
86718 |
49551 |
|
173436 |
132888 |
18678 |
17512 |
580 |
3778 |
| >2 |
19931 |
- |
|
69080 |
57565 |
3685 |
6361 |
177 |
1292 |
| CRSS+FARS |
- |
594773 |
225461 |
|
996373 |
872190 |
55532 |
49547 |
3257 |
15847 |
Table 2.
Statistical description of accident environmental characteristics.
Table 2.
Statistical description of accident environmental characteristics.
| Variables |
(%) |
Injury Severity (%) |
Variables |
(%) |
Injury Severity (%) |
|
0
|
1
|
2
|
3
|
0
|
1
|
2
|
3
|
| Database |
|
Divided, Unprotected (2) |
27.7 |
50.2 |
31.2 |
6.6 |
12 |
| CRSS |
63.3 |
68.7 |
27.7 |
3.2 |
0.4 |
Divided, Protected (3) |
13.4 |
58 |
22.8 |
5.6 |
13.6 |
| FARS |
36.7 |
5 |
25.5 |
18.4 |
51.1 |
One-Way (4) |
7.4 |
72.5 |
19.9 |
3.1 |
4.6 |
| CRSS+FARS |
100 |
45.3 |
26.9 |
8.8 |
19 |
Roadway Profile (TWY_PRFL)
|
|
Location at rural or urban (RUR_URBN)
|
|
Level (1) |
85.9 |
47.5 |
26.9 |
8.1 |
17.5 |
| Rural (1) |
67.9 |
53 |
25.6 |
7.7 |
13.7 |
Grade (2) |
7.2 |
36.1 |
27.7 |
11.4 |
24.8 |
| Urban (2) |
32.1 |
29.1 |
29.5 |
11.3 |
30.2 |
Uphill (3) |
2.4 |
28.8 |
25.7 |
15 |
30.5 |
|
Light condition (LGT_COND)
|
|
Downhill (4) |
2.7 |
31.2 |
24.5 |
13.6 |
30.7 |
| Daylight (1) |
69.8 |
50.6 |
26.8 |
7.2 |
15.4 |
Hillcrest (5) |
1.5 |
23.2 |
27.3 |
15.2 |
34.3 |
| Dawn (2) |
1.6 |
34 |
25 |
12 |
29.1 |
Sag (6) |
0.2 |
26.6 |
31.2 |
11.6 |
30.7 |
| Dusk (3) |
2.6 |
47.6 |
24.8 |
9.3 |
18.3 |
Roadway Alignment (TWY_ALGN)
|
| Dark-Lighted (4) |
15 |
41.1 |
30.5 |
9.4 |
19 |
Straight (1) |
91.6 |
47.2 |
27.4 |
8.1 |
17.4 |
| Dark-Not Lighted (5) |
11 |
19.2 |
22.9 |
17.5 |
40.4 |
Curve Right (2) |
4.6 |
31.1 |
19.1 |
14.9 |
35 |
|
Atmospheric Conditions (ATM_COND)
|
|
Curve Left (3) |
3.8 |
18 |
24 |
19.5 |
38.5 |
| Clear (1) |
74.4 |
46.3 |
26.6 |
8.6 |
18.5 |
Type of Intersection (TWY_INTT)
|
| Cloudy (2) |
14.8 |
43.1 |
27.6 |
9.3 |
20.1 |
Four-Way Intersection (1) |
33.7 |
45.9 |
33.3 |
6.8 |
14 |
| Rain (3) |
8.6 |
44.5 |
27.6 |
8.6 |
19.2 |
T-Intersection (2) |
14.4 |
46.2 |
34.1 |
6.2 |
13.5 |
| Snow (4) |
1.2 |
41.3 |
26.8 |
11.1 |
20.9 |
Roundabout (3) |
0.2 |
84.4 |
13.4 |
0.6 |
1.6 |
| Fog (5) |
0.6 |
17.7 |
24.6 |
17.6 |
40.1 |
Five-Point, or more (4) |
0.2 |
55.5 |
29.7 |
5.9 |
8.9 |
| Other (6) |
0.3 |
22 |
26.4 |
17.3 |
34.4 |
Y-Intersection (5) |
0 |
47.2 |
33.3 |
2.8 |
16.7 |
|
Trafficway Description (TWY_TYPE)
|
|
Other Type (6) |
0.4 |
29.8 |
29.8 |
11.3 |
29.1 |
| Not Divided (1) |
51.5 |
35.6 |
26.6 |
11.6 |
26.2 |
Not an Intersection (7) |
51.1 |
44.7 |
20.6 |
10.9 |
23.9 |
Table 3.
Statistical description of collision and pre-collision characteristics.
Table 3.
Statistical description of collision and pre-collision characteristics.
| Variables |
(%) |
Injury Severity (%) |
Variables |
(%) |
Injury Severity (%) |
|
0
|
1
|
2
|
3
|
0
|
1
|
2
|
3
|
|
Attempted Avoidance Maneuver (DRV_MANU)
|
|
Intersect Paths ( Straight Path) (7) |
14.3 |
26.1 |
40.6 |
9.9 |
23.5 |
| No Avoidance Maneuver (1) |
23.7 |
62.6 |
17.6 |
5.7 |
14.2 |
Opposite Direction (Angle Sideswipe) (8) |
4.2 |
12.4 |
28.4 |
16.5 |
42.8 |
| Braking (2) |
3.5 |
51.6 |
32.8 |
7.5 |
8.2 |
Head-On (9) |
14.7 |
2.2 |
16.1 |
27.1 |
54.6 |
| Unknown (3) |
68 |
40.7 |
29.5 |
9.3 |
20.5 |
Other Types (10) |
0.3 |
8.2 |
23.4 |
10.9 |
57.5 |
| Accelerating (4) |
0.1 |
39.5 |
29.3 |
10.9 |
20.4 |
Travel Speed of This Vehicle (TRV_SPD1)
|
|
| Braking and Steering (5) |
1 |
24.7 |
35.9 |
18.3 |
21 |
0 mph |
16.2 |
84.6 |
11.3 |
1.4 |
2.7 |
| Releasing Brakes (6) |
0 |
22.9 |
41.7 |
2.1 |
33.3 |
0-20 mph |
19 |
82.2 |
8.8 |
2.2 |
6.8 |
| Braking and Unknown Steering Direction (7) |
0.2 |
13.4 |
42.9 |
18.8 |
25 |
21-40 mph |
27.1 |
38.1 |
41.5 |
7.2 |
13.2 |
| Steering (8) |
3.5 |
20.9 |
28.6 |
19 |
31.5 |
41-60 mph |
28.2 |
15.5 |
34.2 |
16.2 |
34.1 |
| Accelerating and Steering (9) |
0 |
26.1 |
19.5 |
16.3 |
38.2 |
>60 mph |
9.4 |
13.7 |
25.9 |
17.4 |
43.1 |
|
Pre-Impact Stability (PIM_STAB)
|
|
Travel Speed of Other Vehicle (TRV_SPD2)
|
|
| Tracking (1) |
92.6 |
48 |
27.1 |
8.2 |
16.7 |
0 mph |
10 |
73 |
21.2 |
2.4 |
3.4 |
| Skidding Longitudinally (2) |
1.4 |
26.5 |
31.1 |
14.5 |
27.9 |
0-20 mph |
18.9 |
79.3 |
15.9 |
2.4 |
2.3 |
| Skidding Laterally (3) |
1.1 |
5.6 |
8.5 |
14.9 |
70.9 |
21-40 mph |
30 |
46.8 |
35.8 |
6.4 |
11 |
| Other (4) |
4.9 |
10.2 |
24.9 |
17.2 |
47.7 |
41-60 mph |
30.9 |
23.1 |
28.4 |
14.7 |
33.8 |
|
Crash Type (ACC_TYPE)
|
|
>60 mph |
10.1 |
18.2 |
21.4 |
16.2 |
44.2 |
| Rear End (1) |
32.1 |
74.9 |
18.6 |
2.1 |
4.4 |
Damage Area Count (DMG_ARCT)
|
|
| Same Direction (Angle,Sideswipe) (2) |
7 |
79 |
11 |
1.8 |
8.2 |
1 |
55.9 |
52.9 |
26.1 |
6.9 |
14 |
| Miss Control (3) |
1.2 |
90.1 |
6.5 |
1.1 |
2.4 |
2-5 |
35.2 |
44.1 |
30.9 |
8.4 |
16.6 |
| Turn Into Path (4) |
12.2 |
42.1 |
37.2 |
5.9 |
14.8 |
6-9 |
4.7 |
4.7 |
20.1 |
22.2 |
53 |
| Turn Across Path (5) |
13.5 |
36 |
43.8 |
7.9 |
12.2 |
10-12 |
4.2 |
1.1 |
10.4 |
21.9 |
66.6 |
| Opposite direction (Forward Impact) (6) |
0.4 |
1.7 |
34.7 |
21.9 |
41.7 |
|
|
|
|
|
|
Table 4.
Statistical description of vehicle and driver characteristics for both parties in the collision.
Table 4.
Statistical description of vehicle and driver characteristics for both parties in the collision.
| Variables |
(%) |
Injury Severity (%) |
Variables |
(%) |
Injury Severity (%) |
|
0
|
1
|
2
|
3
|
0
|
1
|
2
|
3
|
|
The gender of this driver (DRV_SEX1)
|
|
<1 |
4.7 |
54.8 |
21.5 |
6.5 |
17.2 |
| Male (1) |
55.9 |
44.7 |
23.7 |
9.5 |
22 |
1-10 |
55.9 |
49.3 |
24.7 |
7.6 |
18.4 |
| Female (2) |
44.1 |
46.2 |
30.8 |
7.9 |
15.2 |
11-20 |
34.1 |
39.6 |
29.8 |
10.5 |
20.1 |
|
Drug involvement of this driver (DRV_DUG1)
|
|
>20 |
5.3 |
32.4 |
35.3 |
12.7 |
19.7 |
| No (1) |
96.6 |
46.8 |
27.2 |
8.3 |
17.7 |
The Weight of This Vehicle (VEH_WGT1)
|
| Yes (2) |
3.4 |
3.9 |
17.1 |
22.5 |
56.6 |
1700-2700 lbs. |
8.5 |
38.2 |
21.3 |
8.2 |
32.5 |
|
Alcohol Test Result of this driver (DRV_ALC1)
|
|
2701-3700 lbs. |
51.9 |
44.8 |
25.9 |
8.4 |
21 |
| 0 mg/dL |
93.7 |
48 |
27.4 |
8.2 |
16.5 |
3701-4700 lbs. |
26.3 |
48.3 |
28.3 |
9.2 |
14.2 |
| 1-80 mg/dL |
1 |
2.7 |
14.8 |
15.9 |
66.5 |
4701-5700 lbs. |
11.2 |
47.4 |
30.6 |
10.1 |
11.9 |
| >80 mg/dL |
5.4 |
7.7 |
18.7 |
19 |
54.6 |
>5700 lbs. |
2.1 |
40.7 |
34.6 |
10.9 |
13.9 |
|
The Other Driver’s Gender (DRV_SEX2)
|
|
The Weight of Other Vehicle (VEH_WGT2)
|
| Male (1) |
58 |
41 |
26.6 |
9.8 |
22.7 |
1700-2700 lbs. |
8 |
47.77 |
35.4 |
9.45 |
7.38 |
| Female (2) |
42 |
51.4 |
27.2 |
7.5 |
13.9 |
2701-3700 lbs. |
50.1 |
48.77 |
28.8 |
8.91 |
13.52 |
|
Drug Involvement of Other Driver (DRV_DUG2)
|
|
3701-4700 lbs. |
27 |
44.53 |
24.96 |
8.47 |
22.04 |
| No (1) |
96.6 |
46.7 |
26.9 |
8.3 |
18.2 |
4701-5700 lbs. |
12.4 |
35.65 |
19.91 |
8.51 |
35.93 |
| Yes (2) |
3.4 |
7.8 |
25.9 |
22.9 |
43.5 |
>5700 lbs. |
2.5 |
25.54 |
14.95 |
9.9 |
49.61 |
|
Alcohol Test Result of other driver (DRV_ALC2)
|
|
Vehicle Body Type of This Vehicle (VEH_BDY1)
|
| 0 mg/dL |
93.7 |
47.7 |
26.8 |
8 |
17.6 |
Pickup (1) |
15.7 |
45 |
27.7 |
10.4 |
16.9 |
| 1-80 mg/dL |
0.9 |
6.6 |
29 |
23 |
42 |
SUV (2) |
30 |
50.2 |
28.7 |
8.4 |
12.7 |
| >80 mg/dL |
5.4 |
11.6 |
27.2 |
21.2 |
40.1 |
Minivan (3) |
2.1 |
69.1 |
27.8 |
2.8 |
0.3 |
|
Seat Belt Type and Usage Status (SAF_REST)
|
|
Cargo Van (4) |
0.3 |
80.8 |
17.5 |
1.5 |
0.2 |
| Not Used (1) |
12.7 |
9.9 |
14.8 |
16.9 |
58.4 |
VAN (5) |
1.9 |
11 |
30.2 |
18.8 |
40 |
| Two-Point (2) |
0.9 |
41.3 |
35.2 |
8.4 |
15.1 |
Sedan (6) |
40.7 |
43.2 |
25.8 |
8.5 |
22.5 |
| Three-Point (3) |
84.7 |
50.9 |
28.4 |
7.5 |
13.2 |
Coupe (7) |
3.7 |
38.6 |
22.9 |
9.4 |
29.1 |
| Others (4) |
1.7 |
34 |
33 |
16 |
17 |
Wagon (8) |
0.7 |
41.8 |
23 |
6.6 |
28.7 |
|
Air bag deployment (SAF_ARBG)
|
|
Hatchback (9) |
4.1 |
41.1 |
24.4 |
7.9 |
26.6 |
| Not Deployed (1) |
58 |
71.2 |
17.8 |
3 |
8.1 |
Convertible (10) |
0.8 |
39.1 |
21.8 |
8.5 |
30.6 |
| Curtain (2) |
0.1 |
10.2 |
30.5 |
15.3 |
44.1 |
Vehicle Body Type of Other Vehicle (VEH_BDY2)
|
| Side (3) |
1.2 |
19 |
39 |
8.2 |
34 |
Pickup (1) |
17.3 |
34.9 |
20.8 |
9.2 |
35.1 |
| Front (4) |
1.2 |
18.8 |
38.7 |
8.2 |
34.4 |
SUV (2) |
29.8 |
48.2 |
25 |
7.8 |
19 |
| Combined (5) |
16 |
10.1 |
39 |
16.7 |
34.2 |
Minivan (3) |
2.1 |
67.1 |
28.9 |
3.6 |
0.4 |
| Other (6) |
24.4 |
8.7 |
39.8 |
17.4 |
34 |
Cargo Van (4) |
0.4 |
68.3 |
27.6 |
3.9 |
0.2 |
|
The Age of This Vehicle (VEH_AGE1)
|
|
VAN (5) |
2 |
9.2 |
23 |
18 |
49.8 |
| <1 |
5 |
53.7 |
30.3 |
6.9 |
9.1 |
Sedan (6) |
39.5 |
48.3 |
29.9 |
9.1 |
12.8 |
| 1-10 |
56.8 |
49.7 |
29.3 |
7.8 |
13.1 |
Coupe (7) |
3.6 |
42.1 |
31.4 |
10.5 |
16 |
| 11-20 |
32.7 |
39.5 |
23.6 |
10.4 |
26.6 |
Wagon (8) |
0.7 |
46.1 |
31.7 |
10 |
12.2 |
| >20 |
5.4 |
26.7 |
17.3 |
12.1 |
43.9 |
Hatchback (9) |
3.8 |
48.8 |
30.9 |
8.6 |
11.7 |
|
The Age of Other Vehicle (VEH_AGE2)
|
|
Convertible (10) |
0.8 |
41.9 |
34.3 |
10.3 |
13.6 |
Table 5.
Statistical Performance of Accident Severity Prediction Models.
Table 5.
Statistical Performance of Accident Severity Prediction Models.
| Models |
Injury |
Imbalanced Data (%) |
Balanced Data (%) |
|
Severity
|
Precision
|
Recall
|
F1-score
|
Accuracy
|
Precision
|
Recall
|
F1-score
|
Accuracy
|
| |
Level0 |
87.93 |
90.81 |
89.34 |
|
85.19 |
87.19 |
86.17 |
|
| |
Level1 |
72.17 |
74.02 |
73.08 |
|
79.12 |
76.99 |
78.04 |
|
| XGBoost |
Level2 |
51.55 |
20.95 |
29.79 |
|
86.66 |
85.88 |
86.27 |
|
| |
Level3 |
75.57 |
88.07 |
81.34 |
|
88.42 |
89.55 |
88.98 |
|
| |
Average |
71.80 |
68.46 |
68.39 |
79.52 |
84.85 |
84.90 |
84.87 |
84.90 |
| |
Level0 |
87.85 |
90.07 |
88.95 |
|
84.69 |
86.85 |
85.76 |
|
| |
Level1 |
71.01 |
74.28 |
72.61 |
|
79.74 |
74.47 |
77.01 |
|
| Random Forest |
Level2 |
54.96 |
15.33 |
23.97 |
|
86.00 |
87.88 |
86.93 |
|
|
Level3 |
72.74 |
88.33 |
79.78 |
|
88.11 |
89.75 |
88.92 |
|
| |
Average |
71.64 |
67.01 |
66.33 |
78.81 |
84.63 |
84.74 |
84.66 |
83.74 |
| |
Level0 |
88.16 |
90.20 |
89.17 |
|
84.38 |
86.43 |
85.39 |
|
| |
Level1 |
71.60 |
74.19 |
72.87 |
|
75.83 |
75.05 |
75.44 |
|
| CatBoost |
Level2 |
49.39 |
23.59 |
31.93 |
|
81.07 |
79.31 |
80.18 |
|
| |
Level3 |
75.82 |
86.42 |
80.77 |
|
84.68 |
85.38 |
85.03 |
|
| |
Average |
71.24 |
68.60 |
68.69 |
79.22 |
81.49 |
81.54 |
81.51 |
81.52 |
| |
Level0 |
87.88 |
90.28 |
89.06 |
|
83.87 |
86.03 |
84.93 |
|
| |
Level1 |
71.18 |
72.30 |
71.74 |
|
73.84 |
74.27 |
74.06 |
|
| LightGBM |
Level2 |
42.18 |
24.21 |
30.76 |
|
77.72 |
73.80 |
75.71 |
|
| |
Level3 |
75.32 |
83.86 |
79.36 |
|
81.04 |
82.67 |
81.85 |
|
| |
Average |
69.14 |
67.66 |
67.73 |
78.31 |
79.12 |
79.19 |
79.14 |
79.15 |
| |
Level0 |
88.81 |
84.61 |
86.66 |
|
85.03 |
77.86 |
81.29 |
|
| |
Level1 |
63.38 |
77.87 |
69.88 |
|
69.39 |
78.04 |
73.46 |
|
| AdaBoost |
Level2 |
51.16 |
20.92 |
29.70 |
|
80.52 |
74.09 |
77.17 |
|
| |
Level3 |
77.01 |
82.07 |
79.46 |
|
80.65 |
83.89 |
82.24 |
|
| |
Average |
70.09 |
66.37 |
66.43 |
76.62 |
78.90 |
78.47 |
78.54 |
78.45 |
| |
Level0 |
84.50 |
86.85 |
85.66 |
|
80.26 |
81.56 |
80.91 |
|
| |
Level1 |
63.95 |
63.95 |
63.95 |
|
64.50 |
65.64 |
65.07 |
|
| Decision Tree |
Level2 |
37.56 |
24.15 |
29.40 |
|
65.36 |
66.99 |
66.17 |
|
|
Level3 |
69.71 |
70.59 |
70.15 |
|
75.53 |
71.07 |
73.23 |
|
| |
Average |
63.93 |
61.39 |
62.29 |
72.00 |
71.42 |
71.32 |
71.34 |
71.27 |