Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

Investigating influential factors through crash frequency models considering excess zeros and heterogeneity: New insights into mountain freeway safety

  • Liang Zhang,

    Roles Data curation, Formal analysis, Funding acquisition, Validation, Visualization, Writing – original draft

    Affiliation School of Traffic and Transportation Engineering, Changsha University of Science and Technology, Changsha, China

  • Zhongxiang Huang ,

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Resources, Supervision, Writing – original draft, Writing – review & editing

    mehzx@126.com

    Affiliation School of Traffic and Transportation Engineering, Changsha University of Science and Technology, Changsha, China

  • Lei Zhu,

    Roles Data curation, Investigation, Project administration, Software, Writing – review & editing

    Affiliation School of Traffic and Transportation Engineering, Changsha University of Science and Technology, Changsha, China

  • Songtao Yang

    Roles Methodology, Project administration, Resources, Software, Supervision

    Affiliation School of Traffic and Transportation Engineering, Changsha University of Science and Technology, Changsha, China

Abstract

The use of statistical modeling methods to quantify crash causation on mountain freeways is limited by crash data availability and technical challenges posed by excess zeros and heterogeneity, resulting in a lack of significant targeting of proactive crash prevention measures on mountain freeways. We collected multidimensional crash-related information on mountainous freeways in China, including road design characteristics, traffic conditions, pavement performance, and weather conditions. To overcome the challenges of excess zeros and heterogeneity on modeling techniques, we innovatively developed two new models: the Random Parameter Negative Binomial Lindley (RPNB-L) model and the Random Parameter Negative Binomial Generalized Exponential (RPNB-GE) model. The goodness-of-fit indicates that the RPNB-L and RPNB-GE models stand out among the six competing models, suggesting that the Lindley and GE distributions are conducive to portraying the multi-zero attributes, while the regression coefficients randomization treatment provides a deeper portrayal of heterogeneous effects. Moreover, the analysis reveals a considerable number of causes for crash frequency on mountainous freeways in China. These include several interesting results, such as special segments like tunnels and interchanges, Pavement Damage Condition Index (PCI) and the stormy rainfall (TR), which have not been extensively studied in previous research. The research results provided important reference values for the selection of active safety countermeasures for mountain freeways.

Introduction

In China, road traffic crashes account for over 60,000 fatalities annually. Of particular concern are mountainous freeways characterized by complex terrain and variable weather patterns. Studies have revealed that per 100 kilometers, mountainous freeways demonstrate significantly higher safety risks compared to conventional freeways, with crash incidence rates, fatality rates, and injury rates measuring 3.0 times, 5.1 times, and 3.8 times greater, respectively [1]. In response to the critical issue of road safety, the Chinese government issued the Highway Project Safety Evaluation Specification (hereafter referred to as the Specification) in 2015. This document employs both qualitative and quantitative methods to systematically assess traffic safety conditions during various stages of freeway projects, including feasibility studies, preliminary design, construction drawing design, and project delivery and evaluation. In contrast to the comprehensive framework established in the U.S. Highway Safety Manual (HSM) [2], the current Specification, while identifying critical procedural elements for these stages, demonstrates three notable limitations: (1) absence of systematic methodological frameworks, (2) insufficient practical implementation mechanisms, and (3) critical technological gaps in road safety management. Particularly evident is the lack of advanced analytical capabilities for freeway crash investigations, including the application of crash prediction modeling techniques for probabilistic risk assessment and countermeasure evaluation. This paper aims to fill the gap by proposing a crash frequency modeling approach for Chinese mountainous freeways, which will provide a more complete and effective solution for the crash risk analysis and safety management of such freeways.

An appropriate crash frequency model is critical for road safety assessment and improvement. As a result of recent advances in data collection and modelling methods, a growing number of security factors are better understood and new security improvements are being implemented [1,3,4]. However, traffic crashes are complex events that involve the interaction of multiple risk factors, making understanding the causes of crashes sufficiently challenging. In particular, several challenges remain concerning the application of crash modeling methods to freeway scenarios in China: (1) The traffic safety analysis of Chinese freeways predominantly emphasizes geometric design features and traffic conditions. This focus is inconsistent with the reality that crash causes involve multi-dimensional factors related to driver, vehicle, road, and environment interactions. Such a narrow perspective inevitably undermines the reliability of safety evaluations [5,6]. (2) The absence of systematic characterization methods for key intrinsic attributes within crash databases-such as excess zeros and heterogeneity-raises concerns regarding the accuracy of crash frequency models and the reliability of analyses measuring safety effects [7,8]. (3) Limited by the deficiencies of modeling database and technology, the measurement and evaluation results of mountain freeways have gaps with the actual situation, and even completely contradictory conclusions have emerged [8,9].

Based on the questions raised above, this paper makes three key contributions (As shown in Fig 1): (1) A comprehensive crash frequency modeling dataset reflecting the real conditions of Chinese mountainous freeways has been developed, incorporating various risk factors such as road design characteristics, traffic volume and composition, pavement performance, and weather conditions. This dataset provides a standardized template for data collection in the safety management process of Chinese mountainous freeways. (2) A set of advanced crash frequency models has been developed by incorporating new distributions and parameter randomization techniques to address excess zeros and heterogeneity, thus expanding the library of freeway traffic safety analysis methods. (3) The safety factors relevant to Chinese mountainous freeways are identified from a fresh perspective, offering new insights into the analysis of crash causes. For instance, the safety effects of factors like climbing lanes and heavy rainfall warrant further investigation.

The structure of this study is as follows: The second section reviews the relevant literature. The third section details the database preparation and analysis process. The fourth section presents the proposed crash frequency models. The fifth section discusses the test results of these models and analyzes the causes of the crashes. Finally, the conclusion and future research directions are provided.

Literature review

This paper summarized the latest research findings related to crash frequency modeling techniques and analysis of impact factors.

Modeling technique of crash frequency

Modeling approach that deals with excess zeros.

The low probability events of accidents lead to an excessive number of zero-value samples (i.e. the dependent variable in the sample is zero) in the modelled dataset. This characteristic leads to a significant deviation between the actual accident frequency distribution and the traditional Poisson or negative binomial distribution, a discrepancy that is overlooked by conventional accident modeling methods. Studies [10,11] have shown that zero-inflated Poisson (ZIP) and zero-inflated negative binomial (ZINB) models, using a two-stage estimation process, offer substantial advantages in fitting data with multiple zero-value characteristics. However, these models have been criticized for focusing solely on maximizing statistical fit accuracy, while neglecting the underlying mechanisms of crash occurrence, which makes their application controversial [12].

Recently, new advancements in modeling approaches have been introduced to better handle excess zeros. These methods involve embedding distributions that are capable of handling small-count observations in traditional distributions such as Poisson or Negative Binomial distributions. This combination of distributions enhances the ability to handle zero-count observations, leading to improved model performance [13]. Prominent examples include Negative Binomial Lindley (NB-L) models [1419] Negative Binomial Crack (NB-CR) models [20], Generalized Poisson Gaussian (PGG) Models [21] Poisson Weighted Exponential (PWE) Models [22], Poisson Inverse Gaussian (PIG) models [23,24], Negative Binomial Generalized Exponential (NB-GE) models [25,26] and Negative Binomial with Dirichlet Process [27]. Relevant studies also verified the advancement of such improved methods. For instance, Rusli et al. [15] compared the performance of the NB and the NB-L models using two crash databases with zero observation values of 89% and 90% respectively, which showed that the NB-L model had better goodness of fit and prediction accuracy.

Modeling approach that deals with heterogeneity.

The complexity of crash causes makes it impossible to account for all influencing factors, especially as the interactions between unobserved and observed factors vary across space and time-this phenomenon is known as heterogeneity [23,24]. Fixing the coefficients of all variables (e.g., Fixed Parameters models such as Poisson and Negative Binomial models) leads to biased estimates [24]. To address this limitation, several studies [2530] introduced Random Effects (RE) models, embedding random terms into the link function to capture heterogeneity across samples. These studies demonstrated that RE models outperform Fixed Parameters models. However, such models cannot fully explain the complex impacts of unobserved heterogeneity by merely embedding a random term [3032]. In view of this defect, the randomization of regression coefficients, known as Random Parameters (RP) models, was proposed to better characterize heterogeneity across samples [3338]. The RP models do not regard the error term in the link function as the only random component, but allows the regression coefficient of each variable to change among samples, which more comprehensively describes the heterogeneity. The above research results also verified that the RP models have significant advantages over the traditional Fixed Parameters models and RE models in the field of road safety analysis, which provides a reliable reference for the selection of the crash frequency models of Chinese mountain freeways.

Cause analysis of freeway crashes

Design features.

The road design features associated with crash frequency mainly include horizontal and vertical alignment indicators, cross-sectional indicators, speed limits and segment types. Several studies [30,31,3941] have shown that the curvature, slope and latitude are positively correlated with the collision frequency, that is, the higher the above indicators, the higher the crash risk of the segments. However, there is some debate regarding the traffic safety effects of cross-sectional indicators. The studies [4244] found that adding lanes would help reduce single-vehicle and multi-vehicle crashes, while Hou et al. [31] drew the opposite conclusion that three-lane and four-lane would lead to the probability of lane changing and overtaking compared with two-lane, thus increasing the risk of crashes. With respect to the safety effects of speed limits, freeways with higher speed limit values tend to have higher crash risks [43,45], and in particular, the speed limit difference between cars and trucks also had significant positive correlations with crash frequency [45]. Special segments of freeways, such as interchanges, tunnels, and service areas, have also been shown to have significantly higher crash frequency than those of ordinary freeway segments [14].

Traffic conditions.

Traffic condition indexes that influence traffic safety primarily include traffic flow, traffic composition, and traffic speed. The conclusion that traffic volume was significantly positively correlated with crash frequency has been confirmed by numerous studies [5,6,4654]. Regarding traffic composition, Ding et al. [4] and Wen et al. [55] found that freeway segments with higher proportions of conventional cars and heavy vehicles tend to have less stable traffic systems, thereby increasing the crash risk. This is because the presence of heavy vehicles and conventional cars reduces the responsiveness of traffic flow, making it harder for drivers to adjust to environmental change. In addition, Yu et al. (2013) [53], Wen et al. [56] and Cai et al. [7] have shown that high average speeds of traffic flow on mountain freeways lead to a reduction in crash frequency, as drivers are more likely to react and respond to changing conditions when traveling at higher speeds. This is especially true during the dry season, when the effects of higher speeds on crash frequency are more pronounced.

Pavement performance.

In recent years, with the progress of detection methods, pavement performance indicators have been proved to have significant safety effects. Flask et al. [57] and Alnawmasi et al. [58] shown that vehicle subvert accidents are likely to occur when driving in segments with deep ruts. In addition, the interactions between the friction coefficient of pavement and rains significantly affected the traffic safety of freeways, that is, increasing the road friction coefficient significantly reduces the occurrence of crashes in rainy days [31]. The traffic safety effects of the International Flatness Index were controversial, Chen et al. [34] have shown that increased pavement flatness improves driving stability and reduces crash risks. In contrast, Behnood and Mannering [39] believed that uneven pavement instead forces to drive more carefully, thus contributing to improved traffic safety.

Weather conditions.

In recent years, the effects of meteorological conditions such as high winds, rainfall, temperature, and visibility on traffic safety on motorways have been thoroughly explored by numerous scholars [12,4446,49]. The mainstream conclusions include: precipitation and visibility have significant negative impacts on traffic safety on freeways [44], especially heavy rainstorms greatly increase the probability of accidents [42]. Developed countries have further studied the traffic safety effects of micro-weather database or weather conditions at the time of crashes [12]. However, such studies have high requirements on the distance between meteorological observation stations and data collection systems, while road systems in general low- and middle-income countries hardly have the hardware and software conditions to meet the requirements.

Summary of research and work direction

Research on freeway safety in developed countries, such as Europe and America, has been ongoing for many years and has yielded fruitful results. However, there have been few targeted studies on the specific driving culture and driving environment of Chinese mountain freeways. Based on this, the following problems should be addressed gradually.

  • The quality of the databases needs to be further improved.
    Although European and American countries have specifically discussed the traffic safety effects of freeway design characteristics, traffic conditions, pavement performance, and weather conditions, little research has been conducted to incorporate all these potential risk factors into the traffic safety analysis of Chinese mountain freeways.
  • Excess zeros and heterogeneity should be given adequate attention.
    Excess zeros and heterogeneity have been the main factors impacting the performance of crash models. The vast differences in freeway design concepts and management regulations between China and Europe and the United States make the validity of transplanting advanced crash models from Europe and the United States to Chinese mountainous freeways yet to be verified. Thus, it is an urgent necessity to provide reliable and improved modeling ideas for excess zeros and heterogeneity based on the actual conditions of mountain freeways in China.
  • The reliability of traffic safety analysis of Chinese mountain freeways needs to be improved.
    At present, traffic safety analysis of Chinese mountain freeways is partly based on practitioners’ long-term working experience, and partly based on the results of qualitative safety evaluation methods of inference. Only a few are summarized based on the estimation results of a series of traffic safety econometric models. There is still scope for improvement in the traffic safety analysis methods for the mountainous freeways in China.

Date preparation

Research objects

We collected the crashes and potential risk factors of the Kaiping-Zhanjiang segments of the G15 freerway and the Lianzhou-Sanshui segments of the G55 freeway, which are located in the mountainous areas of southern China. The study objects are respectively undertaken by five management companies for daily operation, with a total length of 611km, and the roadway contains 50 interchanges, 8 service areas, 985 bridges and 50 tunnels (The basic conditions of each expressway are shown in Table 1).

Sample division methods

The quality of the crash modeling dataset was largely influenced by the temporal and spatial division of the samples [7]. Regarding the time scale, we chose to divide the data into ‘quarterly’ units for the following reasons: (1) potential risk factors such as traffic conditions, meteorological conditions, and pavement performance on mountainous freeways in China exhibit significant seasonal clustering and correlation [30], and (2) the quarterly division criterion prevented the generation of excessive zero-value samples compared to the micro-division criterion, represented by ‘month’, thus reducing some of the limitations of the crash model [32]. Based on the meteorological characteristics in southern China, February-April, May-July, August-October, and November-January of the following year were designated as the 1st-4th quarters of the year, respectively. This study collected crash and risk factor data over 8 quarters between 2015 and 2016.

Referring to the clue provided by Hou et al. [6], the spatial division of the samples was performed using the uncertain segment length method. This approach focused on maintaining consistency of indicators within the same segment, while allowing for variability in segment length. It effectively addresses concerns about data authenticity that may arise from secondary processing of indicators. Specifically, we truncated the segments at locations where the segment type, plane geometry, slope, and number of lanes changed according to specific rules [33]: (1) Plane geometry indicators were classified into curved and straight sections, with smooth curves included in the curved category; (2) Locations with a longitudinal slope gradient change of  ≥ 1% were used as cut-off points; and (3) Segment types included basic segments, interchanges, service areas, and tunnel segments (the extent of each segment type is shown in Fig 2).

A total of 5568 (i.e. 696*8) samples were obtained by the above spatio-temporal division method.

Description of variables

The modeling variables involved 26 risk factors, including traffic crashes, design characteristics, traffic conditions, pavement performance and weather conditions. With the above established samples as the carrier, the representation indicators of each variable were matched to the samples to form a complete modeling database. The description and statistical characteristics of the database are shown in Table 2.

The variables used for the model developments are extracted through the following task pipeline:

  1. (1) The traffic crashes were obtained from the database maintained in each freeway management center, which detail the locations, causes, related vehicle information, severity of the crashes and casualties. A total of 4155 crashes were recorded.
  2. (2) Road design indicators were mainly obtained through data collection and field investigation, including the construction design drawings and record documents provided by freeway management companies. Regarding the field research, the information of geometric features and traffic facilities were obtained by manual observation and validated with driving recorders. A total of 13 risk factors related to design indicators were recorded.
  3. (3) China’s networked freeway toll collection system categorizes vehicles into five types based on their number of axles, wheels, wheelbase, and height (as shown in Table 3). In this study, the quarterly percentages of the three types of vehicles were excluded, considering their covariance with other variables. The formulas for calculating the quarterly average daily traffic (QADT) and the weighted proportion of each type of vehicle are shown in Eqs (1)–(5).(1) (2) (3) (4) (5)
    where, represent the number of vehicles of category 1 to 5 passing through the toll section in quarter t (t = 1 , 2 , … 8). represents the number of days in the quarter t. represents the average daily standard traffic flow of segment k in quarter t. , , and represent the proportions of class 1, 2, 4 and 5 vehicles in segment k and quarter t, respectively.
  4. (4) The spatial and temporal collection frequencies of pavement performance indicators were 20/50m and 3–9 months, respectively. Therefore, the pavement performance metrics were matched to the samples by interpolation and averaging processes [7].
  5. (5) According to the Standard for Evaluation of Highway Technical Condition (JTG H20-2007) [23], the pavement condition index (PCI) characterizes the degree of pavement damage, and a higher PCI represents a smaller degree of pavement damage. Ride Quality Index (RQI) characterizes the smoothness of the pavement, and the higher its value, the better the smoothness of the pavement. Skidding Resistance Index (SRI) is converted from the lateral force coefficient of the pavement, where the higher its value, the better the slip resistance of the pavement.
  6. (6) Collected meteorological data, such as annual/monthly rainfall, average/maximum/ minimum temperatures, and other easily obtainable weather conditions data, are widely used in road traffic safety studies [7,55,56]. For example, the results of Wen et al. [56] showed that an increase in monthly average wind speed, monthly average daily precipitation, and monthly average visibility were detrimental to freeway traffic safety. Cai et al. [7] showed that the higher the ratio of light/moderate and heavy/stormy rainfall, the higher the crash frequency in freeway tunnels, with the most pronounced effect observed for heavy/stormy rainfall. Based on this, The weather condition variables selected in this paper include the percentage of small/moderate rain days in a quarter (SMR), the percentage of torrential rain days in a quarter (TR), the percentage of days with no sustained wind in a quarter (WD), and the per-centage of days with wind power  ≥  4 in a quarter (WP). The weather data used in this study were obtained from the Guangdong Meteorological Data Center and recorded at the county (district) level, and are covered by data from 41 meteorological stations, with an average distance of 36 km between stations. The shortest and longest distances between a meteorological station and a freeway are 18 km and 34 km, respectively.
  7. (7) No collinear pairs used in modeling after the collinearity test.

Methodology

Innovation crash models considering excess zeros

The necessity of introducing the new distribution was resolved by comparing the probability density profile distributions (shown in Fig 3). The Lindley and GE distributions have similar characteristics, i.e., the mean is close to 0 and the observations far from 0 have a long-tailed characteristic, which fits extremely well with the trend of the distribution of the actual crash frequencies. Therefore, the introduction of the two new distributions allows for an overall shift of the single NB distribution curve towards zero values, which is effective in capturing more zero-valued samples while retaining the advantage of the logic of the NB distribution in explaining crashes.

thumbnail
Fig 3. Resemblance of Lindley and GE probability distributions to the frequency distribution of observed crash counts with excess zeros, where θ represents the Lindley parameter and ω, v represent the scale parameter and shape parameter of GE distribution.

(A) Resemblance of Lindley to crash frequency distribution. (B) Resemblance of GE to crash frequency distribution.

https://doi.org/10.1371/journal.pone.0320135.g003

The core distribution of the NB-L model can be expressed as a hierarchy of NB distribution, Bernoulli distribution and Gamma distribution. Assuming that the crash frequency of segment i in time t follows the NB distribution, the probability distribution of is:

(6)

where, α is the overdiscretized parameter of NB distribution; is the mixed distribution parameter of segment i in time t, which can be decomposed into the product of random parameter and random parameter :

(7)

where, there is exponential correlation between and crash risk factor vector :

(8)

where, β is the regression coefficient vector corresponding to . While follows Lindley distribution, and its probability density function is:

(9)

where is the Lindley parameter in segment i and quarter t.

Eqs (6)–(9) are the conventional expression forms of NB-L model. Lindley distribution can be regarded as a mixture of Gamma distribution and Bernoulli distribution. To facilitate modeling, the NB-L model should be further formulated as a hierarchical model of each distribution.

According to Rusli et al. [15] and Saengthong et al. [20], combined with Gamma distribution and Bernoulli distribution, the hierarchical expression of Lindley distribution is as follows:

(10)

In summary, the complete hierarchical structure expression of NB-L model is summarized as follows:

(11)

The core mixture distribution of the NB-GE model can be expressed as a hierarchy of NB and GE distributions. The probability distribution equation for the NB model is:

(12)

where, is the mixed distribution parameter of segment i in time t, which can decompose the product of and , while .

(13)(14)

where, , respectively represent the scale parameter and shape parameter of GE distribution in segment i within time t.

In summary, the complete hierarchical structure expression of NB-GE model is summarized as:

(15)

Innovation crash models considering heterogeneity

The regression coefficients β, over discrete parameter α and intercept terms £ of the NB-L and NB-GE models are fixed in all segments and quarters, which cannot fully capture the influence of unobserved factors on the crash frequency. Based on the NB-L model, the following discuss how to set the β, α and £ as random parameters to reflect the heterogeneity through their random changes among samples. The models with α and £ randomization are called Random Effect Negative Binomial Lindley (£-RENB-L and α-RENB-L) models, and the model with β randomization is called Random Parameter Negative Binomial Lindley (RPNB-L) model.

According to the instructions provided by Rusli et al. [15], we set  +  and assume that follows the Beta distribution to achieve the purpose of normalization and randomization of α. Therefore, the hierarchical structure of α-RENB-L model can be concluded as follows: as:

(16)

where, is the over-discretized parameter of NB distribution of segment i in quarter t; £ is the intercept term in the link function.

£-RENB-L model assumes that the intercept term in the link function of the NB-L model follows a Normal distribution N ( c , d ) . Specifically, the hierarchical structure of £-RENB-L model is as follows:

(17)

The RPNB-L model introduces a new random component , and assumes that follows a Normal distribution to randomize the regression coefficients. Specifically, the hierarchy of RPNB-L model is as follows:

(18)

In parameter estimation of RPNB-L model, significant variables whose standard deviation of regression coefficient are significantly close to 0 are defined as random variables, that is, there is heterogeneity in the influence of this variable on crashes. On the contrary, it is considered that there is no heterogeneity in the safety effect of this risk factor.

Parameter estimation method and performance test index

Based on the clues provided in the literature [24], the candidate models were used for Bayesian parameter estimation using WinBUGS software. Referring to Wen et al. [56], This study used the N ( 0 . 01 , 10000 )  as the prior distribution of regression coefficients β and for all models. Similarly, the Lindley parameter θ is assigned Gamma ( 0 . 3 , 0 . 5 ) . The scale parameter ω and shape parameter v of GE distribution are assigned Gamma ( 0 . 01 , 0 . 01 ) . The prior distribution of random variable of α-RENB-L model was set as B ( 0 . 5 , 0 . 5 ) . The intercept random variable of £-RENB-L model is set as N ( 0 . 01 , 10000 ) . In addition, a chain is set for 100000 iterations and the first 70000 iterations are discarded. The convergence of MCMC simulation was judged according to Gelman-Rubin statistic in WinBUGS.

Following the clue provided by reference [23,59], Deviation Information Criterion (DIC) is used as an assessment of goodness-of-fit, while Mean Absolute Deviation (MAD) and Root Mean Square Error (RMSE) together are used as an assessment of prediction accuracy. Specifically, DIC assesses the goodness-of-fit by combining the complexity penalty term and the bias. The MAD denotes the absolute value of the difference between the actual observations and the posterior mean, while the RMSE de-notes the square root of the difference between the actual observations and the posterior mean. The specific calculation method of each performance test index is as follows:

(19)(20)(21)

where represents the a posteriori mean deviation of the parameters, which is used to measure the fitting accuracy of the model, and Pd is the effective parameter, which is used to measure the complexity of the model. N, I, T, and C represent the number of samples, the number of road segments, the number of quarters, and the number of parameters included in the model, respectively. denotes the predicted crash frequency for segment i in quarter t. denotes the actual crash frequency for segment i in quarter t.

In addition, Elastic Coefficients and Marginal Effects were further introduced to quantify the safety effects of significant factors [7]. Specifically, the Elasticity Coefficient was used to describe the percentage change in crash frequency for each 1% increase in the continuous significant variables. The Marginal Effect represented the change degree of the average crash frequency per kilometer for each unit increase of the discrete significant variables. The calculation methods of safety effect quantitative indexes are as follows:

(22)(23)

where, is the Elasticity Coefficient of continuous variable x on crash frequency, is the value of x in segment i and time period t, β is the regression coefficient of , is the Marginal Effect of discrete variable y on crash frequency, and L is the average length of modeling sample.

Results and analysis

Super parameters and performance test indicators

The test results of all candidate models are shown in Table 5 and Fig 4.

  1. (1) Firstly, the effectiveness of the improved models based on excess zeros was considered. The DIC, MAD and RMSE indexes of the NB-L model were 11005, 1.894 and 2.863, respectively, which were better than those of the NB-GE and NB models. The DIC, MAD and RMSE indexes of the NB-GE model were 11375, 1.902 and 2.908, respectively, which were superior to those of the NB model. These results corroborate the findings of Rusli et al. [15] and Cai et al. [7], highlighting the significant advantages of the Lindley and GE distributions in fitting multi-zero-valued crash samples.
  2. (2) When heterogeneity is taken into account, the RPNB-L model achieved the best performance among all candidate models, with the lowest DIC, MAD, and RMSE values. The goodness-of-fit indices for α-RENB-L and £-RENB-L models have little difference, which were all lower than that of RPNB-L model but better than that of NB-L, NB-GE and NB models. NB model had the worst goodness of fit because it neither considers excess zeros nor heterogeneity. Such results are consistent with Khanal et al. [60] and Hou et al. [33], that the randomization of coefficients was a better way to deal with heterogeneity than the randomization of α and £.
thumbnail
Fig 4. Comparison of the goodness-of-fit of candidate models.

https://doi.org/10.1371/journal.pone.0320135.g004

Estimation results of regression coefficients

The estimated results for each candidate model are presented in Table 6. As shown in the Table 6, the NB, NB-GE, NB-L, α-RENB-L, £-RENB-L, and RPNB-L models identified 16, 18, 17, 17, 18, and 18 risk factors, respectively, which were significant at the 95% Bayesian Credible Interval (BCI). All of the significant factors had consistent safety effects across the candidate models, as evidenced by the sign of their regression coefficients. Moreover, the risk factors that were significantly positively correlated with crash frequency included ln(SL), ST, CL, PT, DS, CUR„ SLO, SLL, ln(QADT), IIVE, VVE, and SMR. On the other hand, the risk factors that were significantly negatively correlated with crash frequency included CUL, IVVE, PCI, RQI, SRI, TR, WD, and WP.

Estimation results of random parameters

The distributions of random parameters and are shown in Fig 5. follows B ( 17 . 476 , 5 . 031 ) , and follows N ( − 12 . 964 , 4 . 243 ) , indicating that α-RENB-L and £-RENB-L models can effectively describe partial heterogeneity.

The estimation results of the RPNB-L model indicated that the regression coefficients of CUR, CUL, In(QADT), and VVE are all random parameters, as presented in Table 4 and Fig 6. The coefficients of these four variables vary randomly between samples, reflecting unobserved heterogeneity. For example, the regression coefficient for the random variable CUR was detected as following a normal distribution with a mean of 2.746 and a standard deviation of 1.274 (shown in Fig 5a), indicating that CUR was significantly positively correlated with crash frequency in 98.44 % of the samples, while it was significantly negatively correlated with crash frequency in the remaining 1.56 % of the samples. According to Ahmed et al. [52], CUR and CUL interact with unobserved fatigue driving related factors, while ln(QADT) and VVE interacts with weather conditions related factors and design speed.

thumbnail
Fig 5. The distributions of random parameters and .

(A) Parameter random distribution in the α-RENB-L model. (B) Parameter random distribution in the £-RENB-L model.

https://doi.org/10.1371/journal.pone.0320135.g005

thumbnail
Fig 6. Distribution status of random coefficients in the RPNB-L model.

(A) The distribution of regression coefficients in CUR. (B) The distribution of regression coefficients in CUL. (C) The distribution of regression coefficients in ln(QADT). (D) The distribution of regression coefficients in VVE.

https://doi.org/10.1371/journal.pone.0320135.g006

thumbnail
Table 4. Statistical characteristics of random variables.

https://doi.org/10.1371/journal.pone.0313772.t004

Analysis of traffic safety effects

Combined with the Elastic Coefficients and Marginal Effect of RPNB-L model (as shown in Table 7 and Fig 7.), the safety effects of factors related to mountain freeway in China were deeply analyzed.

thumbnail
Table 5. Hyperparameters and performance indexes of candidate models.

https://doi.org/10.1371/journal.pone.0313772.t005

thumbnail
Table 6. Coefficient estimation results of candidate models.

https://doi.org/10.1371/journal.pone.0313772.t006

Traffic safety effects of design features.

The elasticity coefficient of ln(SL) was 1.09, indicating an approximately linear positive correlation with crash frequency. The artificial segment length had no meaningful safety effects; that is, longer segments lead to more natural crashes without the influence of other factors [5,61]. According to the Marginal Effect of ST, the crash frequency of tunnel segments, service areas, and interchange segments was, on average, 0.23 crashes/km higher than that of basic segments. The presence of abundant tunnel segments on China’s mountainous freeways, characterized by frequent sudden environmental changes (e.g., rapid changes in vision, lighting, etc.), increases the probability of nervous driving and sharp reactions to light and dark transitions, thereby raising crash risks [46]. Additionally, the presence of frequent diversion and merging behaviors in the service areas and interchange segments with high traffic volumes inevitably increases the interaction behavior of vehicles thereby increasing the risk of crashes [1,7]. The Marginal Effect of CL showed that the provision of climbing lanes reduced crash frequency by an average of 0.59 crashes/km, as climbing lanes separate slower heavy vehicles from the general traffic flow, thereby increasing the stability of arterial traffic [3]. The Marginal Effect of PT revealed that crash frequency in cement concrete segments was, on average, 0.1 crashes/km higher than in asphalt concrete segments. This suggests that the lower noise levels and greater smoothness of asphalt pavement contribute to improved safety. It is also important to note that the basic segments in this study are asphalt-paved, while tunnel and service segments are paved with cement concrete, which exacerbates the negative safety effects of cement concrete pavements [5,6]. The Marginal Effect of DS indicated that segments with a design speed of 120 km/h had an average crash probability of 0.02 crashes/km higher than segments with a design speed of 100 km/h. In China’s mountainous freeways, higher design speeds lead to more complex vehicle interactions, thus increasing crash probability [56,62]. Among the flat and longitudinal indicators, each 1% increase in the CUR and SLO indicators will increase the crash frequency by 0.14% and 0.11% respectively. A 1% increase in the CUL indicator would reduce the crash frequency by 0.41%. The sharp curved segments resulted in severely restricted sight distances and compressed the turning reflection time of high-speed vehicles, greatly reducing the tolerance of driving safety and thus increasing the risk of crashes [63]. A longer curve length reduces curvature, giving the driver more time to react and maneuver, thereby enhancing safety levels [22]. Steep segments demand high braking performance to ensure vehicle safety. Additionally, in traffic systems with a high volume of heavy vehicles, the speed differential between heavy and standard vehicles exacerbates traffic flow instability in longitudinal slope segments, which is prone to crashes [64].

thumbnail
Table 7. Elasticity coefficients and marginal effects of RPNB-L model.

https://doi.org/10.1371/journal.pone.0313772.t007

thumbnail
Fig 7. Histogram of elasticity coefficients and marginal effects of risk factors for the RPNB-L model.

https://doi.org/10.1371/journal.pone.0320135.g007

Traffic safety effects of traffic conditions.

The results indicated that an increase in traffic volume leads to a higher crash risk. Specifically, a 1% increase in ln(QADT) results in a 4.82% increase in crash frequency. This result was verified to be applicable to freeways, urban trunk roads and rural trunk roads [8,13,42]. Strong interactions within traffic systems with high traffic flow contribute to negative safety effects. Regarding traffic composition, increases in IVE and VVE were significantly associated with higher crash frequency, whereas IVVE showed the opposite effect. Specifically, a 1% increase in IVE and VVE results in a 6.93% and 2.50% increase in crash frequency, respectively, while a 1% increase in IIVE leads to a 1.04% decrease in crash frequency. Potential reasons [7] for such results were that (1) The number of Class I vehicles in this study accounts for the majority (74.1%) of the total number of vehicles, whose influences on crashes were dominant. Segments with more class 1 vehicles are bound to have more interaction effects, frequent overtaking and lane change behaviors, thus increasing the possibility of crashes. (2) The slow speeds, large volumes and slow braking responses of heavy vehicles lead to the large speed difference between vehicles, limited vision of standard cars and other unstable factors in the traffic system, which have safety risks. (3) The negative correlation between Class IV vehicles and the crash frequency may be attributed to the drivers’ route familiarity and driving skills.

Traffic safety effects of pavement performance.

Both RQI and SRI were significantly and negatively correlated with crash frequency. Specifically, each 1% increase in RQI and SRI was associated with a 1.8% and 1.68% decrease in crash frequency, respectively. These findings align with those of Ahmed et al. [52] and Alnawmasi et al. [58] and reflect real-world conditions, where good driving comfort and skid resistance typically contribute to enhanced driving safety. This highlights the critical importance of routine road maintenance for improving traffic safety.

Analysis of traffic safety effects of weather conditions.

The impact of precipitation levels on crashes varied. SMR showed a significant positive correlation with crash frequency, while TR demonstrated a significant negative correlation. Specifically, a 1% increase in the SMR and TR indicators resulted in a 0.69% increase and a 0.62% decrease in crash frequency, respectively. The reasons for these results include: (1) Rainy conditions reduced drivers’ visibility and limited the space available for safe operations; (2) Rainy weather decreased the adhesion between the road surface and tires, particularly in segments with higher volumes of heavy vehicles, increasing the likelihood of loss of control for these vehicles. One possible explanation for the negative safety effects of TR is that heavy rain disrupts travel plans, leading to lower traffic volumes on freeways during heavy rain, which in turn reduces the probability of crashes [53]. Additionally, both WD and WP are negatively correlated with crash frequency. Specifically, a 1% increase in the WD and WP indices would result in a 0.99% and 0.69% decrease in crash frequency, respectively. Although there are no studies on the safety effects of these factors, these findings are consistent with the conditions of China’s mountainous freeways. Regarding driver behavior, some drivers avoid driving in complex mountainous freeway environments during sustained and strong wind conditions, which leads to reduced traffic volumes and, ultimately, a decrease in crash frequency.

Conclusions

This paper gradually adopted the NB-GE and NB-L models, considering excess zeros, and also the α-RENB-L, £-RENB-L and RPNB-L models, considering both excess zeros and heterogeneity, to analyze the causes of crashes on mountainous freeways in China, including freeway design features, traffic volume and composition, pavement performances, and weather conditions. This study is the first to apply innovative crash frequency models, considering both excess zeros and heterogeneity, to analyze the safety of mountain freeways in China. It represents the first comprehensive safety evaluation of Chinese mountain freeways based on multidimensional information of human-vehicular-road-environment.

The findings of this study offer novel insights for policy-making by freeway administrators. Specifically: (1) The risk implications of large vehicles in safety management require urgent attention. We strongly recommend conducting a comprehensive benefit-cost analysis of safety improvement measures targeting large vehicle control on mountainous highways in China. Should the benefit-cost ratio prove favorable, practical intervention strategies for large vehicle regulation should be prioritized to effectively mitigate economic losses and reduce traffic-related casualties. (2) Concerning the safety effects of pavement performance, maintenance measures targeting enhanced surface evenness, improved skid resistance, or reduced pavement distress demonstrate quantifiable safety benefits. A critical implication of this discovery lies in budgetary optimization: under fiscal constraints, highway authorities should strategically prioritize these pivotal performance indicators while selectively allocating residual funds to secondary pavement characteristics based on cost-effectiveness analyses. (3) The integration of high-resolution meteorological data significantly enhances the predictability of safety management systems, enabling transportation agencies to deploy adaptive control strategies. For instance, proactive traffic control protocols could be activated in critical zones (e.g., tunnels, intersections, and service areas) prior to light-to-moderate rainfall events. Notably, this study revealed a counterintuitive safety improvement during heavy precipitation-a phenomenon potentially attributable to reduced travel demand and heightened driver vigilance under extreme weather conditions. Consequently, resource-constrained authorities should prioritize preventive measures during light/moderate rainfall phases while maintaining robust emergency response capabilities for heavy precipitation scenarios.

The results of this study contribute to a better understanding of crash causation on China’s mountain freeways and to the development of reliable safety countermeasures for the world’s largest mountain highway system. The following topics warrant further study:

  1. (1) To improve the reliability of safety analysis, modeling methods based on internal attributes such as temporal-spatial correlation and endogeneity of crash data structures need to be studied.
  2. (2) The safety effects of overpasses, tunnels, and service areas need to be further refined. This includes analyzing traffic volume of on-ramp and off-ramp, acceleration and deceleration lane length, interchange length, ramp speed limit, tunnel length, tunnel spacing, and other relevant factors.

Acknowledgments

Thanks to all the related personnel who participated in this research.

References

  1. 1. Wang X, Li S, Li X, Wang Y, Zeng Q. Effects of geometric attributes of horizontal and sag vertical curve combinations on freeway crash frequency. Accid Anal Prev. 2023;186:107056. pmid:37027898.
  2. 2. AASHTO BDS. American Association of State Highway and Transportation Officials. Washington, DC; 2010. p. 4.
  3. 3. Mahmud A, Gayah VV, Paleti R. Estimation of crash type frequency accounting for misclassification in crash data. Accid Anal Prev. 2023;184:106998. pmid:36780867.
  4. 4. Ding H, Lu Y, Sze NN, Antoniou C, Guo Y. A crash feature-based allocation method for boundary crash problem in spatial analysis of bicycle crashes. Anal Meth Accid Res. 2023;37:100251.
  5. 5. Hou Q, Meng X, Huo X, Cheng Y, Leng J. Effects of freeway climbing lane on crash frequency: Application of propensity scores and potential outcomes. Phys A: Stat Mech Its Appl. 2019;517:246–56.
  6. 6. Hou Q, Huo X, Leng J. A correlated random parameters tobit model to analyze the safety effects and temporal instability of factors affecting crash rates. Accid Anal Prev. 2020;134:105326. pmid:31675667.
  7. 7. Cai M, Tang F, Fu X. A Bayesian bivariate random rarameters and spatial-temporal negative binomial Lindley model for jointly modeling crash frequency by severity: Investigation for Chinese freeway tunnel safety. IEEE Access 2022;10(1):38045–64.
  8. 8. Feng M, Wang X, Li Y. Analyzing single-vehicle and multi-vehicle freeway crashes with unobserved heterogeneity. J Transp Saf Secur. 2023;15(1):59–81.
  9. 9. Zhou Z, Meng F, Song C, Sze NN, Guo Z, Ouyang N. Investigating the uniqueness of crash injury severity in freeway tunnels: A comparative study in Guizhou, China. J Safety Res. 2021;77:105–13. pmid:34092300.
  10. 10. Lee T, Cunningham C, Rouphail N. Movement-based intersection crash frequency modeling. J Transp Saf Secur. 2022;15(5):493–514.
  11. 11. Lord D, Washington S, Ivan JN. Further notes on the application of zero-inflated models in highway safety. Accid Anal Prev 2007;39(1):53–7. pmid:16949027.
  12. 12. Zhang X, Wen H, Yamamoto T, Zeng Q. Investigating hazardous factors affecting freeway crash injury severity incorporating real-time weather data: Using a Bayesian multinomial logit model with conditional autoregressive priors. J Safety Res. 2021;762:48–55. pmid:33653556.
  13. 13. Ding H, Lu Y, Sze N, Chen T, Guo Y, Lin Q. A deep generative approach for crash frequency model with heterogeneous imbalanced data. Anal Meth Accid Res. 2022;34:100212.
  14. 14. Shaon M, Qin X, Shirazi M, Lord D, Geedipally S. Developing a random parameters negative binomial-Lindley model to analyze highly over-dispersed crash count data. Anal Meth Accid Res. 2018;18:33–44.
  15. 15. Rusli R, Haque MM, Afghari AP, King M. Applying a random parameters Negative Binomial Lindley model to examine multi-vehicle crashes along rural mountainous highways in Malaysia. Accid Anal Prev. 2018;119:80–90. pmid:30007211.
  16. 16. Wang W, Yang Y, Yang X, Gayah VV, Wang Y, Tang J, et al. A negative binomial Lindley approach considering spatiotemporal effects for modeling traffic crash frequency with excess zeros. Accid Anal Prev. 2024;207:107741. pmid:39137658.
  17. 17. Dzinyela R, Shirazi M, Das S, Lord D. The negative Binomial-Lindley model with Time-Dependent Parameters: Accounting for temporal variations and excess zero observations in crash data. Accid Anal Prev. 2024;207:107711. pmid:39084005.
  18. 18. Gil-Marin JK, Shirazi M, Ivan JN. Assessing the Negative Binomial-Lindley model for crash hotspot identification: Insights from Monte Carlo simulation analysis. Accid Anal Prev. 2024;199:107478. pmid:38458009.
  19. 19. Hasan T, Abdel-Aty M. Short-term Safety Performance Functions by Random Parameters Negative Binomial-Lindley model for Part-time Shoulder Use. Accid Anal Prev. 2024;199:107498. pmid:38359671.
  20. 20. Saengthong P, Bodhisuwan W, Thongteeraparp A. The zero inflated negative binomial–Crack distribution: Some properties and parameter estimation. Songklanakarin J Sci Technol. 2015;37(6):701–11.
  21. 21. Saha D, Alluri P, Dumbaugh E, Gan A. Application of the Poisson-Tweedie distribution in analyzing crash frequency data. Accid Anal Prev. 2020;137:105456. pmid:32036107.
  22. 22. Zamani H, Ismail N, Faroughi P. Poisson-weighted exponential univariate version and regression model with applications. J Math Stat. 2014;10(2):148–54.
  23. 23. Zha L, Lord D, Zou Y. The Poisson inverse Gaussian (PIG) generalized linear regression model for analyzing motor vehicle crash data. J Transp Saf Secur. 2014;8(1):18–35.
  24. 24. Vangala P, Lord D, Geedipally SR. Exploring the application of the Negative Binomial–Generalized Exponential model for analyzing traffic crash data with excess zeros. Anal Meth Accid Res. 2015;7:29–36.
  25. 25. Heydari S, Fu L, Joseph L, Miranda-Moreno L. Bayesian nonparametric modeling in transportation safety studies: Applications in univariate and multivariate settings. Anal Meth Accid Res. 2016;12:18–34.
  26. 26. Rusli RB. Traffic safety along rural mountainous highways in Malaysia. Queensland University of Technology; 2017.
  27. 27. Faden A, Abdel-Aty M, Mahmoud N, Hasan T, Rim H. Multivariate Poisson-Lognormal models for predicting peak-period crash frequency of joint on-ramp and merge segments on freeways. Transport Res Rec. 2024;2678(3):133–147.
  28. 28. Mannering FL, Shankar V, Bhat CR. Unobserved heterogeneity and the statistical analysis of highway accident data. Anal Meth Accid Res. 2016;11:1–16.
  29. 29. Islam M, Alnawmasi N, Mannering F. Unobserved heterogeneity and temporal instability in the analysis of work-zone crash-injury severities. Anal Meth Accid Res. 2020;28:100130.
  30. 30. Chen E, Tarko A. Modeling safety of highway work zones with random parameters and random effects models. Anal Meth Accid Res. 2014;1:86–95.
  31. 31. Hou Q, Tarko AP, Meng X. Investigating factors of crash frequency with random effects and random parameters models: New insights from Chinese freeway study. Accid Anal Prev. 2018;120:1–12. pmid:30075358.
  32. 32. Hou Q, Meng X, Leng J, Yu L. Application of a random effects negative binomial model to examine crash frequency for freeways in China. Phys A: Stat Mech Its Appl. 2018;509:937–44.
  33. 33. Hou Q, Huo X, Leng J, Mannering F. A note on out-of-sample prediction, marginal effects computations, and temporal testing with random parameters crash-injury severity models. Anal Meth Accid Res. 2022;33:100191.
  34. 34. Chen S, Chen Y, Xing Y. Comparison and analysis of crash frequency and rate in cross-river tunnels using random-parameter models. J Transp Saf Secur. 2022;14(2):280–304.
  35. 35. Khanal B, Zahertar A, Lavrenz S. Heterogeneity in crash data: A cross comparison between ordered probability model and its variant using crashes at suburban type arterial. Accid Anal Prevent. 2024;200:107524.
  36. 36. Li J, Li C, Zhao X. Optimizing crash risk models for freeway segments: A focus on the heterogeneous effects of road geometric design features, traffic operation status, and crash units. Accid Anal Prev. 2024;205:107665. pmid:38901161.
  37. 37. Gao D, Zhang X. Injury severity analysis of single-vehicle and two-vehicle crashes with electric scooters: A random parameters approach with heterogeneity in means and variances. Accid Anal Prev. 2024;195:107408. pmid:38043213.
  38. 38. Islam ASMM, Shirazi M, Lord D. Grouped Random Parameters Negative Binomial-Lindley for accounting unobserved heterogeneity in crash data with preponderant zero observations. Anal Meth Accid Res. 2023;37:100255.
  39. 39. Behnood A, Mannering F. The effect of passengers on driver-injury severities in single-vehicle crashes: A random parameters heterogeneity-in-means approach. Anal Meth Accid Res. 2017;14:41–53.
  40. 40. Wang X, Azati Y, Quddus M, Cai B, Zhang X. Statistical analysis of traffic crashes on mountainous freeway tunnel sections. Transp Res Rec. 2024;2678(2):1–10.
  41. 41. Ahern Z, Corry P, Shirazi M, Paz A. A comprehensive multi-objective framework for the estimation of crash frequency models. Accid Anal Prev. 2025;210:107844. pmid:39626506.
  42. 42. Haddad A, Mondal A, Eluru N, Bhat C. A novel integrated approach to modeling and predicting crash frequency by crash event state. Anal Meth Accid Res. 2024;41:100319.
  43. 43. Putra IGB, Kuo P-F, Lord D, Chiu C-S, Chen MKT, Hsu T-P. Using spatial analysis and variable selection of traffic violations to determine crash frequency. Transportmetr A: Transp Sci. 2024;26–26.
  44. 44. Ogungbire A, Pulugurtha SS. Effectiveness of data imbalance treatment in weather-related crash severity analysis. Transp Res Rec. 2024;03611981241239962.
  45. 45. Kumar Pathivada B, Banerjee A, Haleem K. Impact of real-time weather conditions on crash injury severity in Kentucky using the correlated random parameters logit model with heterogeneity in means. Accid Anal Prev. 2024;196:107453. pmid:38176321.
  46. 46. Wei Z, Das S, Wu Y, Li Z, Zhang Y. Modeling the lagged impacts of hourly weather and speed variation factors on the segment crash risk of rural interstate freeways: Applying a space-time-stratified case-crossover design. Accid Anal Prev. 2024;195:107411. pmid:38016324.
  47. 47. Fanyu M, Sze N, Cancan S, Tiantian C, Yiping Z. Temporal instability of truck volume composition on non-truck-involved crash severity using uncorrelated and correlated grouped random parameters binary logit models with space-time variations. Anal Meth Accid Res. 2021;31:100168.
  48. 48. Li J, Li C, Zhao X. Optimizing crash risk models for freeway segments: A focus on the heterogeneous effects of road geometric design features, traffic operation status, and crash units. Accid Anal Prev. 2024;205:107665. pmid:38901161.
  49. 49. Yu R, Abdel-Aty M. Analyzing crash injury severity for a mountainous freeway incorporating real-time traffic and weather data. Saf Sci. 2014;63:50–6.
  50. 50. Cheng W, Gill GS, Ensch JL, Kwong J, Jia X. Multimodal crash frequency modeling: Multivariate space-time models with alternate spatiotemporal interactions. Accid Anal Prev. 2018;113:159–70. pmid:29407663.
  51. 51. Wang J, He S, Zhai X, Wang Z, Fu X. Estimating mountainous freeway crash rate: Application of a spatial model with three-dimensional (3D) alignment parameters. Accid Anal Prev. 2022;170:106634. pmid:35344798.
  52. 52. Ahmed S, Alnawmasi N, Anastasopoulos P, Mannering F. The effect of higher speed limits on crash-injury severity rates: A correlated random parameters bivariate tobit approach. Anal Meth Accid Res. 2022;34:100213.
  53. 53. Yu R, Abdel-Aty M. Utilizing support vector machine in real-time crash risk evaluation. Accid Anal Prev. 2013;51:252–9. pmid:23287112.
  54. 54. Ahmed MM, Abdel-Aty M, Lee J, Yu R. Real-time assessment of fog-related crashes using airport weather data: A feasibility analysis. Accid Anal Prev. 2014;72:309–17. pmid:25108899.
  55. 55. Wen H, Ma Z, Chen Z, Luo C. Analyzing the impact of curve and slope on multi-vehicle truck crash severity on mountainous freeways. Accid Anal Prev. 2023;181:106951. pmid:36586161.
  56. 56. Wen H, Zhang X, Zeng Q, Sze NN. Bayesian spatial-temporal model for the main and interaction effects of roadway and weather characteristics on freeway crash incidence. Accid Anal Prev. 2019;132:105249. pmid:31415995.
  57. 57. Flask T, Schneider IV W. A Bayesian analysis of multi-level spatial correlation in single vehicle motorcycle crashes in Ohio. Saf Sci. 2013;53(1):1–10.
  58. 58. Alnawmasi N, Mannering F. The impact of higher speed limits on the frequency and severity of freeway crashes: Accounting for temporal shifts and unobserved heterogeneity. Anal Meth Accid Res. 2022;34:100205.
  59. 59. Aguilar C, Russo BJ, Mohebbi A, Akbariyeh S. Analysis of factors affecting the frequency of crashes on interstate freeways by vehicle type considering multiple weather variables. J Transp Saf Secur. 2021;14(6):973–1001.
  60. 60. Gu Z, Peng B, Xin Y. Higher traffic crash risk in extreme hot days? A spatiotemporal examination of risk factors and influencing features. Int J Disaster Risk Reduct. 2025;116:105045.
  61. 61. Feknssa N, Venkataraman N, Shankar V, Ghebrab T. Unobserved heterogeneity in ramp crashes due to alignment, interchange geometry and truck volume: Insights from a random parameter model. Anal Meth Accid Res. 2023;37:100254.
  62. 62. Khan M, Ahmed M. A novel deep learning approach to predict crash severity in adverse weather on rural mountainous freeway. J Transp Saf Secur. 2023;15(8):795–825.
  63. 63. Song L, Li S, Yang Q, Liu B, Lyu N, David Fan W. Partially temporally constrained modeling of speeding crash-injury severities on freeways and non-freeways before, during, and after the stay-at-home order. Accid Anal Prev. 2025;211:107917. pmid:39793299.
  64. 64. Jin J, Huang H, Li Y, Dong Y, Zhang G, Chen J. Variable speed limit control strategy for freeway tunnels based on a multi-objective deep reinforcement learning framework with safety perception. Expert Syst Appl. 2025;267:126277.