Analysis of Stopping Behavior at Rural T-Intersections Using Naturalistic Driving Study Data

Abstract

Rural intersections account for around 30% of crashes in rural areas and 6% of all fatal crashes, representing a significant but poorly understood safety problem. Crashes at rural intersections are also problematic since high speeds on intersection approaches are present which can exacerbate the impact of a crash. Additionally, rural areas are often underserved with EMS services which can further contribute to negative crash outcomes. This paper describes an analysis of driver stopping behavior at rural T-intersections using the SHRP 2 Naturalistic Driving Study data. Type of stop was used as a safety surrogate measure using full/rolling stops compared to non-stops. Time series traces were obtained for 157 drivers at 87 unique intersections resulting in 1277 samples at the stop controlled approach for T-intersections. Roadway (i.e. number of lanes, presence of skew, speed limit, presence of stop bar or other traffic control devices), driver (age, gender, speeding), and environmental characteristics (time of day, presence of rain) were reduced and included as independent variables. Results of a logistic regression model indicated drivers were less likely to stop during the nighttime. However presence of intersection lighting increased the likelihood of full/rolling stops. Presence of intersection skew was shown to negatively impact stopping behavior. Additionally drivers who were traveling over the posted speed limit upstream of the intersection approach were less likely to stop at the approach stop sign.

Share and Cite:

Oneyear, N. , Hallmark, S. , Goswamy, A. , Thapa, R. and Basulto-Elias, G. (2023) Analysis of Stopping Behavior at Rural T-Intersections Using Naturalistic Driving Study Data. Journal of Transportation Technologies, 13, 208-221. doi: 10.4236/jtts.2023.132010.

1. Introduction

1.1. Background

Rural intersections account for 30% of crashes in rural areas and 6% of all fatal crashes, representing a significant but poorly understood safety problem. Crashes at rural intersections are particularly problematic when high speeds on intersection approaches are present. Additionally, motor vehicle crash injury rates are higher in rural versus urban areas due in part to increased emergency medical service (EMS) times, reliance on volunteer EMS personnel, and increased transport time to definitive care [1] . EMS response times in rural areas are 1.6 to 2 times longer than in urban areas [2] [3] , and fatal injury crash rates are two times higher in rural than in urban areas [1] [4] .

Inappropriate gap selection has been found to be a major contributing cause of crashes at rural intersections. In a study conducted in 2003 [5] [6] , inappropriate gap selection accounted for 56% of all right-angle crashes at rural Minnesota through-stop intersections. Right-angle collisions, the result of drivers selecting a gap that is too small or failing to observe traffic control, account for between 36% to 50% of crashes at intersections on high-speed divided highways, while such collisions account for only 28% of crashes at intersections on other types of roads [7] .

Drivers failing to stop on the minor approach have been found to account for 25% of right angle crashes [6] . Retting et al. [8] found that crashes where drivers failed to stop at stop signs were more likely to result in injuries than crashes where drivers stopped. Characteristics correlated to failure to yield right of way include age [9] [10] , speeding, vision obstruction, and inattention/distraction [11] .

Roadway characteristics also play a significant role in intersection crashes. Intersections located on or near horizontal and vertical curves tend to have higher crash rates than intersections on tangent segments [12] [13] [14] [15] . Barua et al. [16] evaluated crashes at rural undivided intersections in Alberta, Canada, and found that crash risk is higher during the fall than the winter (possibly due to harvesting), at nighttime, at offset intersections, at T-intersections, and on horizontal or sag curves. Leckrone et al. [17] evaluated minor approach stop-controlled intersections in Indiana and found that the presence of acceleration lanes for both left and right turns, median width, and a nearly perpendicular intersection angle resulted in a lower likelihood of a severe crash. Additionally, they found that the crash risk was lower at T-intersections than at intersections with four approaches.

1.2. Objective

The Second Strategic Highway Research Program (SHRP2) conducted a large-scale naturalistic driving study (NDS) using instrumented vehicles, which provides a significant amount of on-road driving data for a range of drivers. The present study utilized data from the SHRP2 NDS as well as the SHRP2 Roadway Information Database (RID) to observe driver behavior at rural intersections firsthand using video, kinematic vehicle and driver, and roadway data to determine how roadway, driver, environmental, and vehicle factors interact to affect driver safety at rural intersections. The overarching objective of this study was to better understand how drivers react at rural intersections. This paper summarizes results of analyses to assess stopping behavior at T-intersections (3-approaches). Type of stop was used as a surrogate for intersection safety. It was assumed a full/rolling stop was less likely to result in a conflict or crash than a non-stop. The purpose of the analysis was to better understand driver behavior at rural intersections so that agencies can better address intersection crashes.

2. Description of Data

The second Strategic Highway Research Program (SHRP2) Naturalistic Driving Study (NDS) data is the largest dataset of its kind. The participating states were Florida, Indiana, New York, North Carolina, Pennsylvania and Washington (six US states). The vehicles of naïve drivers were equipped with a Data Acquisition System which included various sensors which extracted kinematic vehicle data such as speed, acceleration, GPS data, and radar. Four cameras were positioned to collect video from the forward roadway, rear roadway, driver face, and over the driver’s shoulder. Over the three years of the study approximately 3400 participants drove 5 million trips logging over 30 million data miles.

The SHRP2 Roadway Information Database (RID) was collected simultaneously with the NDS. Mobile data collection was conducted on over 12,500 center line mile across the six NDS states. Existing roadways and supplemental data were also acquired from public and private sources. These data came from several sources including the NDS states’ Department of Transportation (DOT), Highway Performance Monitoring System (HPMS), covering most roadways for each study state.

Data Reduction

A set of rural intersections within the six States covered by the SHRP2 NDS (Florida, Indiana, New York, North Carolina, Pennsylvania, and Washington) was identified using the RID and other sources such as Google Earth. These intersections were selected to represent a cross-section of geometric features (e.g., skew angle) and intersection countermeasures (e.g., overhead beacons or on-pavement signing) for all-way stop, two-way stop-controlled, and T-intersections.

Ultimately, 87 minor stop controlled T-intersections were utilized in the mode. Time series traces, forward roadway video, and static driver characteristics (e.g., age, gender) were requested and provided by the subcontractor that archives the SHRP2 NDS data. These data were reduced, and 1277 time series traces reflecting a range of driver ages and genders were viable and were utilized in the analysis. Time series traces provide kinematic vehicle data, such as speed, acceleration, and global positioning system (GPS) location, at 0.1-second intervals and represent one driver trip through one intersection.

Roadway characteristics (such as skew angle, number of approaches, and type of countermeasure) were extracted from the RID or Google Street View and were confirmed for each time series trace using the forward roadway video. Environmental characteristics (such as time of day and ambient conditions) and the presence of opposing vehicles were also reduced from the forward roadway video.

Roadway features, such as the location of a stop sign or advance warning sign, were identified and mapped to vehicle position within each of the time series traces. As a result, the distance of the vehicle from a particular characteristic, such as an upstream advance warning sign, could be determined. Data were reduced along the corresponding approach from the location of the stop sign to 600 meters upstream and approximately 5 meters downstream. The maximum speed within that distance was identified, along with whether the driver was 2.24 or 4.47 m/s (5 or 10 mph) over the posted speed limit at any point within the 600 meters. An example of intersection characteristics present is shown in Table 1.

Additionally, the stopping behavior was coded by determining minimum speed within 5 meters upstream or downstream of the stop location. Type of stop was then coded using the following criteria:

• Full stop: speed was reduced to approximately zero.

• Rolling: vehicle speed was greater than zero but less than approximately 2.24 m/s (5 mph).

• Non-stop: vehicle speed was greater than approximately 2.24 m/s (5 mph).

3. Methodology

3.1. Variables Utilized

The dependent variable was type of stop. Initially, models were developed using full, rolling, and no-stop as separate dependent variables. The models for full and rolling stops were similar and the number of full stops was small. As a result, full stop and rolling were combined (n = 821) and compared against no-stops (n

Table 1. Description of intersection characteristics.

= 456). The model included 1277 time series traces at T-intersections. This represented 157 unique drivers at 87 unique intersections. Data were reduced only for the stop-controlled approach. A unique Intersection ID was assigned to each intersection and used as a random effects variable in the model to account for repeated samples at the same intersection. Similarly, a unique Driver ID was assigned to each driver and used as a random effects variable. The corresponding State was also assigned a unique State ID and also included as a random effect. Metrics for independent variables are provided in Table 2. Categorical independent variables are shown in Table S1 which is provided as Supplementary Material.

3.2. Approach

Ordered logit models [18] , also known as proportional odds models, were initially developed to assess the effect of driver, environmental, and roadway factors on the three types of stopping behavior. Listed in order from safest to least safe, the type of stop include full, rolling, and no stop. The types were assumed to be of a discrete, ordinal nature classified on a three-point scale ranging from full stop to no stop. Consequently, these data are well suited for analysis using proportional odds or ordered logit model. This model can be derived by defining a latent variable z, which can be specified as a linear function for each observation such that

z = β X + ε (1)

where, X is the vector of variables determining the discrete ordering, β is the vector of estimable parameters, and ε is a random disturbance term. With the use of this equation, the observed safety level outcome y for each driver is defined as

y = 1 if zμ_0

y = 2 if μ0 < zμ1

y = 3 if μ1 < zμ2

y = …

y = 1 if zμI1

Table 2. Independent variables with all traces.

where, the estimable threshold parameters μ define y, which corresponds to integer ordering, and I is the highest integer-ordered response. The variable μ represents parameters that are jointly estimated using the model parameters β. If the error term is assumed to be distributed as standard normal across observations, an ordered probit model results. Setting the lower threshold μ0 equal to zero results in the outcome probabilities

P ( y = i ) = Φ ( μ i β X ) Φ ( μ i + 1 β X ) (2)

where, μi and μi+1 represent the upper and lower thresholds for response category i, and Φ(.) is the standard normal cumulative function. Estimation was done using standard maximum likelihood methods. Each variable was first examined individually and then simultaneously with other variables until the models providing the best balance of model fit and explanatory power were identified. AIC was utilized to assess model fit, and the “anova()” function in R was used to assess the significance of additional variables.

The “clmm()” function in the “ordinal” package of R was used to fit the models. The dependent variable was type of stop, with full stop < rolling stop < no stop. The variables for relevant interactions (e.g., overhead flashing beacons at night) were evaluated to determine whether they needed to be included in the model. Once the best fit models were found, the model assumptions were tested. It was found that the proportional odds ratio assumption was violated in both models, and therefore an ordinal model was not appropriate.

The data were then visualized through Andrews curves using the “andrews()” function in the “andrews” package of R to determine if one type of stop differed greatly from the others. It was found that the no-stop data differed from the data for the other two stop types. Therefore, it was determined that the data could be modeled with the dependent variable collapsed to two levels, full/rolling stop or no-stop, which greatly simplified the analysis.

Logistic regression was used to model binary responses (in this case, no stop versus full/rolling stop). Mixed effects models have two components, fixed and random. The fixed effects are included to explain the relationship between the dependent variable (in this case, stopping behavior) and a set of independent variables. The random effects are included to control for the dependency among a group of observations within the same group or cluster. In this intersection analysis, approach ID and driver ID are the random effects.

Note that the full stop and rolling stop classes could have been separated, resulting in three possible stopping behavior classes: no stop, rolling stop, and full stop. Two popular models for problems with multiple classes (unlike logistic regression models, which are used for problems with two classes) are multinomial regression and ordered logit regression. Both models were fit. While the former yielded results similar to those of the logistic regression model, logistic regression was chosen because it is more parsimonious. Meanwhile, order logistic regression relies heavily on an assumption of proportional odds, which was not met.

Logistic mixed effects regression was then used to model the probability (odds) of a driver making no stop at a rural intersection, indexed by i in the random variables αi and yi, which follows a Bernoulli distribution for the probability of no stop, pi.

log ( p i / ( 1 p i ) ) = β 0 + B 1 x 1 + + β n x n + α i + γ i (3)

α i ~ N o r m a l ( 0 , σ d 2 )

γ i ~ N o r m a l ( 0 , σ c 2 )

One of the benefits of the logistic regression model is that the output of the model is easily interpreted odds ratios. Odds ratios are the probability that an event happens in relation to the probability that it does not happen.

The odds ratios were obtained by exponentiation of the ordered logit coefficients. An ordered logit model estimates a single equation (regression coefficients) over the values of the dependent variable. There is a direct relationship between the coefficients produced by a logit model and the odds ratios produced by a logistic model.

A logit function is defined as the log base e (log) of the odds:

logit ( p ) = log ( odds ) = log ( p / q ) (4)

The range is negative infinity to positive infinity. In regression, it is easiest to model unbounded outcomes. Logistic regression is, in reality, an ordinary regression using the logit function as the response variable. The logit transformation allows for a linear relationship between the response variable and the coefficients, as follows:

logit ( p ) = a + b X (5)

or

log ( p q ) = a + b X . (6)

Equation (6) can be expressed as an odds ratio by eliminating the log. This is done by taking e to the power for both sides of the equation using e^(log(p/q)) = e(a+bX) or p/q = e(a+bX).

The logistic regression model with mixed effects was adjusted in R . The analysis was conducted via a Bayesian implementation using the package “brms” for fitting, plotting, and summarizing the models. The priors for the parameters were non-informative, the convergence of the chains were assessed using trace plots and R values, and the model fit was assessed using posterior predictive checks.

4. Results

A binomial mixed effects logistic regression model was developed with the dependent variable having possible values of full/rolling stop or no-stop. This was done using the “brm()” function of the “brms” package in R. Relevant interactions (e.g., overhead flashing beacons at night) were tested to determine whether they needed to be included in the model.

The final model was developed using the top fifteen most important variables as determined by a random forest analysis. A random forest analysis is a machine learning regression technique that considers complex relationships between the dependent and the independent variables. It does not provide insights about the mechanism of the model, and for this reason a random forest was not used as the final model. Random forests can, however, provide a ranking of the most important variables. The random forests were fitted using the package “randomForest” in R.

The best fit model is shown in Table 3. Since the model’s implementation is Bayesian, the table does not present p-values but rather 90% credible intervals, which indicate that the corresponding parameter lies within that interval with a probability of 90%. If zero is included within the credible interval, it means that zero is a likely value for the parameter (i.e., that variable might not be significant).

The model shows the likelihood of not stopping. Positive values indicate an increase in not stopping. Drivers who were traveling over the posted speed limit (speeding) were 2.23 times more likely to not stop than drivers traveling at or below the speed limit. Values less than zero indicate they are less likely to not

Table 3. Independent variables for final model for stopping behavior at two-way stop-controlled intersections with all traces.

stop. The inverse of values less than zero can be used to describe the likelihood of stopping. For instance, when a vehicle was present on the major approach, drivers were 0.111 times less likely to not stop. Alternatively, they were 1/0.122476 = 55.53 times more likely to engage in rolling/full stop. When lighting was present at the intersection, drivers were 2.32 times more likely to engage in a rolling/full stop. Time of day was also significant. Drivers were 2.47 times more likely during the daytime and 1.60 times more likely at night to engage in a rolling/full stop than at dawn/dusk.

Interactions were present between intersection skew and turning movement and between turning movement and presence of an advance intersection warning sign. These relationships can be better described graphically. Figure 1 shows the credible sets for the interaction between turning movement and intersection skew. The probability of not stopping is represented on the y-axis, and skew is represented in the x-axis.

As the figure shows, left-turning drivers had a low probability of not stopping for all skew scenarios, including left skew, right skew, and no skew (skew direction is from the perspective of the driver). However, left turning drivers were more likely to not stop when left skew was present. Right-turning drivers were more likely to not stop when either no skew (62%) or right skew (70%) was present. They were less likely to not stop (approximately 28%) when left skew was present. However, the credible sets are rather large so results should be used with caution.

Figure 2 shows the relationship between presence of an advance intersection warning sign, turning movement, and the probability of a driver not stopping.

Figure 1. Interaction between turning movement and intersection skew for T-intersections.

Figure 2. Interaction between turning movement and presence of an advance warning sign for T-intersections.

As the figure shows, right-turning vehicles at T-intersections had a high probability of not stopping regardless of the presence of advance signing. Left-turning drivers had a low probability of not stopping when no advance signing was present but a 25% probability of not stopping when advance signing was present. This result was unexpected, since the purpose of the signing is to warn drivers of an upcoming intersection. However, it should be noted that advance signing and other countermeasures are placed at locations where a problem with safety or driver behavior already exists. As a result, the presence of a countermeasure may be a surrogate for a problem location. In these cases, a before-and-after analysis may yield more representative results.

5. Discussion

5.1. Summary

Stopping behavior at rural T-intersection approaches was modeled using the SHRP 2 naturalistic study data. Logistic regression was used to model intersection behavior using type of stop (full/rolling versus no-stop) as the dependent variable. Type of stop was used as a surrogate for intersection safety. It was assumed a full/rolling stop was less likely to result in a conflict or crash than a non-stop. The purpose of the analysis was to better understand driver behavior at rural intersections so that agencies can better address intersection crashes.

The most influential variable in terms of the odds ratio was the presence of a vehicle on the major approach at the time of the arrival of the subject vehicle at the stop bar.

Daytime and nighttime driving were associated with a higher likelihood of a rolling/full stop than driving at dawn or dusk. As a result, nighttime driving is more problematic than daytime. However, the model also indicated presence of lighting was associated with a higher likelihood of a rolling/full stop. This suggests improved visibility increases driver recognition of the presence of an intersection.

Interactions were present between intersection skew and turning movement. Left-turning drivers were more likely to not stop when any type of skew was present than when no skew was present. Interactions were also present between presence of an advance intersection warning sign and turning movement. Right-turning vehicles at T-intersections had a high probability of not stopping regardless of the presence of advance signing. Results suggest intersection reconfiguration may have a positive safety impact.

Left-turning drivers were more likely to not stop when an advance warning sign was present, which was noted in both models. This result was unexpected, since the purpose of the signing is to warn drivers of an upcoming intersection. However, it should be noted that advance signing and other countermeasures are placed at locations where a problem with safety or driver behavior already exists. As a result, the presence of a countermeasure may be a surrogate for a problem location. In these cases, a before-and-after analysis may yield more representative results.

Driver behavior was also a major contributing factor. Drivers who were traveling over the posted speed limit upstream of the T-intersections were 2.24 times more likely to not stop than drivers traveling at or below the speed limit upstream of the intersection. This suggests speed management could have positive impacts on intersection behavior as well.

5.2. Conclusions

This study showed naturalistic driving study data can be used to better understand driver behavior at rural intersections. In particular, the results were able to show a relationship between upstream speed and stopping behavior. This type of information is not available in crash data showing the utility of NDS. Other driver characteristics, such as distraction and glance location, were also included in the model. These characteristics showed up as relevant in models for all-way stop intersections (which is the subject of another paper) but not for the model for T-intersection described in this paper. Distraction and glance location could only be reduced for a subset of the traces modeled due to resource constraints. As a result, a larger sample size could show the impact of these characteristics.

Additionally, the results show key intersection characteristics such as lighting improved stopping behavior. Other intersection characteristics, such as skew were correlated with drivers not stopping. This provides information for agencies in addressing rural intersection crashes.

Acknowledgements

The team would like to thank the Iowa DOT for funding a pilot stage of this research and the Federal Highway Administration for funding the research. They would also like to thank the staff of the Center for Transportation Research and Education (CTRE) at Iowa State University for their assistance in providing access to the RID and to Virginia Tech Transportation Institute for providing the SHRP 2 NDS.

Supplementary Material

Table S1. Categorical independent variables for stopping model at T-intersections with all traces.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1] Zwerling, C., Peek-Asa, C., Whitten, P.S., Choi, S.W., Sprince, N.L. and Jones, M.P. (2005) Fatal Motor Vehicle Crashes in Rural and Urban Areas: Decomposing Rates into Contributing Factors. Injury Prevention, 11, 24-28.
https://doi.org/10.1136/ip.2004.005959
[2] Gonzalez, R.P., Cummings, G.R., Phelan, H.A., Mulekar, M.S. and Rodning, C.B. (2009) Does Increased Emergency Medical Services Prehospital Time Affect Patient Mortality in Rural Motor Vehicle Crashes? A Statewide Analysis. The American Journal of Surgery, 197, 30-34.
https://doi.org/10.1016/j.amjsurg.2007.11.018
[3] NHTSA (2006) Traffic Crashes Take Their Toll on America’s Rural Roads. National Highway Traffic Safety Administration, Washington DC.
[4] FHWA (2019) Safety. Rural Fatalities.
https://safety.fhwa.dot.gov/local_rural/rural_fatal.cfm
[5] Preston, H., Storm, R., Donath, M. and Shankwitz, C. (2004) Review of Minnesota’s Rural Intersection Crashes: Methodology for Identifying Intersections for Intersection Decision Support. Minnesota Department of Transportation, Saint Paul, MN.
[6] Harder, K.A., loomfield, J.B. and Chihak, B.J. (2003) Reducing Crashes at Controlled Rural Intersections. Minnesota Department of Transportation, Saint Paul, MN.
https://conservancy.umn.edu/bitstream/handle/11299/904/1/200315.pdf
[7] Alexander, L., Cheng, P.L., Donath, M., Gorjestani, A., Menon, A. and Shankwitz, C. (2007) Intersection Decision Support Surveillance System: Design, Performance and Initial Driver Behavior Quantization. Minnesota Department of Transportation, Saint Paul, MN.
https://www.lrrb.org/media/reports/200730.pdf
[8] Retting, R.A., Weinstein, H.B. and Solomon, M.G. (2003) Analysis of Motor-Vehicle Crashes at Stop Signs in Four U.S. Cities. Journal of Safety Research, 34, 485-489.
https://doi.org/10.1016/j.jsr.2003.05.001
[9] McGwin, G. and Brown, D.B. (1999) Characteristics of Traffic Crashes among Young, Middle-Aged, and Older Drivers. Accident Analysis and Prevention, 31, 181-198.
https://doi.org/10.1016/S0001-4575(98)00061-X
[10] Keay, L., Jasti, S., Munoz, B., Turano, K.A., Munro, C., Duncan, D.D., Baldwin, K., Bandeen-Roche, K.J., Gower, E.W. and West, S.K. (2009) Urban Differences in Older Drivers’ Failure to Stop at Stop Signs. Accident Analysis and Prevention, 41, 995-1000.
https://doi.org/10.1016/j.aap.2009.06.004
[11] Campbell, B.N., Smith, J.D. and Najm, W.G. (2004) Analysis of Fatal Crashes Due to Signal and Stop Sign Violations. Volpe National Transportation Systems Center. National Highway Traffic Safety Administration, Cambridge, MA.
https://doi.org/10.1037/e495452008-001
[12] Savolainen, P.T. and Tarko, A.P. (2005) Safety Impacts at Intersections on Curved Segments. Transportation Research Record: Journal of the Transportation Research Board, 1908, 130-140.
https://doi.org/10.1177/0361198105190800116
[13] Burchett, G.D. and Maze, T.H. (2005) Rural Expressway Intersection Characteristics that Contribute to Reduced Safety Performance. Proceedings of the 2005 Mid-Continent Transportation Research Symposium, Ames, 18-19 August 2005, 71-80.
[14] Khattak, A.J. (2006) Intersection Safety. Mid-America Transportation Center, University of Nebraska-Lincoln, NE.
https://digitalcommons.unl.edu/ndor/55/
[15] Van Maren, P.A. (1977) Correlation of Design and Control Characteristics with Accidents at Rural Multi-Lane Highway Intersections. Joint Highway Research Project, Purdue University, West Lafayette, IN.
https://doi.org/10.5703/1288284313958
https://docs.lib.purdue.edu/cgi/viewcontent.cgi?article=2306&context=jtrp
[16] Barua, U., Azad, A.K. and Tay, R. (2010) Fatality Risk of Intersection Crashes on Rural Undivided Highways in Alberta, Canada. Transportation Research Record: Journal of the Transportation Research Board, 2148, 107-115.
https://doi.org/10.3141/2148-13
[17] Leckrone, S.J., Tarko, A.P. and Anastasopoulos, P.C. (2011) Improving Safety at High-Speed Rural Intersections. 3rd International Conference on Road Safety and Simulation, Indianapolis, IN, 14-16 September 2011, 1-19.
http://onlinepubs.trb.org/onlinepubs/conferences/2011/RSS/3/Leckrone,S.pdf
[18] McCullagh, P. (1980) Regression Models for Ordinal Data. Journal of the Royal Statistical Society: Series B (Methodological), 42, 109-127.
https://doi.org/10.1111/j.2517-6161.1980.tb01109.x

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.