Insights into vehicle conflicts based on traffic flow dynamics

Ding, Shengxuan; Abdel-Aty, Mohamed; Wang, Zijin; Wang, Dongdong

doi:10.1038/s41598-023-50017-3

Download PDF

Article
Open access
Published: 17 January 2024

Insights into vehicle conflicts based on traffic flow dynamics

Shengxuan Ding¹,
Mohamed Abdel-Aty¹,
Zijin Wang¹ &
…
Dongdong Wang¹

Scientific Reports volume 14, Article number: 1536 (2024) Cite this article

987 Accesses
1 Citations
Metrics details

Subjects

Abstract

The utilization of traffic conflict indicators is crucial for assessing traffic safety, especially when the crash data is unavailable. To identify traffic conflicts based on traffic flow characteristics across various traffic states, we propose a framework that utilizes unsupervised learning to automatically establish surrogate safety measures (SSM) thresholds. Different traffic states and corresponding transitions are identified with the three-phase traffic theory using high-resolution trajectory data. Meanwhile, the SSMs are mapped to the corresponding traffic states from the perspectives of time, space, and deceleration. Three models, including k-means, GMM, and Mclust, are investigated and compared to optimize the identification of traffic conflicts. It is observed that Mclust outperforms the others based on the evaluation metrics. According to the results, there is a variation in the distribution of traffic conflicts among different traffic states, wide moving jam (phase J) has the highest conflict risk, followed by synchronous flow (phase S), and free flow (phase F). Meanwhile, the thresholds of traffic conflicts cannot be fully represented by the same value through different traffic states. It reveals that the heterogeneity of thresholds is exhibited across traffic state transitions, which justifies the necessity of dynamic thresholds for traffic conflict analysis.

City-scale holographic traffic flow data based on vehicular trajectory resampling

Article Open access 25 January 2023

Exploring the effects of stationary camera spots on inferences drawn from real-time crash severity models

Article Open access 25 November 2022

Multiple vehicle cooperation and collision avoidance in automated vehicles: survey and an AI-enabled conceptual framework

Article Open access 12 January 2023

Introduction

Understanding the dynamics of traffic safety and traffic flow is essential for developing interventions that can reduce the occurrence of vehicle conflicts. These conflicts, often precursors to actual crashes, provide valuable insights into the conditions that may lead to crashes. However, the existing body of research has largely relied on historical crash data for safety evaluation, which has inherent limitations such as inaccurate data, subjective interpretations, and inadequate risk mitigation strategies¹. Moreover, the complexity of driver behavior, a key factor in crashes, is often oversimplified or overlooked in prediction algorithms². On the other hand, SSMs and microscopic traffic data have been proven to be appealing and widely used for analyzing traffic safety performance³. Hence, the goal of this study is to identify conflicts and link them to traffic flow characteristics using empirical trajectory data. To address these limitations, this study sets out with two primary objectives:

To explore the mechanism of traffic conflicts through the lens of macroscopic traffic states. This objective is predicated on the understanding that dynamic traffic states, with their spatiotemporal characteristics, serve as effective indicators for conflict detection. Macroscopic traffic flow characteristics have a profound impact on safety performance⁴, necessitating a deeper examination of how these characteristics correlate with the incidence of traffic conflicts. The transition between traffic states and the relationship of traffic parameters are critical to the causality of traffic conflicts. We aim to analyze traffic flow at various levels, such as Levels of Service (LOS), three-phase theory, and the fundamental diagram, to establish a connection between traffic flow parameters and conflict causality. This approach challenges the conventional reliance on micro-traffic flow features for conflict forecasts and aims to provide a more comprehensive understanding from a macro perspective⁵.

To utilize empirical trajectory data to identify traffic conflicts and determine Surrogate Safety Measures (SSMs) indicator thresholds. This objective seeks to transcend the limitations of previous studies by focusing on the mechanisms of conflict and the inherent heterogeneity in traffic flow, as revealed by high-resolution trajectory data. The selection of different thresholds of various scenarios can help us better understand the correlation between traffic conflicts and traffic flow parameters. The study intends to develop dynamic thresholds for traffic conflict analysis, which is particularly relevant for the algorithm development of automated vehicles (AVs), providing nuanced assessments of conflict severity in relation to traffic states⁶.

By fulfilling these objectives, this study aims to contribute to the field by enhancing the understanding of the relationship between traffic conflicts and traffic flow characteristics, leveraging high-quality trajectory data. This contribution is crucial, as it has the potential to inform the development of more accurate predictive models and safety interventions, ultimately leading to safer road environments.

The structure of the remaining research is outlined as follows: Section “Literature review” presents a comprehensive literature review. The methodology employed in this study is detailed in Section “Methods”. Section “Results” reports on the findings and discussions of the analysis. Finally, Section “Conclusions” concludes with the implications of the study’s findings and suggestions for future research endeavors.

Literature review

Traffic flow and states

The performance of traffic safety like crash occurrence is heavily influenced by traffic flow and their correlation has been studied by several studies. A link between traffic characteristics and daytime freeway crashes is established to confirm the importance of flow variation in traffic safety⁷. High-resolution trajectory data is applied to evaluate heterogeneous crash mechanisms under different traffic states⁸. However, crashes may not occur in many conditions, where safety evaluation is dependent on traffic flow characteristics and traffic conflicts. Probabilistic neural network (PNN) models were separately developed to identify and predict rear-end collisions in both congested flow and free flow scenarios, using loop detector data⁹. A logit model with random parameters and heterogeneity in means and variances was used to investigate the relationship between conflicts and traffic flow characteristics¹⁰. In previous research on traffic flow and states, a three-phase traffic flow theory is developed based on expressway data¹¹. The three-phase traffic flow theory divided traffic states into three categories: free flow(F), synchronous flow(S), and wide moving jam(J). When in a free flow state, traffic flow is at a low density and high speed without disturbance of other vehicles. Furthermore, high flow and speed distinguish synchronous flow. Compared with free flow, the average speed is slower, and the density is higher. While in a wide moving jam flow, the flow and speed tend to be zero, and the density reaches a maximum. Transition processes exist for free flow and synchronous flow (F—S), free flow and wide moving jam (F—J), and synchronous flow and wide moving jam (S—J). Transitions between these three phases can all be first-order transitions. Among them, the transition from free flow to wide moving jam requires two steps. First, free flow transforms into synchronous flow, which then generates wide moving jams. Based on the three-phase theory, traffic states and variables can be chosen to assess the relationship between traffic flow and safety performance¹². Traffic safety analysis with three-phase traffic flow theory is conducted with aggregated traffic flow data¹³.

Traffic conflicts

To comprehensively examine traffic safety concerns that involve drivers, vehicles, and roads, safety surrogate measures (SSMs) have been widely used to measure different dimensions of conflicts¹⁴. The benefits and drawbacks of various SSMs are summarized¹⁵. The performance of SSMs by six indices to calibrate threshold with naturalistic driving data is assessed¹⁶. Fuzzy Surrogate Safety Metrics are presented to distinguish between safe and unsafe situations for rear-end collision¹⁷. Modified SSMs were used to capture the probability and severity of collisions based on simulation¹⁸. Traffic safety is assessed at signalized intersections by simulator validity from perspectives of traffic and safety parameters¹⁹. A series of machine learning algorithms and statistical learning techniques are generally applied to determine factors of conflict and predict the occurrence of conflict. Different network models such as CNN²⁰, LSTM²¹, and DNN²² were proposed to detect and predict traffic conflicts from traffic variables and SSM. Based on statistical methods, conditional logistic regression²³, stratified sampling²⁴, and multiple logistic regression models²⁵ were used to estimate conflict risk. In addition, the Peak Over Threshold (POT) approach²⁶ and Multivariate Extreme Value models²⁷ were also widely used to identify conflict frequency.

Surrogate threshold values

Identifying traffic conflicts and determining thresholds are key for assessing safety performance. Typically, previous studies define thresholds with one value, disregarding their suitability for their studies. Even within the same context, multiple thresholds were proposed for analyzing conflicts. For example, the range of TTC thresholds varied widely from 0.5 s to 6.0s at signalized intersections for rear-end²⁸. It is observed the same problem with PET thresholds as well²⁹. Given the wide variation in the prescribed surrogate thresholds, some researchers have estimated the thresholds empirically. There are several major approaches for measuring conflict thresholds as shown in Table 1. While some studies determine the threshold of SSM based on real crash data, the selection of thresholds of various scenarios can be significantly different. This work addresses this research gap and uses high-resolution trajectory data to analyze three-phase traffic states by different traffic flow characteristics. The main objective of this work was to propose clustering methods, imbalanced data processing, and unsupervised learning evaluation on conflict identification and threshold selection for future research. Thus, these contributions can be applied in different types of locations at various traffic states. This can help us better understand the correlation between traffic conflicts and traffic flow parameters, which may be applied to investigate the differences and associations between microscopic conflict and macroscopic traffic flow.

Table 1 Methods to measure conflict thresholds.

Full size table

Methods

The methodology of this paper is described in the following subsections: identification of traffic states, calculation of SSM, clustering methods, and evaluation. Initially, vehicle trajectory data is analyzed for freeway segments, and traffic flow variables such as flow rate, density, and average speed are calculated to classify traffic states according to the three-phase theory framework. Subsequently, the SSMs are computed for further study. Finally, unsupervised learning models including k-means, GMM, and Mclust are compared to automatically establish SSM thresholds. Figure 1 illustrates the proposed methods.

Data preparation

This study is conducted with the “Citysim Dataset”⁴², an open-source dataset known for its remarkably high resolution of 4K (4096 × 2160) at 30 frames per second, captured from drone videos. Figure 2 illustrates a schematic diagram and an aerial view of the research area. The study area covers 680 m in length and consists of six lanes. During peak hours, a total of 35 minutes of data is available for the entire sample segment, divided into two periods: (1) 5:20 p.m. to 5:35 p.m., and (2) 5:48 p.m. to 6:07 p.m. To begin, vehicles are separated by lanes because traffic conditions differ between lanes.

Following that, we filter out all lane-changing and cut-in behavior to focus on rear-end conflicts. To reduce noise, every 30 frames (1 second) are aggregated to quantify traffic characteristics and moving average method is applied to smooth the data. To streamline the process of locating the subject vehicle and its adjacent vehicles, the expressway has been segmented in both directions. Starting from each ramp and extending 100 meters in either direction, the expressway has been subdivided into 14 sub-segments. Vehicles in each sub-segment are paired to ensure that the trajectory remains continuous. Furthermore, the first and last vehicles in each video are removed. Following the preprocessing of data, many traffic flow characteristics are calculated to identify traffic states. We select space speed as one of the indices to evaluate macro traffic states, which is calculated by the average speed of all vehicles in the road segment. Density is defined as the number of vehicles divided by the length of the road segment. Furthermore, the flow rate is formulated by the average time headway: $q\left(flow \,rate\right)=1/\overline{h }(time \,headway)$. Figure 3 provides a visual representation of vehicular movement westbound across different frames or time intervals. The horizontal axis displays the sequence of frames, sourced from the video recording. The vertical axis measures the distance each vehicle covers. The form of these lines illustrates the vehicle's motion over time: a straight trajectory implies consistent speed, whereas a bending or oscillating one signifies speed fluctuations. The first plot (lane 0 in Fig. 2) shows a prominent cluster of low-speed trajectories towards the beginning (left side) of the chart. The second plot (lane 1 in Fig. 2) displays more diversity in speeds, with many trajectories hovering in the mid-speed range (greens and yellows). The third plot (lane 2 in Fig. 2) reveals a distinct area where vehicles slow down (red zone) on the left.

Identification of traffic state

The traffic flow is categorized into three distinct phases under the three-phase traffic theory: free flow (F), synchronized flow (S), and wide-moving jam (J). Each of these phases can transition into another through specific mechanisms, such as a sudden increase in vehicle density (F to S), the dissipation of congestion (S to F), an increase in congestion leading to a jam (S to J), or the clearing of a jam (J to S). Free Flow (F) is characterized by high speeds and low vehicle density. Vehicles move freely without significant interactions with other vehicles. Synchronized Flow (S) is marked by medium densities and speeds. There is a synchronization in speed among vehicles, leading to closely spaced vehicles moving at similar speeds. Wide Moving Jam (J) involves high density and very low speeds, often leading to complete halts. It is characterized by the presence of "jams" that remain spatially fixed while moving through traffic.

The objective of this work is to connect conflict and traffic flow features, expanding the conventional conflict risk assessment to encompass the traffic flow condition. Figure 4 provides a visual representation of the criteria for traffic state identification. The traffic phase in a wide-moving jam can be determined by analyzing a time series plot that shows the effects of speed, time headway, and traffic flow interruptions. This analysis is conducted with several criteria, such as the average speed, maximum time headway, correlation coefficient between density and flow rate, and the number of vehicles in the phase.

The free-flow (F) phase is characterized by high speed (>12 m/s) and a strong connection between density and flow rate (> 0.5). In contrast, the correlation between density and flow rate in phase S is weak, with a correlation coefficient of less than 0.2. Meanwhile, the speed of Phase S was defined between 8 and 12 m/s. An abrupt change in speed characterizes the scenario that prevails between transitional phases F, S, and J⁴³. There are three factors used to determine phase J. The low average speed (less than 8 m/s) was the first requirement. The maximum time headway (3s) was the second requirement. The third criterion involved a microcosmic interruption of the flow of traffic within a large moving jam⁴⁴: (I<30 s), which is comparable to the quantity of vehicles in the jam. The calculation of I is shown in equation (1). Here, we set the I threshold as 30 to distinguish phase J.

A commonly used equation for calculating space speed (also known as space mean speed) in a stream of vehicles is given by Eq. 1:

$$\mathrm{Space \,\,Mean \,\,Speed }({\text{SMS}}) =\frac{\mathrm{Total\,\, Time \,\,Taken \,\,by \,\,All \,\,Vehicles }}{\mathrm{Total \,\,Distance \,\,Travelled\,\, by \,\,All \,\,Vehicles}}.$$

This formula can be further detailed as Eq. 2:

$$\mathrm{SMS }=\frac{{\sum }_{i=1}^{n}{d}_{i}}{{\sum }_{i=1}^{n}{t}_{i}}$$

where n = number of vehicles. ${d}_{i}$ = distance travelled by the ${i}{th}$ vehicle. ${t}_{i}$ = travel time of the ${i}{th}$ vehicle.

$$I=\frac{{\tau }_{J}}{{\tau }_{jam}},$$

(1)

where ${\uptau }_{{\text{J}}}$ is the duration of wide moving jam. ${\uptau }_{{\text{jam}}}$ is the the mean time in vehicle to pass the downstream of jam.

Surrogate safety measures

Traffic conflicts are used to predict interaction and the possibility of a crash if vehicles remain in their current state. Surrogate safety measures with a risk threshold can be used to assess conflicts⁴⁵. It should be noted that there is no perfect conflict indicator for evaluating global conflict events. By classifying conflict indicators into three distinct types, a more profound comprehension of the interplay between conflicts and traffic flow can be attained. Rear-end collisions can be assessed using Time to Collision (TTC) as an appropriate temporal proximity indicator, providing insights into crash frequency and severity. To detect small crash probabilities and consider the road surface's friction coefficient in assessing pavement characteristics, the potential index for collision with urgent deceleration (PICUD) serves as a valuable spatial proximity indicator. Additionally, deceleration rate to avoid collision (DRAC), which combines a vehicle's maximum available deceleration rate, has been justified to be a reliable kinematic indicator for predicting rear-end crash risk. Figure 5a shows the speed and distance between two vehicles, which is crucial for calculating the TTC. The equation provided calculates TTC based on the distance between the vehicles and their speed difference. Figure 5b shows the calculation of PICUD, that leading vehicle braking and the following vehicle with a reaction gap, which is the distance that the following vehicle travels in the time it takes for the driver to react. Figure 5c illustrates the calculation of DRAC, which quantifies the deceleration requirement for the following vehicle to present a collision. It considers the speed difference between the following vehicle and the leading one based on the distance between them excluding the length of the following vehicle. As a result, conflict measures must be chosen based on the research context. This research simplifies the computation process as well as identifies conflicts and their thresholds in a reliable manner. It should be noted that we chose the closest bounding box point between two vehicles rather than the center of the trajectory, which is more accurate to compute the SSM.

Time-to-collision (TTC)

Originally, TTC referred to the amount of time left before two vehicles would collide if they continued their current trajectory and maintained their speed difference. TTC is calculated using the equation (2).

$$TTC=\frac{{S}_{i}}{\overrightarrow{|{ V}_{i}}-\overrightarrow{{ V}_{i-1}}|}=\frac{{x}_{i-}{x}_{i-1}}{\overrightarrow{|{ V}_{i}}-\overrightarrow{{ V}_{i-1}}|},$$

(2)

where ${S}_{i}$ The distance between two vehicles, from rear bumper to front bumper. ${S}_{i}=$ ${x}_{i-}{x}_{i-1}$, $x$= the position of vehicles. ${{\text{V}}}_{{\text{i}}}$ The speed of a leading vehicle. ${{\text{V}}}_{{\text{i}}-1}$ The speed of a following vehicle.

Potential index for collision with urgent deceleration (PICUD)

PICUD is a measure that calculates the variation in distance between two consecutive vehicles in instances where the leading vehicle engages its emergency brakes. This calculation of PICUD is represented by assessing the change in the gap between the two vehicles during such emergency braking scenarios⁴⁶.

$${\text{PICUD}}=\frac{{{{\text{V}}}_{{\text{i}}}}^{2}-{{{\text{V}}}_{{\text{i}}-1}}^{2}}{2{\text{a}}}+{S}_{i}-{{\text{V}}}_{{\text{i}}-1}\Delta {\text{t}},$$

(3)

where ${\text{a}}$ The urgent deceleration of a leading vehicle. $\Delta {\text{t}}$ The reaction time of a following vehicle.

Deceleration rate to avoid the crash (DRAC)

DRAC involves dividing the difference in speed between a following vehicle and a leading vehicle by the time interval between them. DRAC represents the rate at which the following vehicle needs to slow down to prevent a collision with the leading vehicle. The calculation is in equation (4):

$${\text{DRAC}}=\frac{{{({\text{V}}}_{{\text{i}}}-{{\text{V}}}_{{\text{i}}-1})}^{2}}{2({S}_{i}-{{\text{L}}}_{{\text{i}}-1})},$$

(4)

where ${{\text{L}}}_{{\text{i}}-1}$ The length of the following vehicle.

Clustering methods and evaluation

In this section, we suggest evaluating the representation of SSM through unsupervised learning using three clustering models: k-means, GMM clustering, and Mclust. Since there are no ground-truth labels for traffic conflicts, internal evaluation methods and external evaluation methods are the two broad categories to evaluate clustering results. The external evaluation method assesses the quality of the clustering results while knowing the true label (ground truth), whereas the internal evaluation method does not rely on external information but only on the clustering results and sample attributes⁴⁷. To assess the clustering outcomes, this research relies on internal metrics such as the Silhouette Coefficient, Calinski-Harabasz Score, and Davies-Bouldin Score.

A smaller ratio of Silhouette Coefficient Index indicates a greater distance between the sample point's cluster structure and the nearest cluster structure, which implies a better clustering result⁴⁸. In addition, as the Calinski-Harabasz (CH) index decreases, the distance between clusters becomes smaller, suggesting a poorer quality of clustering⁴⁹. The possible values of CH index range from 0 to infinity. Lastly, the range of the Davies-Bouldin Index is between [0, +∞). The clustering method performs good when the index is small⁵⁰.fs

Results

Description of traffic state

The distribution of traffic flow variables at each traffic state is displayed in Table 2. To minimize interference, an output SSM sequence describing the interaction between each pair of vehicles was generated with a selected time step of one second (30 frames). To ensure that the simultaneity and variability of traffic states were accurately captured, the algorithm employed a sliding time-window analysis that allowed for the dynamic categorization of traffic states at any given moment. Classifying traffic states every 30 seconds. Summing the duration that each state was identified within the time windows for the entire study period.

Table 2 Description of traffic state.

Full size table

The results show that phase S has the most cars over the longest period, followed by phases J, and then F. This phenomenon occurs because there is little abrupt turbulence throughout these three steady stages. Due to traffic congestion, phase J has the lowest speed and highest density, whereas phase F has the highest speed and lowest density among them. Phase S has the maximum flow rate owing to the large number of cars during this phase. Between phases J and F, phase S has a medium speed and density. In terms of transitional states, they do not last for long, and the levels of traffic flow variables were mild, compared to stable phases (F, S, J), which include fewer vehicles. These states have higher standard deviations than other states because they are undergoing unstable transitions and turbulence of traffic flow.

Identification of traffic conflicts and thresholds

To validate the identification of traffic conflicts with different thresholds across various traffic states, matched traffic flow data of corresponding road segments is captured. The upstream refers to the traffic flow that occurs before a conflict point when considering the direction of traffic movement. To clarify, it starts when we first record information about each new vehicle that is spotted during a set period (which is 30 seconds for our study) as it enters the section of the road we're observing. Downstream refers to the traffic flow that occurs after the conflict point, again considering the direction of traffic. If we fail to locate the vehicle ID after the vehicle has left the road section at the downstream point, then the downstream location will keep a record of the last known details for that vehicle. The time of conflict is recorded by minTTC, which represents the smallest TTC value recorded among two distinct vehicle trajectories. The number of conflicts is represented by minTTC in our work. It signifies the most critical moment of potential collision between individual pairs of vehicles. Non-conflict observations are of utmost significance in studies as they showcase distinct characteristics that help discern conflict-prone situations by clustering methods. In this research, non-conflict observations refer to instances where no conflicts arise during the respective period and the subsequent timestamp (30s). Investigate the average values of SSMs for each identified cluster to establish threshold levels, which is shown in Table 3 with corresponding traffic index. For further study, it can be extended to delineate the perimeters separating clusters by examining the spread of SSMs for each grouping. Scrutinize the space spanned by multiple variables to ascertain the demarcation points for each one. These points typically occur where a cluster’s density begins to fade, potentially equidistant from the central points of adjacent clusters. Employing the clusters’ statistical characteristics, such as specific percentiles of SSMs within a cluster, can aid in setting benchmarks. These benchmarks can categorize varying degrees of traffic incident severity, ranging from high-risk to low-risk, and extending to non-conflict scenarios.

Table 3 Statistic summary of traffic conflicts and corresponding parameters.

Full size table

Three models are compared from different theoretical perspectives to examine the performance of classification by unsupervised clustering. The input considers all traffic conflict variables, including PICUD, TTC, and DRAC. The Mclust model performs better than other clustering techniques, with results shown in Table 4.

Table 4 Clustering performance.

Full size table

Compared with all states with pre-set thresholds, using the method proposed in our work can detect more conflicts. The thresholds of TTC and DRAC are higher than the common value in previous study since it considers the traffic condition of Freeway segment. When considering the specific traffic states, we can identify more details about traffic characteristics across their transition and link the relation between traffic conflicts and traffic flow characteristics (Fig. 6). Within Fig. 6a, the variation in headway—defined as the time gap between vehicles—presents a notable pattern for the observation period. Instances of decreased headway suggest a tightening of the gap between vehicles, which typically is associated with increased traffic density or a decrease in traffic speed, which is consistent with Figs. 6b and c. Such repeated occurrences of reduced headway might signal regular intervals of traffic congestion, whereas prolonged periods of expanded headway are indicative of less congested, free-flowing traffic conditions. Figure 6b reveals that greater vehicle density often leads to a reduction in speed, a trend that does not necessarily equate to a decrease in headway provided that the traffic flow remains consistent. This implies that even in dense traffic conditions, if the flow is steady and uninterrupted, vehicles may maintain a uniform headway. In the case of Fig. 6c, the inverse relationship between vehicle density and speed is depicted, with a scattered distribution of data points suggesting a variability in traffic behavior. At lower densities, vehicle speeds are high and diverse, pointing to a free-flow state. Conversely, as vehicle density rises, the spread of speed narrows, indicating a constrained flow and potential traffic congestion. This transition and narrowing of speed variation may reflect the onset of congested traffic conditions, characterized by reduced and more uniform vehicle speeds. The time-speed relationship (Fig. 6d) is characterized by its variability, with alternating peaks and valleys suggesting fluctuations that could be attributable to common traffic patterns, such as heavy congestion, or other transient influences on traffic velocity.

The thresholds vary significantly among these states. Phase J has the lowest threshold of TTC, and the traffic characteristics exhibit the same tendency of distribution. The average speeds and volumes of upstream and downstream are the smallest among all the states and exhibits lowest speed changes. The difference in volume between upstream and downstream is smaller than in other states, which indicates the lower variablity in traffic flow. The high coefficient of variation for speed is larger during this phase, which indicates that the speed is more spread out in relation to the mean, leading to higher variability and less homogeneity. Since phase F has the greatest threshold of TTC with the same tendency of average speed and flow, which allows drivers to have more time to respond to emerging systems, making it safer than other traffic states with fewer conflicts. The smallest coefficient of variation for speed suggests that speed has less dispersion and is more homogeneous during phase F. When a vehicle is in phase S, it keeps a similar deceleration. Due to large traffic volumes in phase S, vehicles in this phase have large thresholds, which means it requires more distance for a vehicle to avoid conflict. Other than stable phases, the transition between these phases has a larger threshold compared with phase J and S. This is because the vehicle tends not to maintain the same deceleration before a crash at these phases. The turbulence of deceleration will result in changes in the remaining distance between the leading and following vehicles. The larger standard deviation and coefficient of variation of average speed indicates a more extensive spread of the speeds of different states, reflecting greater variability across the transition of traffic states. DRAC aligns with our observations in TTC. Compared with the value (3.5/s^2) selected in most research, the threshold of DRAC is more accurate and sensitive to the changes in the flow and speed of vehicles. To be noticed, the threshold of PICUD is selected as 1 in all the states, which is a comparison of thinking distance and braking distance, expressed as a ratio or percentage.

Based on the PICUD, TTC, and DRAC values obtained from the Citysim Dataset, phase J poses the highest risk of conflict when traffic flow is extremely heavy and congested. Phase S follows with the second-highest conflict risk, which may be due to the high density and small space headway between surrounding vehicles. The maximum space headway between vehicles explains why phase F has fewer conflicts than S and J (Fig. 6). Moreover, due to the various traffic features, the transitional stages between these phases experience more conflicts than phase F. Hazardous situations may arise during the transitional state, as drivers tend to alter their behavior by decelerating in response to stop-and-go waves, which can exacerbate conflicts. The risk of conflict is higher during the S→J and F→S transitional states than in other transitional states. The high flow rate and vehicle speed during the F→S transitional state implies significantly more dangerous situation than phase F.

Conclusions

In this study, the three-phase traffic theory is utilized to establish a link between macroscopic traffic flow states and microscopic traffic conflicts. By analyzing microscopic traffic trajectory data, an unsupervised clustering method is proposed in this research to detect traffic conflicts and establish the SSM thresholds based on the three-phase framework. Initially, traffic states and their transitions are identified using the three-phase theory and traffic characteristics. Conflicts in each state of traffic were then assessed using SSMs including TTC, DRAC, and PICUD. After comparing various clustering methods, the conflicts and thresholds were clustered using Mclust method. The study demonstrates that phase J poses the highest risk of conflicts based on the conflict outcomes, with phase S following closely due to its substantial sample size. On the other hand, phase F exhibits better performance than the other phases. The transitional states exhibit comparable levels of conflict risk, with the S→J and F→S transitions displaying more conflicts than the other transitions. These results suggest that the distribution of traffic conflict varies depending on the traffic state. Meanwhile, the thresholds of traffic conflicts cannot be fully represented by the same value through different traffic states.

Additionally, our method offers a novel approach to traffic conflict studies, which typically rely on predefined thresholds that may not reflect the complexity of traffic states, demonstrating the advantage of diverse thresholds over one. In terms of technology development, the advanced detection of conflicts using bounding boxes rather than centers of vehicles can be integrated into the development of autonomous vehicle (AV) safety systems. These systems would benefit from a more granular understanding of the vehicle’s surroundings, thereby enhancing the AV's ability to respond to potential hazards. By incorporating this method, traffic management systems can dynamically adjust their conflict detection mechanisms based on the prevailing traffic phase, thus enhancing the accuracy and timeliness of safety measures. This is particularly useful for intelligent transportation systems which can integrate these findings in real-time to improve traffic safety and flow. Our approach is also more generalizable and feasible in traffic safety applications since it can be employed to study crash causality without relying on actual crash data, which is particularly valuable when crash data is scarce or noisy. Further research with larger and more diverse datasets could focus on the following research directions. First, more motion features (e.g. converging/diverging trend of traffic flow, microscope vehicle movement behavior patterns) could be factored into the traffic state partitioning approach to help better reveal the hidden information during traffic propagation. Second, several more surrogate safety measure indices can be compared at the boundaries to extend the severity of traffic conflicts in the disaggregate real-time safety analysis.

Data availability

The datasets used during the current study are available on GitHub: https://github.com/UCF-SST-Lab/UCF-SST-CitySim1-Dataset.

References

Islam, Z. & Abdel-Aty, M. Traffic conflict prediction using connected vehicle data. Anal. Methods Accid. Res. 39, 100275. https://doi.org/10.1016/j.amar.2023.100275 (2023).
Article Google Scholar
Islam, Z., Abdel-Aty, M., Goswamy, A., Abdelraouf, A. & Zheng, O. Effect of signal timing on vehicles’ near misses at intersections. Sci. Rep. 13, 9065. https://doi.org/10.1038/s41598-023-36106-3 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Peng, Y., Abdel-Aty, M., Shi, Q. & Yu, R. Assessing the impact of reduced visibility on traffic crash risk using microscopic data and surrogate safety measures. Transport. Res. C Emerg. Technol. 74, 295–305 (2017).
Article Google Scholar
Kuang, Y., Qu, X. & Yan, Y. Will higher traffic flow lead to more traffic conflicts? A crash surrogate metric based analysis. PLoS One 12, e0182458 (2017).
Article PubMed PubMed Central Google Scholar
Yasmin, S., Eluru, N., Wang, L. & Abdel-Aty, M. A. A joint framework for static and real-time crash risk analysis. Anal. Methods Accid. Res. 18, 45–56 (2018).
Google Scholar
Wu, Y., Abdel-Aty, M., Zheng, O., Cai, Q. & Zhang, S. Automated safety diagnosis based on unmanned aerial vehicle video and deep learning algorithm. Transport. Res. Rec. 2674, 350–359 (2020).
Article Google Scholar
Abdel-Aty, M. & Abdalla, M. F. Linking roadway geometrics and real-time traffic characteristics to model daytime freeway crashes: Generalized estimating equations for correlated data. Transport. Res. Rec. 1897, 106–115 (2004).
Article Google Scholar
Wang, L., Zou, L., Abdel-Aty, M. & Ma, W. Expressway rear-end crash risk evolution mechanism analysis under different traffic states. Transportmetr. B Transport Dyn. https://doi.org/10.1080/21680566.2022.2101565 (2022).
Article Google Scholar
Abdel-Aty, M. & Pande, A. Identifying crash propensity using specific traffic speed conditions. J. Saf. Res. 36, 97–108 (2005).
Article Google Scholar
Yuan, C. et al. Using traffic flow characteristics to predict real-time conflict risk: A novel method for trajectory data analysis. Anal. Methods Accid. Res. 35, 100217 (2022).
Google Scholar
Kerner, B. S. Three-phase traffic theory and highway capacity. Phys. A Stat. Mech. Appl. 333, 379–440 (2004).
Article MathSciNet Google Scholar
Liu, T., Li, Z., Liu, P., Xu, C. & Noyce, D. A. Using empirical traffic trajectory data for crash risk evaluation under three-phase traffic theory framework. Accid. Anal. Prev. 157, 106191. https://doi.org/10.1016/j.aap.2021.106191 (2021).
Article PubMed Google Scholar
Xu, C., Liu, P., Wang, W. & Li, Z. Safety performance of traffic phases and phase transitions in three phase traffic theory. Accid. Anal. Prev. 85, 45–57 (2015).
Article PubMed Google Scholar
Sayed, T. & Zein, S. Traffic conflict standards for intersections. Transport. Plan. Technol. 22, 309–323 (1999).
Article Google Scholar
Wang, C., Xie, Y., Huang, H. & Liu, P. A review of surrogate safety measures and their applications in connected and automated vehicles safety modeling. Accid. Anal. Prev. 157, 106157 (2021).
Article PubMed Google Scholar
Lu, C. et al. Performance evaluation of surrogate measures of safety with naturalistic driving data. Accid. Anal. Prev. 162, 106403 (2021).
Article ADS PubMed Google Scholar
Mattas, K. et al. Fuzzy surrogate safety metrics for real-time assessment of rear-end collision risk. A study based on empirical observations. Accid. Anal. Prev. 148, 105794 (2020).
Article PubMed Google Scholar
Ozbay, K., Yang, H., Bartin, B. & Mudigonda, S. Derivation and validation of new simulation-based surrogate safety measure. Transport. Res. Rec. J. Transport. Res. Board 2083, 105–113. https://doi.org/10.3141/2083-12 (2008).
Article Google Scholar
Tarko, A. P. Estimating the expected number of crashes with traffic conflicts and the Lomax Distribution–A theoretical and numerical exploration. Accid. Anal. Prev. 113, 63–73 (2018).
Article PubMed Google Scholar
Abdel-Aty, M., Wu, Y., Zheng, O. & Yuan, J. Using closed-circuit television cameras to analyze traffic safety at intersections based on vehicle key points detection. Accid. Anal. Prev. 176, 106794 (2022).
Article PubMed Google Scholar
Zhao, J., Liu, P., Xu, C. & Bao, J. Understand the impact of traffic states on crash risk in the vicinities of type A weaving segments: A deep learning approach. Accid. Anal. Prev. 159, 106293 (2021).
Article PubMed Google Scholar
Formosa, N., Quddus, M., Ison, S., Abdel-Aty, M. & Yuan, J. Predicting real-time traffic conflicts using deep learning. Accid. Anal. Prev. 136, 105429. https://doi.org/10.1016/j.aap.2019.105429 (2020).
Article PubMed Google Scholar
Abdel-Aty, M., Uddin, N. & Pande, A. Split models for predicting multivehicle crashes during high-speed and low-speed operating conditions on freeways. Transport. Res. Rec. 1908, 51–58 (2005).
Article Google Scholar
Harb, R., Radwan, E., Yan, X., Pande, A. & Abdel-Aty, M. Freeway work-zone crash analysis and risk identification using multiple and conditional logistic regression. J. Transport. Eng. 134, 203–214. https://doi.org/10.1061/(ASCE)0733-947X(2008)134:5(203) (2008).
Article Google Scholar
Yan, X., Radwan, E. & Abdel-Aty, M. Characteristics of rear-end accidents at signalized intersections using multiple logistic regression model. Accid. Anal. Prev. 37, 983–995. https://doi.org/10.1016/j.aap.2005.05.001 (2005).
Article PubMed Google Scholar
Arun, A., Haque, M. M., Washington, S., Sayed, T. & Mannering, F. How many are enough?: Investigating the effectiveness of multiple conflict indicators for crash frequency-by-severity estimation by automated traffic conflict analysis. Transport. Res. C Emerg. Technol. 138, 103653 (2022).
Article Google Scholar
Fu, C. & Sayed, T. Comparison of threshold determination methods for the deceleration rate to avoid a crash (DRAC)-based crash estimation. Accid. Anal. Prev. 153, 106051 (2021).
Article PubMed Google Scholar
Abdel-Aty, M., Wang, Z., Zheng, O. & Abdelraouf, A. Advances and applications of computer vision techniques in vehicle trajectory generation and surrogate traffic safety indicators. Accid. Anal. Prev. 191, 107191. https://doi.org/10.1016/j.aap.2023.107191 (2023).
Article PubMed Google Scholar
Zheng, L., Ismail, K. & Meng, X. Investigating the heterogeneity of postencroachment time thresholds determined by peak over threshold approach. Transport. Res. Rec. J. Transport. Res. Board 2601, 17–23. https://doi.org/10.3141/2601-03 (2016).
Article Google Scholar
Zheng, L., Ismail, K., Sayed, T. & Fatema, T. Bivariate extreme value modeling for road safety estimation. Accid. Anal. Prev. 120, 83–91 (2018).
Article PubMed Google Scholar
Zhao, P. & Lee, C. Assessing rear-end collision risk of cars and heavy vehicles on freeways using a surrogate safety measure. Accid. Anal. Prev. 113, 149–158 (2018).
Article PubMed Google Scholar
Arun, A., Haque, M. M., Bhaskar, A., Washington, S. & Sayed, T. A systematic mapping review of surrogate safety assessment using traffic conflict techniques. Accid. Anal. Prev. 153, 106016 (2021).
Article PubMed Google Scholar
Yang, K., Yu, R., Wang, X., Quddus, M. & Xue, L. How to determine an optimal threshold to classify real-time crash-prone traffic conditions?. Accid. Anal. Prev. 117, 250–261 (2018).
Article PubMed Google Scholar
Wang, X., Khattak, A. J., Liu, J., Masghati-Amoli, G. & Son, S. What is the level of volatility in instantaneous driving decisions?. Transport. Res. C Emerg. Technol. 58, 413–427 (2015).
Article Google Scholar
Pascucci, F., Rinke, N., Schiermeyer, C., Berkhahn, V., & Friedrich, B. A discrete choice model for solving conflict situations between pedestrians and vehicles in shared space. (2017).
Ghanipoor Machiani, S. & Abbas, M. Safety surrogate histograms (SSH): A novel real-time safety assessment of dilemma zone related conflicts at signalized intersections. Accid. Anal. Prev. 96, 361–370. https://doi.org/10.1016/j.aap.2015.04.024 (2016).
Article PubMed Google Scholar
Saunier, N., & Sayed, T. Clustering vehicle trajectories with hidden Markov models application to automated traffic safety analysis. In The 2006 IEEE international joint conference on neural network proceedings. 4132-4138 (IEEE, 2006).
Tageldin, A., Zaki, M. H. & Sayed, T. Examining pedestrian evasive actions as a potential indicator for traffic conflicts. IET Intell. Transport Syst. 11, 282–289. https://doi.org/10.1049/iet-its.2016.0066 (2017).
Article Google Scholar
Chauhan, R., Dhamaniya, A. & Arkatkar, S. Challenges in rear-end conflict-based safety assessment of highly disordered traffic conditions. Transport. Res. Rec. 2677, 624–634. https://doi.org/10.1177/03611981221108156 (2023).
Article Google Scholar
Zheng, L., Sayed, T., & Essa, M. Validating the bivariate extreme value modeling approach for road safety estimation with different traffic conflict indicators. Accid. Anal. Prev. 123, 314–323 (2019).
Article PubMed Google Scholar
Guo, Y., Sayed, T., & Zheng, L. A hierarchical bayesian peak over threshold approach for conflict-based before-after safety evaluation of leading pedestrian intervals. Accid. Anal. Prev. 147, 105772 (2020).
Article PubMed Google Scholar
Zheng, O. et al. CitySim: A drone-based vehicle trajectory dataset for safety oriented research and digital twins. Preprint at https://arXiv.org/arXiv:2208.11036 (2022).
Zhu, Y., Wu, Q. & Xiao, N. Research on highway traffic flow prediction model and decision-making method. Sci. Rep. 12, 19919. https://doi.org/10.1038/s41598-022-24469-y (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kerner, B. S. Introduction to Modern Traffic Flow Theory and Control: The Long Road to Three-Phase Traffic Theory (Springer Science & Business Media, 2009).
Book Google Scholar
Nie, B. et al. Safety envelope of pedestrians upon motor vehicle conflicts identified via active avoidance behaviour. Sci. Rep. 11, 3996. https://doi.org/10.1038/s41598-021-82331-z (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Uno, N., Iida, Y., Itsubo, S. & Yasuhara, S. In Proc. of the 13th mini-EURO Conference-Handling Uncertainty in the Analysis of Traffic and Transportation Systems, Bari, Italy, 10–13.
Francis, J. et al. Unsupervised feature extraction of aerial images for clustering and understanding hazardous road segments. Sci. Rep. 13, 10922. https://doi.org/10.1038/s41598-023-38100-1 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Aranganayagi, S. & Thangavel, K. In International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007), 13–17 (IEEE, 2007).
Wang, X. & Xu, Y. IOP Conference Series Materials Science and Engineering 052024 (IOP Publishing, 2019).
Google Scholar
Xiao, J., Lu, J. & Li, X. Davies Bouldin index based hierarchical initialization K-means. Intell. Data Anal. 21, 1327–1338 (2017).
Article Google Scholar

Download references

Acknowledgments

Article processing charges were provided in part by the UCF College of Graduate Studies Open Access Publishing Fund.

Author information

Authors and Affiliations

Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL, 32816, USA
Shengxuan Ding, Mohamed Abdel-Aty, Zijin Wang & Dongdong Wang

Authors

Shengxuan Ding
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Abdel-Aty
View author publications
You can also search for this author in PubMed Google Scholar
Zijin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dongdong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study conception and design: S.D., M.A.A., Z.W., D.W.; data collection and processing: S.D., M.A.A.; analysis and interpretation of results: S.D., Z.W., D.W.; draft manuscript preparation S.D., M.A.A., Z.W., D.W. All authors reviewed the results and approved the final version of the manuscript.

Corresponding author

Correspondence to Shengxuan Ding.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ding, S., Abdel-Aty, M., Wang, Z. et al. Insights into vehicle conflicts based on traffic flow dynamics. Sci Rep 14, 1536 (2024). https://doi.org/10.1038/s41598-023-50017-3

Download citation

Received: 30 July 2023
Accepted: 14 December 2023
Published: 17 January 2024
DOI: https://doi.org/10.1038/s41598-023-50017-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

City-scale holographic traffic flow data based on vehicular trajectory resampling

Exploring the effects of stationary camera spots on inferences drawn from real-time crash severity models

Multiple vehicle cooperation and collision avoidance in automated vehicles: survey and an AI-enabled conceptual framework

Introduction

Literature review

Traffic flow and states

Traffic conflicts

Surrogate threshold values

Methods

Data preparation

Identification of traffic state

Surrogate safety measures

Time-to-collision (TTC)

Potential index for collision with urgent deceleration (PICUD)

Deceleration rate to avoid the crash (DRAC)

Clustering methods and evaluation

Results

Description of traffic state

Identification of traffic conflicts and thresholds

Conclusions

Data availability

References

Acknowledgments

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links