Forecasting the evolution of fast-changing transportation networks using machine learning

Lei, Weihua; Alves, Luiz G. A.; Amaral, Luís A. Nunes

doi:10.1038/s41467-022-31911-2

Download PDF

Article
Open access
Published: 22 July 2022

Forecasting the evolution of fast-changing transportation networks using machine learning

Nature Communications volume 13, Article number: 4252 (2022) Cite this article

6626 Accesses
10 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Transportation networks play a critical role in human mobility and the exchange of goods, but they are also the primary vehicles for the worldwide spread of infections, and account for a significant fraction of CO₂ emissions. We investigate the edge removal dynamics of two mature but fast-changing transportation networks: the Brazilian domestic bus transportation network and the U.S. domestic air transportation network. We use machine learning approaches to predict edge removal on a monthly time scale and find that models trained on data for a given month predict edge removals for the same month with high accuracy. For the air transportation network, we also find that models trained for a given month are still accurate for other months even in the presence of external shocks. We take advantage of this approach to forecast the impact of a hypothetical dramatic reduction in the scale of the U.S. air transportation network as a result of policies to reduce CO₂ emissions. Our forecasting approach could be helpful in building scenarios for planning future infrastructure.

Reconstructing commuters network using machine learning and urban indicators

Article Open access 13 August 2019

Modelling epidemic spread in cities using public transportation as a proxy for generalized mobility trends

Article Open access 16 April 2022

Estimation of Regional Economic Development Indicator from Transportation Network Analytics

Article Open access 14 February 2020

Introduction

Transportation networks are critical infrastructures and one of the foundations of modern globalized societies. The air transportation network alone is responsible for the mobility of millions of people every day across the world^1,2. However, transportation networks are also responsible, indirectly, for the propagation of diseases, such as influenza and, recently, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)^3,4,5. In addition to their role in enabling pandemics, transportation is also a significant contributor to greenhouse gas emissions, accounting for about 29% of the total U.S. greenhouse gas emissions and 14% of the total global greenhouse gas emissions^6,7. Among all transportation sectors, air transportation contributes 9% of U.S. greenhouse gas emissions and 10.6% of global greenhouse gas emissions^6,7. Even more concerning, at a time when global greenhouse gas emissions must be reduced, emissions from the transportation sector are on the rise^8,9. However, as the consequences of climate change become inescapable⁹, it is inevitable that dramatic changes in how transportation networks are organized will occur¹⁰. Thus, it is crucial for planners to be able to forecast how transportation networks could evolve in the coming decades.

The study of connection dynamics in networked systems, including transportation networks, has yielded significant insights^{11,12,13,14,15}. However, the study of the temporal dynamics of the edges in transportation networks remains underdeveloped. A significant challenge for transportation networks is that their edge dynamics are the outcome of concurrent actions of businesses motivated by competition and profit, governments motivated by national interests, and historical contingencies.

Recently, machine learning (ML) approaches have been successfully applied in the study of human mobility^16,17, sustainability of transportation infrastructure¹⁸, and the impact of COVID-19 on gasoline demands¹⁹. Here, we take a similar approach to probe the dynamics of the edges in transportation networks. For mature transportation networks, structure changes are primarily due to the addition and removal of edges, the addition and removal of nodes being much less significant. In the past, the addition of edges has been studied mainly in the context of missing link prediction^20,21 and network growth models²², whereas the removal of edges has been studied in contexts, such as network percolation^23,24, attack and error tolerance²⁵, dismantling strategies²⁶, catastrophic failures²⁷, synchronization and phase-transitions²⁸, pruning processes based on removal of underutilized links²⁹, and cascading failures³⁰, to name a few. However, edge removals in real-world temporal networks do not grow unbound as in percolation or dismantling processes and the mechanism determining the removal of edges is not well understood. To address this knowledge gap and because of the practical implications of the problem, we apply machine learning algorithms to the challenge of predicting edge removals on transportation networks.

We investigate the edge dynamics of two large mature but fast-changing transportation networks: the Brazilian inter-cities bus transportation network (Brazil Bus net)^31,32 and the U.S. domestic air transportation network (U.S. Air net)³³. We do not consider here rail transportation networks because they tend to change very slowly. Using ML algorithms to classify edges by their topological properties, we find statistically significant differences between features of edges retained and features of edges removed. Further, we develop an ML model that enables us to forecast removed edges. We also test the robustness of our model to large external shocks, such as COVID-19 travel restrictions. We use this model to simulate the effect of a reduction in the number of connections in the U.S. domestic air transportation network and discuss the implications of our findings on building alternative scenarios for planning future infrastructure.

Results

Transportation networks

We collected data for the Brazilian inter-city bus transportation network (Brazil Bus net) and the United States domestic air transportation network (U.S. Air net) at a monthly temporal resolution. In the Brazil Bus net, the nodes represent cities with bus stops on a bus route. An undirected edge ${e}_{ij}^{m}$ connects nodes i and j if there is at least one bus route connecting them at some point during the month m. The number of buses during the month m is used as the weight for edge ${e}_{ij}^{m}$. We construct both a weighted and an unweighted undirected temporal network {G₁ → G₂ → . . . → G_T}, where G_m represents the network snapshot constructed with data from month m.

In the U.S. Air net, the nodes represent U.S. cities with airports. An edge indicates that at least one airline directly connected the two cities during the monthly observation window (Fig. 1a). In the weighted network, the weight of an edge is the number of flights during the monthly observation window. Figure 1b shows that a significant fraction of existing edges is removed from the network from one snapshot to another.

**Fig. 1: Edge dynamics in two countrywide transportation networks.**

Machine learning prediction

We formulate the question of how to predict which edges will be removed as a supervised classification problem. In a network snapshot G_m, we assign to edges one of three states: ‘added’, ‘retained’, or ‘removed’. Added edges were not present at the beginning of the monthly observation window but are present at the end. Retained edges are present at the beginning and end of the monthly observation window. Removed edges are present at the beginning but not at the end of the monthly observation window. To achieve the goal of predicting removed edges, we only need to consider edges already present in G_m. Those edges can only be ‘retained’ or ‘removed’. Therefore, our problem is reduced to a binary classification where the task is to determine if an edge is retained or removed in a given snapshot. The features of an edge can be extracted from G_m and represented as a feature vector ${{{{{{{{\bf{X}}}}}}}}}_{ij}^{m}$. Thus, we can write the probability of an edge ${e}_{ij}^{m}$ being removed as:

$${{{{{{{\bf{Prob}}}}}}}}\left({e}_{ij}^{m}=removed\right)=f\left({{{{{{{{\bf{X}}}}}}}}}_{ij}^{m}\right).$$

(1)

To test our hypothesis that removed edges are significantly different in their topological features from those of retained edges, we randomly select 70% of retained and removed edges in a selected snapshot for inclusion in the training set. As illustrated in Fig. 2a, if a model is trained with G_m, one can perform two different tests depending on whether the testing edges are selected from the same snapshot as the training edges. In the simultaneous test, we test on edges from the same snapshot as the training edges. In the non-simultaneous test, we test on edges from snapshots that come after the training snapshot. This test evaluates the similarity of removal dynamics for different snapshots.

**Fig. 2: Performance of machine learning models for predicting retained and removed edges in a snapshot of a transportation network.**

Features

Numerous features could be used to characterize an edge in a network. In much of the literature, it is assumed that edge weights are not available²⁰. Thus we separately study the impacts of edge weight and edge topological features on the predictability of edge removals. We consider a subset of possible unweighted topological features used widely in the link prediction literature (Table 1). Most are local properties³⁴. Therefore, one could make predictions even without knowing the structure of the entire network.

Table 1 Considered features: Γ_i refers to the set of neighbors of node i. k_i = ∣Γ_i∣ is the degree of node i⁴⁰.

Full size table

To illustrate the differences between the features of retained and removed edges, we use data from the January 2014 snapshot for both transportation networks and present the distributions of those 11 unweighted topological features and the weight for both retained and removed edges. We compared the feature samples of retained and removed edges using the Kolmogorov-Smirnov statistics, a test for the null hypothesis that two samples are drawn from the same continuous distribution. Because we make multiple comparisons, we used Bonferroni corrections on the significance level, i.e. α = 0.05/12, where 12 is the number of comparisons. We can reject the hypothesis that the features of retained and removed edges come from the same distribution, with p value < 3 × 10⁻⁴ for all cases.

In order to select a classification model, we performed a stratified 10-fold cross-validation on the balanced training set with 27 widely used classification algorithms available in the scikit-learn Python library³⁵ and in the eXtreme Gradient Boost package³⁶. We calculate the balanced accuracy, the F1 score, and the area under the receiver operating characteristic curve (ROC-AUC) to compare the classification performance of the 27 algorithms (see Methods for implementation details and Supplementary Fig. S1 for the performance of all algorithms). For 8 of the 27, we obtain high and stable accuracies (Fig. 2c). The results suggest that those algorithms have consistent and similar prediction accuracies ranging from 0.6 to 0.8. We select a single algorithm with high accuracy and low error variance, XGBClassifier, for all subsequent analyses.

Prediction

We consider four separate models using unweighted topological features, weighted topological features (Table.S1), edge weights, unweighted topological features & edge weights as the feature vectors for the XGBClassifier.

Simultaneous prediction

Considering only unweighted topological features, for the Brazil Bus net, the balanced accuracies using the XGBClassifier in simultaneous tests have an average of 0.65 (Fig. 3a). For the U.S. Air net, XGBClassifier yields an average balanced accuracy of 0.70. These results suggest that with this ML approach we can differentiate the retained edges from the removed edges in a given network snapshot using their topological features.

**Fig. 3: Comparison of different models’ performances against appropriate null models.**

Considering weighted topological features marginally increases the balanced accuracy to 0.69 for the Brazil Bus net and 0.71 for the U.S Air net. In contrast, edge weights alone improve the predictive power of the model by 10% to 0.82 for the U.S. Air net. Including both unweighted topological features and edge weights does not significantly improve the models’ performance compared with the model that only uses edge weights to classify removals.

Nonsimultaneous prediction

A more general and useful test, however, is achieved by using a model trained on a single snapshot to predict edge removals in latter snapshots. Surprisingly, the prediction of the XGBClassifier for the non-simultaneous tests in the Brazil Bus net is no better than random guessing (Fig. 3b). For the U.S. Air net, the model yields an average balanced accuracy of 0.70 using topological features and 0.82 using edge weights, similar to what was observed for simultaneous prediction. See Fig. 3c for confusion matrices.

It is not surprising that weights are the most predictive feature of the model for the US Air net. Connections with more flights are likely to include services from different airlines, and thus unlikely to suddenly drop to zero. Surprisingly, the same argument does not hold for the Brazil Bus net.

Model interpretation

Next, we investigate why our predictions fail for the Brazil Bus net in the non-simultaneous tests. To this end, we use the SHapley Additive exPlanations (SHAP) values^10,37,38. Figure 4a shows the SHAP values summary of the feature importance as well as how their values affect the outputs of the model for the simultaneous test in a particular snapshot considering the model with edge weights and topological features. The SHAP values summary for a particular snapshot reveals that the Adamic-Adar index, edge weights, and the local path index are the most important feature for the bus network. For the U.S. Air net, the most important features for predicting edge removals are edge weights, the hub promoted index and the local path index.

Fig. 4: Edge weight, the hub promoted index and the resource allocation index consistently have the largest predictive power for whether edges will be removed or retained across different snapshots for the U.S. Air net.

Figure 4b shows the same SHAP values summary for the model considering only the unweighted topological features. For the U.S. Air net, the most important unweighted topological features for predicting edge removals are the hub promoted index and the resource allocation index. Specifically, edges with low values of the hub promoted index and the resource allocation index are more likely to be removed. The ranking of feature importances remains quite stable for different time snapshots.

While edge weight does not unveil the dynamics of the network and is on average ranked the most important feature at all snapshots (Fig. 4c), the hub promoted index and the resource allocation index are consistently the most important features in determining which edges are removed (Fig. 4d) in the U.S. Air net. This implies that the U.S. Air net has consistent removal dynamics over time, whereas, the Brazil Bus net shows no such stability.

For the bus network, features such as the local path index are important in some snapshots but not in others. This variability in the ranking of feature importance explains why non-simultaneous predictions fail for the Brazil Bus net: The model over-fits the current snapshot and performs well on the simultaneous test but fails to generalize when predicting edge removals on different snapshots.

For the U.S. Air net, the primacy of edge weights as the most important feature is not surprising. Edges tend to maintain their weights over time, so edges with low weights are less likely to be retained. However, this is tautological because it does not advance our knowledge about how edge weight is determined. The weight of an edge can be predicted using unweighted topological features (Fig. S3). To this end, we look at the unweighted topological features.

To the predictions of edge removals, the large predictive power of the hub promoted index and the resource allocation index can be understood if one considers that they capture the importance to a city of maintaining connections to hubs^34,39,40. The hub promoted index is defined as

$${h}_{p}=\frac{\left|{{{\Gamma }}}_{u}\cap {{{\Gamma }}}_{v}\right|}{\min ({k}_{u},{k}_{v})}.$$

(2)

If k_u < k_v, h_p can be seen as the fraction of node u’s neighbors that are connected to node v. A large value suggests node v is likely a hub (compared with non-hubs, node u shares more common neighbors with hubs) thus a direct connection between u and v is important (Fig. 4e). The resource allocation index is defined as

$${r}_{a}=\mathop{\sum }\limits_{{w}_{n}\in {{{\Gamma }}}_{u}\cap {{{\Gamma }}}_{v}}\frac{1}{{k}_{n}},$$

(3)

where k_n is the degree of the common neighborhood node w_n. As Fig. 4f illustrates, larger k_n means greater redundancy of short paths from u to v. However, not all redundancy is the same. Redundancy through highly connected nodes (r_a is small) would suggest that a direct connection could be replaced by a 2-step path through hubs, but redundancy through poorly connected nodes (r_a is large) would mean u and v are likely hubs and that their connection is important.

Model performance during the COVID-19 pandemic period

The U.S. economy was strongly affected by the COVID-19 pandemic. Coupled with travel restrictions, the economic downturn produced a strong reduction in airline traffic. Between February 2020 and April 2020, the number of monthly passengers on US domestic flights collapsed from 70 million to 2.87 million. This extraordinary situation provides us with a natural experiment with which to test the ability of our approach to continue making accurate predictions in the face of external shocks.

We downloaded the data needed to construct the U.S. air transportation network for the period January 2019 to March 2021. We found that despite the sharp reduction in the number of passengers, the fractions of edges removed monthly from the air transportation network were similar to those observed in the pre-pandemic period (Fig. 5a).

**Fig. 5: Forecasting edge removals in the U.S. air transportation network during a large external shock.**

To test the accuracy of our model in predicting edge removals during the period of travel restrictions, we first considered the simultaneous test. We obtained very similar balanced accuracy results for this period when compared to the period before the travel restrictions, suggesting that the considerations used to make decisions about which connections to remove remained consistent during the later period (Fig. 5b). We also found that the ranking of feature importance was also consistent after the travel restrictions were in place, making our model suitable for predictions even under this exogenous shock (Fig. 5c). Finally, we investigated our model using the non-simultaneous test to compute the prediction accuracy during the travel restriction period (Fig. 5d). Despite the small reduction in the accuracy for the months after the travel restrictions, the balanced accuracies obtained are very similar.

Long-term stability of forecast

Using edge weights as a feature in our classification model yields higher accuracy. Whereas predicting whether edges are going to be removed or not suffices to build sensible scenarios for the future of transportation networks, the higher accuracy of the model using edge weights imposes the challenge of predicting the weights of future snapshots. A shortcoming to such a model is that it needs another model to predict edge weights, increasing the overall complexity of the approach. To test the feasibility of such an approach, we take steps in this direction and check the stability of a model that uses only weights as features for long-term predictions. To compute the edge weights in future snapshots, we first fit a regression model that uses the weights of the current snapshot to predict the weights of the next snapshot, this is, w_ij(t) = f(w_ij(t − 1)). For edges that are added in future snapshots, we input the weights directly from the data so that we do not need to introduce an additional model to predict edge additions.

We simulate long-term forecast for the U.S. Air net for a model considering only unweighted topological features and a model considering only edge weights (Fig. 6a). Starting from a given snapshot network G_m, the model is trained to predict the edges removed in the next snapshot G_m+1. Then, the model uses the prediction G_m+1 to predict G_m+2 and so on and so forth. To evaluate the performance of the predictions at each time step, we compare the structure of the predicted network with the structure of the actual network using the Jaccard similarity

$$J=\frac{\left|{E}_{data}\cap {E}_{predicted}\right|}{\left|{E}_{data}\cup {E}_{predicted}\right|}$$

(4)

where E_data is the edge set from the actual network and E_predicted is the edge set of the predicted network.

**Fig. 6: Forecasting the impacts of a hypothetical reduction of the scale of the U.S. Air net.**

Our analysis suggests that unweighted topological features have more stable predictions independently of the initial conditions of the network, that is, which month was chosen to train the model. In contrast, models considering only the edge weights yield about 13% of trajectories with very large errors, suggesting that this model could become unexpectedly poor for long-term predictions. In fact, at the end of the simulation, models using only edge weights can, at times, have a performance similar to a model where edges are removed randomly from one month to another (Fig. 6a).

Forecasting changes to the U.S. Air net

Encouraged by the ability of our approach to forecast the edge removal dynamics of the U.S. Air net over long periods, we next use the model considering unweighted topological features to simulate the effect of hypothetical air travel restrictions aiming to reduce CO₂ emissions. We use the model trained on a known snapshot to predict the probability that a given edge is removed and remove it according to that probability. We take the December 2018 snapshot of the U.S. Air net as the initial state of the network. In each simulation, we assume that there is a target N_f for the total number of edges in the network and that at each time step we remove a fraction of existing edges

$$\delta {N}_{m}=-\gamma ({N}_{m}-{N}_{f}),$$

(5)

where m is the number of months from the start of the simulation and N_m is the number of edges in the current snapshot.

Figure 6 b shows ensembles, each including 30 simulations starting on December 2018 and removing (R_f = 2/3, 4/5, where R_f = 1 − N_f/N₀) of edges at two different rates (γ = 0.02, 0.04). Based on the predicted edge removals, we project the estimated carbon emissions relative to the emissions in 2018 (see Fig. 6c and the SI for details of the estimations)⁴¹. To better quantify the likelihood of removal, we calculate the average time an edge is retained and rank edges from the shortest survival time to the longest survival time (Fig. 6d). We find that edges connecting hubs (e.g. Chicago, IL, and Boston, MA) are the least likely to be removed. Of practical importance, we find that an edge’s survival time depends on the values of the two parameters, γ and R_f (Fig. 6e).

Daily global CO₂ emissions decreased by 17% during the early stages of the COVID-19 pandemic. A reduction in carbon emissions this dramatic could help many countries achieve the goals of the Paris Climate Agreement¹⁰. An important component of such reduction would be decreasing the number of air connections among cities such as the one we explored above. An important question, thus, is the extent to which our model may be of use to planners.

As Fig. 6f makes clear, many Midwestern cities are likely to lose air connections if there is a large reduction in the size of the U.S. domestic air transportation network. Only a fraction of those cities have or will host new rail connections. For the most part, the existing or proposed rail connections will not be served by high-speed trains. In practical terms, this means that most travelers between those cities will do so by automobile.

The removal of a large fraction of air connections could thus lead to an increase in automotive traffic for trips in the 3-8 hours range. Such an increase could result in increased CO₂ emissions due to the construction, expansion, and maintenance of roads and due to increased miles traveled by car. This might be a missed opportunity. High-speed rail is a scalable, and a more environmentally sound approach to transportation than adding cars to the roads, and building and expanding roadways.

A complicating factor is the time scale for the planning of new U.S. rail lines. Amtrak has advertised its plans under the name “2035 Amtrak”⁴². Much can and will change in the U.S. between now and 2035. For instance, climate change is likely to force migrations from the South and West of the U.S. areas of the Midwest. Thus, our model cannot provide the answer to how to plan a rail system that will anticipate population and air transportation changes. However, our model can help planners build additional scenarios for future changes in the U.S. domestic air transportation network.

Discussion

We show that edge removal processes in transportation networks are not random and that it is possible to make accurate predictions based on local network structures. Even though those features are able to differentiate edges removed and retained, a model trained in a single time snapshot is not able to correctly predict removed edges in different time snapshots for the Brazil Bus net. In contrast, for the U.S. Air net, the non-simultaneous tests show that the features of the edges removed are similar in all snapshots.

For the U.S. Air net, we find that edge weight, the hub promoted index, and the resource allocation index consistently have the largest predictive power for the aggregate network. This finding is not surprising since it simply highlights the fact that direct connections between hubs are very important while connections to a city already connected to a hub are not. Remarkable, when considering individual airlines, we find that the hub promoted index is still very important for accurate prediction, but its importance varies over time and by the airline. That is, predicting the removed edges for the aggregate network is actually simpler than the prediction for an individual airline (see Supplementary Fig. S4 and Fig. S5).

Our study has several limitations. First, the number of topological features tested in our work is limited. The predictive power could be improved by including additional features. However, we did test the impact of some global features such as the edge betweenness centrality, edge current flow betweenness centrality, and demographic features such as the intercity gravitation flow⁴³, but did not see any improvement in predictive power (see Supplementary Fig. S6 and Fig. S7). We also compared our model to a purely physical model that removes edges according to the rank of inter-city gravitation flow (see Supplementary Fig. S8). Our analyses thus suggest that the local features can predict edge removals better than global and demographic factors.

Second, we left the interplay of the multilayer structure of the air transportation networks unexplored⁴⁴. For example, it would be very interesting to use machine learning approaches to explore the interplay between the removal of edges in one network (e.g., air transportation network) and the growth of edges in another transportation network (e.g., high-speed rail systems).

Third, the crucial role of airline strategy and route network optimization is not explicitly included in our approach. Nonetheless, we note that many events concerning airline strategy that is arguably impossible to predict – governmental regulation, airport expansions, mergers and acquisitions, bankruptcy protection, epidemics, natural disasters, and so on – would also affect the decision to drop routes. It is thus remarkable that our approach is able to achieve such accuracy when predicting changes in the aggregate network.

Finally, it is widely accepted that exogenous shocks to the economy or from the environment can dramatically change the topology of the network as airline companies try to adapt to the new market conditions. Surprisingly, the fact that local topological features alone can enable accurate predictions about edge removals in the air transportation network highlights the importance of our results to the literature on transportation networks. Nonetheless, an interesting question for further research would be the investigation of possible market scenarios where removals of airline connections are not only a function of topological features but also of other external factors such as large-scale migration patterns due to climate change.

Methods

Data

The Brazilian inter-cities bus data was collected from the Brazilian National Land Transportation Agency (ANTT)³¹. The dataset contains all inter-cities bus transportation from January 2005 to December 2014 with monthly resolution. The data is represented as a temporal unweighted undirected network, where nodes are individual bus stops (cities) and edges represent bus routes between the two cities within that monthly snapshot. The network has 120 snapshots with about 1734 nodes and 18781 edges on average.

We obtained United States domestic air transportation data from the Bureau of Transportation Statistics (BTS)³³. The data is in the period from January 2004 to December 2018. We later obtained data for the period from January 2019 to March 2021 for the analysis of the impact of the COVID-19 pandemic’s travel restrictions. Using the same approach we used in Brazil inter-cities bus data, each snapshot of the network is constructed from data of the corresponding month. The nodes are airports (cities) and an edge represents that there is at least one airline connecting the two cities within that monthly snapshot. The networks have 192 snapshots with about 819 nodes and 6547 edges on average.

Class balancing

Most machine learning classification algorithms favor the majority class in an imbalanced dataset. In the two transportation networks we study, the number of removed edges is much smaller than the edges that are retained. To mitigate this issue in our highly imbalanced data, we balanced the training data by keeping the same number of the majority class data samples (retained edges) as the minority class data samples (removed edges) using random under-sampling.

Performance metrics

The performance metrics are calculated using Scikit-learn (version: ‘0.21.3’) built-in functions³⁵. For binary classifications, the results fall into true positive/TP (removed edges predicted to be removed), false positive/FN (retained edges predicted to be removed), true negative/TN (retained edges predicted to be retained), and false negative/FN (removed predicted to be retained).

For a binary case, precision is defined as:

$${\mathtt{precision}}=\frac{TP}{TP+FP}$$

(6)

and recall is defined as:

$${\mathtt{recall}}=\frac{TP}{TP+FN}$$

(7)

The balanced accuracy is defined as the arithmetic mean of true positive rate and true negative rate:

$${\mathtt{balanced-accuracy}}=\frac{1}{2}\left({\mathtt{recall}}+\frac{TN}{TN+FP}\right)$$

(8)

The F1 score is defined as the harmonic mean of precision and recall:

$$F1=2\times \frac{({\mathtt{precision}}* {\mathtt{recall}})}{({\mathtt{precision}}+{\mathtt{recall}})},$$

(9)

The area under the receiver operating characteristic curve (AUC-ROC) is a performance measurement for the classification problems at various threshold settings. It measures the area under the curve of the plot of true positive rates vs. false positive rates. The higher the AUC-ROC, the better the model is in predicting the correct classes.

Hyperparameter tuning

For XGBClassifier, we performed a brute force grid search hyperparameter tuning. For the sake of computational time, we tested on a predefined hyperparameter space. That is learning rate 0.01–0.4; gamma 0.0–0.2; maximum tree depth 0–10; number of boosting rounds 0–200. All default hyperparameters are outside the region of overfitting and underfitting (see Supplementary Fig. S2). For all analyses, we report our results using the default hyperparameters.

Those hyperparameters are: learning rate = 0.3; number of boosting rounds = 100; maximum tree depth = 3; objective = binary:logistic; booster = gbtree; gamma = 0; min child Weight = 1; max delta step = 0; subsample = 1; colsample bytree = 1; colsample bylevel = 1; colsample bynode = 1; reg alpha = 0; reg lambda = 1; scale pos Weight = 1; base Score = 0.5.

Null model

To justify that the predictability comes from the non-trivial section of removed edges, we construct a null model to estimate the fraction of correct predictions that XGBClassifier would make if edge features were not correlated with removals. To do so, we shuffle the labels (retained and removed), destroying any correlation between features and labels. Then, we split the data as we did for the original model and train the XGBClassifier algorithm on the training set and test the predictions on the testing set.

If features are not correlated with labels and the null model produces predictions similar to the original model on the non-shuffled data, the predictions observed in non-shuffled data is likely a result of overfitting. If the null model produces predictions that are no better than chance, our ML approach is capturing the functional relationship between edge features and edge removals on the non-shuffled data.

SHAP (SHapley Additive exPlanation) values

The computation of SHAP values is a suitable approach to quantify feature importance⁴⁵. To assess the importance of a feature, one calculates the change in the expected model prediction by withholding that feature. Mathematically, this method retrains the model on all subset of features S ⊂ F, where F is the set of all features. Since multiple subsets satisfied $S\subseteq F\setminus \left\{i\right\}$, the importance of the feature is computed using all possible permutations. Mathematically, the SHAP value for a particular feature i (out of F total features), given a prediction x is:

$${\phi }_{i}(x)=\mathop{\sum}\limits_{S\subseteq F\setminus \left\{i\right\}}\frac{\left|S\right|!\left(\left|F\right|-\left|S\right|-1\right)!}{\left|F\right|!}\left[{f}_{S\cup \left\{i\right\}}\left({x}_{S\cup \left\{i\right\}}\right)-{f}_{S}\left({x}_{S}\right)\right]$$

(10)

where ${f}_{S\cup \left\{i\right\}}$ is a model trained with feature $S\cup \left\{i\right\}$, and f_S is a model trained on S without feature i. Thus, the rank of feature importances are given by the sum of the SHAP value magnitudes ϕ_i over all predictions.

Estimate the C O ₂ emission reduction

To estimate the CO₂ emissions from the U.S. domestic air transportation, we use the average fuel efficiency of U.S. airlines in 2018. The methodology to calculate the CO₂ emission associated with a specific route can be done as follows⁴⁶:

Monthly CO₂ emissions (in tons) = 3.16 × 32.5 (gram fuel per km) × trip distance (in km) × number of flights each month × 10⁻⁶ (tons per gram).

Where 3.16 is the constant representing the number of tonnes of CO₂ produced by burning a tonne of aviation fuel. 32.5 g fuel per km was the average fuel efficiency of U.S. airlines in 2018⁴¹.

Data availability

The Brazilian bus transportation network and U.S. air transportation network are freely available for download at http://antt.gov.br/ and https://www.transtats.bts.gov/TableInfo.asp, respectively.

Code availability

Our code is available for download at https://github.com/amarallab/transportation_network_evolution.

References

Guimera, R., Mossa, S., Turtschi, A. & Amaral, L. A. N. The worldwide air transportation network: Anomalous centrality, community structure, and cities’ global roles. Proc. Natl. Acad. Sci. 102, 7794–7799 (2005).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Barbosa, H. et al. Human mobility: Models and applications. Phys. Rep. 734, 1–74 (2018).
Article ADS MathSciNet MATH Google Scholar
Colizza, V., Barrat, A., Barthélemy, M. & Vespignani, A. The role of the airline transportation network in the prediction and predictability of global epidemics. Proc. Natl. Acad. Sci. 103, 2015–2020 (2006).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Chinazzi, M. et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science 368, 395–400 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Maier, B. F. & Brockmann, D. Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China. Science 368, 742–746 (2020).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
EPA. Fast Facts on Transportation Greenhouse Gas Emissions (2019). https://www.epa.gov/greenvehicles/fast-facts-transportation-greenhouse-gas-emissions.
IPCC. AR5 climate change 2014: Mitigation of climate change - Chapter 8 “Transport" (2014). https://www.ipcc.ch/report/ar5/wg3/.
Ritchie, H. & Roser, M. Co and greenhouse gas emissions. Our World in Data (2020). https://ourworldindata.org/co2-and-other-greenhouse-gas-emissions.
IPCC. Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (2021). https://www.ipcc.ch/report/ar6/wg1/.
Le Quere, C. et al. Temporary reduction in daily global CO₂ emissions during the COVID-19 forced confinement. Nat. Clim. Chan. 10, 647–653 (2020).
Article ADS Google Scholar
Holme, P. & Saramaki, J. Temporal networks. Phys. Rep. 519, 97–125 (2012).
Article Google Scholar
Scholtes, I. et al. Causality-driven slow-down and speed-up of diffusion in non-markovian temporal networks. Nat. Commun. 5, 1–9 (2014).
Article Google Scholar
Pastor-Satorras, R., Castellano, C., Van Mieghem, P. & Vespignani, A. Epidemic processes in complex networks. Rev. Mod. Phys. 87, 925–979 (2015).
Article ADS MathSciNet Google Scholar
Koher, A., Lentz, H. H. K., Gleeson, J. P. & Hövel, P. Contact-based model for epidemic spreading on temporal networks. Phys. Rev. X 9, 031017 (2019).
CAS PubMed Central Google Scholar
Berthier, E., Porter, M. A. & Daniels, K. E. Forecasting failure locations in 2-dimensional disordered lattices. Proc. Natl. Acad. Sci. 116, 16742–16749 (2019).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Bassolas, A. et al. Hierarchical organization of urban mobility and its connection with city livability. Nat. Commun. 10, 4817 (2019).
Article ADS PubMed PubMed Central Google Scholar
Spadon, G., de Carvalho, A. C., Rodrigues-Jr, J. F. & Alves, L. G. Reconstructing commuters network using machine learning and urban indicators. Sci. Rep. 9, 1–13 (2019).
Article ADS CAS Google Scholar
Asensio, O. I. et al. Real-time data from mobile platforms to evaluate sustainable transportation infrastructure. Nat. Sustainability 3, 463–471 (2020).
Article Google Scholar
Ou, S. et al. Machine learning model to project the impact of covid-19 on us motor gasoline demand. Nat. Energy 5, 666–673 (2020).
Article ADS CAS Google Scholar
Lü, L. & Zhou, T. Link prediction in complex networks: A survey. Phys. A: Statistical mechanics and its applications 390, 1150–1170 (2011).
Article ADS Google Scholar
Divakaran, A. & Mohan, A. Temporal link prediction: A survey. New Gener. Comput. 38, 213–258 (2020).
Article Google Scholar
Goldenberg, A., Zheng, A. X., Fienberg, S. E. & Airoldi, E. M. A survey of statistical network models. Found. Trends Mach. Learn. 2, 1–117 (2009).
Article MATH Google Scholar
Callaway, D. S., Newman, M. E. J., Strogatz, S. H. & Watts, D. J. Network robustness and fragility: Percolation on random graphs. Phys. Rev. Lett. 85, 5468–5471 (2000).
Article ADS CAS PubMed Google Scholar
Achlioptas, D., D’Souza, R. M. & Spencer, J. Explosive percolation in random networks. Science 323, 1453–1455 (2009).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Crucitti, P., Latora, V., Marchiori, M. & Rapisarda, A. Error and attack tolerance of complex networks. Physica A: Statistical mechanics and its applications 340, 388–394 (2004).
Article ADS MathSciNet MATH Google Scholar
Ren, X.-L., Gleinig, N., Helbing, D. & Antulov-Fantulin, N. Generalized network dismantling. Proceedings of the National Academy of Sciences 116, 6554–6559 (2019).
Article ADS MathSciNet CAS MATH Google Scholar
Buldyrev, S. V., Parshani, R., Paul, G., Stanley, H. E. & Havlin, S. Catastrophic cascade of failures in interdependent networks. Nature 464, 1025–1028 (2010).
Article ADS CAS PubMed Google Scholar
D’Souza, R. M., Gómez-Gardenes, J., Nagler, J. & Arenas, A. Explosive phenomena in complex networks. Adv. Phys. 68, 123–223 (2019).
Article ADS Google Scholar
Verma, T., Russmann, F., Araújo, N. A., Nagler, J. & Herrmann, H. J. Emergence of core–peripheries in networks. Nat. Commun. 7, 1–7 (2016).
Article CAS Google Scholar
Sahasrabudhe, S. & Motter, A. E. Rescuing ecosystems from extinction cascades through compensatory perturbations. Nat. Commun. 2, 1–8 (2011).
Article CAS Google Scholar
Agency, N. L. T. National Land Transport Agency-ANTT 2017 Statistics and Road Studies-Operational Data. http://antt.gov.br.
Alves, L. G. A., Aleta, A., Rodrigues, F. A., Moreno, Y. & Amaral, L. A. N. Centrality anomalies in complex networks as a result of model over-simplification. N. J. Phys. 22, 013043 (2020).
Article Google Scholar
BTS. BTS-Transtats (2018). https://www.transtats.bts.gov/TableInfo.asp.
Li, W. & Cai, X. Statistical analysis of airport network of China. Phys. Rev. E 69, 046106 (2004).
Article ADS CAS Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785-794 (ACM, 2016).
Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. In Guyon, I. et al. (eds.) Advances in Neural Information Processing Systems, vol. 30, 4765–4774 (Curran Associates, Inc., 2017).
Lundberg, S. M. et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat. Biomed. Engineer. 2, 749–760 (2018).
Article Google Scholar
Ou, Q., Jin, Y.-D., Zhou, T., Wang, B.-H. & Yin, B.-Q. Power-law strength-degree correlation from resource-allocation dynamics on weighted networks. Phys. Rev. E 75, 021102 (2007).
Article ADS Google Scholar
Zhou, T., Lü, L. & Zhang, Y.-C. Predicting missing links via local information. Eur. Phys. J. B 71, 623–630 (2009).
Article ADS CAS MATH Google Scholar
Xinyi Sola Zheng, B. G. & Rutherford, D. U.S. domestic airline file efficiency ranking, 2017-2018. Tech. Rep. (2019). https://theicct.org/publications/us-domestic-airline-fuel-efficiency-ranking-2017-18.
Amtrak. Amtrak Connects US: A Visionto Grow Rail Service Across America (2021). https://www.amtrakconnectsus.com/.
Barthelemy, M. The statistical physics of cities. Nat. Rev. Phys. 1, 406–415 (2019).
Article Google Scholar
Andrea, A., Latora, V., Nicosia, G. & Nicosia, V. Pareto optimality in multilayer network growth. Phys. Rev. Lett. 121, 128302 (2018).
Article ADS Google Scholar
Lundberg, S. M. et al. From local explanations to global understanding with explainable ai for trees. Nat. Mach. Intellig. 2, 56–67 (2020).
Article Google Scholar
ICAO. ICAO Carbon Emissions Calculator Methodology (2018).
BTS. BTS-Transtats https://data.bts.gov/Research-and-Statistics/Air-Travel-Domestic/em4z-nqt3 (2021).

Download references

Acknowledgements

This research was supported by a gift from John and Leslie McQuown, by the National Science Foundation grants numbers 1956338 and 2033604, and by Air Force Office of Scientific Research grant number FA9550-19-1-0354.

Author information

Authors and Affiliations

Department of Physics and Astronomy, Northwestern University, Evanston, IL, 60208, USA
Weihua Lei & Luís A. Nunes Amaral
Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA
Luiz G. A. Alves & Luís A. Nunes Amaral
Northwestern Institute on Complex Systems (NICO), Northwestern University, Evanston, IL, 60208, USA
Luís A. Nunes Amaral

Authors

Weihua Lei
View author publications
You can also search for this author in PubMed Google Scholar
Luiz G. A. Alves
View author publications
You can also search for this author in PubMed Google Scholar
Luís A. Nunes Amaral
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.L., L.G.A.A., and L.A.N.A. conceived and designed the study. W.L. performed the numerical simulations. W.L., L.G.A.A., and L.A.N.A performed the data analysis. W.L., L.G.A.A., and L.A.N.A created the figures. W.L. wrote the first draft of the paper. W.L., L.G.A.A., and L.A.N.A. wrote, read, and approved the final version of the paper.

Corresponding author

Correspondence to Luís A. Nunes Amaral.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lei, W., Alves, L.G.A. & Amaral, L.A.N. Forecasting the evolution of fast-changing transportation networks using machine learning. Nat Commun 13, 4252 (2022). https://doi.org/10.1038/s41467-022-31911-2

Download citation

Received: 23 February 2021
Accepted: 08 July 2022
Published: 22 July 2022
DOI: https://doi.org/10.1038/s41467-022-31911-2

This article is cited by

Spatiotemporal dynamics of traffic bottlenecks yields an early signal of heavy congestions
- Jinxiao Duan
- Guanwen Zeng
- Shlomo Havlin
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.