Road networks and socio-demographic factors to explore COVID-19 infection during its different waves

Uddin, Shahadat; Khan, Arif; Lu, Haohui; Zhou, Fangyu; Karim, Shakir; Hajati, Farshid; Moni, Mohammad Ali

doi:10.1038/s41598-024-51610-w

Download PDF

Article
Open access
Published: 18 January 2024

Road networks and socio-demographic factors to explore COVID-19 infection during its different waves

Shahadat Uddin¹,
Arif Khan¹,
Haohui Lu¹,
Fangyu Zhou¹,
Shakir Karim¹,
Farshid Hajati² &
…
Mohammad Ali Moni³

Scientific Reports volume 14, Article number: 1551 (2024) Cite this article

638 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The COVID-19 pandemic triggered an unprecedented level of restrictive measures globally. Most countries resorted to lockdowns at some point to buy the much-needed time for flattening the curve and scaling up vaccination and treatment capacity. Although lockdowns, social distancing and business closures generally slowed the case growth, there is a growing concern about these restrictions' social, economic and psychological impact, especially on the disadvantaged and poorer segments of society. While we are all in this together, these segments often take the heavier toll of the pandemic and face harsher restrictions or get blamed for community transmission. This study proposes a road-network-based networked approach to model mobility patterns between localities during lockdown stages. It utilises a panel regression method to analyse the effects of mobility in transmitting COVID-19 in an Australian context, together with a close look at a suburban population’s characteristics like their age, income and education. Firstly, we attempt to model how the local road networks between the neighbouring suburbs (i.e., neighbourhood measure) and current infection count affect the case growth and how they differ between delta and omicron variants. We use a geographic information system, population and infection data to measure road connections, mobility and transmission probability across the suburbs. We then looked at three socio-demographic variables: age, education and income and explored how they moderate independent and dependent variables (infection rates and neighbourhood measures). The result shows strong model performance to predict infection rate based on neighbourhood road connection. However, apart from age in the delta variant context, the other variables (income and education level) do not seem to moderate the relationship between infection rate and neighbourhood measure. The results indicate that suburbs with a more socio-economically disadvantaged population do not necessarily contribute to more community transmission. The study findings could be potentially helpful for stakeholders in tailoring any health decision for future pandemics.

A meta-analysis on global change drivers and the risk of infectious disease

Article 08 May 2024

Infectious disease in an era of global change

Article 13 October 2021

The WHO estimates of excess mortality associated with the COVID-19 pandemic

Article Open access 14 December 2022

Introduction

The COVID-19 pandemic has caused a significant amount of mortality and clinical burden as well as impacted transport, logistics¹ and economies² globally. Governments put a significant budget and effort into preventing and treating this disease, curbing its growth through public health measures such as mass vaccination, contact tracing, mobility restrictions, lockdowns, etc., and softening the economic loss. In academia, a global effort has been put forward to understand the pathophysical properties of the virus, evaluate public health measures, and model the transmission that could help predict the spread based on historical data.

At the beginning of the pandemic, researchers significantly focused on modelling and predicting the disease transmission and measuring the public health impact. Classical prediction models using the disease spread data have mostly turned out effective. Hernandez-Matamoros et al.³ modelled the COVID-19 spread pattern using the autoregressive integrated moving average (ARIMA) model with data from 145 countries, which showed good prediction potential using variables such as population, culture, climate, humidity, etc. Swaraj et al.⁴ proposed an ARIMA-based model that could capture the linear and non-linear components of the disease spread data by integrating an autoregressive neural network. The hybrid method outperformed the single ARIMA model for daily observed cases. Some variations of the classical SIR (Susceptible-Infected-Recovery) model were also used. For example, Abdy et al.⁵ proposed a new SIR model with fuzzy parameters like infection rate, recovery rate, and death rate due to COVID-19. Liu et al.⁶ extended the current susceptible-exposed-infected-recovery (SEIR) model, which is a variation of the SIR model, by incorporating extra compartments. This model can explain the new features of COVID-19 and fine-tune the new model with a neural network aimed at a higher accuracy prediction.

Machine learning models, notably artificial neural networks (ANNs) and recurrent neural networks (RNNs), were preferred when the datasets were much more complicated with more complex features. Car et al.⁷ proposed the first ANN-based model to predict the COVID-19 spread trend. They trained three distinct models using confirmed, recovered and deceased cases and achieved 0.94 for the coefficient of determination. Melin et al.⁸ presented a multiple-ensemble ANN model using a fuzzy response aggregation for time series data. The ensemble ANN models make it possible to predict various conditions, and fuzzy logic could help aggregate the responses of these neural predictors. Beyond these, the best determination coefficient achieved so far is from the experiments by Pinter et al.⁹, who used ANFIS and MLP-ICA methods to predict the number of infected people and the mortality rates. Their determination coefficient score reached 0.99 when applying the MLP-ICA method. The typical modelling using RNN and the best results among RNN variants are developed from the long short-term memory (LSTM) method. Chimmula and Zhang¹⁰ used an LSTM-based approach to forecast COVID-19 patterns and concluded that the pandemic would come to an end by the end of June 2020. Such a conclusion could be considered quite plausible only for the COVID-19 first wave. Yudistira¹¹ also used LSTM to understand and model the correlation of the COVID-19 growth rate. The optimal structure of the models was determined heuristically. Their experiments concluded that LSTM outperformed RNN when using RMSE value as the comparing metrics.

One fundamental premise for the COVID-19 transmission model is that accelerated human mobility increases disease transmission; therefore, most governments employ some mobility restrictions¹². However, there has been tremendous public debate and concerns for these restrictions' efficacy, reasoning, timeframe and coverage since they significantly impacted different societal groups' quality of life and economic conditions. Although such restrictions have been used during earlier epidemics, the current COVID-19 pandemic is notably different due to its high transmissibility and frequent mutations¹³. A few years after the COVID-19 outbreak, there have been various studies to understand the efficacy of mobility restrictions and business closures and also whether there could be other factors (e.g., income level, economic support, awareness, education, etc.) if improved, could be more effective than mobility restriction to fight the virus. Oh et al.¹⁴ used Google mobility data and regression models and found that mild and moderate mobility restrictions reduced COVID-19 case counts in most countries. However, severe mobility restriction did not give a proportionately significant case count decrease. Bonaccorsi et al.¹⁵ used a graph network approach utilising Italian mobility data from Facebook. They highlighted the social costs of lockdown as the mobility restriction has hugely reduced fiscal revenues and increased poverty. Bharati and Fakir¹⁶ found that stricter rules successfully contained the contagion. However, they also found that restrictions reduced mobility more in relatively less-developed countries. The causal effect of a reduction in mobility on case count was higher in more developed countries. Other similar research^17,18,19 also used variations of regression models and found that mobility restrictions at local and international levels have aided in controlling the initial spread of COVID-19. While these studies generally agree that lockdowns were mostly effective in throttling initial spread at the cost of enormous economic cost that affects different socio-economic groups differently, there is still a gap in the applicability of the data sources and the context of different variants of the COVID-19 virus. For example, many studies rely on mobility data provided by third parties which might have sampling bias or specific to certain user groups and can only be observed after the event has been happened. Also, little research has focused on understanding how socio-economic factors moderate mobility restriction and case count in different phases of the COVID-19 pandemic, i.e., during different variant outbreaks.

In this study, we propose a road-network-based network approach to model mobility patterns between localities during lockdown stages and utilise a panel regression method to analyse the effects of mobility in transmitting COVID-19 in the Australian context, together with a close look at a suburban population’s characteristics like their age, income and education. The suburban road network is planned according to local transport demand and, therefore, in an efficient transport system—road connections represent the mobility pattern of the area’s population and could potentially be utilised in disease modelling^20,21,22. In the context of the infectious nature of COVID-19, this study adopts a network approach to model the virus's spread within geographic areas, emphasising attributes pertinent to direct viral transmission between individuals. Acknowledging the propensity for increased infections in areas already harbouring infected residents or those connected by roads to high-infection locales, we employ two time-series measures, the prior infection count and a composite measure predicated on the suburban road network, to model the infection numbers in the given suburbs or postal areas.

Our approach

As summarised in the Introduction section, researchers have used various attributes to model the number of COVID-19 infections for a geographic area in a given period. Given the highly infectious nature of COVID-19, this study considered features that affect the direct transmission of the virus between individuals. There is a good chance of a higher number of future COVID-19 infections in a suburb if it already has an increased number of infected residents. Similarly, the possibility of the same suburb having more infected patients will increase if it has direct road connections with suburbs with many COVID-19-infected patients. Controlling human mobility is challenging at the inter-suburban level; even strict lockdowns or curfews will be in place^23,24.

Accordingly, this study considered two time-series measures to model the COVID-19 infection number for a given postal area or suburb. The first one is the infection number or count from previous time points. The second is a composite one based on the suburban road network. It is a weighted sum based on the number of road connections to each neighbouring suburb (i.e., the weighting factor) and their respective infection count at the previous time point. The following formula can capture our approach.

$$InfNum_{t} = f\left( {InfNum_{{\left( {t - 1} \right)}} , RNInf_{{\left( {t - 1} \right)}} } \right)$$

(1)

where $InfNum_{t}$ is the number of infected COVID-19 patients in a suburb at time t (i.e., current infection number), $InfNum_{{\left( {t - 1} \right)}}$ is the number of infected COVID-19 patients at time (t − 1) (i.e., previous infection number), and $RNInf_{{\left( {t - 1} \right)}}$ is the road network-based infection measure at (t − 1) (i.e., neighbourhood measure). Mathematically, the following formula represents this measure.

$$RNInf_{{\left( {t - 1} \right)}} = \mathop \sum \limits_{i = 1}^{n} \left( {C^{i} \times NorInf_{{\left( {t - 1} \right)}}^{i} } \right)$$

(2)

where n indicates the number of other suburbs that the underlying suburb has road connections with, $C^{i}$ is the number of road connections the suburb has with the suburb $i$, and $NorInf_{{\left( {t - 1} \right)}}^{i}$ is the normalised infection number of suburb $i$ at $\left( {t - 1} \right)$ time point. This study considers the population sizes of the neighbouring suburbs to normalise their respective infection numbers. Since this measure depends on its connection with neighbouring suburbs and their infection number for a given suburb, this study names it the neighbourhood measure.

Methods and materials

Data source

This study considered the COVID-19 infection data for 100 different suburbs of the Greater Sydney area of New South Wales, Australia²⁵. We considered two distinct periods for the infection statistics of these suburbs: one for the delta variant (4 weeks starting from August 24, 2021) and another for the omicron variant (4 weeks beginning on November 17, 2021). The delta variant also spread during the second period. However, we termed this period ‘omicron’ since the omicron variant had already become prevalent in these suburbs since early November 2021²⁵. Table 1 details the basic statistics of the infection data considered in this study.

Table 1 The basic statistics of the COVID-19 infection data for 100 suburbs considered in this study.

Full size table

To quantify the second independent variable ($RNInf_{{\left( {t - 1} \right)}}$), we first construct the suburban road network. A node in this network represents a suburb. An edge between two nodes indicates at least one road connecting the underlying suburbs represented by those nodes, and the edge weight points to the number of roads connecting the two suburbs of the edge. We took the map data from Google Maps, Australia²⁶. Figure 1 illustrates an example of the suburban road network construction. For a given suburb, we then considered the infection number for each of its neighbouring suburbs. The methodology illustrated in Fig. 1 utilises edge weights to signify the number of roads interlinking suburbs. It is essential to make clear that this method is not saying that the more roads there are between suburbs, like Burwood and Strathfield, the more people will travel between them. Instead, we are using this method as a structured way to estimate possible movement and interactions between different suburbs, serving as a helpful indicator to measure ease of access and connection between areas. These are paramount factors in analysing the potential for virus transmission. By providing a quantitative approximation of interaction and mobility potential, it contributes nuanced insights to understanding the multifaceted dynamics of virus spread. Finally, we used formula (2) to quantify this measure.

This study considered three moderating attributes (i.e., age, education and income) to investigate their impact on the relationship between the dependent and independent variables of this study’s proposed model. The relevant data of these two socio-demographic attributes for different suburbs were collected from the census data provided by the Australian Bureau of Statistics²⁷.

Data analysis design

Since this study repeatedly measured the model variables at four different time points, we followed the panel regression, a powerful tool for modelling time series data²⁸, to explore the proposed model. This study used a 1-week duration for each repeated measure. In total, we considered four 1-week windows for the panel regression modelling. We used fixed effect panel regression for research data analysis since we found a significant correlation between the dependent and the independent variables from the initial data exploration. The dependent ${InfNum}_{t}$ variable is significantly correlated at p < 0.001 with the independent ${InfNum}_{(t-1)}$ variable for delta (rho = 0.952) and omicron (rho = 0.506) variants. It also has a similar statistically significant correlation with ${RNInf}_{\left(t-1\right)}$ at p < 0.001 for the delta (rho = 0.771) and omicron (rho = − 0.161) data. We used Stata to run the fixed effect panel regression²⁹.

This study considered the median population age value, the percentage of residents having a university or tertiary degree, and the median weekly household income to measure the three socio-demographic attributes, age, education and income, respectively, for each suburb. The median values for age, education and income attributes for 100 data instances have split the dataset into two groups. For example, the education = 0 group includes all suburbs with a lower percentage of residents having a university degree than the median value of all data instances of this study, and vice versa. We first created six more independent variables to check their moderating strength by multiplying each with the first two independent variables (i.e., InfNum_{(t − 1)} and RNInf_{(t − 1)}). Then, we reran the panel regressions, including these six newly created independent variables.

Results

Figure 2 illustrates the undirected road network among the 100 suburbs considered in this study. We used Gephi³⁰ and web Mercator³¹ projection to draw this road network. In this network, there are 214 undirected edges among its 100 nodes. The maximum number of roads connecting two suburbs is 16, between 2142 and 2160 postal areas.

Table 2 shows the results from the fixed effect panel regressions. The models for both omicron and delta variants show very high R-squared values. The R-squared value for the delta variant is 0.8566, and for the omicron variant, it is 0.5267. The previous infection number (InfNum_(t−1)) significantly impacts the present infection number for the delta and omicron variants. Neighbourhood measure (RNInf_(t−1)) also significantly impacts the present infection number. It shows a positive impact on the delta variant. However, it shows a negative effect on the omicron variant.

Table 2 Panel regression outcome for delta and omicron variants.

Full size table

To check the moderating effect of three socio-demographic attributes (i.e., age, education and income) on the findings of Table 2, we added six more independent variables to our dataset and repeated the same panel regression. These six composite variables are based on multiplying each socio-demographic attribute with the three independent variables. The corresponding results are presented in Table 3. Since our main concern is to check the moderating effect of the three socio-demographic features, we do not report R-squared values in this table. There are no specific patterns revealed in the significance values of this table. The composite independent variables based on the multiplication of education and each independent variable do not show any significant outcome for delta and omicron variants. Age moderates the relations the present infection number (InfNum_t) has with RNInf_(t−1) and InfNum_(t−1) for only the delta variant. For the omicron variant, age moderates only the relation between InfNum_(t−1) and InfNum_t. Conversely, income mediates the association between InfNum_(t−1) and InfNum_t for both variants.

Table 3 Panel regression outcome for checking the moderating impact of age, education and income.

Full size table

Figure 3 shows the kernel density estimation (KDE) for age, education and income. KDE is a non-parametric way to estimate the probability density function of a random variable³². The median value of each socio-demographic attribute is used to split the dataset into two groups. The density estimations are based on this study's single dependent variable (InfNum_t), divided into two groups by each of the three socio-demographic attributes. This figure reveals that the density functions are closely identical between different groups based on age, education and income, further echoing the findings from Table 3. These three socio-demographic attributes do not reveal specific patterns in moderating the relationship between the model's independent and dependent variables.

Discussion

Human mobility data has been shown to be an effective measure for modelling COVID-19 infection count³³. In the first part of this study, we aimed to capture this mobility through the neighbourhood measure and its effect on the COVID-19 infection count. The neighbourhood measure considered a relatively granular suburb level as a geographical unit and used the number of shared roads to approximate human movement across the suburbs. The research dataset covers two periods of COVID-19 infection for the delta and omicron variants, as shown in Table 1. One exciting perspective to note and explore in this study is that some of the underlying factors changed between these two timeframes. During the delta outbreak, the research areas were under lockdown (with only allowed shopping limit within a 5 km radius for essential items). Some areas of concern even had nighttime curfew during this timeframe. Sydney’s vaccination coverage (double dose) went from approximately 26–43%³⁴. On the other hand, there was no lockdown during the omicron phase of the dataset, although mask mandates, social distancing, and capacity caps in businesses partially remained³⁵. Double-dose vaccination coverage (double dose) rose from 77% to almost 79% during this period. As a result, people's mobility within and across the suburbs was inevitably significantly higher during the Omicron outbreak. The omicron variant itself is more transmissible than the delta variant. Therefore, it would be interesting to see how the neighbourhood measure affected the infection count during delta and omicron outbreaks.

The fixed effect panel regression model shows good prediction performance for the delta variant with an R-squared value of 85.66%. The model performance was relatively weaker for the omicron variant, with a 52.67% R-squared value. The previous infection count significantly positively impacts the present infection count (dependent variable) for both variants. The same goes for the neighbourhood measure on its effect on the present infection count, except that for delta, the effect is positive, and for omicron, it is negative. Together, these results indicate that infection counts for a suburb during the delta variant can be well modelled through past infection count and influx from surrounding suburbs, i.e., neighbourhood measure. While the present infection count should naturally be affected by the previous infection count, the effect of influx from the neighbourhood is more interesting. As we mentioned earlier, especially during the delta outbreak, there was a lockdown in place, and residents were only allowed to go out for essential shopping within a 5 km radius. Suburbs considered in our research are relatively granular in size, and residents could move across the neighbouring suburbs for essential reasons even while staying within a 5 km bubble. Therefore, this prediction model using suburb-level granular data effectively captures macro-movement during the lockdown and utilises it to predict case counts during the delta variant.

The regression model and the neighbourhood measure did not reveal many insights for the omicron variant because the R-square value was not much higher than the delta variant, and the neighbourhood measure showed a significant negative impact on infection count counter-intuitively. Two factors could contribute to this finding. First, there was no lockdown or movement restriction during the omicron variant. Second, omicron is more transmissible than the delta variant³⁶. The high contagiousness and unrestricted movement within the suburb might make the neighbourhood measure less reliable in predicting the case count for omicron.

In the second part of this research, we looked into three socio-economic moderating factors—age, education and income. We intended to see whether suburbs with more residents of higher age brackets, education levels or income differ from suburbs having fewer residents with those factors in terms of case count and neighbourhood measure. This was important in a way that during the delta outbreak, a lockdown was imposed in the areas of concern and a nighttime curfew for some time. These areas of concern were mostly concentrated in western Sydney, where many residents are culturally and linguistically diverse and have a migrant background. These suburbs have more members per household, less income, and education levels on average. Many of the wage earners’ jobs could not be performed from home. Consequently, stay-at-home orders and the lockdown hard hit these suburbs more³⁷. Therefore, we investigated these suburbs with high population and COVID-19 cases and explored whether age, education and income have any moderating effect on the case count and neighbourhood measure.

The results in Table 3 summarily show the moderating effects. Education did not have any moderating impact on any combination. Age and income significantly moderated the relation between delta and omicron variants' previous and present case counts. However, income has a small coefficient value for the moderating effect and thus does not reveal any meaningful insight. Age has a positive coefficient, indicating that suburbs with a higher age bracket tended to have higher case growth. This goes along with the fact that older people are at higher risk of comorbidities and COVID-19³⁸. Age positively moderates the relation between neighbourhood measure and present case count only for the delta variant. This might indicate that suburbs with a relatively higher aged population tend to have more mobility (for work or essential purposes) if they have more options to travel across suburbs through the higher number of available road connections. For the omicron variant, we have seen earlier that the neighbourhood measure does not affect the case count, probably due to the high transmissibility of the variant and increased local movement due to the absence of lockdown. Consequently, none of the socio-economic variables moderated the relation between the neighbourhood measure and case count.

There are studies in the current literature that explore how restrictions mitigate the adverse COVID-19 effects from various perspectives. Like this study, some used network analysis and statistical modelling^15,19. As outlined in Table 4, like our study, any mobility restrictions helped reduce COVID-19 negative impacts in one way or another. Our study successfully developed models to explore future COVID-19 infection rates based on prior data and road network density, indicating its uniqueness and novelty. Instead of exploring the direct effect of mobility restriction, our study showed how infection counts could be better estimated, thus controlled, from road network features and previous infection data.

Table 4 A comparison of this study and other similar studies from the literature.

Full size table

From the methodological viewpoint, our study faces some limitations, which can potentially create future research scopes. First, we did not consider the number of road lanes connecting two suburbs while capturing road networks. A two-lane road or a multi-lane highway could connect two suburbs, thus representing different transport capabilities. Second, although panel regression is a widely used modelling method for longitudinal data, it has several assumptions on the research dataset used for modelling. In future studies, we aim to explore these assumptions and how they impact model performance by adopting other existing methods (e.g., Bayesian structural time series model³⁹) to capture temporal dynamics.

Conclusion

The Greater Sydney area residents endured nearly 4 months of COVID-19 lockdown during the last half of 2021. While the lockdown bought precious time to ramp up vaccination rollout and prepare healthcare facilities, it left a lasting economic and psychological impact. This study analysed the mobility and prevalence data in two distinct timeframes to model and predict the COVID-19 case count during late 2021. The timeframes represented delta and omicron outbreaks, respectively, and for the former outbreak, there was a lockdown in place and a nighttime curfew for some period. The road network between the neighbouring suburbs was used to approximate the influx and corresponding risk of case growth from adjacent areas. Therefore, this study helps us explore and compare the effect of mobility and case count during and without a lockdown period. It also provides a comparison between delta and omicron variants. The moderating effect of three socio-economic variables is discussed. The method introduced in this study shows an effective way to utilise geographic information and road connection networks with health data to model COVID-19 transmission. The regression model results show that the road network-based neighbourhood measure significantly predicts the case count for the delta variant. The results also show that the income or education level of the residents does not necessarily have any effect in moderating the case count and mobility. The methodology presented in this study could be replicated for other states or countries to gather similar insights.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author upon reasonable request.

References

Nižetić, S. Impact of coronavirus (COVID-19) pandemic on air transport mobility, energy, and environment: A case study. Int. J. Energy Res. 44(13), 10953–10961 (2020).
Article Google Scholar
Štifanić, D. et al. Impact of COVID-19 on forecasting stock prices: An integration of stationary wavelet transform and bidirectional long short-term memory. Complexity 2020, 1846926 (2020).
Article Google Scholar
Hernandez-Matamoros, A., Fujita, H., Hayashi, T. & Perez-Meana, H. Forecasting of COVID19 per regions using ARIMA models and polynomial functions. Appl. Soft Comput. 96, 106610 (2020).
Article PubMed PubMed Central Google Scholar
Swaraj, A. et al. Implementation of stacking based ARIMA model for prediction of Covid-19 cases in India. J. Biomed. Inform. 121, 103887 (2021).
Article PubMed PubMed Central Google Scholar
Abdy, M., Side, S., Annas, S., Nur, W. & Sanusi, W. An SIR epidemic model for COVID-19 spread with fuzzy parameter: The case of Indonesia. Adv. Differ. Equ. 2021(1), 1–17 (2021).
Article MathSciNet Google Scholar
Liu, X. X., Fong, S. J., Dey, N., Crespo, R. G. & Herrera-Viedma, E. A new SEAIRD pandemic prediction model with clinical and epidemiological data analysis on COVID-19 outbreak. Appl. Intell. 51(7), 4162–4198 (2021).
Article Google Scholar
Car, Z., Baressi Šegota, S., Anđelić, N., Lorencin, I. & Mrzljak, V. Modeling the spread of COVID-19 infection using a multilayer perceptron. Comput. Math. Methods Med. 2020, 5714714 (2020).
Article PubMed PubMed Central Google Scholar
Melin, P., Monica, J. C., Sanchez, D. & Castillo, O. Multiple ensemble neural network models with fuzzy response aggregation for predicting COVID-19 time series: The case of Mexico. Healthcare 8(2), 181 (2020).
Article PubMed PubMed Central Google Scholar
Pinter, G., Felde, I., Mosavi, A., Ghamisi, P. & Gloaguen, R. COVID-19 pandemic prediction for hungary; a hybrid machine learning approach. Mathematics 8(6), 890 (2020).
Article Google Scholar
Chimmula, V. K. R. & Zhang, L. Time series forecasting of COVID-19 transmission in Canada using LSTM networks. Chaos Solitons Fractals 135, 109864–109864 (2020).
Article PubMed PubMed Central Google Scholar
Yudistira, N. COVID-19 growth prediction using multivariate long short term memory. arXiv:2005.04809 (2020).
Varotsos, C. A. & Krapivin, V. F. A new model for the spread of COVID-19 and the improvement of safety. Saf. Sci. 132, 104962 (2020).
Article PubMed PubMed Central Google Scholar
Lotfi, M., Hamblin, M. R. & Rezaei, N. COVID-19: Transmission, prevention, and potential therapeutic opportunities. Clin. Chim. Acta 508, 254–266 (2020).
Article CAS PubMed PubMed Central Google Scholar
Oh, J. et al. Mobility restrictions were associated with reductions in COVID-19 incidence early in the pandemic: Evidence from a real-time evaluation in 34 countries. Sci. Rep. 11(1), 13717 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bonaccorsi, G. et al. Economic and social consequences of human mobility restrictions under COVID-19. Proc. Natl. Acad. Sci. 117(27), 15530–15535 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Bharati, T. & Fakir, A. M. Pandemic Catch-22: How effective are mobility restrictions in halting the spread of COVID-19 in developing countries. Covid Econ. 26, 107–136 (2020).
Google Scholar
Thombre, A. & Agarwal, A. A paradigm shift in urban mobility: Policy insights from travel before and after COVID-19 to seize the opportunity. Transport Policy 110, 335–353 (2021).
Article PubMed PubMed Central Google Scholar
Sharma, G., Dhulipala, S. & Patil, G. R. Effect of tourism and air travel restrictions on the initial international spread of the COVID-19 pandemic. Tour. Anal. 28(3), 357–370 (2023).
Article Google Scholar
Li, W., Zhao, S.-C., Ji, X.-F. & Ma, J.-W. Impact of traffic exposure and land use patterns on the risk of COVID-19 spread at the community level. China J. Highw. Transport 33(11), 43–54 (2020).
Google Scholar
Eisenberg, J. N. et al. In-roads to the spread of antibiotic resistance: Regional patterns of microbial transmission in northern coastal Ecuador. J. R. Soc. Interface 9(70), 1029–1039 (2012).
Article PubMed Google Scholar
Numminen, E. & Laine, A.-L. The spread of a wild plant pathogen is driven by the road network. PLoS Comput. Biol. 16(3), e1007703 (2020).
Article PubMed PubMed Central Google Scholar
Uddin, S., Khan, A., Lu, H., Zhou, F. & Karim, S. Suburban road networks to explore COVID-19 vulnerability and severity. Int. J. Environ. Res. Public Health 19(4), 2039 (2022).
Article CAS PubMed PubMed Central Google Scholar
Al Wahaibi, A. et al. The impact of mobility restriction strategies in the control of the COVID-19 pandemic: Modelling the relation between COVID-19 health and community mobility data. Int. J. Environ. Res. Public Health 18(19), 10560 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y. et al. Effects of human mobility restrictions on the spread of COVID-19 in Shenzhen, China: A modelling study using mobile phone data. Lancet Digit. Health 2(8), e417–e424 (2020).
Article PubMed PubMed Central Google Scholar
NSW Health. COVID-19 data and statistics. 2022 [cited 2021 December 25]. https://www.nsw.gov.au/covid-19/stay-safe/data-and-statistics.
Google Maps. Google maps, Australia. 2022 [cited 2021 June 15]. www.maps.google.com.au.
Census QuickStats. Australian Bureau of Statistics: 2016 Census QuickStats. 2021 [cited 2021 May 25]. https://quickstats.censusdata.abs.gov.au/census_services/getproduct/census/2016/quickstat/POA2190?opendocument.
Chamberlain, G. Multivariate regression models for panel data. J. Econom. 18(1), 5–46 (1982).
Article MathSciNet Google Scholar
Kohler, U. & Kreuter, F. Data Analysis Using Stata (Stata Press, College Station, 2005).
Google Scholar
Bastian, M., Heymann, S. & Jacomy, M. Gephi: an open source software for exploring and manipulating networks. In Third International AAAI Conference on Weblogs and Social Media 361–362 (San Jose, California, USA, 2009).
Battersby, S. E., Finn, M. P., Usery, E. L. & Yamamoto, K. H. Implications of web Mercator and its use in online mapping. Cartogr. Int. J. Geogr. Inf. Geovisualization 49(2), 85–101 (2014).
Google Scholar
Terrell, G. R. & Scott, D. W. Variable kernel density estimation. Ann. Stat. 20(3), 1236–1265 (1992).
Article MathSciNet Google Scholar
Hou, X. et al. Intracounty modeling of COVID-19 infection with human mobility: Assessing spatial heterogeneity with business traffic, age, and race. Proc. Natl. Acad. Sci. 118(24), e2020524118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Australian Broadcasting Corporation News. Tracking Autralia's COVID vaccine rollout numbers. 2022 [cited 2022 February 26]. https://www.abc.net.au/news/2021-03-02/charting-australias-covid-vaccine-rollout/13197518.
Reuters News. Freedom Day': Sydney reopens as Australia looks to live with COVID-19. 2022 [cited 2022 February 26]. https://www.reuters.com/world/asia-pacific/long-100-days-sydney-reopens-australia-looks-live-with-covid-19-2021-10-10/.
Cameroni, E. et al. Broadly neutralizing antibodies overcome SARS-CoV-2 Omicron antigenic shift. Nature 602, 1–9 (2021).
Google Scholar
Australian Broadcasting Corporation News. How Sydney's COVID-19 lockdown is dividing the city (2022). https://www.abc.net.au/news/2021-08-22/sydney-covid-19-lockdown-is-creating-growing-inequality/100391922.
Monod, M. et al. Age groups that sustain resurging COVID-19 epidemics in the United States. Science 371(6536), eabe8372 (2021).
Article CAS PubMed Google Scholar
Brodersen, K. H., Gallusser, F., Koehler, J., Remy, N. & Scott, S. L. Inferring causal impact using Bayesian structural time-series models. Ann. Appl. Stat. 9(1), 247–274 (2015).
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

School of Project Management, Faculty of Engineering, The University of Sydney, Forest Lodge, NSW, 2037, Australia
Shahadat Uddin, Arif Khan, Haohui Lu, Fangyu Zhou & Shakir Karim
School of Science and Technology, University of New England, Armidale, NSW, 2350, Australia
Farshid Hajati
Artificial Intelligence and Cyber Futures Institute, Charles Sturt University, Bathurst, NSW, 2795, Australia
Mohammad Ali Moni

Authors

Shahadat Uddin
View author publications
You can also search for this author in PubMed Google Scholar
Arif Khan
View author publications
You can also search for this author in PubMed Google Scholar
Haohui Lu
View author publications
You can also search for this author in PubMed Google Scholar
Fangyu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shakir Karim
View author publications
You can also search for this author in PubMed Google Scholar
Farshid Hajati
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Ali Moni
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.U.: conceptualisation, data collection, data analysis and writing. A.K., H.L., F.Z., S.K.: data collection, data analysis and writing. F.H. and M.A.M.: data analysis and writing.

Corresponding author

Correspondence to Shahadat Uddin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Uddin, S., Khan, A., Lu, H. et al. Road networks and socio-demographic factors to explore COVID-19 infection during its different waves. Sci Rep 14, 1551 (2024). https://doi.org/10.1038/s41598-024-51610-w

Download citation

Received: 28 September 2023
Accepted: 07 January 2024
Published: 18 January 2024
DOI: https://doi.org/10.1038/s41598-024-51610-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.