Rural–urban scaling of age, mortality, crime and property reveals a loss of expected self-similar behaviour

Sutton, Jack; Shahtahmassebi, Golnaz; Ribeiro, Haroldo V.; Hanley, Quentin S.

doi:10.1038/s41598-020-74015-x

Download PDF

Article
Open access
Published: 08 October 2020

Rural–urban scaling of age, mortality, crime and property reveals a loss of expected self-similar behaviour

Jack Sutton¹,
Golnaz Shahtahmassebi¹,
Haroldo V. Ribeiro² &
…
Quentin S. Hanley¹

Scientific Reports volume 10, Article number: 16863 (2020) Cite this article

2479 Accesses
5 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The urban scaling hypothesis has improved our understanding of cities; however, rural areas have been neglected. We investigated rural–urban population density scaling in England and Wales using 67 indicators of crime, mortality, property, and age. Most indicators exhibited segmented scaling about a median critical density of 27 people per hectare. Above the critical density, urban regions preferentially attract young adults (25–40 years) and lose older people (> 45 years). Density scale adjusted metrics (DSAMs) were analysed using hierarchical clustering, networks, and self-organizing maps (SOMs) revealing regional differences and an inverse relationship between excess value of property transactions and a range of preventable mortality (e.g. diabetes, suicide, lung cancer). The most striking finding is that age demographics break the expected self-similarity underlying the urban scaling hypothesis. Urban dynamism is fuelled by preferential attraction of young adults and not a fundamental property of total urban population.

Socio-economic, built environment, and mobility conditions associated with crime: a study of multiple cities

Article Open access 17 August 2020

Urban form and structure explain variability in spatial inequality of property flood risk among US counties

Article Open access 01 April 2024

Exploring urban housing disadvantages and economic struggles in Seoul, South Korea

Article Open access 02 April 2024

Introduction

Cities are important drivers of economic and creative human activities^1,2,3,4 and this behavior has long been linked to population⁵. These studies have shown super-linear scaling in urban performance indicators such as patents, GDP, and R&D employment^1,2,3. Other less desirable features follow similar scaling such as homicide^4,6, AIDS cases¹, and general crime^7,8. Conversely, there are important economies of scale found in cities in such indicators as road surface and petrol stations¹. Underpinning this work is the notion of self-similarity leading to behavior which is well approximated by power laws⁹. Modeling this behavior remains an active area of research. These studies have shown that per capita measures are deeply and fundamentally biased in all but the rare metrics which show linear scaling^2,10,11. This important paradigm shift has not been as widely appreciated beyond the urban scaling community.

Despite the improved understanding of power law scaling in urban regions, linear per capita models remain a cornerstone of many aspects of policy and resource allocation. For example, in the UK regional distribution of health care resources is done via clinical commissioning groups (CCGs). This begins with a per capita allocation which is adjusted for mortality, market forces, and a range of other factors based on nutrition, obesity, smoking, drugs etc^12,13,14. The use of scaled metrics provides an opportunity to better understand the taxonomy of health and well-being as well as a host of other metrics. Regional considerations also appear in discussions of economic and social issues in the UK as a north south divide^15,16,17. The distribution of population explains some of this, but analysis of regional behaviour relative to scaling law expectations can provide a more definitive view of regional characteristics.

The urban scaling literature has an inherent bias by studying cities and neglecting rural regions. Although the urban population currently exceeds the rural population worldwide¹⁸, urban areas cover a relatively small amount of the world’s land area and very few studies have looked to see whether cities are fundamentally different from rural regions. Definitions of rural vary. Example definitions of rural areas include: areas which are not urban¹⁹, areas of low population density and other indicators of rural life²⁰, or based on surface urban heat islands²¹. We consider rural and urban to be extremes of a continuum of human environments with population density providing a quantitative metric of position along that continuum. In previous work using data from England and Wales, we found that some metrics follow a single law while others undergo transitions at critical population densities⁷. Metrics undergoing transitions exhibit a range of behaviors: acceleration (e.g. robbery), inhibition (e.g. shoplifting), and collapse (detached housing transactions). The statistical mechanics underlying this behavior remains an unsolved problem, but the existence of critical population densities allows an empirical division between rural and urban. The neglect of rural regions almost certainly neglects their importance assuring the food and material security of heavily urbanized regions.

A consequence of the improved understanding of the effects of scale in indicators is the development of scale adjusted and density scale adjusted metrics^2,7,11. These were initially developed as indicators of the uniqueness of a particular urban region and used to develop a taxonomy of similar types of cities². The methodology has since been adapted to density scaling of both urban and rural regions where it was used to understand the inter-relationships between crime and property⁸.

Here, we investigated a range of indicators of mortality, crime, property and age throughout England and Wales to determine if mortality behaves similarly to the previous work on crime and property.

Theory

Population density scaling

Density scale adjusted metrics^7,8 are an area normalized approach to scale adjusted metrics^2,11. Urban scaling uses total population to predict a range of indicator metrics using power laws.

$$Y={Y}_{0}{n}^{\beta }$$

(1)

where, Y is an indicator such as crime or GDP, Y₀ is a pre-exponential factor; n is the population density of the region and β is a scaling exponent. When looking at both rural and urban regions, density metrics (y = Y/A) and population density (d = n/A) have been found to better predict overall behavior^7,8 where A is the area of a given region.

$$y={y}_{0}{d}^{\beta }$$

(2)

Similarly to urban scaling, when β < 1, the scaling is sub-linear; when β = 1, the scaling is linear; and when β > 1, the scaling is super-linear. Data is usually fitted to log transformed data to obtain parameters.

$$\mathrm{log}y={\mathrm{log}y}_{0}+\beta \mathrm{log}d$$

(3)

Empirically, transitions appear at a critical population density, d^*, for some metrics in the range of 10–70 people per hectare⁷. To account for this, Eq. (3) can be adjusted to allow a segmented fit⁷ at the critical density.

$$\mathrm{log}y=\left\{\begin{array}{c}{\mathrm{log}y}_{0}+{\beta }_{L}\mathrm{log}d\\ {\mathrm{log}y}_{1}+{\beta }_{H}\mathrm{log}d\end{array} \begin{array}{c}d<{d}^{*}\\ d \ge {d}^{*}\end{array}\right.$$

(4)

In this, β_L and y₀ are the exponent and pre-exponential factor below the transition; β_H and y₁ are the exponent and pre-exponential factor above the threshold. For purposes of modelling, the transition point is held to be continuous (e.g. $\mathrm{log}{y}_{1}=\mathrm{log}{y}_{0}+\left({\beta }_{L}-{\beta }_{H}\right)\mathrm{log}{d}^{*}$).

Density scale adjusted metrics (DSAMs)⁸, z_i, are the residuals in the fits obtained from the models defined by Eqs. (3) and (4).

$${z}_{i}=\mathrm{log}{y}_{i}-\mathrm{log}y$$

(5)

A number of issues have been noted when fitting power laws to urban scaling data sets²² and particularly when data sets have null values or zeros²³. In the data considered here, this issue is occasionally severe. Although progress has been made on these problems we note the following: (1) The analysis of scale adjusted metrics^2,8,10 assumes that the power law fits are an incomplete explanation of the data. Specifically, the approach^2,8,10 assumes the residuals around a power law fit contain explainable variance and are not random relative to other residuals. (2) Power variance models (e.g. Taylor’s law^22,24) are good models of the noise in some instances and across limited scales. However, segmented fluctuation scaling occurs at least in the case of crime²⁴. (3) Alternatives to power law models have been presented^22,25 but the extent to which the problems driving their development apply to density scales is unknown. In this context, power law models and the segmented modifications used here remain useful for understanding scale in human systems despite their limitations.

If two arrays of DSAMs corresponding to indicators (X, Y) over a set of n regions are represented by $X=({x}_{1}, {x}_{2}, \dots , {x}_{n})$ and $Y=({y}_{1}, {y}_{2}, \dots , {y}_{n})$, a range of similarity measures (sm) can be computed. A region in this context is a defined land area of some size. Here, it represents administrative areas in the UK (unitary authorities, non-metropolitan districts, metropolitan boroughs, and London boroughs) but could be any defined region for which indicator data is available. We considered 6 similarity measures: Pearson correlation (r(X, Y)), Spearman correlation ((S(${rg}_{X},{rg}_{Y}$)), Kendall correlation ((K(X,Y)), cosine similarity (c(X, Y)), and Jaccard similarities (J(X, Y)) to investigate the inter-relationships between the DSAMs.

The matrix of similarity measures (sm_ij) generated for each pair of indicator DSAMs (e.g. mortality, property, crime and age) were analyzed by hierarchical clustering based on a distance, ${\delta }_{ij}=\sqrt{2(1-s{m}_{ij})}$.

Results and discussion

Overview of regions

England and Wales consist of 348 regions including unitary authorities, non-metropolitan districts, metropolitan boroughs, and London boroughs. The regions ranged in area from 289 ha (City of London, England) up to 518,037 ha (Powys, Wales). Regional populations were from 2158 (Isles of Scilly, England) to 1,070,912 (Birmingham, England) while population density ranged from 0.25 people per hectare (Eden, Cumbria, England) up to 139 p/ha (Islington, England). This covers a range of environments and regions from very rural to highly urban.

Rural–urban scaling

The density scaling model gave reasonable fits to power laws (e.g. Figs. 1, S1 to S4). Regions did not stand out relative to the scaling laws with the notable exception of the City of London. This region was an obvious outlier in 23 separate metrics and was so extreme that it merits special attention (e.g. Fig. 1). The City of London is a small 289 hectare region within the greater London metropolitan area with a small resident population (7355) and a much larger (> 350,000) daytime population. Scaling laws have been shown to change depending on whether resident or floating population is considered²⁶. In our work, many crime indicators gave positive deviations consistent with daytime population. However, dementia mortality and to a lesser extent lung cancer exhibited extreme negative deviation. The generally reduced incidence of dementia in the high population density portion of the scaling plot is intriguing. The trend can be partly explained by a lower proportion of older people. However, the exponents for age and dementia are incommensurate. Dementia mortality decreases to a greater extent than the reduction in older people. This makes the City of London which is nearly a factor of 10 below expectations even more remarkable and future studies of dementia risk should consider a more detailed look at this group of people.

The density scaling exponents (Figs. 2a,b, S1, S2, S3, and S4; Tables S1 and S2) for crime and property were similar to those observed previously⁷ when parliamentary constituencies were used to define areas. Approximately half of crime metrics followed simple power laws: ASB, Burglary, Vehicle Crime, Violent Crime, Other Crime, Bike Theft, Weapons and Order. The remainder exhibited segmented scaling. Drugs, Other Theft, Theft from the Person and Robbery accelerated while Shoplifting and CD&A were inhibited in high density regions. This heterogeneity of behaviors is a challenge to crime opportunity theory^27,28 and situational action theories^29,30. A simple power law suggests uniformly increasing opportunities or criminogenic settings, but critical densities with both acceleration and inhibition require a clearer picture of what these opportunities and criminogenic settings represent. Similarly, the observation of a single relationship defining burglary across all scales challenges the notion of designed environments³¹ for reducing this and the other crime types showing single exponential behavior. The behavior of the eight single power-law crime types is remarkably robust over the entire land area of England and Wales.

Examination of mortality (Figs. 2c, S3, and Table S2) revealed that in rural regions except for 5 types of cancer (liver, stomach, lung, larynx and uterine cancer) and homicide, all mortality indicators exhibited sub-linear to linear scaling. In high density regions, all mortality except homicide was strongly sublinear. The dramatic improvement in mortality can be understood by examining the scaling of age groups.

Population density (Figs. 2d, S4; Table S2) had a profound influence on age demographics. High density regions attract young adults aged 25–39 and people age 45 and over preferentially leave. Although density exponents are not directly comparable to conventional ones³², the strength of the super-linear attraction for young people (β_H = 1.46 for the 30–34 age group) may be sufficient to explain almost all reported super-linear economic indicators^1,2,33. This can entirely explain the acceleration of Robbery in high density areas (Figs. 3, S1, and Table S2). Age has a strong influence on the exponent for mortality indicators. For example, kidney cancer and dementia show sublinear scaling in high density regions for the general population (Fig. S5, Table S3). When the two oldest age groups are considered, the protective effect of high density remains but is less pronounced. The data is suggestive for homicide having a single scaling exponent when considered using only the 30–34 age group, however, the data is too sparse at high density to reach a robust conclusion with this data set. If this observation holds beyond the UK, it is probably an important underlying mechanism for many effects observed in the urban scaling literature. As a minimum, age groups break the universal self-similarity of the urban scaling hypothesis. Scaling is not constant across age groups.

From a policy perspective, these findings are important. Mortality and health are primarily understood in per capita terms. As noted above, UK National Health Service funding is provided through clinical commissioning groups using a formula based primarily on a constant per capita cost^12,13,14. This per capita model may significantly under-estimate the economies of scale in high population density regions and the additional cost associated with delivering effectively to people in rural environments. The extent of the economies of scale are striking and there is a clear rural–urban divide in terms of mortality. The scaling phenomenon explains persistent northern excess mortality in the UK¹⁶. The regions north of the “north–south” divide have a lower population density and DSAM metrics make clear that the excess mortality is per capita and is commensurate with rural metrics.

Critical densities

Fifty-one out of sixty-seven indicators (6 crime, 8 property, 21 mortality and 16 age) exhibited a critical density (Fig. 4) distributed around a median of 27 p/h. This is similar to the average value of 30 p/h for 19 indicators in our previous work⁷. Although a bimodal density histogram is observed (Fig. 4b), a single distribution dominates. This is remarkable considering they arise from a wide range of indicators including crime, property, mortality and age. The exceptions to the rule include four age groups (aged 5–9, aged 10–14, aged 40–44, aged 45–49). The 40–49 age range is the boundary between the young adults who are super-linearly attracted to high density urban regions and the elderly who preferentially leave. It is likely that were the age ranges defined differently no critical point would be observed and the change in exponent around the critical values for all four is relatively small. Without these transitional age groups, only two exceptions remain. For the 45 indicators with critical densities in the same distribution, there is currently no explanation. There is no explanation for why mortality, crime, and property scaling pivots around a critical density. Age group behavior is important, but there is no explanation for the preferential attraction of young people to regions above a critical density. The critical density appears robustly near 27 p/h, but the reason it appears at that scale is unclear. The physics of percolation transitions^34,35,36 may offer solutions, but a unifying statistical mechanics remains to be found which predicts a transition in human behavior (crime), health (mortality), economics (property transaction values), and age demographics at a critical density remains to be found.

Correlation and hierarchical clustering of DSAMs by category

Correlation analysis and hierarchical clustering of DSAMs (Fig. 5) showed a tendency for crime, property, and mortality to be positively correlated and cluster together with other members of their class. As examples, most mortality types were positively correlated and clustered with all other mortality types (Fig. S6) with many correlations above 0.5 and values reaching 0.72 (e.g. Fig. S7: Lymphoid Cancer vs. Prostate Cancer). The exceptions were bone cancer, larynx cancer, and homicide. All crime types were positively correlated (Fig. S8) and clustered (Fig. 5) with all other types of crime. All property types were positively correlated (Fig. S9) and clustered (Fig. 5) with all other property types with some very strong correlations (e.g. Freehold vs. Old properties had Pearson correlation = 0.95 (Fig. S10)).

Age did not follow this pattern. Age groups were highly stratified with children and old people anti-correlated (Fig. S11). Different age ranges clustered with different indicator classes. Young people (15–34) clustered with crime. Older people (55 and above) clustered with mortality except for bone cancer. The very young (0–14) and middle-aged people (35–54) clustered with property. Most correlations in the first two clusters were positive, while most property indicators were anti-correlated with children aged 0–14.

The most striking finding is the division of the bulk of mortality indicators into two groups. One group clustered with the elderly and tended to have positive correlation with certain types of property DSAMs. The other group, nearly all of which are to some degree preventable (Accidents, Liver Cancer, Diabetes, Lung Cancer, Stomach Cancer, Oesophagus Cancer, Kidney cancer, Uterus Cancer, Homicide, Suicide and Larynx Cancer) had nearly universal anti-correlation with property DSAMs (c.f. Fig. S12, Flat vs. Lung Cancer DSAMs). The extent to which the magnitude of property transaction value exceeds scaling expectations protects against a wide range of mortality from preventable conditions ranging from homicide to uterine cancer. These conclusions were generally reinforced by all correlation measures (Figs. S13 and S14). The similarity measures were less informative (Figs. S15 and S16).

A limitation of the heatmap and clustering (Fig. 5) is the pairwise structure which does not display the significance of the correlations. A network accounting for this was created by bootstrapping the Pearson correlation with 2000 replications for every pair of metrics to identify correlations significant at 99% confidence. The resulting network (Fig. 6) has 66 nodes including all metrics except bone cancer which had no statistically significant correlations. There were 784 significant connections out of 2211 possible and the optimal modularity score (0.472) partitioned the network into 3 communities very similar to the clusters in the indicator heatmap (Fig. 4).

Specifically, the network analysis found three modules containing: the elderly and mortality; children, middle-aged people and property; and young adults and crime. There were only two exceptions to this pattern, suicide and cancer of the larynx, which clustered with young adults and crime. These two were also most closely related to each other in the clustering analysis. Cancer of the larynx has long been associated with alcohol³⁷ and smoking³⁸ and preventative measures beyond cessation are limited. The association with suicide as well as the positive correlations of cancer of the larynx and suicide with ASB, CD&A, violence, accidents, diabetes, liver and lung cancers suggests health care delivery focusing on mental health^39,40, alcohol⁴¹, and community safety may be beneficial for this group. Considering these types of mortality as long term responses to violence, stress, and mental illness could lead to more efficient prevention strategies.

Analysis of DSAMs by region

To understand regional behavior the clustering and correlation analysis was repeated on the transpose of the matrix of DSAMs such that it was presented by region rather than indicator (Fig. 7). Although heterogeneity is seen, broadly two clusters appear with universal anti-correlation at the extreme ends. The two extreme ends (e.g. Stoke-on-Trent vs. Bromley) live in nearly opposite worlds. If crime and mortality are above expectation in one it is below in the other. A geomap of the two clusters (Fig. S17) divided North England, Wales and the Midlands from Southern England with some exceptions.

Self-organizing maps

The simple geomap (Fig. S17) did not provide sufficient understanding of regional heterogeneity apparent in the cluster analysis. Regions are also affected by age demographics and their importance needs to be understood better. To explore regional behavior, the 348 regions were distributed onto an 8 by 8 hexagonal self-organizing map (Fig. S18). After 350 iterations convergence was reached (Fig. S19) with 4 clusters containing 2, 95, 190 and 61 regions which were colored orange, red, blue and green, respectively (Fig. 8).

The four clusters represent: (i) 61 mostly coastal areas (green) with a few more urban inland regions consisting of St. Helens, Stoke-on-Trent, Wyre Forest, Malvern Hills, Strafford-on-Avon, Dacorum, Ipswich, Kensington and Chelsea, Hammersmith and Fulham, City of Westminster, Islington and Camden); (ii) 2 regions (orange) including the City of London and St. Edmundsbury; (iii) 95 regions (red) mostly within the south of England (exceptions are: Richmondshire, Leeds, Bradford, Preston, Chorley, Blackburn with Darwen, Rosendale, Trafford, Manchester); and (iv) 190 more rural regions (blue) primarily in the North of England and Wales.

A key feature of the primarily coastal grouping was excess mortality and a more elderly demographic (people aged 60 +). The City of London and St. Edmundsbury were exceptional in comparison to the other clusters. They exhibit extremely low DSAMs for nearly all mortality types and high property and crime DSAMs driven by the City of London and to a lesser extent by St. Edmundsbury. Characterizing this cluster is aided by a plot of St. Edmundsbury Vs. City of London (Fig. S21) which shows the similarity to be related to high crime and property DSAMs and low mortality. It is important to note that the SOM classification is not based on correlation. Thus, the large group of more neutral indicators form part of the overall picture. The cluster primarily in the South of England is characterized by low mortality, a younger age demographic, and high property DSAMs. The remaining cluster (blue) represents most of the area of England and Wales. These are generally average for age, crime, and mortality with below expectation property DSAMs.

Conclusion

This study represents an advance in our understanding of scaling behavior while challenging the urban scaling hypothesis. It supports the general concept of scaling by making clear the problems of per capita models when applied to health outcomes. However, incommensurate scaling in different age demographics is a challenge. The scaling hypothesis considers all people as equal participants in the acceleration of life in cities. The data here shows that much of that acceleration depends on the ability of urban regions to attract young adults. Observed urban scaling is a consequence of separate scale related processes that define the behavior of specific age demographics around a critical transition in human behavior at the rural and urban boundary.

The consequences of this are great. There have now been many studies making clear that linear per capita measures are biased^1,4,7,10,11. The current study is the first to extend this to mortality from non-transmissible diseases and age demographics. Epidemiologists studying excess death need to understand the bias of per capita models. For example, the observed northern excess mortality in the UK¹⁶ reflects mortality at low population density rather than north–south division. The north mostly falls into a single category (the blue region of Fig. 7) and this region does not have exceptional mortality for the population densities. Policy makers need to understand the limitations of linear per capita models. In terms of mortality outcomes, there are large cumulative economies of scale between the most rural and the highest density urban regions. This is a consequence of scale related changes in age groups, to scaling behavior across all population densities, and conditions where high density areas provide protection (e.g. dementia). Within this context, health care resourcing is skewed in favor of population dense regions.

The robust rural–urban division near 25–30 p/h makes clear that ignoring rural regions is a missed opportunity for researchers studying urban systems. The existence of a rural–urban boundary justifies the study of cities while providing a clearer comparison against which claims about urban areas can be made. The lack of a clear explanation for critical densities and why they appear in such a consistent place is an important unsolved problem.

The success of DSAMs and related methodologies^2,8,10,11 makes clear that any set of scaling laws provides an incomplete picture of both rural and urban landscapes. Although they may appear to be, the residuals are not randomly distributed around the scaling law whatever the model. They are extensively correlated and reveal persistent structure and regional variation.

Materials and methods

Data sets

Data on mortality and age were provided by NOMIS (https://www.nomisweb.co.uk) a database service for labour market statistics run by the University of Durham on behalf of the UK Office of National Statistics. To anonymise the mortality data, NOMIS sets values ≤ 2 to 0 and values of 3 and 4 to 5 causing some distortion of low values and rare events. The age demographic data is model adjusted for a particular year based on the most recent census. Population, land area, crime and property information were obtained from the UK Home Office and Land Registry via UKCrimeStats (https://www.ukcrimestats.com) which provides alignment of public data sets using geographic shape files obtained from the Ordnance Survey Boundary Line dataset. Data covering the period from 2013–2017 were captured on 20/03/2019. A total of 67 indicators were obtained (Table 1) and are available as S1 Dataset.

Table 1 Comprehensive list of indicators studied. Sixty-seven indicators were studied: 14 indicators of crime, 9 indicators of property, 26 indicators of mortality and 18 indicators of age.

Full size table

Networks

Previous studies of crime and property found that DSAMs show extensive correlations and form modular networks⁸. The network representation $N=(V, E)$ consists of nodes $V=\left\{{v}_{1}, {v}_{2}, \dots , {v}_{n}\right\}$ and edges $E=\left\{{e}_{1}, {e}_{2}, \dots , {e}_{m}\right\}$. Here, nodes are indicators (e.g. Burglary, Suicide, etc.) and edges between them indicate two indicator metrics (i and j) with significant positive Pearson’s correlation, ${\rho }_{i,j}$ , between their corresponding DSAMs (edges weighted by ${\rho }_{i,j}$). The Pearson correlation was selected based on our previous work⁷. Indicators in networks were clustered by modularity optimization to detect community structure^42,43,44,45.

Self-organizing maps

A self-organizing map is an iterative approach to representing high dimensional datasets in a low dimensional space^46,47 using a pre-defined array of nodes, m, arranged in a “grid-like” structure. We selected an 8 × 8 hexagonal array of nodes initialized to a random weight ${w}_{ij}$ in the interval [0, 1]. This array was the largest n × n array without empty nodes⁴⁸. The nodes were then updated after introducing each regional DSAMs input vector ${x}_{1}, \dots , {x}_{n}$ at iteration $t$. The distance, $D\left(j\right),$ was obtained by calculating the Euclidean distance between the input vector and weight vector for all units such that:

$$D\left(j\right)=\sum_{i=1}^{n}\sum_{j=1}^{m}{\left({x}_{i}-{w}_{ij}\right)}^{2}$$

(6)

The input vector (regional DSAMs) was assigned to the unit index $j$ that has the minimum Euclidean distance. The weight vector ${w}_{ij}$ is updated on the “winning” unit $j$ after each iteration such that:

$${w}_{ij}\left(t+1\right)={w}_{ij}\left(t\right)+\alpha \left(x(t)-{w}_{ij}(t)\right)$$

(7)

where $x\left(t\right)$ is the input vector’s instance at iteration t, ${w}_{ij}\left(t\right)$ is the old weight, ${w}_{ij}\left(t+1\right)$ is the new weight and $\alpha$ is the learning rate in the interval [0, 1], which decreases with $t$, to ensure the network converges. After the learning phase, all observations (i.e. regions) are positioned into a node within the map. If two or more observations are positioned within the same node this shows similarity.

The nodes were clustered by the standardized gap statistic⁴⁹,

$${Gap}_{n}\left(k\right)={E}_{n}^{*}\left\{\mathrm{log}\left({W}_{k}\right)\right\}-\mathrm{log}\left({W}_{k}\right)$$

(8)

where $k$ is the number of clusters, ${W}_{k}$ is the pooled within-cluster sum of squares around the cluster means and ${E}_{n}^{*}$ denotes expectation under a sample of size n from the reference distribution⁴⁹ usually a uniform distribution (i.e. a distribution with no obvious clustering). An estimate of ${E}_{n}^{*}\left\{\mathrm{log}\left({W}_{k}\right)\right\}$, is obtained by simulating B samples of $\mathrm{log}\left({w}_{k}^{*}\right)$ each of size n generated from a Monte Carlo sample ${X}_{1}^{*}, . . . , {X}_{n}^{*}$ drawn from the reference distribution. In each case, ${E}_{n}^{*}\left\{\mathrm{log}\left({W}_{k}\right)\right\}$ is an average of B samples of $\mathrm{log}\left({w}_{k}^{*}\right)$. Therefore, assuming the reference distribution is a uniform distribution, a large gap statistic means that the clustering structure does not resemble uniformly distributed observations. Thus, the optimal number of clusters k occurs when $Gap\left(k\right)\ge Gap\left(k+1\right)-{s}_{k+1}$. Here, ${s}_{k}$ is the simulation error in ${E}_{n}^{*}\left\{\mathrm{log}\left({W}_{k}\right)\right\}$.

Data analysis

The data were analyzed using the statistical software R version (3.6.2)⁵⁰ with the Segmented (1.1–0)^51,52,53,54, proxy (0.4–2.4)⁵⁵, boot (1.3–2.4)^56,57, kohonen (3.0.1)^58,59, and factoextra (1.0.6)⁶⁰, moments (0.14)⁶¹, gplots (3.0.3)⁶², ggplot2 (3.3.1)⁶³, car (3.0–8)⁶⁴, nortest (1.0–4)⁶⁵, RColorbrewer (1.1–2)⁶⁶, NbClust (3.0)⁶⁷, tidyverse (1.3.0)⁶⁸, cowplot (1.0.0)⁶⁹, psych (1.9.12.31)⁷⁰, sf (0.8–1)⁷¹, raster (3.0–12)⁷², dplyr (0.8.3)⁷³, spData (0.3.3)⁷⁴, tmap (2.3–2)⁷⁵, leaflet (2.0.3)⁷⁶, mapview (2.7.0)⁷⁷, shiny (1.4.0.2)⁷⁸, and png (0.1–7)⁷⁹ packages. The data were log transformed and analyzed by piecewise regression. The Davies test was used to test the significance of any changes of slope with a 99% confidence level set for inclusion of a second segment. The Davies test and Akaike (AIC) and Bayesian (BIC) information criteria were used to select single and double exponential models. The residuals from the selected model were computed and used directly as DSAMs. Correlation and similarity measures were investigated including Pearson, Spearman and Kendall correlation, cosine similarity, and Jacquard distance using the proxy package computed in a pairwise manner for all indicator metrics and regions. The Pearson correlation and uncertainties were bootstrapped using the boot package to find significant connections at 95% confidence. The obtained connections for both the indicators and the regions are used to form positive and negative networks. The networks were constructed using Gephi version (0.9.2)⁸⁰. The self-organizing maps (SOM) were constructed using the kohonen package to investigate regional characteristics. A range of clustering methods were deployed on the SOM using the package factoextra to find an optimal number of clusters. These clusters are represented in the regional maps.

Data availability

All data generated or analysed during this study are included in this published article (and its supplementary information files). This data was compiled from a range of publicly available sources as noted in the manuscript. These are provided as the Following files: S1_data_raw.csv, S1_data_densities.csv, S1_data_cluster_means.csv, and S1_data_residuals.csv.

Code availability

We have also provided a set of R-scripts as supplementary information. This has been provided as S1_maincode_rev1.R.

References

Bettencourt, L. M. A., Lobo, J., Helbing, D., Kühnert, C. & West, G. B. Growth, innovation, scaling, and the pace of life in cities. Proc. Natl. Acad. Sci. 104, 7301–7306 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Bettencourt, L. M. A., Lobo, J., Strumsky, D. & West, G. B. Urban scaling and its deviations: Revealing the structure of wealth, innovation and crime across cities. PLoS ONE 5, e13541 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
van Raan, A. F. J., van der Meulen, G. & Goedhart, W. Urban Scaling of Cities in the Netherlands. arXiv Prepr. arXiv1503.04795 (2015).
Alves, L. G. A., Ribeiro, H. V. & Mendes, R. S. Scaling laws in the dynamics of crime growth rate. Phys. A 392, 2672–2679 (2013).
Article Google Scholar
Society, S., Annaler, G. & Geography, H. Urban Allometric Growth Author (s): Stig Nordbeck Published by : Wiley on behalf of the Swedish Society for Anthropology and Geography Stable URL : https://www.jstor.org/stable/490887dy/Iydx. 53, 54–67 (2016).
Gomez-Lievano, A., Youn, H. J. & Bettencourt, L. M. A. The statistics of urban scaling and their connection to Zipf’s law. PLoS ONE 7, e40393 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Hanley, Q. S., Lewis, D. & Ribeiro, H. V. Rural to urban population density scaling of crime and property transactions in english and welsh parliamentary constituencies. PLoS ONE 11, e0149546 (2016).
Article PubMed PubMed Central CAS Google Scholar
Ribeiro, H. V., Hanley, Q. S. & Lewis, D. Unveiling relationships between crime and property in England and Wales via density scale-adjusted metrics and network tools. PLoS ONE 13, e0192931 (2018).
Article PubMed PubMed Central CAS Google Scholar
Bettencourt, L., Lobo, J. & Youn, H. The hypothesis of urban scaling: formalization, implications and challenges. arXiv Prepr. arXiv1301.5919 (2013).
Alves, L. G. A., Ribeiro, H. V., Lenzi, E. K. & Mendes, R. S. Distance to the scaling law: A useful approach for unveiling relationships between crime and urban metrics. PLoS ONE 8, e69580 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Alves, L. G. A., Mendes, R. S., Lenzi, E. K. & Ribeiro, H. V. Scale-adjusted metrics for predicting the evolution of urban indicators and quantifying the performance of cities. PLoS ONE 10, e0134862 (2015).
Article PubMed PubMed Central CAS Google Scholar
Advisory Committee Allocation Resource. Public health grant : Exposition book for proposed formula for 2016–17 target allocations – Technical Guide. (2016).
NHS England Analytical Services (Finance). Technical Guide to Allocation Formulae and Pace of Change. (2016).
Anonymous. Fair Shares: A guide to NHS Allocations. (2018).
Green, A. E. The north–south divide in great Britain: An examination of the evidence. Trans. Inst. Br. Geogr. 13, 179 (1988).
Article Google Scholar
Hacking, J. M., Muller, S. & Buchan, I. E. Trends in mortality from 1965 to 2008 across the English north-south divide: Comparative observational study. BMJ 342, 1–9 (2011).
Article Google Scholar
Keeble, D. & Bryson, J. Small-firm creation and growth, regional development and the North-South divide in Britain. Environ. Plan. A 28, 909–934 (1996).
Article Google Scholar
United Nations. World Urbanisation Prospects: The 2014 Revision. ST/ESA/SER.A/366, (2015).
Salvatore, M., Pozzi, F., Ataman, E., Huddleston, B. & Bloise, M. Mapping global urban and rural population distributions. (2005).
Swiecki-Sikora, A. L., Henry, K. A. & Kepka, D. HPV vaccination coverage among us teens across the rural–urban continuum. J. Rural Heal. 35, 506–517 (2019).
Article Google Scholar
Li, K., Chen, Y., Wang, M. & Gong, A. Spatial–temporal variations of surface urban heat island intensity induced by different definitions of rural extents in China. Sci. Total Environ. 669, 229–247 (2019).
Article ADS CAS PubMed Google Scholar
Leitão, J. C., Miotto, J. M., Gerlach, M. & Altmann, E. G. Is this scaling nonlinear? Subject category: Subject areas. R Soc. Open Sci. 3, 150649 (2016).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Finance, O. & Cottineau, C. Are the absent always wrong? Dealing with zero values in urban scaling. Environ. Plan. B Urban Anal. City Sci. 2, 1–15 (2018).
Google Scholar
Hanley, Q. S., Khatun, S., Yosef, A. & Dyer, R. M. Fluctuation scaling, Taylor’s law, and crime. PLoS ONE 9, 2 (2014).
Article CAS Google Scholar
Yang, V. C., Papachristos, A. V. & Abrams, D. M. The origin of urban productivity scaling laws: mathematical model and new empirical evidence. arXiv1712.00476, 1–9 (2017).
Caminha, C. et al. Human mobility in large cities as a proxy for crime. PLoS ONE 12, 1–13 (2017).
Article CAS Google Scholar
Grasmick, H. G., Tittle, C. R., Bursik, R. J. & Arneklev, B. J. Testing the core empirical implications of gottfredson and hirschi’s general theory of crime. J. Res. Crime Delinq. 30, 5–29 (1993).
Article Google Scholar
Pratt, T. C. & Cullen, F. T. The empirical status of Gottfredson and Hirchi’s general theory of crime: A meta-analysis. Criminology 38, 931–964 (2000).
Article Google Scholar
Wikström, P. O. H. Why Crime Happens: A Situational Action Theory. In Analytical Sociology: Actions and Networks (ed. Manzo, G.) 74–94 (Wiley, New York, 2014).
Google Scholar
Wikström, P. O. H. Crime as alternative: Towards a cross-level situational action theory of crime causation. In Beyond Empiricism: Institutions and Intentions in the Study of Crime (ed. McCord, J.) 1–38 (Transaction Publishers, Abingdon, 2004).
Google Scholar
Kinney, J. B., Mann, E. & Winterdyk, J. A. Crime Prevention. Crime Prevention: International Perspectives, Issues, and Trends (CRC Press, Boca Raton, 2017). https://doi.org/10.1201/9781315314211.
Book Google Scholar
Cottineau, C., Finance, O., Hatna, E., Arcaute, E. & Batty, M. Defining urban clusters to detect agglomeration economies. Environ. Plan. B Urban Anal. City Sci. 46, 1611–1626 (2019).
Article Google Scholar
Bettencourt, L. M. A. The origins of scaling in cities. Science (80-). 340, 1438–1441 (2013).
Article ADS MathSciNet CAS MATH Google Scholar
Lee, D., Cho, Y. S., Goh, K.-I., Lee, D.-S. & Kahng, B. Recent advances of percolation theory in complex networks. J. Korean Phys. Soc. 73, 152–164 (2018).
Article ADS Google Scholar
Arcaute, E. et al. Cities and regions in Britain through hierarchical percolation. R. Soc. Open Sci. 3, 1–11 (2016).
Article MathSciNet Google Scholar
Alves, L. G. A., Andrade, J. S., Hanley, Q. S. & Ribeiro, H. V. The hidden traits of endemic illiteracy in cities. Phys. A 515, 566–574 (2019).
Article Google Scholar
Wynder, E., Covey, L., Mabuchi, K. & Mushininski, M. Environmental factors in cancer of the larynx. A second Look. Cancer 38, 1591–1601 (1976).
Article CAS PubMed Google Scholar
South, A. P. et al. Mutation signature analysis identifies increased mutation caused by tobacco smoke associated DNA adducts in larynx squamous cell carcinoma compared with oral cavity and oropharynx. Sci. Rep. 9, 1–9 (2019).
Article ADS CAS Google Scholar
Barnard-Kelly, K. D. et al. Suicide and self-inflicted injury in diabetes: A balancing act. J. Diabetes Sci. Technol. https://doi.org/10.1177/1932296819891136 (2019).
Article PubMed PubMed Central Google Scholar
Amiri, S. & Behnezhad, S. Cancer diagnosis and suicide mortality: A systematic review and meta-analysis. Arch. Suicide Res. 2, 1–19 (2019).
Google Scholar
Alattas, M., Ross, C. S., Henehan, E. R. & Naimi, T. S. Alcohol policies and alcohol-attributable cancer mortality in U.S. States. Chem. Biol. Interact. 315, 108885 (2020).
Article CAS PubMed Google Scholar
Girvan, M. & Newman, M. E. J. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA. 99, 7821–7826 (2002).
Article ADS MathSciNet CAS PubMed MATH PubMed Central Google Scholar
Newman, M. E. J. & Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004).
Article ADS CAS Google Scholar
Newman, M. E. J. Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA. 103, 8577–8582 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Blondel, V. D., Guillaume, J. L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 10, 1–12 (2008).
MATH Google Scholar
Kohonen, T. Self-organizing maps (Springer, Berlin, 2001).
Book MATH Google Scholar
Kohonen, T. The self-organizing map. Proc. IEEE 78, 1464–1480 (1990).
Article Google Scholar
Chang, M. Artificial Intelligence for Drug Development, Precision Medicine, and Healthcare (Chapman and Hall/CRC, Boca Raton, 2020).
Book Google Scholar
Tibshirani, R., Walther, G. & Hastie, T. Estimating the number of clusters in a data set via the gap statistic. J. R. Stat. Soc. Ser. B Stat. Methodol. https://doi.org/10.1111/1467-9868.00293 (2001).
Article MathSciNet MATH Google Scholar
Team, R. C. R: A language and environment for statistical computing. (2019).
Muggeo, V. M. R. Segmented: An R package to fit regression models with broken-line relationships. R News 8, 20–25 (2008).
Google Scholar
Muggeo, V. M. R. Estimating regression models with unknown break-points. Stat. Med. 22, 3055–3071 (2003).
Article PubMed Google Scholar
Muggeo, V. M. R. Testing with a nuisance parameter present only under the alternative: A score-based approach with application to segmented modelling. J. Stat. Comput. Simul. 86, 3059–3067 (2016).
Article MathSciNet MATH Google Scholar
Muggeo, V. M. R. Interval estimation for the breakpoint in segmented regression: A smoothed score-based approach. Aust. N. Z. J. Stat. 59, 311–322 (2017).
Article MathSciNet MATH Google Scholar
Meyer, D. & Buchta, C. Proxy: Distance and similarity measures. (2019).
Canty, A. & Ripley, B. D. boot: Bootstrap R (S-Plus) Functions. (2019).
Davison, A. C. & Hinkley, D. V. Bootstrap Methods and Their Applications (Cambridge University Press, Cambridge, 1997).
Book MATH Google Scholar
Wehrens, R. & Kruisselbrink, J. Flexible self-organizing maps in kohonen 3.0. J. Stat. Softw. 87, 1–18 (2018).
Article Google Scholar
Wehrens, R. & Buydens, L. M. C. Self- and super-organizing maps in R: The kohonen package. J. Stat. Softw. 21, 1–19 (2007).
Article Google Scholar
Kassambara, A. & Mundt, F. factoextra: Extract and visualize the results of multivariate data analyses. (2019).
Novomestky, L. K. and F. moments: Moments, cumulants, skewness, kurtosis and related tests. R package version 0.14. (2015).
Gregory R. Warnes, Ben Bolker, Lodewijk Bonebakker, Robert Gentleman, Wolfgang Huber Andy Liaw, Thomas Lumley, M. & Maechler, Arni Magnusson, Steffen Moeller, M. S. and B. V. gplots: Various R Programming Tools for Plotting Data. R package version 3.0.1.1. (2019).
Wickham, H. ggplot2: Elegant graphics for data analysis (Springer, New York, 2016).
Book MATH Google Scholar
Fox, J. & Weisberg, S. An {R} Companion to Applied Regression. (Sage, 2019).
Gross, J. & Ligges, U. nortest: Tests for normality. (2015).
Neuwirth, E. RColorBrewer: ColorBrewer Palettes. (2014).
Charrad, M., Ghazzali, N., Boiteau, V. & Niknafs, A. NbClust}: An {R package for determining the relevant number of clusters in a data set. J. Stat. Softw. 61, 1–36 (2014).
Article Google Scholar
Wickham, H. et al. Welcome to the {tidyverse}. J. Open Source Softw. 4, 1686 (2019).
Article ADS Google Scholar
Wilke, C. O. cowplot: Streamlined Plot Theme and Plot Annotations for ‘ggplot2’. (2019).
Revelle, W. psych: Procedures for psychological, psychometric, and personality research. (2019).
Pebesma, E. Simple features for R: Standardized support for spatial vector data. R J. 10, 439–446 (2018).
Article Google Scholar
Hijmans, R. J. raster: Geographic data analysis and modeling. (2020).
Wickham, H., François, R., Henry, L. & Müller, K. dplyr: A Grammar of Data Manipulation, R package version 0.8.3. (2019).
Bivand, R., Nowosad, J. & Lovelace, R. spData: Datasets for Spatial Analysis. (2020).
Tennekes, M. {tmap}: Thematic maps in {R}. J. Stat. Softw. 84, 1–39 (2018).
Article Google Scholar
Cheng, J., Karambelkar, B. & Xie, Y. leaflet: Create interactive web maps with the javascript ‘Leaflet’ library. (2019).
Appelhans, T., Detsch, F., Reudenbach, C. & Woellauer, S. mapview: Interactive Viewing of Spatial Data in R. (2019).
Chang, W., Cheng, J., Allaire, J. J., Xie, Y. & McPherson, J. shiny: Web Application Framework for R. (2020).
Urbanek, S. png: Read and write PNG images. (2013).
Bastian, M., Heymann, S. & Jacomy, M. Gephi: an open source software for exploring and manipulating networks. in Third international AAAI conference on weblogs and social media (2009).

Download references

Acknowledgements

The authors are grateful to the Office of National Statistics and the UK Home Office for making these data publicly available.

Author information

Authors and Affiliations

School of Science and Technology, Nottingham Trent University, Nottingham, NG11 8NS, UK
Jack Sutton, Golnaz Shahtahmassebi & Quentin S. Hanley
Departamento de Física, Universidade Estadual de Maringá, Maringá, PR, 87020-900, Brazil
Haroldo V. Ribeiro

Authors

Jack Sutton
View author publications
You can also search for this author in PubMed Google Scholar
Golnaz Shahtahmassebi
View author publications
You can also search for this author in PubMed Google Scholar
Haroldo V. Ribeiro
View author publications
You can also search for this author in PubMed Google Scholar
Quentin S. Hanley
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.H., G.S., J.S., and H.R. designed the study and wrote and edited the manuscript text. Q.H. and J.S. assembled the data set. J.S. prepared the figures and did the analysis.

Corresponding author

Correspondence to Quentin S. Hanley.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file1

Supplementary file2

Supplementary file3

Supplementary file4

Supplementary file5

Supplementary file6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sutton, J., Shahtahmassebi, G., Ribeiro, H.V. et al. Rural–urban scaling of age, mortality, crime and property reveals a loss of expected self-similar behaviour. Sci Rep 10, 16863 (2020). https://doi.org/10.1038/s41598-020-74015-x

Download citation

Received: 29 May 2020
Accepted: 24 September 2020
Published: 08 October 2020
DOI: https://doi.org/10.1038/s41598-020-74015-x

This article is cited by

ELIMINATE: a PCR record-based macroelimination project for systematic recall of HCV-RNA-positive persons in Austria
- Caroline Schwarz
- David Bauer
- Thomas Reiberger
Wiener klinische Wochenschrift (2024)
Distance to highway and factory density related to lung cancer death and associated spatial heterogeneity in effects in Jiading District, Shanghai
- Na Zhang
- Yingjian Wang
- Yibiao Zhou
Environmental Science and Pollution Research (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Theory

Population density scaling

Results and discussion

Overview of regions

Rural–urban scaling

Critical densities

Correlation and hierarchical clustering of DSAMs by category

Analysis of DSAMs by region

Self-organizing maps

Conclusion

Materials and methods

Data sets

Networks

Self-organizing maps

Data analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links