Abstract
Urban climate is determined by a variety of factors, whose knowledge can help to attenuate heat stress in the context of ongoing urbanization and climate change. We study the influence of city size and urban form on the Urban Heat Island (UHI) phenomenon in Europe and find a complex interplay between UHI intensity and city size, fractality, and anisometry. Due to correlations among these urban factors, interactions in the multilinear regression need to be taken into account. We find that among the largest 5,000 cities, the UHI intensity increases with the logarithm of the city size and with the fractal dimension, but decreases with the logarithm of the anisometry. Typically, the size has the strongest influence, followed by the compactness, and the smallest is the influence of the degree to which the cities stretch. Accordingly, from the point of view of UHI alleviation, small, disperse, and stretched cities are preferable. However, such recommendations need to be balanced against e.g. positive agglomeration effects of large cities. Therefore, tradeoffs must be made regarding local and global aims.
Introduction
Urban Heat Island (UHI) is a commonly observed phenomenon worldwide, describing an elevated temperature of urban areas compared to their surroundings. Understanding UHI is of great relevance in the current discussion on sustainable urban design. In particular, heat waves have been observed more persistent and more frequent in the last decades^{1, 2}, and are projected to intensify in the future^{3}. Furthermore, heat waves are shown to pose an added stress on cities^{4}, raising serious concerns regarding general wellbeing and potential threats to human health, which in turn demands effective adaptation measures to alleviate the UHI.
The UHI effect arises from the anthropogenic modification of natural landscapes and the consequent atmospheric and thermophysical changes in the urban boundary layer^{5}. The formation of UHI can be mainly ascribed to an increased absorption and trapping of solar radiation in builtup urban fabrics associated with high thermal admittance of construction materials and the urban canyon structure. Anthropogenic heat release from transport and buildings in the purpose of heating and air conditioning further exacerbate the UHI. Other factors, such as population density, builtup density, and vegetation fractions can also directly or indirectly contribute to the formation of UHI. However, studies based on different spatial scales, more precisely, the vertical scales (urban screen, canopy, boundary levels) and the horizontal scales (mirco, local, regional scales) may lead to varying results on the individual contributions of each factor^{6}.
UHI studies can be roughly categorized in two domains regarding the number of investigated cities. On the one hand, case study work focuses on one or a few cities and assesses the UHI characteristics with a high level of detail. On the other hand, ensemble or crosssectional studies investigate a large number of cities aiming at achieving an understanding of common characteristics or fundamental differences arising among them. The availability of remote sensing surface skin temperature with global coverage has given rise to a number of systematic empirical studies of the latter type. In the following we focus on surface skin temperature and only mention the type explicitly (i.e. surface or 2 m) when necessary.
A global UHI study across more than 400 big cities^{7} revealed that the average annual intensity during daytime is higher than during nighttime and that the daytime intensity correlates negatively with the difference of vegetation cover and activity between urban and suburban areas. Similar diurnal patterns were found in an analysis of 32 Chinese cities^{8}. A followup work^{9} based on the same Chinese cities suggested an exponential decay of the UHI along urbanrural gradients, the rate and extent exhibit sitespecific diurnal and seasonal variations. In Europe, the UHI intensity of urban agglomerations exhibits a size dependency, and can typically reach a maximum of approx. 3 °C in summer and 0.5 °C in winter^{10}.
A study^{11} based on 65 cities in North America found that the annual mean daytime and nighttime UHI are positively correlated with the precipitation and the logarithm of population, respectively. It was suggested that the enhanced aerodynamic roughness of densely vegetated rural areas in the humid climate zone (with abundant precipitation) leads to less efficient convection, which hampered the heat transfer from urban to rural areas and resulted in an intensified UHI. This outcome at first glance seems to differ from previous studies^{12, 13} based on air temperature, stating that a deficit of precipitation in the summer leads to stronger rural warming than in urban areas, i.e. a diminished UHI. However, there are substantial differences between these studies besides the data type. The positive correlation in ref. 11 is regressed out of annual mean data (UHI intensity against precipitation) among scores of cities (crosssectional), whereas the studies^{12, 13} are based on data of individual case study cities across time (temporal).
Another global study comprehensively assessed the dependence of UHI on various urban intrinsic factors, regardless of geographic and climatic factors^{14}. Night light, urban area and vegetation are, inter alia, dominant ones accounting for the UHI or urban heat sinks, whereas population and urban structure were found to be of less relevance.
These studies have in common that land cover data is combined with remotely sensed surface skin temperature, i.e. urban land cover is used to define the physical extent of urban areas enabling to systematically extract the temperatures inside the cities and in their rural surroundings. To date, this methodology is an established standard protocol for robustly benchmarking the thermal stress across cities, and for deciphering statistical features of the UHI associated with biophysical and socioeconomical indicators. These merits can scarcely be promised by the conventional case study work.
In order to gain an understanding of the UHI phenomenon and its relevance in terms of urban design, insights about the influencing factors are necessary. On the one hand, the UHI intensity of a city is subject to the empirical metrics and indicators used for quantifying the phenomenon^{15}. On the other hand, while analyzing its physical essence, it is determined by a variety of factors which can roughly be categorized into (i) external and (ii) intrinsic ones^{16}. External factors include location (lat./lon.)^{17}, background climate (in particular wind)^{10, 18}, proximity to water courses (associated with sea or lakebreeze circulation), etc., whereas intrinsic ones depict cityspecific features (e.g. city size, land cover fractions, anthropogenic heat releases) which, despite being outcomes of longrun urbanization, can be regulated and reshaped.
How to alleviate the UHI effect is another issue of considerable interest. Local interventions (e.g. parks of various sizes, green and cool roofs) are shown to have a limited influence on local climate. The cooling distance, i.e. the maximum distance within which the cooling effect of such green spaces can still be detected, ranges from tens to hundreds meters^{19, 20}. Possibilities to influence intrinsic properties – including the overall urban form – are very limited in cities of developed countries due to small growth rates or even negative ones. In contrast, dramatic urbanization is taking place in developing countries, so that insights about how the urban form affects UHI intensities could provide guidance for the large scale planning of cities, where there is a great demand of new infrastructure.
Thus, we search for traceable signatures between features of urban form and UHI intensity. We consider three features of urban form which break down the spatial shape of the urban extent into single values. First, city size, since it has been shown previously that larger cities tend to have higher UHI intensities. Second, the fractal dimension which represents an established measure to characterize the compactness of a city. Third, anisometry which we revealed as an important measure of city shape, quantifies to which extent a city’s length is greater than its width. Examples include cities extending along valleys, rivers, country borders, etc. As we show below, interactions among the three indicators need to be taken into account which implies that the influence of each of them on the UHI intensity cannot be separated.
Results
Following the methodology employed in previous studies^{10, 21}, we combine land cover data with remote sensing temperature data and define the surface UHI intensity ΔT as the difference between the average temperature within the considered urban cluster and the average temperature within an equal area belt around it (see Methods Section for details). In contrast to ref. 10, here we consider the 5,000 largest urban clusters in Europe and average the summer months (June, July, August) daytime observations from 2006 to 2013. In the following we investigate how the UHI intensity ΔT depends on (i) the size, (ii) the fractality, and (iii) the anisometry of the city clusters. Therefore, we need to measure the three quantities for all considered city clusters.

(i)
The city size S _{ C } is simply given by the number of cells constituting the city clusters multiplied by the area of each cell, 6.25 × 10^{−2} km^{2}. Due to Zipf’s law for cities^{22,23,24,25,26} there are many small cities and few large ones so that we use the logarithm of city size, ln S _{ C }, in order to reduce the skewness.

(ii)
We compute the fractal dimension using the box counting method, assuming $n\sim {r}^{{D}_{\mathrm{f}}}$, where n is the number of (square) boxes of side length r necessary to cover the structure, see Methods Section. In Fig. 1(a–c) we show 3 examples of city clusters differing in size and fractality. The corresponding boxcounting results for varying r are shown in Fig. 1(d–f) and linear regressions in the loglog scale provide the slopes which are an estimate of the fractal dimensions D _{f}. The fractal dimension D _{f} can be considered as a measure of compactness, i.e. compact cities have usually large values of D _{f}.

(iii)
The anisometry A of a city cluster is defined as the eccentricity of the equivalent ellipse of the city cluster, i.e. the ratio of major axis to its minor axis, see Methods Section. It is a measure for the extent to which the city deviates from an approximate circular shape (A → 1), i.e. to which extent it’s length is greater than its width (A > 1). Figure 1(a–c) also illustrates the anisometry by means of ellipses. As can be seen, the stretched shape of Belgrade is reflected in a higher value of A. Again, we use the logarithm, i.e. ln A.
Figure 2 consists of scatterplots where the daytime UHI intensity is plotted separately vs. the three quantities – binned values and regressions are included for illustrative purposes. In Fig. 2(a), ΔT is displayed as a function of the city size. As expected and consistent with previous work^{10, 11, 14, 16, 18, 27}, the UHI intensity increases with city size and doubling the city size leads to approximately 0.4 °C additional UHI intensity. Studies of UHI intensities in relation to population size go back to Oke^{28}, who reported both a logarithmic and a powerlaw (exponent ≈ 1/4) relation between UHI intensity and population. In Fig. 2 we also include quantile regressions and find that there is heteroscedasticity in the form of stronger spreading of ΔT among large cities.
In Fig. 2(b) the influence of the fractal dimension on the UHI intensity is shown. In the range where most cities are found, ΔT typically increases by roughly 2 °C with increasing D _{f}. This finding suggests that more compact cities have more pronounced UHI intensities. The literature on UHI intensity and fractality is very limited. The fractal analysis of surface skin temperature related to vegetation abundance^{29} cannot be easily compared with our results since here we study the urban cluster which leads to the UHI. In a more general sense, the influence of urban form has been studied for the example of Beijing metropolitan area^{30}, and it has been reported that, compared to a compact city, a dispersed one is efficient in reducing mean urban heat island intensity, but affects the thermal feedback at the regional scale. Last, in Fig. 2(c) ΔT is plotted as a function of the anisometry. As one would expect from intuition, the UHI intensity decreases with increasing anisometry, by approximately 1.5 °C in the shown range. Thus, more circular cities seem to exhibit elevated UHI intensities. The above mentioned heteroscedasticity is also observed in Fig. 2(b) and (c), and the spreading of ΔT is wider among cities with larger D _{f} and smaller ln A.
Correlations among the three quantities ln S _{ C }, D _{f}, and ln A require a more complex analysis. While anisometry and cluster size are essentially uncorrelated with a Pearson correlation coefficient ρ of −0.05 [see supplementary Fig. S2(a)], fractal dimension and anisometry (ρ = −0.61) as well as cluster size and fractal dimension (ρ = 0.30) exhibit moderate correlations, as shown in supplementary Fig. S2(b) and (c), respectively. On the one hand, cities with lower fractal dimension tend to exhibit higher anisometry, i.e. more compact cities also tend to be more circular. This correlation is the strongest among the three variables considered. On the other hand, larger cities tend to exhibit higher fractal dimensions, i.e. they are more compact. It has been reported previously^{31,32,33} that city size and fractal dimension are positively correlated, i.e. larger cities in terms of population or urbanized area have higher fractal dimensions.
Thus, we employ multilinear regression in order to characterize the complex interplay between the UHI intensity and the three factors. Linearity, however, still represents an approximation – but a reasonable one – as will be discussed with the following example. The correlations between UHI intensity and city size S _{ C } have been fitted according to a loglogistic function^{10}
where the parameters a, b, c determine the saturation value, the inflection point, and the steepness, respectively. However, the sigmoidshape of Equation (1) takes only effect far from the inflection point, i.e. ${S}_{\mathrm{C}}\ll b$ or $b\ll {S}_{\mathrm{C}}$. (i) For small clusters [$\mathrm{ln}({S}_{\mathrm{C}})\to \phantom{\rule{.25em}{0ex}}\infty $] the accuracy of ΔT is limited by the resolution of land surface temperature data (≈1 × 1 km^{2}). In order to have a reasonable estimate, both cluster and belt temperature should be based at least on a few gridded values. (ii) Due to Zipf’s law for cities (see above), for large clusters [$\mathrm{ln}({S}_{\mathrm{C}})\to \infty $] the sample of cities reduces considerably. As a consequence, there are simply too few data points carrying information on whether or not ΔT(S _{ C }) saturates. Thus, it is justified to expand Equation (1) in the midrange. Since around the inflection point the logistic function $F(x)=\mathrm{1/(1}+\mathrm{exp}(\phantom{\rule{.25em}{0ex}}\phantom{\rule{.25em}{0ex}}x))$ can be approximated^{34} by $F(x)\approx 1/2+1/4x$, Equation (1) can be approximated by the logarithmic function
which corresponds to a linear polynomial of $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}$.
After having motivated the linear approximation, we finally apply the multilinear regression. In the absence of correlations among the intrinsic urban factors a simple linear combination according to ∆T = a + b $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}+c{D}_{\mathrm{f}}+d\phantom{\rule{.25em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}A$, where a, …, d are parameters, would be sufficient. Due to the correlations, all interaction terms need to be taken into account. By interaction we mean the statistical correlations between two independent variables when multilinearity occurs. We performed a stepwise linear regression with interactions (see Methods Section) to all 5,000 considered city clusters and obtained
with R ^{2} = 0.34, all fitting parameters carry the unit °C. According to the analysis, only six out of eight terms contribute statistically to ΔT. These are the offset, the three urban factors, and the interaction terms between fractal dimension and size as well as between fractal dimension and anisometry. Consistent with the absence of correlations between size and anisometry (see supplementary Fig. S2a), the corresponding interaction term is missing. Similarly, the threepointcorrelation term ${D}_{\mathrm{f}}\phantom{\rule{.25em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}\phantom{\rule{.25em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}A$ statistically does not add information.
As a consequence of the remaining interaction terms, the (linear) dependence of ΔT on e.g. D _{f} has a varying slope depending on the considered values of ln S _{ C } and ln A. For fixed values, e.g. $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}=5$ and $\mathrm{ln}\phantom{\rule{.10em}{0ex}}A=0.5$, Equation (3) simplifies to $\mathrm{\Delta}T({D}_{\mathrm{f}})=\phantom{\rule{.25em}{0ex}}5.38+4.67\phantom{\rule{.25em}{0ex}}{D}_{\mathrm{f}}$. However, for other values of $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}$ and ln A both, slope and intercept, are different. A similar effect occurs for $\mathrm{\Delta}T(\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}})$ and $\mathrm{\Delta}T(\mathrm{ln}\phantom{\rule{.10em}{0ex}}A)$. Due to this complex interplay, it can hardly be visualized in two dimensions how the UHI intensity depends on all of the three intrinsic urban factors.
Following the above example, Fig. 3 illustrates the linear dependencies of the UHI intensity on one urban factor when the other two are kept constant. Therefore, we fix two of the factors, simplify Equation (3) to a linear form depending only on the third factor, and extract slope and intercept. Then we rasterize the two fixed factors, repeat the procedure, and display the slope and intercept as shown in Fig. 3.
In Fig. 3(a) we observe that ΔT(D _{f}) is steepest for large cities with small anisometry and less steep for small cities with large anisometry. The diagonal stripes are due to the interactions of D _{f} with ln S _{C} and ln A. In Fig. 3(b), $\mathrm{\Delta}T(\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}})$ has its largest slope for compact cities, i.e. large D _{f}, which only occurs in combination with small anisometry. In this case, the slope only changes along D _{f} (horizontal stripes) – interactions with ln A have not been found. Lastly, the slope of ΔT(ln A) is mostly negative [Fig. 3(c)], with the steepest negative slopes observed for cities with a large fractal dimension. The vertical stripes illustrate the interactions with D _{f}, i.e. there would be no stripes in the absence of interactions.
At this point we still do not know which of the three factors has the strongest influence. The reason is that due to different ranges (e.g. D _{f} is roughly within 1.2 and 1.8, while $\mathrm{ln}\phantom{\rule{.10em}{0ex}}A$ is roughly in the range between 0 and 2.5), the parameters obtained in Equation (3) are not comparable. Thus, we repeat the multilinear regression, but normalize the data previously to zero mean and unit standard deviation, e.g. ${D}_{\mathrm{f}}^{\u204e}=({D}_{\mathrm{f}}\u3008{D}_{\mathrm{f}}\u3009)/{\sigma}_{{D}_{\mathrm{f}}}$ where $\u3008{D}_{\mathrm{f}}\u3009$ is the mean and ${\sigma}_{{D}_{\mathrm{f}}}$ the standard deviation. Then we obtain
whereas the 95% confidence interval of the parameters is ±0.03 or smaller. Now we can insert the average values $\u3008\phantom{\rule{.25em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}^{\u204e}\u3009$ and $\u3008\phantom{\rule{.25em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}{A}^{\u204e}\u3009$ as typical values (which are both zero due to normalization) and obtain $\mathrm{\Delta}T=0.71+0.23\phantom{\rule{.1em}{0ex}}{D}_{\mathrm{f}}^{\u204e}$. Accordingly, from analogous considerations for the other factors, we see that city size has the strongest influence ($0.33\phantom{\rule{.1em}{0ex}}\phantom{\rule{.25em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}^{\u204e}$), followed by fractality ($0.23\phantom{\rule{.1em}{0ex}}{D}_{\mathrm{f}}^{\u204e}$), and smallest is the influence of anisometry ($0.10\phantom{\rule{.25em}{0ex}}\phantom{\rule{.1em}{0ex}}\mathrm{ln}\phantom{\rule{.10em}{0ex}}{A}^{\u204e}$). Consistent with Fig. 2, ΔT increases with $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}^{\u204e}$ as well as with ${D}_{\mathrm{f}}^{\u204e}$ and decreases with $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{A}^{\u204e}$. However, due to the above discussed interaction, the ranking is only valid for typical cities in our sample and including further small cities could affect the overall outcome. Moreover, we perform a rather basic normalization and we cannot exclude that the skewed distributions could affect the resulting parameters.
Since it has been argued that the UHI intensity based on daytime LST are overestimated^{35}, we also included an analysis based on nighttime LST in the Supplementary Information. Due to overall weaker intensities, the dependencies on the city size, fractality, and anisometry are less pronounced. Nevertheless, the relative contributions are consistent with the daytime results.
Certainly, the regression Equations (3) and (4) can hardly capture the huge variations of urban form and heat island intensities for entire Europe. For instance, Berlin (${S}_{\mathrm{C}}=854.69$ km^{2}, ${D}_{\mathrm{f}}=1.66$ and $\mathrm{ln}\phantom{\rule{.10em}{0ex}}A=0.4$) is the largest city among the ones shown in Fig. 1, the measured and predicted temperatures are 3.12 °C and 3.34 °C. For Belgrade (${S}_{\mathrm{C}}=227.56$ km^{2}, ${D}_{\mathrm{f}}=1.54$ and $\mathrm{ln}\phantom{\rule{.10em}{0ex}}A=0.99$) and Birmingham (${S}_{\mathrm{C}}=606.38$ km^{2}, ${D}_{\mathrm{f}}=1.75$ and $\mathrm{ln}\phantom{\rule{.10em}{0ex}}A=0.25$), the measured ΔT are 1.39 °C and 3.75 °C; the predicted ΔT are 1.82 °C and 3.82 °C, respectively. The three examples suggest that the predictive power of the global regression model, i.e. based on the full sample of cities, is rather limited, which could be due to regional inhomogeneities.
Therefore, we adopted two different sampling strategies to assess the robustness of the results against the regional inhomogeneities. We first divided the study area into 9 zones of similar number of cities and applied the multilinear regression in Equation (4), independently. As shown in Fig. 4(a), city size dominates in most cases, followed by fractal dimension, whereas in South Europe anisometry has a larger impact on the UHI. Second, we created subsamples of varying number of cities with and without replacement and applied stepwise regression to the subsamples. Figure 4(b) and (c) reveal that as the sample size increases, our model in Equation (4) tends to appear more frequently. We conclude that our model has a good global performance, while at local scale the model should be used with certain precaution.
Discussion
In summary, we explore the recently established methodology, which systematically combines urban land cover and remote sensing surface skin temperature, in order to characterize the UHI intensities of a vast number of cities. Studying the largest 5,000 European urban agglomerations, we find a complex interplay among the correlations with intrinsic urban factors. Among the three considered large scale urban features, typically city size has the strongest influence, followed by the fractality – and the anisometry presents the weakest influence. That is, in general, the larger, the more compact (high fractal dimension), and the less stretched (small anisometry) the cities are, the stronger their UHI intensity tends to be.
Our empirical findings on the dependence of the UHI intensity on the city size and form could be attributed to the scale effect of convection^{36}. As derived in supplementary, by adopting an idealized urban configuration, the UHI intensity is approximately proportional to ${S}_{\mathrm{C}}^{(1a{D}_{\mathrm{f}})(1m)}$ with a ≈ 0.43 and $1m>0$. For a fixed fractal dimension, as the urban area increases, the heat convection (quantified by the convection heat transfer coefficient h) diminishes, resulting in a higher surface temperature. Analogously, for a fixed surface area S _{ C }, an increasing fractal dimension D _{f} weakens the convection and leads to a higher surface temperature.
Our results can be relevant for urban policy and planning in the context of global warming and local UHI adaptation.
1. Avoid large cities.
How the UHI intensity depends on city size is in particular relevant in world regions of ongoing urbanization. Policies could be developed for incentives to also populate medium size and small cities, i.e. thereby to control the exponent of Zipf’s law for cities^{22,23,24,25,26}, which relates the relative frequency of large and small cities.
2. Avoid compact cities.
More compact urban clusters have larger fractal dimensions^{37}. Qualitatively, it is comprehensible that urban sprawl and polycentric form lead to smaller fractal dimensions. Urban planning can influence these features of urban form.
3. Avoid rotund cities (i.e. approximately rotational invariance).
It is plausible that stretched cities have lower UHI intensities since the distances to the city border are shorter, in favor of enhanced atmospheric convection. Thus, from an UHI alleviation perspective, cities extending along natural or artificial topographic lines are preferable over those developing mostly around their center.
Certainly, such recommendations need to be opposed to other advantages and disadvantages. In particular, keeping cities small and the consequent ameliorated urban climate should be balanced against positive agglomeration effects of large cities such as shorter trip lengths^{38}. Scattered and anisometric cities come along with more traffic, which has negative side effects, including increased anthropogenic heat and CO_{2} emissions. Thus, tradeoffs on the local scale need to be made, when implementing urban factors. Moreover, from a global point of view it has been argued that compact cities are preferred because of their potentials in reducing energy consumption and CO_{2} emissions^{39}. However, as mentioned above such recommendations should also be adjusted according to regional specificities (see Fig. 4).
Our work adds to previously gained understanding on how compact urban form increases the UHI intensity and on the problems of transferring such insights into spatial planning^{40, 41}. Thus, our results also contribute to the ongoing discussion on the effectiveness of urban forms – in particular, singlecentric (compact city) vs. polycentric city (dispersed city) – as a means for alleviating heat islands as a negative impact of urbanization^{30}.
This study is also an example on how concepts from fractal geometry are of use in city science. For three decades, it has been argued that cities are fractal in form^{42,43,44}, and the relation between fractal structures and urban areas has received widespread attention^{32}. The fractal dimension of urban agglomerations is a measure of their compactness. Thus, in this study we contribute to the view on cities from a fractals perspective and postulate that the correlations between cluster size and fractal dimension are a manifestation of multifractality at the regional scale.
Last but not least, our work opens a perspective for future studies in various directions. First, since here we solely investigate surface skin temperature, an apparent question to be raised is to what extent similar correlations of UHI intensities with urban form also appear considering air temperature. Due to data limitations this can hardly be verified empirically, so that numerical modeling^{45} could represent an alternative. Second, we focus on large scale features of urban form, i.e. intrinsic factors. It could be interesting to test whether the consideration of external factors, foremost wind, would improve the characterization of the influence of intrinsic factors on the UHI intensities. Third, we study ensemble data, i.e. quantify correlations among the sample of cities, and do not consider temporal dynamics. It is important to verify if our findings also hold for an individual city under growth scenarios reflecting the features of urban form^{30}. Lastly, can numerical models reproduce our findings or lead at all to comparable results^{21}?
Methods
Datasets
CORINE urban morphological zones (UMZ) 2006 data at 250 m spatial resolution are used for delineating urban areas in Europe. For countries where raster UMZ data are not available, e.g. UK and Switzerland, we generated the UMZ data based on the CORINE Land Cover 2006 data following the method described in ref. 46. The processed UMZ data, containing binary urban/nonurban information for 38 European countries, are projected to the sinusoidal coordinate system which is consistent with that used in the LST data.
We used the MODIS Aqua 8day composite (MYD11A2, Version 5) LST products for the summer months (JuneJulyAugust, JJA) from 2006 to 2013, in total 104 observations across 8 years. The data are at 926.6 m ≈ 1 km spatial resolution, and are measured at 13:30 (daytime) and 01:30 (nighttime) local solar time. In this study, we focus on the daytime LST, because the daytime surface UHI is more pronounced than that of the nighttime. Complementary results based on nighttime LST can be found in supplementary. According to the pixelwise LST error flag inherent in MYD11A2, we disregard pixels with LST error >2 K. We also omitted pixels with view zenith angle >35° to minimize the anisotropy bias caused by the view angle, while guaranteeing a sufficient data quantity for further analyses^{47}. Based on the processed LST data, multiannual summer mean LST is calculated.
Urban heat island (UHI) intensity
We followed the methodology employed in previous studies^{7, 10, 21} to calculate the surface UHI intensity. Cities are defined via the City Clustering Algorithm (CCA)^{24, 26, 48} based on the UMZ data, with a clustering parameter l = 250 m, being in accordance with the spatial resolution. According to CCA, any pair of urban cells with a distance no larger than l are assigned to the same urban cluster. We defined an equalarea belt region around an identified city cluster as its rural or suburban reference, devoid of water courses and urban pixels of other clusters. The surface UHI intensity of an urban cluster is defined as the difference between average urban and rural temperature^{10, 21}, i.e. $\mathrm{\Delta}T={T}_{\mathrm{C}}{T}_{\mathrm{B}}$. In contrast to previous studies^{10, 21}, here we base our analysis on temporally aggregated temperature data, namely multiannual summer mean LST. Moreover, in contrast to ref. 10, here we disregard small city clusters and consider only the largest 5,000 clusters corresponding to cluster areas S _{ C } > 6.1 km^{2}.
Fractal Dimension
We used the boxcounting algorithm to compute the fractal dimension D _{f} for each urban cluster^{49}. Therefore, we count the number of boxes N of size r × r necessary to fully cover the considered urban cluster. Assuming $N(r)\sim {r}^{{D}_{\mathrm{f}}}$, the linear regression to $\mathrm{ln}(N)$ vs. ln(r) provided the slope which corresponds to the fractal dimension D _{f}. The conventional method is to initialize r to the minimum cell size and stepwise double it until $N(r\mathrm{)=1}$. It turned out that this 2based exponential sampling method led to a discreteness artefact and denser sampling was more robust. Thus, we adopted a denser sampling strategy by incrementing r by 1 and omitting any point [r, N(x)] if the count $N(r)=N(r\mathrm{1)}$. Sampling can be seen in Fig. 1(d–f).
Anisometry
We computed the anisometry (A) of city clusters similar to the method^{50}. We defined the anisometry of a city cluster as the ratio of the city cluster’s maximum Feret’s diameter to its minimum Feret’s diameter. The Feret’s diameter is the distance between two parallels tangent to an object along a certain direction. In order to illustrate the relative stretch of clusters, we drew the equivalent ellipse of a city cluster by assigning the maximum and minimum Feret’s diameters to the axes of the ellipse (see. Figure 1(a–c). The ellipse centers at the centroid of a city cluster. Analog to the cluster size, we use the logarithms of anisometry (ln A) throughout the study to reduce the skewness.
Quantile regression
Quantile regression^{51, 52} is a method for estimating the impact of observed covariates on quantiles of the response variable. In contrast to ordinary least squares regression, quantile regression is particular applicable for the model with heterogeneous variance, e.g. in the presence of heteroscedasticity, where the former approach usually misestimates the real relationship or fails to detect the nonzero changes^{53}. Quantile regression finds wide application in disciplines, where data are seldom normally distributed, e.g. ecology^{53}, climatology^{54, 55}, etc. Assuming a regression function $Y=\beta X+\epsilon $. The estimators for the quantile τ, i.e. ${\beta}_{\tau}$ are obtained by minimizing the sum of asymmetrically weighted absolute residuals. The weights are given by the function ${\rho}_{\tau}$ ^{52}.
Multilinear regression
We employed the general multilinear model to quantify the relation between the UHI intensity ΔT and predictive variables – the logarithm of city size $\mathrm{ln}\phantom{\rule{.10em}{0ex}}{S}_{\mathrm{C}}$, fractal dimension D _{f}, and the logarithm of anisometry $\mathrm{ln}\phantom{\rule{.10em}{0ex}}A$. We use the general ansatz
where a, …., h are eight parameters, and e.g. D _{f} ln S _{ C } is the interaction between fractal dimension and city size. We used the forward and backward stepwise regression to determine the variables in the multilinear model. The Bayesian Information Criterion was used to add and remove terms in the model, and to avoid dataoverfitting.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
 1.
Lettenmaier, D., Mishra, V., Ganguly, A. & Nijssen, B. Observed Climate Extremes in Global Urban Areas. Environ. Res. Lett. 16, 14787 (2014).
 2.
Coumou, D. & Rahmstorf, S. A decade of weather extremes. Nat. Clim. Chang. 2, 1–6 (2012).
 3.
Meehl, Ga & Tebaldi, C. More intense, more frequent, and longer lasting heat waves in the 21st century. Science 305, 994–7 (2004).
 4.
Li, D. & BouZeid, E. Synergistic interactions between urban heat islands and heat waves: The impact in cities is larger than the sum of its parts. J. Appl. Meteorol. Climatol. 52, 2051–2064 (2013).
 5.
Oke, T. R. Boundary Layer Climates 2nd edn, (Methuen, London, 1987).
 6.
Arnfield, A. J. Two decades of urban climate research: A review of turbulence, exchanges of energy and water, and the urban heat island. Int. J. Climatol. 23, 1–26 (2003).
 7.
Peng, S. et al. Surface urban heat island across 419 global big cities. Environ. Sci. Technol. 46, 696–703 (2012).
 8.
Zhou, D., Zhao, S., Liu, S., Zhang, L. & Zhu, C. Surface urban heat island in China’s 32 major cities: Spatial patterns and drivers. Remote Sens. Environ. 152, 51–61 (2014).
 9.
Zhou, D., Zhao, S., Zhang, L., Sun, G. & Liu, Y. The footprint of urban heat island effect in China. Sci. Rep. 5, srep11160 (2015).
 10.
Zhou, B., Rybski, D. & Kropp, J. P. On the statistics of urban heat island intensity. Geophys. Res. Lett. 40, 5486–5491 (2013).
 11.
Zhao, L., Lee, X., Smith, R. B. & Oleson, K. Strong contributions of local background climate to urban heat islands. Nature 216–219 (2014).
 12.
Lee, S. H. & Baik, J. J. Statistical and dynamical characteristics of the urban heat island intensity in Seoul. Theor. Appl. Climatol. 100, 227–237 (2010).
 13.
Lemonsu, A., KounkouArnaud, R., Desplat, J., Salagnac, J.L. L. & Masson, V. Evolution of the Parisian urban climate under a global changing climate. Clim. Change 116, 679–692 (2013).
 14.
Clinton, N. & Gong, P. MODIS detected surface urban heat islands and sinks: Global locations and controls. Remote Sens. Environ. 134, 294–304 (2013).
 15.
Schwarz, N., Lautenbach, S. & Seppelt, R. Exploring indicators for quantifying surface urban heat islands of european cities with MODIS land surface temperatures. Remote Sens. Environ. 115, 3175–3186 (2011).
 16.
Oke, T. R. The energetic basis of the urban heat island. Q. J. R. Meteorol. Soc. 108, 1–24 (1982).
 17.
Wienert, U. & Kuttler, W. The dependence of the urban heat island intensity on latitude – A statistical approach. Meteorol. Zeitschrift 14, 677–686 (2005).
 18.
Imhoff, M. L., Zhang, P., Wolfe, R. E. & Bounoua, L. Remote sensing of the urban heat island effect across biomes in the continental USA. Remote Sens. Environ. 114, 504–513 (2010).
 19.
Zupancic, T., Westmacott, C. & Bulthuis, M. The impact of green space on heat and air pollution in urban communities: A metanarrative systematic review. Tech. Rep. March (2015).
 20.
Feyisa, G. L., Dons, K. & Meilby, H. Efficiency of parks in mitigating urban heat island effect: An example from Addis Ababa. Landsc. Urban Plan. 123, 87–95 (2014).
 21.
Zhou, B. et al. Assessing seasonality in the surface urban heat island of London. J. Appl. Meteorol. Clim. 55, 493–505 (2016).
 22.
Auerbach, F. Das Gesetz der Bevölkerungskonzentration. Petermanns Geogr. Mitteilungen 59, 73–76 (1913).
 23.
Zipf, G. K. Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology (Reprint of 1949 Edition) (Martino Publishing, Manfield Centre, CT, 2012).
 24.
Rozenfeld, H. D., Rybski, D., Gabaix, X. & Makse, H. A. The area and population of cities: New insights from a different perspective on cities. Am. Econ. Rev. 101, 2205–2225 (2011).
 25.
Rybski, D. Auerbach’s legacy. Environ. Plan. A 45, 1266–1268 (2013).
 26.
Fluschnik, T. et al. The size distribution, scaling properties and spatial organization of urban clusters: a global and regional percolation perspective. Int. J. GeoInformation 5, 110 (2016).
 27.
Park, H.S. Features of the heat island in Seoul and its surrounding cities. Atmos. Environ. 20, 1859–1866 (1986).
 28.
Oke, T. R. City size and the urban heat island. Atmos. Environ. 7, 769–779 (1973).
 29.
Weng, Q., Lu, D. & Schubring, J. Estimation of land surface temperature – vegetation abundance relationship for urban heat island studies. Remote Sens. Environ. 89, 467–483 (2004).
 30.
Yang, L. et al. Contrasting impacts of urban forms on the future thermal environment: example of Beijing metropolitan area. Environ. Res. Lett. 11, 034018 (2016).
 31.
Shen, G. Fractal dimension and fractal growth of urbanized areas. Int. J. Geogr. Inf. Sci. 16, 419–437 (2002).
 32.
Encarnação, S., Gaudiano, M., Santos, F. C., Tenedório, J. A. & Pacheco, J. M. Fractal cartography of urban areas. Sci. Rep. 2 (2012).
 33.
Rybski, D., Ros, A. G. C. & Kropp, J. P. Distance weighted city growth. Phys. Rev. E 87, 042114 (2013).
 34.
Weisstein, E. W. Sigmoid function. MathWorld–A Wolfram Web Resource. http://mathworld.wolfram.com/SigmoidFunction.html (accessed: 2016 May 7th).
 35.
Mohan, M. & Kandya, A. Impact of urbanization and landuse/landcover change on diurnal temperature range: A case study of tropical urban airshed of India using remote sensing data. Sci. Total Environ. 506–507, 453–465, doi:10.1016/j.scitotenv.2014.11.006 (2015).
 36.
Sakai, S., Iizawa, I., Onishi, M., Nakamura, M. & Kobayashi, K. Fractal Geometry of the Ground Surface and Urban Heat Island. In Int. Conf. Urban Clim., July, 1–4 (2009).
 37.
Makse, H. A., Andrade, J. S., Batty, M., Havlin, S. & Stanley, H. E. Modeling urban growth patterns with correlated percolation. Phys. Rev. E 58, 7054–7062 (1998).
 38.
Louf, R. & Barthelemy, M. How congestion shapes cities: From mobility patterns to scaling. Sci. Rep. 4, srep05561 (2014).
 39.
Martilli, A. An idealized study of city structure, urban climate, energy consumption, and air quality. Urban Clim. 10, 430–446 (2014).
 40.
Schwarz, N. & Manceur, A. M. Analyzing the influence of urban forms on surface urban heat islands in Europe. J. Urban Plan. Dev. 141, A4014003 (2015).
 41.
Debbage, N. & Shepherd, J. M. The urban heat island effect and city contiguity. Comput. Environ. Urban Syst. 54, 181–194 (2015).
 42.
Batty, M. & Longley, P. A. Urban shapes as fractals. Area 19, 215–221 http://www.jstor.org/stable/20002475 (1987).
 43.
Batty, M. & Longley, P. A. Fractalbased description of urban form. Environ. Plann. B 14, 123–134 (1987).
 44.
Batty, M. & Longley, P. Fractal Cities: A Geometry of Form and Function (Academic Press Inc, San Diego, CA and London, 1994).
 45.
De Ridder, K., Lauwaet, D. & Maiheu, B. UrbClim– A fast urban boundary layer climate model. Urban Clim. 12, 21–48 (2015).
 46.
Simon, A., Fons, J., Milego, R. & Georgi, B. Urban Morphological Zones version F2v0: Definition and procedural steps. Tech. Rep., ETC/LUSI, EEA (2010). Available at: http://www.eea.europa.eu/dataandmaps/data/urbanmorphologicalzones20061. (Accessed: 11th May 2016).
 47.
Hu, L., Brunsell, N. A., Monaghan, A. J., Barlage, M. & Wilhelmi, O. V. How can we use MODIS land surface temperature to validate longterm urban model simulations? J. Geophys. Res. Atmos. 119, 3185–3201 (2014).
 48.
Rozenfeld, H. D. et al. Laws of population growth. Proc. Nat. Acad. Sci. USA 105, 18702–18707 (2008).
 49.
Bunde, A. & Havlin, S. E. Fractals and Disordered Systems (Springer, Berlin, 1996).
 50.
Medalia, A. & Hornik, G. Pattern recognition problems in the study of carbon black. Pattern Recognit. 4, 155–172 (1972).
 51.
Koenker, R. & Bassett, G. Jr. Regression quantiles. Econometrica 46, 33–50 http://www.jstor.org/stable/1913643 (1978).
 52.
Koenker, R. & Hallock, K. F. Quantile Regression. J. Econ. Perspect. 15, 143–156 (2001).
 53.
Cade, B. S. & Noon, B. R. A gentle introduction to quantile regression for ecologists. Front. Ecol. Environ. 1, 412–420 (2003).
 54.
Donner, R. V. et al. Spatial patterns of linear and nonparametric longterm trends in Baltic sealevel variability. Nonlinear Proc. Geoph. 19, 95–111 (2012).
 55.
Mueller, B. & Seneviratne, S. I. Hot days induced by precipitation deficits at the global scale. Proc. Natl. Acad. Sci. USA 109, 12398–403 (2012).
Acknowledgements
The research leading to these results has received funding from the European Community’s Seventh Framework Programme under Grant Agreement 308497 (Project RAMSES). Author B.Z. thanks ClimateKIC, the climate innovation initiative of the EU’s European Institute of Innovation and Technology (EIT), for award of a Ph.D. scholarship. We acknowledge the EEA for making available the CORINE land cover data and NASA LP DAAC for the MODIS LST data.
Author information
Affiliations
Potsdam Institute for Climate Impact Research (PIK), Member of the Leibniz Association, P.O. Box 60 12 03, Potsdam, D14412, Germany
 Bin Zhou
 , Diego Rybski
 & Jürgen P. Kropp
Department of Geo and Environmental Sciences, University of Potsdam, Potsdam, Germany
 Jürgen P. Kropp
Authors
Search for Bin Zhou in:
Search for Diego Rybski in:
Search for Jürgen P. Kropp in:
Contributions
D.R. and B.Z. designed the experiments; B.Z. carried out the data analysis; B.Z. prepared the figures; all authors wrote and reviewed the manuscript.
Competing Interests
The authors declare that they have no competing interests.
Corresponding author
Correspondence to Diego Rybski.
Electronic supplementary material
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.