Introduction

Much debate exists about the possible large-scale land-cover change impacts on catchment hydrology across temporal and spatial scales1,2. Afforestation is suggested to have many co-benefits such as improving biodiversity, air and water quality, providing timber, enhancing human well-being and reducing flood risk3,4. Widespread afforestation is often suggested as a solution for achieving Net Zero (balancing carbon emissions with removal) by numerous groups including the UK government5,6. However, there is a limited understanding of how effective woodland is for Natural Flood Management in modifying water fluxes and stores over large catchments1,7. Impacts of land cover on hydrology are particularly important to understand due to changing streamflow regimes both in the UK and globally8,9,10,11.

Land cover exerts strong localised controls on the water balance and streamflow timings within a catchment that can be difficult to detect over larger scales2,12. Numerous studies have investigated the influence of afforestation on hydrology, including plot-scale studies to understand infiltration rates and groundwater levels, and small catchment studies evaluating the impact of land-cover change on streamflow and water quality13,14,15. However, these studies are often smaller than 1000 km2 in size, and it is unknown whether the impacts of afforestation may scale up over larger catchments, given its complex influence on streamflow16,17,18. Furthermore, few systematic evaluations exist of how afforestation location and extent may influence catchment hydrology across a wide range of climatic and physiographic conditions19,20.

This study is a theoretical assessment of the extent to which afforestation extent and location in catchments predominantly over 1000 km2 in size may increase or decrease streamflow across temperate catchments in the British Isles. We hypothesise that: (i) increasing afforestation extent should proportionally reduce streamflow at all exceedances16,21; (ii) catchments may be more hydrologically responsive to certain afforestation locations such as upland compared to lowland regions1,2; (iii) catchment properties such as climate and soil types may differentiate a catchment’s hydrological response to afforestation22. Our work is the first, to our knowledge, to employ a high-resolution 1 km2 physics-based land-surface model to quantify the impact of afforestation on streamflow across multiple catchments using a large ensemble of land-cover scenarios. This approach allows us to isolate drivers influencing catchment sensitivity, determine individual hydrological processes altered by widespread afforestation, elucidate the uncertainty generated by afforestation location and most importantly find the impact of afforestation over large spatial extents on streamflow. Beyond the British Isles, our results describe the influence of afforestation on temperate catchment hydrology.

We focus on twelve diverse catchments (areas covering 511–9931 km2) which capture multiple hydrological regimes, drainage patterns, soil, and land-cover types to understand catchment properties influencing streamflow response to afforestation (Fig. 1 and Supplementary Table S1). Using the Joint UK Land Environment Simulator (JULES) run at a 30 minute timestep, we simulate the hydrological implications of a large ensemble of broadleaf afforestation scenarios23,24 (“Methods”). JULES simulates carbon, water and energy fluxes at the land surface when driven with a time series of meteorological data. Further details about the model setup can be found in the “Methods”.

Systematic criteria were used to generate up to 288 broadleaf afforestation scenarios per catchment for planting within grasslands (“Methods”), in line with afforestation scenarios that show improved pastures and rough grasslands will be the most likely initial afforestation locations in the UK25. However, we note this approach does not consider other factors such as the socio-economic implications of afforestation location. Initially, we identify separate planting locations in each catchment using either stream order26,27 (relative size of stream in a catchment) or propensity for saturation28 (how likely a location is to be wet after rainfall) (“Methods”). Stream order varies from one to seven (small to large stream) and propensity for saturation from one to five (low-to-high saturation level). We create watersheds (catchment boundaries) that are both within, and outside, each of the defined catchment planting locations (Supplementary Fig. S1). Broadleaf woodland planting was then either random, with ~25 and 50% randomly distributed within the chosen locations, or around existing land cover at 25 and 50 m buffers. Discussion exists over where to plant woodland in relation to existing land cover. To investigate these planting locations, we systematically add woodland around watercourses29,30, urban areas31,32 and existing broadleaf woodland4,33. Afforestation according to these planting criteria generated between 234 and 288 scenarios and between 0 and c. 40 percentage point increase in broadleaf woodland per catchment (Supplementary Fig. S2). However, owing to the available space and catchment area the same extent of maximum afforestation could not be achieved in all catchments; nor do we account for the possible effect of forest stand age on catchment hydrology13. A large ensemble of scenarios provides evidence to ascertain whether a hydrological change in catchments is primarily driven by planting location or extent, and to quantify the variability, or uncertainty, induced by afforestation across catchments.

To characterise afforestation influence on streamflow we use eight hydrologic metrics34,35. For analysing extreme and average streamflow, we calculate the top 1% (very high flow), 5% (high flow), 50% (median flow), 90% (low flow) and 95% (very low flow) quantiles of daily flows for the period 2000–2010. This period allows us to observe the impact of land-cover change in a flood-rich period36,37 and avoid uncertainty in catchment hydrology when comparing scenarios to original land cover that would substantially change over longer periods. We compute the slope of the flow duration curve, to quantify flow variability38,39; median streamflow elasticity, to measure catchment streamflow response to yearly changes in precipitation40,41; and runoff ratio to calculate water balance changes related to streamflow and evaporation42. To determine catchment hydrological responsiveness to afforestation, we calculate the median regression slope of changes in streamflow metrics for every percentage point increase in afforestation over the study period, for each catchment and watershed afforestation area43. This regression slope is a proxy for catchment responsiveness to afforestation and is used to infer the median change in a hydrologic metric due to afforestation in a catchment. For each catchment, Spearman’s rank correlation is calculated between the quantified catchment sensitivity to afforestation and catchment attributes44. To establish the impact and significance of different planting locations on hydrologic metrics according to catchment and land-cover location we undertake a one-way analysis of variance (ANOVA) test.

Results and discussion

Afforestation extent influence on streamflow

Afforestation on average reduces streamflow linearly at all streamflow quantiles within JULES (Fig. 1). At the highest flows, we find a small reduction of streamflow with increases in broadleaf woodland (median reduction of 1.3% ± 0.6 (one standard deviation) and 1.4% ± 0.6 for the top 1% and 5% flows respectively, per ten-percentage point increase in woodland) (refer to Supplementary Table S2 for percentage and absolute values). These percentage changes in the high flows are relatively small compared to flood peak magnitudes, meaning a substantial area of the catchment would need to be afforested before an appreciable reduction in flow is observed. The correlation strength is moderate (ρ = −0.46 ± 0.17 across individual catchments (Supplementary Table S3)) suggesting afforestation location and other factors can affect streamflow reduction at the highest flows (Fig. 1a–c). There appears to be a weaker correlation between afforestation extent and high-flow reductions in small catchments compared to the largest (e.g., Tamar, ρ = −0.35; Severn-HB, ρ = −0.77 (Supplementary Table S3)). This finding suggests the impact of afforestation location is important in smaller catchments but in the larger catchments, the locational impact is diluted, making it more difficult to disentangle afforestation location45,46. Targeted afforestation for flood management may therefore be detectable in smaller catchments, but become less impactful as the broadleaf woodland extent is scaled up over larger catchments1,7. Greater uncertainty exists in how afforestation extent and location will impact streamflow in smaller catchments, with a wide range of possible responses. However, there is a more predictable hydrological response to the land-cover change in the largest catchments, regardless of the location change.

Fig. 1: Effect of afforestation on high, median and low flows across Great Britain.
figure 1

Three scatter graphs illustrating the percentage reduction in flow quantile levels for each percentage point of woodland planted in the different catchments. Reduction in (a) the top 1% of flows, (b) the median flows and (c) the 95% quantile of daily flows. Each data point represents an afforestation scenario in a single catchment. Dashed red lines represent the median reductions in the different flow quantiles for all catchments. Dark blue represents the six wetter catchments (mean precipitation 3.11–4.16 mm day−1) and lighter blue, the six drier catchments (1.99–2.55 mm day−1). Bars show the reduction in each flow level per ten-percentage point increase in broadleaf woodland for the different groups as well as the median reduction for all catchments in red. Catchment locations are shown on the right and the lettering code can be found in Supplementary Table S1.

Afforestation impact on catchment hydrology varies seasonally, with large decreases in runoff in the winter, spring, and autumn months and no decrease or minor increases in summer months within JULES (Supplementary Fig. S3). An increase in high flows for some catchments coincides with the timing of the highest flows during the summer months. Our simulations suggest that this increase in high flows is due to an increase in topsoil moisture leading to greater levels of saturation-excess overland flow following periods of high-intensity rainfall. In all periods, there is increased evaporation from intercepted water stored in the canopy, but this increase in topsoil moisture is primarily driven by a decrease in soil evaporation (Supplementary Fig. S4). Particularly in the winter and spring periods, an increase in soil evaporation results from the loss of leaves from the broadleaf woodland, followed by regrowth. In the model, reduced canopy cover increases the exposure of the soil surface to incoming short-wave radiation and reduces aerodynamic resistance, leading to an increase in potential evaporation. With broadleaf planting on grasslands, there is increased soil infiltration, soil and canopy evaporation and transpiration rates, and so we therefore find a commensurate reduction in both the surface and subsurface runoff. In contrast, in the summer months a decrease in modelled soil evaporation increases topsoil moisture (because the increase in canopy foliage reduces soil exposure to short-wave radiation and surface wind speed). As broadleaf woodland can achieve a higher leaf area index than the grasslands, the increase in topsoil moisture enhances saturation levels at rainfall interception (at the soil) and thus surface runoff. It should also be noted the increase in topsoil moisture may be due to differences in the root structures of broadleaf woodland and grasslands represented in JULES. While broadleaf woodland is modelled to have a root depth of 3 m, grasslands have root depths of 0.5 m, resulting in a potential overdrying of the topsoil.

Afforestation extent reduces simulated median and low flows the most, with −2.8% ± 1.0 and −4.3% ± 3.1 respectively per ten-percentage points of afforestation (Supplementary Table S2). Strong correlations between median and low-flow reduction and afforestation suggest extent has a stronger influence on reducing streamflow than location, and this finding is supported by previous studies (ρ = −0.83 ± 0.12, ρ = −0.75 ± 0.15 (Supplementary Table S3))12,39,47,48,49. Each percentage point of afforestation in a catchment reduces the median flow by 1.0 mm yr−1 ± 0.21. Reduction of the median and low flows is primarily due to modelled increases in evaporation (Supplementary Fig. S4), which is either from the woodland canopy or the soil surface when deciduous trees shed their leaves. Small increases in the woodland area increase the lowest flows on average the most for the smallest catchments, due to the increased summer topsoil moisture, which is not balanced by the increased evaporation in subsequent months. As the woodland area increases, there is a greater reduction in overall flows and more water leaves the catchment model domain by evaporation, reducing the lowest flow quantiles. Downstream of small afforestation locations there would not be any major disturbances to streamflow, and it may provide slight buffering against drought conditions.

Catchment response to afforestation extent is also clear using the flow-regime metrics (runoff ratio, slope of the flow duration curve and catchment elasticity). We find a reduced contribution of rainfall in streamflow with a decrease in the runoff ratio (−1.9% ± 0.6 per ten-percentage points of afforestation) as reported in previous studies14,50. Modelled flow regimes become more variable with strong decreases in low and lesser decreases in high-flow quantiles, increasing the slope of the flow duration curve (1.1% ± 0.7 per ten-percentage points of afforestation). The greater variance between flow-regime variability and afforestation suggests planting location more strongly controls streamflow regimes than extent (ρ = −0.46 ± 0.23 (Supplementary Table S3)). Catchment discharge response to yearly changes in rainfall also decreases with broadleaf afforestation by modulating rainfall input to a catchment (median reduction in catchment elasticity of −3.5% ± 1.2 per ten-percentage points of afforestation). Changes in a catchment’s rainfall regime would therefore be less detectable in streamflow observations. These findings highlight the fact that simulations of future streamflow using physics-based models depend not only on climatic changes but also land-cover changes. Without the inclusion of land-cover changes, considerable uncertainty will be introduced into projections of future flooding, drought and water management51.

Afforestation location influence on streamflow

ANOVA reveals highly significant differences (P < 0.01) in simulated flow reductions for all flow quantile levels when planting across the different stream orders, apart from the very highest flows (F (1,670) = 5.3, P = 0.022) (Fig. 2a). This highly significant difference in streamflow reduction across stream orders for the hydrologic metrics is due to differences in the largest and smaller streams, suggesting woodland acreage matters more than planting location. To emphasise, when the same area of broadleaf is planted for each stream order there is no highly significant difference between orders (P > 0.01). Runoff reduction according to stream order is also highly significant as seen in the runoff ratio and catchment elasticity (F (1,670) = 17.93, P < 0.01; F (1,670) = 14.01, P < 0.01). Again, these differences are due to planting between the largest and the smaller streams. Flow-regime variability differences by stream order location were not as significantly different (F (1,670) = 5.11, P = 0.0241) suggesting catchment planting area does not consistently lead to differences in flow-regime variability.

Fig. 2: Percentage streamflow change for each percentage point of afforestation.
figure 2

Sensitivity of catchment hydrology to afforestation in relation to (a) stream order, (b) propensity for saturation, (c) planting area. “None” in planting area indicates the absence of preferential planting location, i.e. planting randomly at 25 and 50% within a catchment area. Left three metrics (grey shading) reflect flow regimes, whereas those on the right represent five flow quantiles. The vertical axis indicates the median quantile regression slope for the percentage change in the hydrologic indicator per percentage point increase in broadleaf woodland. For example, a quantile coefficient of −1 would represent a median 1% decrease for each percentage point increase in woodland. Each bar represents the mean quantile coefficient for all catchments and the error bars represent the standard error.

Planting according to different saturation areas led to significantly distinct reductions in streamflow for both the median and low flows in JULES when planting extent is not constant (F (1,478) = 4.89, P < 0.01; F (1,478) =  11.11, P < 0.01] (Fig. 2b). However, the difference between saturation areas is less significant for all other flow quantiles and when planting extent becomes fixed in each area, the differences between streamflow reductions are not significant (P > 0.01). This means selecting afforestation sites within areas that are more or less likely to become saturated may not necessarily lead to different hydrological responses downstream, if planting the same acreage. However, afforestation effects differ the most between the least and most likely areas of saturation when planting acreage is not fixed (P = 0.011), with greater reductions in the modelled runoff ratio in the least saturated areas. Benefits observed at small afforestation-plot scales may not size up to larger catchments and could have a negligible impact on reducing the highest flows.

Planting location, around urban, broadleaf, watercourses, or randomly, leads to different simulated streamflow dynamics (Fig. 2c) even when the area planted is constant. Not planting according to existing land cover greatly reduces streamflow at all flow quantiles compared with planting around the three existing land-cover types. The difference in planting area location is greatest for low-flow quantiles (ANOVA P < 0.001) compared with high-flow quantiles (P < 0.01). Overall, no single best planting location could be identified around existing land covers if trying to provide Natural Flood Management benefits52. We find consistent, minimal differences in streamflow when planting around urban and watercourses, across all flow quantiles. Planting location also significantly reduces catchment runoff (P < 0.001) for both runoff ratio and catchment elasticity; the largest differences occur when planting is not according to existing land cover but when it was random. However, location effect on flow-regime variability is more nuanced, with notable differences between planting around urban, watercourse locations and randomly.

The difference in streamflow reduction is not significantly different when planting woodland inside and outside of the various drainage basin locations (Supplementary Fig. S5). Often it is debated whether interventions to reduce maximum flows are more effective in the headwaters or the valley floor downstream1. This modelling study finds no compelling difference in streamflow, including extreme flood flow changes, when similar acreage is planted in the headwaters or the valley floor. However, in our study only one tree type is planted, so location may matter more for other tree types. In addition, our model does not simulate the effects of forest management techniques such as ditching and coppicing that might also have an influence on streamflow49,53.

Catchment properties altering streamflow sensitivity to afforestation

Catchment climate exerts a strong influence on the hydrological response to afforestation in different types of catchments when comparing model results to their attributes. There are strong associations between climate properties and median flow reduction (Table 1). As a catchment’s average rainfall (and runoff ratio) increase, afforestation has a smaller effect on streamflow reduction in JULES (Fig. 1a–c). We find planting broadleaves have less of an impact on low-flow conditions and droughts in wetter locations, and greater response in drier areas. Drier catchments are more likely to show decreases in runoff from afforestation, due to increased rates of potential evapotranspiration and water usage, as seen in other studies21,49,50 (Fig. 1). A negative association with catchment aridity emphasises this point. In situations of high rainfall and thus high runoff events, the same runoff generation processes will be present in a catchment with and without afforestation. In these situations, land cover will only have a minimal impact on flow reduction. Here, the strong correlation with catchment elevation is likely influenced by differences in average rainfall (ρ = 0.8, P = 0.0032). Few of the catchment properties shown in Table 1 exhibit significant correlations with catchment response to afforestation at the high and low flows. This could be due to differences in planting location with catchment attributes not capturing the temporal and spatial elements of runoff generation mechanisms influenced by afforestation or the greater variability in extremes. The catchment area has a strong negative association between high-flow reduction and afforestation potentially due to the greater relative reduction in the specific discharge of larger compared to smaller catchments.

Table 1 Sensitivity to afforestation with catchment properties.

There are not many significant associations between simulated catchment hydrology changes and catchment soil attributes (Table 1). This suggests that catchment soil properties may have a minimal role in altering catchment hydrology response to afforestation, at least in the JULES model. However, the slight significant negative association between soil depth to bedrock and changes in streamflow regimes with afforestation suggests afforestation may alter longer-term hydrological fluxes. In catchments with deeper soils, where subsurface flow dominates, afforestation is likely to reduce the subsurface component of streamflow, decreasing runoff ratio and catchment elasticity whilst increasing flow-regime variability. A similar effect can also be seen in the low flows, albeit slight. This effect may be overemphasised in JULES owing to its uniform soil depth and so could be due to runoff generation mechanisms using topographic slope.

Conclusions

This study has quantified potential afforestation impacts on temperate catchment hydrology in Great Britain using a high-resolution land-surface model. We find afforestation reduces streamflow at all flow quantiles with a clearer impact observed at median and low flows than high flows in twelve catchments over 500 km2. When attributing changes in streamflow, afforestation location in the catchment can influence the highest and lowest streamflow extremes, particularly in smaller catchments. We find the extent of afforestation is more important than its location, particularly for the median flows. Simulated catchments with low rainfall and deep soils are more hydrologically responsive to afforestation than others in our chosen catchments. We show the effects of widespread afforestation on streamflow over multiple spatial scales within catchments can be significant and thus are important to include when establishing projections of hydrological change. These results provide quantitative insight into key water management decisions regarding the extent and location of afforestation in temperate regions and suggest caution is required when advocating widespread tree planting to mitigate future hydro-climatic change or to attempt to control flooding in large catchments.

Methods

Catchment locations and input data

To determine the impact of afforestation on catchment hydrology we select twelve varied catchments from across the British Isles (Supplementary Material-S1 and Fig. 1). These catchments capture a range of hydrological regimes, drainage patterns and catchment soil and land-cover properties to determine how such factors may influence catchment response to afforestation. Being predominantly >1000 km2 in area (ranging from 511 to 9931 km2 in size), they are adequately represented in a hydrological model to integrate processes at a 1 km2 spatial resolution54,55. Two catchments are nested within larger ones, the Ure within the Ouse, and the Severn at Bewdley (Severn-B) within the Severn at Haw Bridge (Severn-HB) (Fig. 1).

The period 2000–2010, a flood-rich period for the British Isles36,37, is chosen to assess afforestation influence on streamflow as it allows us to avoid the uncertainty that would be associated with land-cover changes over a longer period when comparing to baseline results. This length of the simulation period also reduces the computational demand with a large ensemble of land-cover scenarios. Accordingly, the CEH land-cover map for the year 200056, in the form of the CHESS-land dataset57, is used to provide configurational datasets specifying soil hydraulic and thermal properties, vegetation characteristics, and orography, for the model at a 1 km2 spatial resolution for the unaltered land-cover scenarios. This dataset has successfully been used in other studies55,58. The 25 m rasterised land-cover map is reclassified into eight different land-cover types (Supplementary Tables S4 and S5) and used to derive afforestation scenarios related to land cover before being converted to a percentage land-cover fraction at a 1 km2 spatial resolution. To provide the required meteorological driving data, we use the CHESS-met dataset59 which includes long-wave and short-wave radiation, air temperature, specific humidity and pressure. The 50 m CEH Integrated Hydrological Digital Terrain Model elevation data is used to derive topographical and catchment attributes as well as catchment boundaries and river networks60. Soil hydraulic information comes from the Harmonised World Soil Database and was made uniform across each grid cell61.

The model

The Joint UK Land Environment Simulator (JULES) is a physically based land-surface model that simulates the fluxes of carbon, water and energy at the land surface when driven by a time series of the atmospheric data23,24. Multiple studies have used JULES before including the investigation into evapotranspiration drivers across Great Britain58,62, atmospheric river formation over Europe63, the impact of solar dimming and carbon dioxide on runoff64 and developing river routing algorithms with a Regional Climate Model65. JULES is routinely used at the Met Office, where it is coupled with several other models to understand future changes globally and across the UK, by bridging the atmosphere, land surface and ocean66. This study is predominantly a theoretical, scenario-based modelling study designed to draw out general principles and to quantify the relation between afforestation and hydrological response, and as such the results are not intended to provide detailed guidance for specific practical actions.

The use of a process-based model enables us to investigate physical explanations for the hydrological impacts of changes in land cover and the explicit representation of vegetation that will influence fluxes, partitioning and storages within the realm of epistemic uncertainty for other conceptual and hydrological models where vegetation is not included. JULES models both plant phenology and canopy storages23,24. When changing the plant functional type in JULES, both the properties of the above-ground vegetation (such as canopy height and leaf area index) and the soil infiltration factor and the root depth are altered23. However, there are several caveats that must be considered with this approach. First, the model configuration used in the present study is uncoupled from the atmosphere and so large-scale land-cover changes cannot alter nearby weather67. Second, each grid cell is hydrologically separated from adjacent cells, with streamflow and runoff hydrologically uncoupled from the rest of the system. Soil water also does not flow between grid cells. Third, soil thermal and hydraulic properties are uniform across a grid cell. This reduces the impact of hydrological pathways within a cell and the interaction of vegetation with these varying soil types that could have ramifications at multiple temporal and spatial scales. For example, within the cell there may be vegetation that is water-stressed (e.g. valley sides) compared with vegetation where water is not limited (e.g. floodplain) which would change how much transpiration is possible and thus runoff68.

Precipitation in the model is partitioned by vegetation and when it reaches the soil surface it is portioned into either infiltration excess overland flow, at a rate controlled by the hydraulic conductivity of the soil, or saturation-excess overland flow as determined by the Probability Distributed Model (PDM)69,70. Throughfall (TF) through the canopy is dependent on the rainfall and the existing water in the canopy:

$${T}_{F}=P\left(1-\frac{C}{{C}_{m}}\right)\exp \left(-\frac{{\varepsilon }_{r}{C}_{m}}{P\varDelta t}\right)+P\frac{C}{{C}_{m}}$$
(1)

where P is the rainfall rate (kg m−2 s−1), C is the amount of water in the canopy (mm), Cm is the maximum water storage of the canopy (mm) and εr is the fraction of the grid cell occupied by convective precipitation. The maximum amount of canopy water storage is a function of the leaf area index (L):

$${C}_{m}={A}_{m}+{B}_{m}L$$
(2)

where Am is the ponding of water on the soil surface and interception by leafless vegetation (mm) and Bm is the rate of change of water holding capacity with leaf area index. At each timestep (n) the canopy storage is updated thus:

$${C}^{(n+1)}={C}^{(n)}+(P-{T}_{F})\Delta t$$
(3)

Based on the surface energy balance, the fraction of the proportion of water stored in the canopy compared with the maximum canopy capacity of that plant type is used to calculate the effective surface resistance to determine tile evapotranspiration.

Surface runoff is generated by two processes in JULES: infiltration excess, where the water flux at the surface is greater than the infiltration rate of the soil, and saturated excess overland flow where the water flux at the surface is converted to runoff when the soil is completely saturated. To calculate the saturation-excess overland flow, the PDM69 is used to determine the fraction of the model grid cell that will be saturated (fsat) which is used as a multiplier to convert any excess water reaching the surface to runoff:

$${f}_{{{{{{\mathrm{sat}}}}}}}=1-{\left[\frac{{\max }(0,S-{S}_{0})}{{S}_{{\max }}-{S}_{0}}\right]}^{\frac{b}{b-1}}$$
(4)

where S is the fraction of the grid cell soil water storage, S0 is the minimum storage at and below which there is no surface saturation (mm), Smax is the maximum grid cell storage (mm) and b is the Clapp and Hornberger71 soil exponent. We use the topography-derived parameterisation for the b and S0/Smax parameters to reduce individual calibration with the following relationship55:

$$\left\{\begin{array}{c}b=2.0\hfill\\ {S}_{0}/{S}_{{\max }}=\,{\max }\,(1-\frac{s}{{s}_{{\max }}},\,0.0)\end{array}\right.$$
(5)

where s is the grid cell slope (°) and smax is the maximum grid cell storage (mm). Once interception and surface runoff have been calculated, the remaining water enters the soil. This water is allocated to the different soil layers within the soil column by using the Darcy–Richards equation:

$$W=k\left(\frac{{{{{{\mathrm{d}}}}}}\varphi }{{{{{{\mathrm{d}}}}}}z}+1\right)$$
(6)

where W is the vertical flux of water through the soil (kg m−2 s−1), k is soil conductivity (kg m−2 s−1), φ is suction (m) and z is the vertical flux of water through the soil (m). To calculate suction and soil conductivity, we use the van Genuchten72 scheme:

$$\left(\frac{\theta }{{\theta }_{s}}\right)=\frac{1}{{[1+{(\alpha \varphi )}^{(\frac{1}{1-m})}]}^{m}}$$
(7)

where θ is the average volumetric soil moisture (m3 m−3), θs is the soil moisture at saturation (m3 m−3), α and m are van Genuchten parameters dependent on soil type. The hydraulic conductivity is calculated thus:

$${K}_{h}={K}_{hs}{S}^{\varepsilon }{\left[1-{\left(1-{S}^{\frac{1}{m}}\right)}^{m}\right]}^{2}$$
(8)

where Kh is the hydraulic conductivity (m s−1) and Khs is the hydraulic conductivity for saturated soil (m s−1). ε is an empirical value set at 0.5 in JULES and S is found by:

$$S=\frac{(\theta -{\theta }_{r})}{({\theta }_{s}-{\theta }_{r})}$$
(9)

where θr is the residual soil moisture (m3 m−3). Vegetation can access water from each level in the soil column as a function of the root density where the fraction of roots (r) in each soil layer (l) from depth zl-1 to zl is:

$${r}_{l}=\frac{{e}^{-\frac{2{z}_{l-1}}{{d}_{r}}}-{e}^{-\frac{2{z}_{l}}{{d}_{r}}}}{1-{e}^{-\frac{2{z}_{t}}{{d}_{r}}}}$$
(10)

where zl is the depth of the lth soil layer, dr is the root depth (m) and zt is the total depth of the soil column (m). The water flux extracted from a soil layer is elE where E is transpiration (kg m−2 s−1) and el can be found by:

$${e}_{l}=\frac{{r}_{l}{\beta }_{l}}{{\sum }_{l}{r}_{l}{\beta }_{l}}$$
(11)

and βl is defined by:

$${\beta }_{l}=\left\{\begin{array}{cc}1 \hfill& {\theta }_{l}\ge {\theta }_{c}\hfill\\ ({\theta }_{l}-{\theta }_{w})/({\theta }_{c}-{\theta }_{w}) & {\theta }_{w}\, < \,{\theta }_{l} \, < \, {\theta }_{c}\hfill\\ 0 \hfill& {\theta }_{l}\le {\theta }_{w}\hfill\end{array}\right.$$
(12)

where θc and θw are the volumetric soil moisture critical and wilting points respectively (m3 m−3) and θl is the unfrozen soil moisture at that soil layer (m3 m−3). In this configuration of JULES, when a soil layer becomes saturated, the excess water is routed to lower layers. When the bottom layer becomes fully saturated any excess water is added to the subsurface runoff. Both the surface and subsurface runoff are then passed to the River Flow Model65,73 which routes the flows according to a flow direction grid74.

This study uses a combination of calibrated model parameters from the previous work of Robinson et al.59 and Martinez-de la Torre et al.55 (Rose suites u-bi090 and u-au394, respectively, which can be found using the Rose/Cylc suite control system: https://metomi.github.io/rose/doc/html/index.html). We compare observed streamflow from the NRFA database75 with model output for the years 2000–2010 using the base land and CHESS-met datasets. The model is spun-up for the years 1990–2000 to ensure soil moisture content has been equilibrised. To quantify the accuracy of the model, we use a range of standard error metrics. These include the Nash–Sutcliffe efficiency76 measure:

$${{{{{\mathrm{NSE}}}}}}=1-\frac{{\sum }_{i=1}^{n}{({Q}_{{{{{{\mathrm{sim}}}}}}}-{Q}_{{{{{{\mathrm{obs}}}}}}})}^{2}}{{\sum }_{i=1}^{n}{({Q}_{{{{{{\mathrm{obs}}}}}}}-{\bar{Q}}_{{{{{{\mathrm{obs}}}}}}})}^{2}}$$
(13)

Kling–Gupta efficiency77:

$${{{{{\mathrm{KGE}}}}}}=1-\sqrt{{(r-1)}^{2}+{\left(\frac{{\sigma }_{{{{{{\mathrm{sim}}}}}}}}{{\sigma }_{{{{{{\mathrm{obs}}}}}}}}-1\right)}^{2}+{\left(\frac{{\mu }_{{{{{{\mathrm{sim}}}}}}}}{{\mu }_{{{{{{\mathrm{obs}}}}}}}}-1\right)}^{2}}$$
(14)

Root-mean-squared error:

$${{{{{\mathrm{RMSE}}}}}}=\sqrt{\mathop{\sum }\limits_{i=1}^{n}{({Q}_{{{{{{\mathrm{sim}}}}}}}-{Q}_{{{{{{\mathrm{obs}}}}}}})}^{2}}$$
(15)

Mean absolute error:

$${{{{{\mathrm{MAE}}}}}}=\frac{{\sum }_{i=1}^{n}|{Q}_{{{{{{\mathrm{sim}}}}}}}-{Q}_{{{{{{\mathrm{obs}}}}}}}|}{n}$$
(16)

where Qsim is the simulated discharge, Qobs is the observed discharge, r is the linear correlation between observation and simulations, σsim|obs is the standard deviation of discharge, μsim|obs is the mean of discharge and n is the number of observations. We also use NSE(log(Q)) and KGE(1/Q) to understand how well the model can reproduce low flows. Using these measures, we find that JULES performs satisfactorily apart from the Avon Catchment which may be due to fast subsurface flows generated by its geology55 (Supplementary Table S7). With process-based models, it is difficult to both accurately reproduce physical processes and make the output faithful to reality due to epistemic uncertainties78. Even though model performance is not the same as achieved with calibrated conceptual or empirical models, it allows us to determine the effects of vegetation changes on the hydrological cycle.

JULES’ ability to faithfully represent hydrological land-surface processes in Great Britain has been evaluated in several studies58,79,80 and the plant functional type parameters it uses at global scales81,82. To validate the ability of our configuration of JULES to represent soil moisture and potential evapotranspiration rates, we compare the model output with observations from twelve COSMOS-UK sites within our catchments covering grasslands, croplands, coniferous and broadleaf woodland83 (Supplementary Fig. S8). We evaluate model performance from the start of the COSMOS-UK station records until January 1, 2018 so that we use the same forcing data as our experiments. Station start dates vary from October 2013 to August 2017. We compare COSMOS-UK observed soil moisture to the first 0.1 m of the soil column in JULES and evaporation to the sum of the soil evaporation and plant transpiration. We find a median KGE score of 0.44 for the topsoil moisture and 0.53 for potential evaporation (Supplementary Tables S9 and S10). Low error metrics observed for topsoil moisture are due to systematic undercalculation by JULES80. At our broadleaf sites, Alice Holt and Wytham Woods, we find both systematic over and underprediction of the topsoil moisture respectively. In line with other studies, we find that there is a slight overestimation of evaporation in JULES58,84. As illustrated by the median coefficients of determination between the COSMOS-UK and JULES data of 0.62 and 0.60 for the topsoil moisture and evaporation respectively, JULES broadly represents changes in these variables over time.

Land-cover scenarios

Modelling the influence of afforestation on catchment hydrology has been attempted before but usually only at the scale of a single catchment for a limited range of scenarios. In this study, we focus on the theoretical effect of widespread planting of broadleaf trees to examine whether planting location is a stronger control on hydrological response than afforestation extent by using a large ensemble of up to 288 land-cover change scenarios. We choose to focus just on broadleaf woodland for several reasons. First, we are trying to replicate a landscape that could be considered a natural climatic climax community that might occur if it had not been for human intervention during the Holocene. Second, broadleaf woodland has the potential to absorb and store carbon in soils for longer time periods. Finally, to reduce computational cost and the issue of potentially expanding the errors induced by potentially spurious parameters of needleleaf woodland in this version of JULES85. Although potential woodland planting locations have been suggested by the Environment Agency and authorities in Scotland and Wales86,87,88, the differences in planting criteria means it is not possible to systematically compare hydrological changes across our chosen catchments. Here we attempt to create afforestation scenarios related to both catchment river network structure and land use that are directly comparable across a range of catchments. Afforestation was in grassland areas to reduce the complexity of the decisions made and enable an understanding behind catchment sensitivity to land-cover changes related to soil and catchment structure.

Three metrics were selected to discretise the catchment into distinct areas for afforestation: the Topographic Wetness Index (TWI)28, Strahler27 and Shreve orders26. These metrics capture different parts of the catchment such as propensity for saturation, drainage network location and relative contributing areas. TWI is calculated by:

$${{{{{\mathrm{TWI}}}}}}=\,{{{{\mathrm{ln}}}}}\,\frac{a}{\tan \,\gamma }$$
(17)

where a is the upslope area draining through a point, per unit contour length, and γ is the local surface topographic slope in radians. All three metrics were calculated using the 50 m IHDTM60, thresholding stream formation at an accumulation of ten pixels using the D8 flow direction algorithm within ArcGIS 10.6.189. Strahler order ranges from one (headwaters) to seven (lowlands). Due to the continuous nature of TWI (0.05–31.49) and the large ordinal range of Shreve order (1–9523) calculated for the entire British Isles, we group TWI orders into five quantiles and seven quantiles for Shreve. Increasing TWI order in this case indicates increasing propensity for saturation, or potential maximum saturation level, and increasing Shreve order indicates increasing contributing area. Catchments were broken down to watersheds from the downstream point of the Shreve and Strahler orders. Due to the nature of the data, this led to some first order Strahler catchments being incorrectly generated for some catchments (Supplementary Table S7). Using these generated catchment areas, we plant both inside and outside of these watersheds to understand the hydrological difference between opposing planting locations. In each of the catchment areas, two different levels of afforestation were tested of ~25 and 50% of the possible planting area. Planted area was assigned at random in the catchment and was produced by calculating the area available for afforestation and randomly producing points that covered the area required using the Create Random Points tool in ArcGIS 10.6.1.

Discussions exist about where to plant woodland in relation to existing land cover, to provide ecosystem services, including around watercourses29,30, urban areas31,32 and woodland4,33. Therefore, in this study we try to understand how these potential planting scenarios will affect hydrology in general. Using the CEH 2000 land-cover map56 buffers of broadleaf land cover were created at 25 and 50 m around these three land uses (Supplementary Fig. S7). These were then discretised according to the catchment areas. As an example, one scenario would be afforesting up to 50 m around existing broadleaf woodland inside the Shreve order one catchment area, whilst another would be randomly afforesting within 25% of the available area outside of TWI order five areas.

Afforestation according to different catchment areas and land-cover uses between 234 and 288 scenarios for each catchment and between 0 and c. 40 percentage point increase in broadleaf woodland (Supplementary Fig. S2 and Table S8). Due to the structure and size of the different catchments, and thus differences in Strahler and Shreve orders, not all catchments had a comparable number of higher orders. Produced scenarios were converted to the 1-km2 grid scale by altering the fraction of land-cover types within each grid cell. It should be noted that this work only considers the impact of mature broadleaf woodland and neglects the influence of the initial planting and growing of the woodland that would likely have its own impact on catchment hydrology as frequently reported13,49. Furthermore, it does not include the period when there would be the highest amount of carbon sequestration. This study seeks to understand the theoretical impact of woodland on catchment hydrology when fully developed to understand the long-term implications of management decisions.

Hydrological signatures and analysis

Several hydrologic indices can be used to characterise the influence of afforestation on streamflow regime34,35. To analyse average streamflow and extremes, we look at the top 1% (very high flow), 5% (high flow), 50% (median flow), 90% (low flow) and 95% (very low flow) quantiles of daily streamflow. To quantify flow variability, we use the slope of the flow duration curve38,40 calculated thus:

$${{{{{\mathrm{FDC}}}}}}=\frac{{{{{\mathrm{ln}}}}}({Q}_{33 \% })-\,{{{{\mathrm{ln}}}}}({Q}_{66 \% })}{(0.66-0.33)}$$
(18)

where Q33% is the 33rd flow exceedance quantile and Q66% is the 66th flow exceedance quantile. To ascertain catchment responsiveness to climatic forcing, we use median streamflow elasticity40,41:

$${{{{{\mathrm{MSE}}}}}}={{{{{\mathrm{median}}}}}}\left(\frac{{{{{{\mathrm{d}}}}}}Q}{{{{{{\mathrm{d}}}}}}P}\frac{P}{Q}\right)$$
(19)

where dQ and dP are the annual changes in yearly discharge and precipitation, respectively. Finally, we use the runoff ratio to quantify water balance changes related to streamflow and evapotranspiration42:

$${{{{{\mathrm{RR}}}}}}=\frac{{\mu }_{Q}}{{\mu }_{P}}$$
(20)

where µQ and µP are the average yearly discharge and precipitation using daily values, respectively. We also qualitatively assess the largest peak flow daily event in the 10-year record used in this study to determine the impact of afforestation on the highest possible flows in each catchment.

To determine how afforestation influences streamflow metrics, percentage changes in flow metrics are plotted as a function of percentage point increases in afforestation (calculated using the difference between original and afforested scenario). Quantile regression is applied to determine the median regression slope of the trend for the entire period43. The benefit of using quantile regression is that it identifies the median response of the input variable (in this case the level of afforestation in both percentage and absolute terms) without being influenced by extreme outliers. In this way, we can estimate the proportional streamflow response to afforestation over the period. We use the regression slope coefficient as a proxy of catchment sensitivity to afforestation for each streamflow metric. The slope coefficient is then correlated to catchment attributes, as stated in the CAMELS-GB dataset44, using Spearman’s rank correlation. This allows us to determine the direction and significance of the catchment property influences on the sensitivity of catchments to afforestation for the different hydrologic signatures. To determine the impact of different planting locations according to catchment and land-cover location a one-way analysis of variance (ANOVA) test is undertaken using R90.