A big data approach to macrofaunal baseline assessment, monitoring and sustainable exploitation of the seabed

Cooper, K. M.; Barry, J.

doi:10.1038/s41598-017-11377-9

Download PDF

Article
Open access
Published: 29 September 2017

A big data approach to macrofaunal baseline assessment, monitoring and sustainable exploitation of the seabed

K. M. Cooper¹ &
J. Barry¹

Scientific Reports volume 7, Article number: 12431 (2017) Cite this article

6188 Accesses
18 Citations
60 Altmetric
Metrics details

Subjects

Abstract

In this study we produce a standardised dataset for benthic macrofauna and sediments through integration of data (33,198 samples) from 777 grab surveys. The resulting dataset is used to identify spatial and temporal patterns in faunal distribution around the UK, and the role of sediment composition and other explanatory variables in determining such patterns. We show how insight into natural variability afforded by the dataset can be used to improve the sustainability of activities which affect sediment composition, by identifying conditions which should remain favourable for faunal recolonisation. Other big data applications and uses of the dataset are discussed.

A national macroinvertebrate dataset collected for the biomonitoring of Ireland’s river network, 2007–2018

Article Open access 25 August 2020

Geospatial data on the sediments of Lake Balaton

Article Open access 18 January 2024

FISHGLOB_data: an integrated dataset of fish biodiversity sampled with scientific bottom-trawl surveys

Article Open access 04 January 2024

Introduction

In common with many parts of the world, the UK’s seas are increasingly subject to pressure from a range of anthropogenic activities^1,2. These activities include, inter alia, fishing, oil and gas operations, aggregate dredging, renewable energy generation, dredging of ports and harbours, and installation of cables and pipelines. To ensure maintenance of a ‘healthy’ marine environment³, it is essential that these activities are, as far as possible, carried out in an environmentally sustainable way. Whilst the focus of this study is on marine aggregate dredging and faunal-sediment relationships, findings may have relevance for other activities affecting seabed sediment composition.

The UK marine aggregate dredging industry produces sand and gravel (aggregate) from licensed extraction areas located around the coast of England and Wales (Fig. 1a), with material used for construction, fill and coastal defence⁴. The process of aggregate dredging can create some localised environmental impacts (e.g. changes in seabed topography^5,6, alterations to sediment composition^{6,7,8,9,10,11,12,13} and loss of benthic fauna^6,14,15), although these vary considerably^16,17. In recognition of the likely impacts, the aggregate industry’s operations are subject to Environmental Impact Assessment (EIA), and, where activities are subsequently licensed, to environmental monitoring over the course of the dredging permission.

Typically, this environmental monitoring has focused on assessing impacts of ongoing dredging, with considerable attention given to the invertebrate seabed assemblages or ‘benthos’. Whilst of interest, impacts to the benthos (e.g. reductions in faunal diversity and abundance) are increasingly predictable as the number of scientific studies increases¹⁷. What the monitoring has not explained to date, however, is what is likely to happen to impacted sites after cessation of dredging¹¹? For example, will it be possible for the original faunal assemblage type to return? Whilst this issue of recoverability is surely the most important question for sustainability, the number of recovery studies remains limited^{10,12,13,18,19,20,21,22} due to the time-consuming and expensive nature of the work.

Expert judgements are made regarding the faunal recovery potential of aggregate extraction sites. However, predictions are difficult given the large number of factors involved (e.g. sensitivity of faunal communities to disturbance and changes in sediment composition¹¹, nature of the local environment and its capacity to recover from physical disturbance^13,23). Where recovery does not proceed in line with expectations then, theoretically, options for active seabed restoration can be considered²⁴. However, whilst technically feasible, such options are likely to be expensive²⁵ and outcomes uncertain^26,27. For this reason, the existence of unacceptable residual impacts should, arguably, signify a failure of the monitoring and management process^17,25.

In recognition of these challenges, an alternative approach to monitoring has recently been suggested^28,29. This new approach seeks to ensure the return of the pre-impacted faunal assemblage type, thus preserving the ecosystem functioning of the seabed³⁰. This is achieved by maintaining the associated habitat (sediment composition) within certain limits. These habitat limits are determined by the range of conditions seen for the particular assemblage in the wider environment. Following the successful testing of this approach at site specific²⁸ and regional²⁹ scales, a decision was taken by regulators, industry and other stakeholders to adopt it across all regions of aggregate dredging in the UK (see Fig. 1a). The newly termed ‘Regional Seabed Monitoring Programme’ (RSMP)³¹ is designed to improve environmental protection by making it clear when unacceptable changes in sediment composition are occurring, allowing for early management intervention. It is also expected to significantly reduce the costs of monitoring for the aggregates industry.

To allow for implementation of the RSMP, it is necessary to: (i) produce a baseline assessment of the UK’s macrobenthic infauna, with a particular focus around sites and regions of marine aggregate dredging, (ii) identify the range of sediment composition found in association with the different baseline faunal assemblages, and (iii) develop a method for assessing the likely ecological significance of anthropogenically-induced changes in sediment composition. These objectives are achieved using a dataset comprising of new and existing data belonging to government and industry sources.

Results

Faunal data analysis

Univariate indices

Maps of taxon (family) richness (hereafter referred to as taxon richness) and total abundance reveal a complicated picture, although some patterns are discernible (Fig. 2). For instance, within the North Sea there is an underlying trend of increasing values for both measures with increasing latitude. A similar trend is observed from the Celtic Shelf to the southern Irish Sea. Within the English Channel values are generally high, although there is a clear transition to lower values at the eastern end. Patterns within the Irish Sea and along the west coast of Scotland are less clear because of the lower numbers of samples in these areas. Hotspots of diversity (in terms of taxon richness) can be found in several areas. Many of these hotspots are consistent with underlying trends (e.g. mid English Channel and mid Irish Sea), whilst others appear in marked contrast (e.g. Humber, Outer Thames). Within the Humber hotspot in particular, samples with very high values of both measures lie adjacent to those with very low values.

Community analysis

The elbow plot relating to the faunal data did not suggest an obvious number of groups for k-means clustering (Fig. 3a). We chose a clustering solution based on 12 groups, as this number coincided with a slight levelling out of the plot and explained >70% of the inherent variability. Opting for a lower rather than a higher number of cluster groups also served to increase the number of sample replicates for sediment analysis (see below).

The dendrogram reveals four broad groupings in the data, within which there are further subdivisions (Fig. 3a). Clear differences can be seen in the spatial distribution of faunal assemblages at a UK level (Fig. 4). For instance, some are spatially restricted (e.g. A1, A2a, B1a and B1b), whilst others are more widely distributed (e.g. C1a, C1b, D2a, D2b, D2c and D2d). Assemblage A1 is found almost exclusively off the Humber. A2a is predominately found in the southern North Sea, whilst being almost entirely absent from the northern North Sea and Celtic Sea. The main areas for assemblage A2b occur in the mid English Channel and mid Irish Sea, although it is also found in the outer Thames estuary and in other isolated patches. Assemblages B1a and B1b occur mainly in the eastern English Channel, with isolated occurrences elsewhere. C1a is found from the Humber region round to Lyme Bay, and also in the Bristol Channel and mid Irish Sea; isolated patches occur elsewhere. C1b has a widespread distribution, with notable patches in the southern North sea and coastal areas of the eastern English Channel. D1 has a largely coastal distribution throughout UK waters, although it is also found in some offshore areas of the southern North Sea. D2a has a widespread distribution, although it is notably absent from the more sandy areas of the Severn Estuary. Assemblage D2b is common in the deeper waters of the northern North Sea, the Celtic Sea and off the west coast of Scotland; with isolated occurrences elsewhere. Assemblage D2c is commonly found off the east coast, eastern end of the English Channel, Celtic Sea and Bristol Channel/Severn Estuary. D2d is found in offshore areas of the southern North Sea and on the Dogger Bank. Elsewhere, D2d can be found in coastal areas along the west coast of England and Wales, off the North East of England, in the eastern English Channel and in other isolated patches.

Results of a SIMPER analysis showed clear differences in the number of characterising taxa between the faunal assemblage groups (Table 1). For example, groups D2a, D2b, D2c and D2d were characterised by low numbers of taxa, whilst groups A1, A2a, A2b and B1a, had relatively higher numbers of characterising taxa. Polychaetes were the dominant faunal group across all assemblages, with bivalve molluscs being a feature of groups A1, A2a, B1b, D1, D2b and D2d. Epifaunal taxa such as bryozoans were characteristic of groups A1, A2b and B1a. Several taxa (e.g. Spionidae) characterised multiple assemblage groups, whilst others (e.g. the bivalve mollusc, Glycymerididae) were characteristic of only one assemblage, B1b. The highest and lowest mean numbers of taxa and abundance were associated with faunal groups A1 and D2c respectively.

Table 1 Biological characteristics of the macrofaunal assemblages identified through a k-means clustering of macrofaunal data (colonials included, forth-root transformation).

Full size table

Temporal assessment

Plots of faunal cluster identity by year (Fig. 5a) and season (Fig. 5b) showed broadly consistent spatial patterns. For example, assemblages A1/C1a, D2b, D2c, D2d and are frequently found, respectively, off the Humber estuary, in the deeper waters of the northern North Sea, off East Anglia, and on the Dogger Bank. This suggests that faunal assemblages, at the level of family, are largely stable though time. Within the eastern English Channel dredging region, the apparent switch from assemblage B1a to B1b (Fig. 5a) can be explained by the omission of colonial taxa from surveys undertaken between 2008 and 2013.

Explaining patterns in faunal distribution

Results of the best analysis identified 3 variables: AvCur, Mud and Sand as ‘best explaining’ the patterns in the macrofaunal data (Table 2). Despite the moderate correlation between the underlying resemblance matrices (ρ = 0.4), results from adonis showed that these predictors only accounted for a total of 13.1% of the total variability (AvCur = 6.1%, Mud = 3.0% and Sand = 4.0%); all were statistically significant (p < 0.01).

Table 2 Results of a best analysis identifying the subset of environmental variables which are most correlated with the macrofaunal data.

Full size table

When considered together, the dbRDA ordination (Fig. 6) and heat maps for individual environmental variables (Supplementary Fig. S1) provide some insight into the spatial distribution of faunal assemblages (Fig. 4b). For example, groups D1 and D2b are found in areas of high mud content with weaker currents; D2c and D2d dominate in areas of high sand content; A1, A2b, B1a, B1b and C1a and are all found in areas with high gravel content and strong currents; C1b, A2a and D2a are found in areas of more mixed sediment.

Faunal distribution within areas of aggregate industry interest

Maps showing the faunal cluster identity of stations within aggregate extraction sites, their zones of potential secondary effect, and the wider region (Fig. 7) fulfil the first objective of this study: to produce a baseline assessment of benthic macrofauna, with a particular focus around sites and regions of marine aggregate dredging. As seen in these plots, the nature of faunal assemblages differs between extraction areas, with some supporting only a single assemblage (e.g. Figure 7c,f), whilst others support multiple groups (e.g. Figure 7b). With the exception of assemblage D2b, all groups are represented within extraction areas. In addition, taxon richness also shows wide variation across extraction sites, both within and between regions. For example, some sites support very spare assemblages (Fig. 7c,f), whilst others support much richer assemblages (Fig. 7a,d,e).

Faunal-sediment relationships

In this section we examine the composition of sediments found in association with different faunal cluster groups - first using all the data (i.e. irrespective of sample location) and then within different physical cluster regions to control for the influence of other variables. We also explore whether reductions in the proportion of gravel could lead to declines in taxon richness and total abundance.

Identification of physical cluster groups

Based on the output of an elbow plot (Fig. 3b), k-means clustering was used to partition samples into 10 environmental cluster groups (Fig. 8a). Box and whisker plots for environmental variable by physical cluster group (Fig. 8b) provide insight into the different environmental conditions (not including sediments) associated with each group. For example, sites belonging to cluster group 6, located around the Isle of Wight, are associated with strong average currents and high bed stress.

Sediment composition by faunal and faunal-physical cluster groups

Examination of cumulative sediment distribution plots reveals some clear differences in the mean sediment composition of samples from the different faunal cluster groups (Fig. 9a). The most obvious separation is between samples with a significant proportion of gravel (groups A, B and C), versus those mainly dominated by sand (group D). Sand-dominated groups are split between those with higher (D1 and D2b) and lower (D2a, D2c and D2d) percentages of mud. Variability in sediment composition, as revealed by values of MVDISP, was highest for samples belonging to D2c and lowest for samples belonging to B1a and B1b (Table 3).

Table 3 Mean percentage composition of sediments by Wentworth size class for each faunal cluster group.

Full size table

Cumulative sediment distribution plots for each faunal-physical cluster group show a high degree of similarity, but also some differences (Fig. 9b). In most cases the more extreme values can be explained by a low number of replicates (e.g. A1_9, A2a_4, A2a_10, D1_6). Details of the mean sediment composition for each faunal-physical cluster group are shown in Supplementary Table S2.

Relationship between gravel and taxon richness/total abundance

Using all available data, a plot of percentage gravel versus taxon richness shows a moderate positive relationship between the two variables (Fig. 10a). However, when the data are plotted by faunal cluster group the same relationship no longer applies (Fig. 10b), with the possible exception of faunal group D1. Whilst the slopes of some groups are significantly different from zero, this is a reflection of the large number of samples rather than a meaningful trend. This suggests the positive correlation seen in Fig. 10a results from the regression across cluster groups. Similar results were obtained for percentage gravel versus total abundance.

Assessing the ecological significance of sediment change

The present study indicated that test sample ‘EEC2010_Site 102’ had a baseline faunal-physical cluster group identity of B1a_4. The sediment means and covariance matrix for this group are shown in Table 4. The test sample failed the Mahalanobis distance test (p < 0.05), and comparison of the test values with the means of the cluster group suggested the failure resulted principally from an excess of fine sand (fS, Table 4). Further testing of the approach at a former aggregate extraction site in the southern North Sea¹³ can be found in the Supplementary Note 1.

Table 4 Summary of a Mahalanobis distance test for the sediment composition of sample ‘EEC2010_Site 102’.

Full size table

Discussion

This study had 3 main objectives: (i) to produce a baseline assessment of the UK’s macrobenthic infauna, with a particular focus around sites and regions of marine aggregate dredging, (ii) to identify the range of sediment composition found in association with the different baseline faunal assemblages, and (iii) to develop a method for assessing the likely ecological significance of sediment change. Achieving these objectives is an important step in the development of the RSMP approach to monitoring^23,31. This approach aims to improve sustainability by ensuring the sediment habitat, within the footprint of dredging effect, is able to support the return of the original faunal assemblage type after cessation of activities.

We identified 12 faunal assemblages and their distribution around the UK. Some of these assemblages were geographically isolated, whilst others were more widespread. With the exception of D2b, all assemblage types were represented, in at least one location, within areas of aggregate dredging interest. Spatial patterns in faunal assemblages were broadly consistent over time (by year and season), and were largely driven, as far as could be explained, by sediment composition and hydrodynamics. Notable differences in the composition of sediments were observed between samples from different faunal assemblage groups. Within these groups, some small differences in sediment composition were evident between samples from the different physical cluster regions (Fig. 9b), thus supporting the assessment of sediment change based on samples from the same faunal and physical cluster region.

The approach taken to assessing whether sediments remained within an acceptable condition (i.e. Mahalanobis distance test) was successful in identifying known problems at an existing monitoring site in the eastern English Channel dredging region³², and at a former aggregate extraction site in the southern North Sea¹³ (see Supplementary Information). Furthermore, in both examples a comparison of the sediment composition of the test samples with that of the wider cluster group(s) correctly identified the sediment fractions known to be responsible for the problem. A comparison of taxon richness by percentage gravel for individual faunal groups (Fig. 10) suggests that a reduction in the proportion of gravel would not necessarily lead to a decline in richness, so long as sediments remained within the limits defined by the Mahalanobis distance test.

Findings from this study are broadly consistent with those of other workers in terms of broadscale patterns in univariate faunal metrics³³, the number and nature of faunal assemblages^34,35,36, the consistent temporal patterns in faunal assemblages^34,37, and the existence of the same faunal assemblage types on both the east and west sides of the UK³⁶. In addition, other studies highlighted a similar set of environmental variables as influencing the distribution of faunal communities^{31,37,38,39,40}, and identified a need to account for hydrodynamic factors where assessing animal-sediment relationships³⁸. Finally, in contrast to earlier work^28,29, we did not set out predefined sediment limits by faunal cluster group. We contend that the setting of sediment limits, as defined by the full sediment envelope, is statistically questionable given that individual sediment fractions are not independent. The approach taken in the present study also allows for the continual addition of new data, meaning decisions will always be based on the most comprehensive dataset available.

This study is important for a number of reasons. Firstly, it highlights the considerable quantity of benthic data which exists across government and industry. Through integration, standardisation and analysis of the combined dataset, this study has produced a faunal baseline, reducing the need for reliance on modelled data⁴¹. This work has been possible due to improvements in access to data, and a willingness among data owners to share information. Secondly, the study represents a significant step forward in our understanding of how sediment change is likely to affect benthic faunal assemblages. Whilst it has long been recognised that faunal assemblages differ in their sensitivity to changes in sediment composition¹¹, and that a faunal recovery is often predicated on a physical recovery¹³, it has hitherto not been possible to quantify such relationships. Both the close relationship between sediments and the benthos, reported here and in the literature³⁸, and the consistent temporal patterns in faunal assemblage distribution support the development of the RSMP approach.

The consistent spatio-temporal patterns in macrofauna are, perhaps, unsurprising given the key role of sediments in structuring faunal assemblages, and the fact that sediment distribution is moderated by hydrodynamic forces³⁷. In addition, our analyses were undertaken at the family level, and this could obscure any changes at the lower taxonomic levels of genus and species. Within faunal cluster groups, the lack of a strong relationship between taxon richness/total abundance and percentage gravel requires some explanation. One possibility, at least within the mobile sandy areas, is that taxa are unable to colonise any gravel effectively due to the abrasive nature of sand in suspension.

It is important to acknowledge limitations associated with the present study. For instance, the acceptability of changes in sediment composition is fundamentally linked to the number and identity of faunal assemblages. In the present study there was no obvious clustering solution, and we justify our selection of 12 groups as a balance between capturing biological complexity, whilst ensuring an adequate number of replicates for analysis of sediment composition. With 12 groups we were able to capture approximately 70% of the inherent variability (Fig. 3a) and whilst the number of replicates was generally high, for some faunal-physical groups numbers were still low (see Supplementary Table S2). In future, as more data become available, the analyses could be repeated with a greater number of cluster groups. Whilst a more objective means of identifying cluster groups is available (i.e. SIMPROF)⁴², this approach is not suitable for use with k-means. Further, we contend that statistical significance is less important than deciding on an operationally good number for the purpose in question. The lack of an obvious faunal clustering solution supports the idea of there being a continuum rather than wholly distinct faunal assemblages. This is presumably the result of taxa which occur across multiple groups (e.g. Spionidae, Nemertea, Capitellidae). We also recognise that the faunal baseline presented in this study is based not on a one-off snapshot survey, but rather the accumulated picture from 48 years of macrobenthic surveys. Whilst this could be considered a weakness, the majority of samples (96%) were acquired post 2000, and the consistent spatio-temporal patterns identified in this study provide confidence in the results. Furthermore, by incorporating an element of temporal variation, the baseline is, arguably, more robust for detecting anthropogenic change against a background of natural variability. The analysis of data at family level might also be criticised. However, there is precedence for working at this level⁴³, and we considered it to be the only pragmatic way to address the inevitable differences in taxonomic resolution between surveys and thus to create a standardised dataset. That this was necessary highlights the need for greater adoption of quality control measures for macrobenthic sample processing. There are also issues regarding the comparability of sediment data generated through a combination of sieving and laser sizing versus sieving only, and this requires further investigation.

This study provides a faunal baseline and methodology for assessing the ecological significance of sediment change. The approach will be now be used by the marine aggregates industry to assess the status of the seabed as part of their new Regional Seabed Monitoring Programme (RSMP)^23,31. Where sediment samples fail the Mahalanobis distance test this should lead to further investigation to determine the most likely cause (e.g. natural variability, sampling issues, other human activities or aggregate dredging). Whilst it was developed for the UK marine aggregate industry, this approach could also be relevant for other offshore activities which can affect the composition of seabed sediments (e.g. dredge material disposal, offshore windfarms, pipelines, drilling for oil and gas). As recently highlighted³¹, the harmonisation and integration of offshore monitoring programmes could deliver significant benefits for industry and government. More widely, big data applications can help identify the likely significance of human-induced changes in the environment through an improved understanding of natural variability. This will lead to better management and improved sustainability.

It remains a hypothesis that faunal recovery will occur after dredging assuming that sediments are left within an acceptable condition, as assessed using a Mahalanobis distance test. As such, there is a need to test this hypothesis as and when opportunities arise. Additional work would also be useful to address the issue of comparability between sediment data generated through sieving and a combination of laser sizing and sieving (see above). Work is also required to understand the implications of the failure of sediment data to meet the assumption of multivariate normality. It is recommended that new data continue to be added to the existing dataset so that future analyses are always based on the best available evidence.

Conclusion

A big data approach offers valuable insights into the natural variability inherent within different ecosystems. This understanding increases our ability to identify which human induced impacts are likely to have long-term ecological significance. This leads to more effective management, innovative and cheaper monitoring solutions, and ultimately, better environmental sustainability.

Methods

The dataset

The dataset compiled for this study comprises of 33,198 macrofaunal samples (83% with associated data on sediment particle size composition) covering large parts of the UK continental shelf (Fig. 1b). Whilst the majority of samples come from existing datasets, also included are 2,500 new samples collected specifically for the purpose of this study (Fig. 11). These new samples were acquired during 2014–2016 from the main English aggregate dredging regions (Humber, Anglian, Thames, Eastern English Channel and South Coast) and at four individual, isolated extraction sites where the RSMP methodology is also being adopted (e.g. Area 457, North-West dredging region; Area 392, North-West dredging region; Area 376, Bristol Channel dredging region; Goodwin Sands, English Channel). This work was funded by the developers, and carried out by contractors on their behalf. Samples were collected in accordance with a detailed protocols document⁴⁴ which included control measures to ensure the quality of faunal and sediment sample processing. Additional samples were acquired to fill in gaps in spatial coverage and to provide a contemporary baseline for sediment composition.

Sources of existing data include both government and industry, with contributions from the marine aggregate dredging, offshore wind, oil and gas, nuclear and port & harbour sectors. Samples have been collected over a period of 48 years from 1969 to 2016, although the vast majority (96%) were acquired since 2000. Samples have been collected during every month of the year, although there is a clear peak during summer months when weather conditions are generally more favourable for fieldwork. A variety of gear types have been used for sample collection including grabs (0.1 m² Hamon, 0.2 m² Hamon, 0.1 m² Day, 0.1 m² Van Veen and 0.1 m² Smith McIntrye) and cores. Of these various devices, 93% of samples were acquired using either a 0.1 m² Hamon grab or a 0.1 m² Day grab. Sieve sizes used in sample processing include 1mm and 0.5mm, reflecting the conventional preference for 1mm offshore and 0.5mm inshore (see Fig. 11). Of the samples collected using either a 0.1 m² Hamon grab or a 0.1 m² Day grab, 88% were processed using a 1mm sieve.

Taxon names were standardised according to the WoRMS (World Register of Marine Species) list using the Taxon Match Tool (http://www.marinespecies.org/aphia.php?p=match). Of the initial 13,449 taxon names, only 4,248 remained after correction. The output from this tool also provides taxonomic aggregation information, allowing data to be analysed at different taxonomic levels - from species to phyla. Macrofaunal data were collated using the Primer 6® software package⁴⁵, which allows for merging of individual datasets to produce a single taxon/sample matrix. Metadata pertaining to each sample were stored in a related ‘factors’ sheet within the Primer workspace. Due to the size of the dataset, it was split into multiple Primer workbooks to allow for ease of working. The final dataset comprises of a single sheet comma-separated values (.csv) file. Colonials accounted for less than 20% of the total number of taxa and, where present, were given a value of 1 in the dataset. This component of the fauna was missing from 325 out of the 777 surveys, reflecting either a true absence, or simply that colonial taxa were ignored by the analyst. Sediment particle size data were provided as percentage weight by sieve mesh size, with the dataset including 99 different sieve sizes. Sediment samples have been processed using sieve, and a combination of sieve and laser diffraction techniques. Key metadata fields include: Sample coordinates (Latitude & Longitude), Survey Name, Gear, Date, Grab Sample Volume (litres) and Water Depth (m). A number of additional explanatory variables (Table 5, variables 1–8) were acquired through interrogation of raster data layers using the over function in the statisical package R⁴⁶. In total, the dataset dimensions are 33,198 rows (samples) × 13,588 columns (variables/factors), yielding a matrix of 451,094,424 individual data values. The dataset and associated files used in this study are available from the Cefas Data Hub (https://doi.org/10.14466/CefasDataHub.34), with the R script provided in the Supplementary Information.

Table 5 Explanatory variables used in the study.

Full size table

In seeking to understand the relationship between fauna and sediments, it was important to identify and exclude any samples taken in previously impacted areas where faunal composition may not reflect a ‘natural’ state. For example, those sites which have been subject to aggregate dredging where the period of time between sample collection and last dredging is insufficient for recovery. To do this, sample locations were overlaid onto GIS layers (polygons) detailing the location of dredging in each year (available data from 1993 to 2015). An R script was used to calculate the number of years between sample collection and the time of last dredging. These values were then compared to the known faunal recovery times for different seabed landscapes²². For stations which had been subject to dredging, and where the time between last dredging and sampling was less than the predicted recovery time, the sample was flagged as ‘impacted’. Other samples known to have been taken within differently impacted areas (e.g. dredge material disposal sites) were similarly flagged. In this study we made no attempt to account for the effects of demersal fishing due to the widespread nature of the pressure, and a current lack of understanding of how it affects macroinfaunal assemblages.