This paper describes benthic coral reef community composition point-based field data sets derived from georeferenced photoquadrats using machine learning. Annually over a 17 year period (2002–2018), data were collected using downward-looking photoquadrats that capture an approximately 1 m2 footprint along 100 m–1500 m transect surveys distributed along the reef slope and across the reef flat of Heron Reef (28 km2), Southern Great Barrier Reef, Australia. Benthic community composition for the photoquadrats was automatically interpreted through deep learning, following initial manual calibration of the algorithm. The resulting data sets support understanding of coral reef biology, ecology, mapping and dynamics. Similar methods to derive the benthic data have been published for seagrass habitats, however here we have adapted the methods for application to coral reef habitats, with the integration of automatic photoquadrat analysis. The approach presented is globally applicable for various submerged and benthic community ecological applications, and provides the basis for further studies at this site, regional to global comparative studies, and for the design of similar monitoring programs elsewhere.
|Measurement(s)||marine benthic feature|
|Technology Type(s)||photoquadrat transect surveys|
|Factor Type(s)||benthic composition|
|Sample Characteristic - Organism||benthic communities|
|Sample Characteristic - Environment||coral reef • marine coral reef flat zone • marine coral reef crest • marine coral reef back reef • marine coral reef fore reef|
|Sample Characteristic - Location||Heron Island Reef, 23–052|
Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.14034320
Background & Summary
This study describes a unique point-based data set for coral reef environments, collected using a photoquadrat survey method published for seagrass environments1. The data set describes the spatial and temporal distribution of benthic community abundance and composition for Heron Reef, a 28 km2 shallow platform reef located in the Capricorn Bunker Group, Southern Great Barrier Reef (GBR), Australia. On average, 3,600 coral reef data points were collected annually over the period 2002 to 2018. Annual data sets were acquired for independent research projects, but the collection methods were consistent. The initial field data collection design was planned to acquire detailed field data to describe the spatial distribution and variability of benthic composition across the study site to assist with calibration and validation of earth observation-based mapping products.
To create a map based on earth observation imagery, it is common to use training or calibration data to transform the imagery into a map of surface properties using a supervised algorithm (e.g. multivariate statistical clustering, random forest)2. To report on the accuracy measures of the maps, reference or validation data are contrasted with the output maps3. Hence for calibration and validation purposes, georeferenced field data must be representative of all the features to be mapped and collection should ideally coincide with satellite image acquisition. Many earth observation approaches have been implemented for mapping the benthic communities of Heron Reef4,5,6,7,8,9,10,11,12 and several of these maps are now accessible online6,13,14.
Several studies have utilised time series benthic data to analyse changes in benthic community and coral type trends, supporting broad ecological knowledge of coral reef ecosystems such as the Caribbean reef degradation15 and coral cover decline on the GBR16. Similarly, benthic community and coral cover data sets have been identified as important indicators of coral reef health providing the backbone for monitoring and management initiatives around the world17,18.
Articles and data sets have been published that describe the benthic community properties of Heron Reef, however, their spatial coverage, number of georeferenced data points, and revisit times are limited19. The time series photoquadrat data sets presented in this paper could be used for further understanding of benthic community distribution, including statistical analysis of trends in coral cover, analysis of changes in benthic community and coral type, or used for testing of other earth observation-based mapping and modelling approaches. Additionally, as our methodology describes machine annotation of the field photoquadrats, it would be possible to reanalyse the photoquadrats with new categories not previously considered important from a biological perspective (e.g. unknown disease or impact, or a specific benthic community type), or for other features (e.g. the counting of sea cucumbers (Holothuroidea sp.)).
Detailed analyses of our complete data set may permit a greater understanding of the persistence and/or dynamics of the benthic community at Heron Reef. As such, our ongoing analyses include evaluation of changes in community composition following major impacts such as cyclones, coral bleaching, crown of thorns predation, etc., and additionally, statistical analyses of coral recovery after such impacts. To this degree, these benthic community data sets are invaluable.
The photoquadrat-based data in this study was collected for Heron Reef, Southern Great Barrier Reef, Australia (Fig. 1). Here we provide a short overview of the collection methods, however a detailed description can be found in11. These methods are applicable to any habitat. Photoquadrats were analysed for substrate and/or benthic community types known to be present on the reef (Fig. 1). The benthic community classes included in the analysis are shown in Table 1.
Georeferenced photoquadrat data collection
Detailed information on benthic community composition was gathered at Heron Reef on the reef flat (0–2 m depth) and at the 5 m contour on the reef slope using a repeatable and fine spatial scale (sampling every 2–4 m) technique for surveying benthic cover11. The technique required a snorkeler or diver manually capture georeferenced photoquadrats along defined transect surveys using a standard digital camera in a waterproof housing (e.g. Sony Cyber shot, Canon AA540, Lumix, or Olympus T4). A plumb-line attached to the camera, ensured that the footprint of each photoquadrat approximated 1 m2 of the benthos.
From 2002–2004, a 100 m transect tape was deployed at each defined survey start site at a maximum depth of 3 m, or on scuba at 5 m depth. From 2005 onwards, instead of deploying a tape, the surveyor towed a standard handheld GPS (e.g. Garmin eTrex, Garmin 72) at the surface in a waterproof bag for all surveys. This enabled accurate registration of the location of the acquisition of each photoquadrat, which was subsequently assigned via time synchronization, with the track log from the towed GPS. Once this method was established transect survey lengths were extended to distances of 500 m–1500 m. The start and end point of each transect was defined by GPS waypoints, permitting accurate revisits in subsequent years. The distance between successive photoquadrats was estimated by the surveyor’s kick cycle. However this was not considered a problem as the exact location of each photograph was known through the GPS synchronisation.
All surveys were performed during the day, and derivation of sunlight and sun angle can be ascertained through the timestamp of each photoquadrat and its corresponding GPS location. Reef Flat surveys were collected at high tide to provide sufficient water depth for the snorkeler to safely traverse the reef. Reef Slope surveys were collected at low tide. No water quality information was recorded.
The locations of the transect surveys were chosen to ensure they traversed gradients or edge features to detect any change in benthic cover over these features. This was done initially through visual assessment of existing satellite imagery in combination with expert knowledge of the study area. The aim was to produce data that provided an adequate representation of the variation in benthic community cover across Heron Reef. Limited transect surveys were located within the deep lagoonal area of the reef, as this area is hard to access by boat due to tidal range restrictions permitting short working times in the lagoon. Transect surveys were revisited in subsequent years, and additional transect surveys were included on subsequent trips based on increased knowledge of the environment. The benthic data sets and photoquadrat images are available at20.
Automated photoquadrat analysis for benthic community composition
Percentage cover of the benthic communities for each photoquadrat was determined through a machine-learning (ML) approach which assessed benthic community composition. A previously devised category scheme consisting of 63 class codes that differentiated all major GBR-specific coral morphologies and other bottom types was used21 which, following machine annotation, were collapsed first into broad groups and subsequently into six simplified groups for validation purposes (Table 1).
Initial training of the ML platform was achieved via manual annotation of approximately 5% of the total number of photoquadrats (equivalent to 108,700 annotated points; based on21), to achieve a machine annotation accuracy of >70% as determined by the classifier21. A unique source was created for each camera used. To give a default and uniform image annotation area, boundaries of 5% were used for the top and left sides of the photoquadrat, whilst a boundary of 95% was used for the right and bottom sides of the photoquadrat. Annotation points (50) were generated randomly over the entire annotation area per photoquadrat. For manual annotation of photoquadrat sets, the level of confidence was set to 100%. A further approximately 2.5% of photoquadrats were manually annotated in an identical manner to provide a validation data set to calculate the accuracy of the machine annotation. Automated annotation of the remaining 92.5% of the photoquadrats was achieved subsequently22.
Detailed information regarding the output benthic cover percentages and the number of benthic photoquadrats acquired for each field campaign are documented in Table 2. The benthic data sets and photoquadrat images are available at20, with the photoquadrats and benthic cover analysis for individual survey years accessible online through the campaign specific DOIs listed in the table, from where the data can be downloaded directly.
To understand the validation technique applied to these data sets, it is important to reiterate the purpose of collecting the data set itself, which was a fast field method to gather benthic community information over a large spatial extent, whilst accurately representing variability. Validation of the data set was conducted on various levels, and included: standardisation of photoquadrat capture method and conditions, and a quantitative accuracy assessment.
Standardisation of photoquadrat image capture
To standardise photoquadrat image capture, the camera and lens setup used was calibrated prior to annual survey, so as to capture a footprint that covered the same extent of the benthos. This was accomplished by attaching a plumb-line to the camera system such that when it touched the bottom, the captured photoquadrats represented ~1 m2 of the benthos. To do this standardisation, the camera was moved vertically over a marked 1 m2 until the field of view enveloped the area, and the plumb-line was fixed. During the survey the operator used the plumb-line to determine the camera height above the ground. When held vertically with the weight touching the substrate this permitted reproducible capture of photoquadrats that covered the same area for all surveys. Light conditions were generally the same for each expedition, the data collected over a consecutive 4–5 day period, with stable weather, water clarity conditions and tidal range. Ideally light conditions would have been standardised using a strobe, however this would slow down the speed of the transect surveys.
Quantitative accuracy assessment
To determine the accuracy of the machine annotation we constructed a confusion matrix that compared, for a select set of validation photoquadrats, the benthic composition output from the machine learning annotation (modelled data), with the equivalent manual annotations (reference data). Using the confusion matrix we calculated the overall accuracy and the individual benthic label user and producer accuracy following a well-documented method3. All cameras demonstrated an overall accuracy of between 74% and 82% (Table 3;3). To provide a validation data set, ~2.5% of photoquadrats were manually annotated in an identical manner to the training data (36,950 annotated points; see Methods Section).
Roelfsema, C. M., Kovacs, E. M. & Phinn, S. R. Field data sets for seagrass biophysical properties for the Eastern Banks, Moreton Bay, Australia, 2004–2014. Scientific Data 2, 150040, https://doi.org/10.1038/sdata.2015.40 (2015).
Lyons, M. et al. Mapping the world’s coral reefs using a global multiscale earth observation framework. Remote Sensing in Ecology and Conservation 6, https://doi.org/10.1002/rse2.157 (2020).
Congalton, R. G. & Green, K. Assessing the accuracy of remotely sensed data: Principles and practices. Mapping Science. 2 edn, 200 (CRC Press, 2008).
Andréfouët, S. et al. Multi-site evaluation of IKONOS data for classification of tropical coral reef environments. Remote Sensing of Environment 88, 128–143, https://doi.org/10.1016/j.rse.2003.04.005 (2003).
González-Rivero, M. et al. Scaling up Ecological Measurements of Coral Reefs Using Semi-Automated Field Image Collection and Analysis. Remote Sensing 8, 30, https://doi.org/10.3390/rs8010030 (2016).
Hedley, J., Roelfsema, C. M. & Phinn, S. R. Supplement to: Hedley, J. et al. (2009): Efficient radiative transfer model inversion for remote sensing applications. Remote Sensing of Environment 113(11), 2527–2532, https://doi.org/10.1016/j.rse.2009.07.008 (2012).
Joyce, K., Phinn, S., Roelfsema, C., Neil, D. & Dennison, W. Combining Landsat ETM+ and Reef Check classifications for mapping coral reefs: a critical assessment from the southern Great Barrier Reef, Australia. Coral Reefs 23, 21–25, https://doi.org/10.1007/s00338-003-0357-7 (2004).
Joyce, K. E., Phinn, S. R. & Roelfsema, C. M. Live coral cover index testing and application with hyperspectral airborne image data. Remote Sensing 5, 6116–6137, https://doi.org/10.3390/rs5116116 (2013).
Ortiz, J. C. et al. The effect of wave exposure and competition for space on the community composition of coral reefs. Coral Reefs (in press).
Phinn, S. R., Roelfsema, C. M. & Mumby, P. J. Multi-scale image segmentation for mapping coral reef geomorphic and benthic community zone. International Journal of Remote Sensing 33, 3768–3797, https://doi.org/10.1080/01431161.2011.633122 (2012).
Roelfsema, C. & Phinn, S. Integrating field data with high spatial resolution multispectral satellite imagery for calibration and validation of coral reef benthic community maps. J Appl Remote Sens 4, 043527-043527–043528, https://doi.org/10.1117/1.3430107 (2010).
Purkis, S. & Roelfsema, C. M. In Remote Sensing of Wetlands: Applications and Advances (pp. (eds Tiner, R. W., Lang, M. W. & Klemas, V. V.) 223–242 (CRC Press, 2015).
Borrego-Acevedo, R., Roelfsema, C. M., Phinn, S. R. & Grinham, A. In Supplement to: Borrego-Acevedo, R. et al. (2014): Predicting distribution of microphytobenthos abundance on a reef platform by combining in-situ underwater spectrometry and pigment analysis. Limnology and Oceanography, 5(5), 461–470, https://doi.org/10.1080/2150704X.2014.922723 (2013).
Phinn, S. R., Roelfsema, C. M. & Mumby, P. J. In Supplement to: Phinn, S. R. et al. (2012): Multi-scale, object-based image analysis for mapping geomorphic and ecological zones on coral reefs. International Journal of Remote Sensing, 33(12), 3768–3797, https://doi.org/10.1080/01431161.2011.633122 (2012).
Hughes, T. P. Catastrophes, Phase Shifts, and Large-Scale Degradation of a Caribbean Coral Reef. Science 265, 1547–1551, https://doi.org/10.1126/science.265.5178.1547 (1994).
De’ath, G., Fabricius, K. E., Sweatman, H. & Puotinen, M. The 27–year decline of coral cover on the Great Barrier Reef and its causes. Proceedings of the National Academy of Sciences 109, 17995–17999, https://doi.org/10.1073/pnas.1208909109 (2012).
Obura, D. O. et al. Coral Reef Monitoring, Reef Assessment Technologies, and Ecosystem-Based Management. Frontiers in Marine Science 6, https://doi.org/10.3389/fmars.2019.00580 (2019).
Flower, J. et al. Interpreting coral reef monitoring data: A guide for improved management decisions. Ecological Indicators 72, 848–869, https://doi.org/10.1016/j.ecolind.2016.09.003 (2017).
Connell, J. H. Disturbance and recovery of coral assemblages. Coral Reefs 16, S101–S113, https://doi.org/10.1007/s003380050246 (1997).
Roelfsema, C. M., Kovacs, E., Stetner, D. & Phinn, S. R. Georeferenced benthic photoquadrats captured annually from 2002-2017, distributed over Heron Reef flat and slope areas. PANGAEA https://doi.org/10.1594/PANGAEA.894801 (2018).
González-Rivero, M. et al. The Catlin Seaview Survey – kilometre-scale seascape assessment, and monitoring of coral reef ecosystems. Aquatic Conservation: Marine and Freshwater Ecosystems 24, 184–198, https://doi.org/10.1002/aqc.2505 (2014).
González-Rivero, M. et al. Monitoring of Coral Reefs Using Artificial Intelligence: A Feasible and Cost-Effective Approach. Remote Sensing 12, https://doi.org/10.3390/rs12030489 (2020).
Funding provided by: University of Queensland; CSIRO; Cooperative Research Centre Coastal Zone, Estuaries and Waterways Management; ARC Linkage Grant to Prof. S Phinn; and World Bank Global Environment Facility Coral Reef Remote Sensing, ARC linkage innovative Coral Reef Monitoring. Fieldwork support was provided by: Coral and Reef Check Volunteers, Staff and students at University of Queensland, Heron Island Research Station. Field assistance: Rodney Borrego, Ian Leiper, Douglas Stetner, Josh Passenger, Megan Saunders, Robert Canto, Peran Bray, Emma Kennedy.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Roelfsema, C., Kovacs, E.M., Markey, K. et al. Benthic and coral reef community field data for Heron Reef, Southern Great Barrier Reef, Australia, 2002–2018. Sci Data 8, 84 (2021). https://doi.org/10.1038/s41597-021-00871-5