Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Benthic and coral reef community field data for Heron Reef, Southern Great Barrier Reef, Australia, 2002–2018


This paper describes benthic coral reef community composition point-based field data sets derived from georeferenced photoquadrats using machine learning. Annually over a 17 year period (2002–2018), data were collected using downward-looking photoquadrats that capture an approximately 1 m2 footprint along 100 m–1500 m transect surveys distributed along the reef slope and across the reef flat of Heron Reef (28 km2), Southern Great Barrier Reef, Australia. Benthic community composition for the photoquadrats was automatically interpreted through deep learning, following initial manual calibration of the algorithm. The resulting data sets support understanding of coral reef biology, ecology, mapping and dynamics. Similar methods to derive the benthic data have been published for seagrass habitats, however here we have adapted the methods for application to coral reef habitats, with the integration of automatic photoquadrat analysis. The approach presented is globally applicable for various submerged and benthic community ecological applications, and provides the basis for further studies at this site, regional to global comparative studies, and for the design of similar monitoring programs elsewhere.

Measurement(s) marine benthic feature
Technology Type(s) photoquadrat transect surveys
Factor Type(s) benthic composition
Sample Characteristic - Organism benthic communities
Sample Characteristic - Environment coral reef • marine coral reef flat zone • marine coral reef crest • marine coral reef back reef • marine coral reef fore reef
Sample Characteristic - Location Heron Island Reef, 23–052

Machine-accessible metadata file describing the reported data:

Background & Summary

This study describes a unique point-based data set for coral reef environments, collected using a photoquadrat survey method published for seagrass environments1. The data set describes the spatial and temporal distribution of benthic community abundance and composition for Heron Reef, a 28 km2 shallow platform reef located in the Capricorn Bunker Group, Southern Great Barrier Reef (GBR), Australia. On average, 3,600 coral reef data points were collected annually over the period 2002 to 2018. Annual data sets were acquired for independent research projects, but the collection methods were consistent. The initial field data collection design was planned to acquire detailed field data to describe the spatial distribution and variability of benthic composition across the study site to assist with calibration and validation of earth observation-based mapping products.

To create a map based on earth observation imagery, it is common to use training or calibration data to transform the imagery into a map of surface properties using a supervised algorithm (e.g. multivariate statistical clustering, random forest)2. To report on the accuracy measures of the maps, reference or validation data are contrasted with the output maps3. Hence for calibration and validation purposes, georeferenced field data must be representative of all the features to be mapped and collection should ideally coincide with satellite image acquisition. Many earth observation approaches have been implemented for mapping the benthic communities of Heron Reef4,5,6,7,8,9,10,11,12 and several of these maps are now accessible online6,13,14.

Several studies have utilised time series benthic data to analyse changes in benthic community and coral type trends, supporting broad ecological knowledge of coral reef ecosystems such as the Caribbean reef degradation15 and coral cover decline on the GBR16. Similarly, benthic community and coral cover data sets have been identified as important indicators of coral reef health providing the backbone for monitoring and management initiatives around the world17,18.

Articles and data sets have been published that describe the benthic community properties of Heron Reef, however, their spatial coverage, number of georeferenced data points, and revisit times are limited19. The time series photoquadrat data sets presented in this paper could be used for further understanding of benthic community distribution, including statistical analysis of trends in coral cover, analysis of changes in benthic community and coral type, or used for testing of other earth observation-based mapping and modelling approaches. Additionally, as our methodology describes machine annotation of the field photoquadrats, it would be possible to reanalyse the photoquadrats with new categories not previously considered important from a biological perspective (e.g. unknown disease or impact, or a specific benthic community type), or for other features (e.g. the counting of sea cucumbers (Holothuroidea sp.)).

Detailed analyses of our complete data set may permit a greater understanding of the persistence and/or dynamics of the benthic community at Heron Reef. As such, our ongoing analyses include evaluation of changes in community composition following major impacts such as cyclones, coral bleaching, crown of thorns predation, etc., and additionally, statistical analyses of coral recovery after such impacts. To this degree, these benthic community data sets are invaluable.


The photoquadrat-based data in this study was collected for Heron Reef, Southern Great Barrier Reef, Australia (Fig. 1). Here we provide a short overview of the collection methods, however a detailed description can be found in11. These methods are applicable to any habitat. Photoquadrats were analysed for substrate and/or benthic community types known to be present on the reef (Fig. 1). The benthic community classes included in the analysis are shown in Table 1.

Fig. 1

Heron Reef, southern Great Barrier Reef, Australia. (a) Location of photoquadrat transect surveys on Heron Reef collected over a period of 17 years, (b) example of the individual photoquadrat locations along the transect survey where each individual point represents a photoquadrat, and (c) conceptualisation of snorkeler-based georeferenced photoquadrat transect surveys.

Table 1 Benthic community and coral type descriptions and their class codes used for photoquadrat annotation.

Georeferenced photoquadrat data collection

Detailed information on benthic community composition was gathered at Heron Reef on the reef flat (0–2 m depth) and at the 5 m contour on the reef slope using a repeatable and fine spatial scale (sampling every 2–4 m) technique for surveying benthic cover11. The technique required a snorkeler or diver manually capture georeferenced photoquadrats along defined transect surveys using a standard digital camera in a waterproof housing (e.g. Sony Cyber shot, Canon AA540, Lumix, or Olympus T4). A plumb-line attached to the camera, ensured that the footprint of each photoquadrat approximated 1 m2 of the benthos.

From 2002–2004, a 100 m transect tape was deployed at each defined survey start site at a maximum depth of 3 m, or on scuba at 5 m depth. From 2005 onwards, instead of deploying a tape, the surveyor towed a standard handheld GPS (e.g. Garmin eTrex, Garmin 72) at the surface in a waterproof bag for all surveys. This enabled accurate registration of the location of the acquisition of each photoquadrat, which was subsequently assigned via time synchronization, with the track log from the towed GPS. Once this method was established transect survey lengths were extended to distances of 500 m–1500 m. The start and end point of each transect was defined by GPS waypoints, permitting accurate revisits in subsequent years. The distance between successive photoquadrats was estimated by the surveyor’s kick cycle. However this was not considered a problem as the exact location of each photograph was known through the GPS synchronisation.

All surveys were performed during the day, and derivation of sunlight and sun angle can be ascertained through the timestamp of each photoquadrat and its corresponding GPS location. Reef Flat surveys were collected at high tide to provide sufficient water depth for the snorkeler to safely traverse the reef. Reef Slope surveys were collected at low tide. No water quality information was recorded.

The locations of the transect surveys were chosen to ensure they traversed gradients or edge features to detect any change in benthic cover over these features. This was done initially through visual assessment of existing satellite imagery in combination with expert knowledge of the study area. The aim was to produce data that provided an adequate representation of the variation in benthic community cover across Heron Reef. Limited transect surveys were located within the deep lagoonal area of the reef, as this area is hard to access by boat due to tidal range restrictions permitting short working times in the lagoon. Transect surveys were revisited in subsequent years, and additional transect surveys were included on subsequent trips based on increased knowledge of the environment. The benthic data sets and photoquadrat images are available at20.

Automated photoquadrat analysis for benthic community composition

Percentage cover of the benthic communities for each photoquadrat was determined through a machine-learning (ML) approach which assessed benthic community composition. A previously devised category scheme consisting of 63 class codes that differentiated all major GBR-specific coral morphologies and other bottom types was used21 which, following machine annotation, were collapsed first into broad groups and subsequently into six simplified groups for validation purposes (Table 1).

Initial training of the ML platform was achieved via manual annotation of approximately 5% of the total number of photoquadrats (equivalent to 108,700 annotated points; based on21), to achieve a machine annotation accuracy of >70% as determined by the classifier21. A unique source was created for each camera used. To give a default and uniform image annotation area, boundaries of 5% were used for the top and left sides of the photoquadrat, whilst a boundary of 95% was used for the right and bottom sides of the photoquadrat. Annotation points (50) were generated randomly over the entire annotation area per photoquadrat. For manual annotation of photoquadrat sets, the level of confidence was set to 100%. A further approximately 2.5% of photoquadrats were manually annotated in an identical manner to provide a validation data set to calculate the accuracy of the machine annotation. Automated annotation of the remaining 92.5% of the photoquadrats was achieved subsequently22.

Data Records

Detailed information regarding the output benthic cover percentages and the number of benthic photoquadrats acquired for each field campaign are documented in Table 2. The benthic data sets and photoquadrat images are available at20, with the photoquadrats and benthic cover analysis for individual survey years accessible online through the campaign specific DOIs listed in the table, from where the data can be downloaded directly.

Table 2 Overview of the data files that represent the 58,941 georeferenced photoquadrats captured during the field campaigns, in addition to links to the percentage benthic cover data sets generated via machine learning for each year.

Technical Validation

To understand the validation technique applied to these data sets, it is important to reiterate the purpose of collecting the data set itself, which was a fast field method to gather benthic community information over a large spatial extent, whilst accurately representing variability. Validation of the data set was conducted on various levels, and included: standardisation of photoquadrat capture method and conditions, and a quantitative accuracy assessment.

Standardisation of photoquadrat image capture

To standardise photoquadrat image capture, the camera and lens setup used was calibrated prior to annual survey, so as to capture a footprint that covered the same extent of the benthos. This was accomplished by attaching a plumb-line to the camera system such that when it touched the bottom, the captured photoquadrats represented ~1 m2 of the benthos. To do this standardisation, the camera was moved vertically over a marked 1 m2 until the field of view enveloped the area, and the plumb-line was fixed. During the survey the operator used the plumb-line to determine the camera height above the ground. When held vertically with the weight touching the substrate this permitted reproducible capture of photoquadrats that covered the same area for all surveys. Light conditions were generally the same for each expedition, the data collected over a consecutive 4–5 day period, with stable weather, water clarity conditions and tidal range. Ideally light conditions would have been standardised using a strobe, however this would slow down the speed of the transect surveys.

Quantitative accuracy assessment

To determine the accuracy of the machine annotation we constructed a confusion matrix that compared, for a select set of validation photoquadrats, the benthic composition output from the machine learning annotation (modelled data), with the equivalent manual annotations (reference data). Using the confusion matrix we calculated the overall accuracy and the individual benthic label user and producer accuracy following a well-documented method3. All cameras demonstrated an overall accuracy of between 74% and 82% (Table 3;3). To provide a validation data set, ~2.5% of photoquadrats were manually annotated in an identical manner to the training data (36,950 annotated points; see Methods Section).

Table 3 Quantitative assessment of the machine annotation stevia construction of a confusion matrix.


  1. 1.

    Roelfsema, C. M., Kovacs, E. M. & Phinn, S. R. Field data sets for seagrass biophysical properties for the Eastern Banks, Moreton Bay, Australia, 2004–2014. Scientific Data 2, 150040, (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Lyons, M. et al. Mapping the world’s coral reefs using a global multiscale earth observation framework. Remote Sensing in Ecology and Conservation 6, (2020).

  3. 3.

    Congalton, R. G. & Green, K. Assessing the accuracy of remotely sensed data: Principles and practices. Mapping Science. 2 edn, 200 (CRC Press, 2008).

  4. 4.

    Andréfouët, S. et al. Multi-site evaluation of IKONOS data for classification of tropical coral reef environments. Remote Sensing of Environment 88, 128–143, (2003).

    ADS  Article  Google Scholar 

  5. 5.

    González-Rivero, M. et al. Scaling up Ecological Measurements of Coral Reefs Using Semi-Automated Field Image Collection and Analysis. Remote Sensing 8, 30, (2016).

    ADS  Article  Google Scholar 

  6. 6.

    Hedley, J., Roelfsema, C. M. & Phinn, S. R. Supplement to: Hedley, J. et al. (2009): Efficient radiative transfer model inversion for remote sensing applications. Remote Sensing of Environment 113(11), 2527–2532, (2012).

    Article  Google Scholar 

  7. 7.

    Joyce, K., Phinn, S., Roelfsema, C., Neil, D. & Dennison, W. Combining Landsat ETM+ and Reef Check classifications for mapping coral reefs: a critical assessment from the southern Great Barrier Reef, Australia. Coral Reefs 23, 21–25, (2004).

    Article  Google Scholar 

  8. 8.

    Joyce, K. E., Phinn, S. R. & Roelfsema, C. M. Live coral cover index testing and application with hyperspectral airborne image data. Remote Sensing 5, 6116–6137, (2013).

    ADS  Article  Google Scholar 

  9. 9.

    Ortiz, J. C. et al. The effect of wave exposure and competition for space on the community composition of coral reefs. Coral Reefs (in press).

  10. 10.

    Phinn, S. R., Roelfsema, C. M. & Mumby, P. J. Multi-scale image segmentation for mapping coral reef geomorphic and benthic community zone. International Journal of Remote Sensing 33, 3768–3797, (2012).

    ADS  Article  Google Scholar 

  11. 11.

    Roelfsema, C. & Phinn, S. Integrating field data with high spatial resolution multispectral satellite imagery for calibration and validation of coral reef benthic community maps. J Appl Remote Sens 4, 043527-043527–043528, (2010).

    ADS  Article  Google Scholar 

  12. 12.

    Purkis, S. & Roelfsema, C. M. In Remote Sensing of Wetlands: Applications and Advances (pp. (eds Tiner, R. W., Lang, M. W. & Klemas, V. V.) 223–242 (CRC Press, 2015).

  13. 13.

    Borrego-Acevedo, R., Roelfsema, C. M., Phinn, S. R. & Grinham, A. In Supplement to: Borrego-Acevedo, R. et al. (2014): Predicting distribution of microphytobenthos abundance on a reef platform by combining in-situ underwater spectrometry and pigment analysis. Limnology and Oceanography, 5(5), 461–470, (2013).

  14. 14.

    Phinn, S. R., Roelfsema, C. M. & Mumby, P. J. In Supplement to: Phinn, S. R. et al. (2012): Multi-scale, object-based image analysis for mapping geomorphic and ecological zones on coral reefs. International Journal of Remote Sensing, 33(12), 3768–3797, (2012).

  15. 15.

    Hughes, T. P. Catastrophes, Phase Shifts, and Large-Scale Degradation of a Caribbean Coral Reef. Science 265, 1547–1551, (1994).

    ADS  CAS  Article  PubMed  Google Scholar 

  16. 16.

    De’ath, G., Fabricius, K. E., Sweatman, H. & Puotinen, M. The 27–year decline of coral cover on the Great Barrier Reef and its causes. Proceedings of the National Academy of Sciences 109, 17995–17999, (2012).

    ADS  Article  Google Scholar 

  17. 17.

    Obura, D. O. et al. Coral Reef Monitoring, Reef Assessment Technologies, and Ecosystem-Based Management. Frontiers in Marine Science 6, (2019).

  18. 18.

    Flower, J. et al. Interpreting coral reef monitoring data: A guide for improved management decisions. Ecological Indicators 72, 848–869, (2017).

    Article  Google Scholar 

  19. 19.

    Connell, J. H. Disturbance and recovery of coral assemblages. Coral Reefs 16, S101–S113, (1997).

    Article  Google Scholar 

  20. 20.

    Roelfsema, C. M., Kovacs, E., Stetner, D. & Phinn, S. R. Georeferenced benthic photoquadrats captured annually from 2002-2017, distributed over Heron Reef flat and slope areas. PANGAEA (2018).

  21. 21.

    González-Rivero, M. et al. The Catlin Seaview Survey – kilometre-scale seascape assessment, and monitoring of coral reef ecosystems. Aquatic Conservation: Marine and Freshwater Ecosystems 24, 184–198, (2014).

    Article  Google Scholar 

  22. 22.

    González-Rivero, M. et al. Monitoring of Coral Reefs Using Artificial Intelligence: A Feasible and Cost-Effective Approach. Remote Sensing 12, (2020).

Download references


Funding provided by: University of Queensland; CSIRO; Cooperative Research Centre Coastal Zone, Estuaries and Waterways Management; ARC Linkage Grant to Prof. S Phinn; and World Bank Global Environment Facility Coral Reef Remote Sensing, ARC linkage innovative Coral Reef Monitoring. Fieldwork support was provided by: Coral and Reef Check Volunteers, Staff and students at University of Queensland, Heron Island Research Station. Field assistance: Rodney Borrego, Ian Leiper, Douglas Stetner, Josh Passenger, Megan Saunders, Robert Canto, Peran Bray, Emma Kennedy.

Author information




Chris M. Roelfsema, design (50%), methods (55%), collection (55%), analysis (15%), writing (30%). Eva M. Kovacs, design (25%), methods (25%), collection (30%), analysis (20%), writing (30%). Kathryn Markey, design (0%), methods (5%), collection (0%), analysis (25%), writing (4%). Julie Vercelloni, design (5%), methods (5%), collection (0%), analysis (10%), writing (10%). Alberto Rodriguez- Ramirez, design (0%), methods (0%), collection (0%), analysis (10%), writing (4%). Sebastian Lopez-Marcano, design (0%), methods (0%), collection (0%), analysis (5%), writing (5%). Manuel Gonzalez-Rivero, design (0%), methods (5%), collection (0%), analysis (5%), writing (5%). Ove Hoegh-Guldberg, design (0%), methods (0%), collection (0%), analysis (2%), writing (4%). Stuart R. Phinn, design (20%), methods (10%), collection (15%), analysis (0%), writing (5%).

Corresponding author

Correspondence to Chris Roelfsema.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit

The Creative Commons Public Domain Dedication waiver applies to the metadata files associated with this article.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Roelfsema, C., Kovacs, E.M., Markey, K. et al. Benthic and coral reef community field data for Heron Reef, Southern Great Barrier Reef, Australia, 2002–2018. Sci Data 8, 84 (2021).

Download citation

Further reading


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing