A contemporary baseline record of the world’s coral reefs

Rodriguez-Ramirez, Alberto; González-Rivero, Manuel; Beijbom, Oscar; Bailhache, Christophe; Bongaerts, Pim; Brown, Kristen T.; Bryant, Dominic E. P.; Dalton, Peter; Dove, Sophie; Ganase, Anjani; Kennedy, Emma V.; Kim, Catherine J. S.; Lopez-Marcano, Sebastian; Neal, Benjamin P.; Radice, Veronica Z.; Vercelloni, Julie; Beyer, Hawthorne L.; Hoegh-Guldberg, Ove

doi:10.1038/s41597-020-00698-6

Download PDF

Data Descriptor
Open access
Published: 20 October 2020

A contemporary baseline record of the world’s coral reefs

Scientific Data volume 7, Article number: 355 (2020) Cite this article

7012 Accesses
8 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Addressing the global decline of coral reefs requires effective actions from managers, policymakers and society as a whole. Coral reef scientists are therefore challenged with the task of providing prompt and relevant inputs for science-based decision-making. Here, we provide a baseline dataset, covering 1300 km of tropical coral reef habitats globally, and comprised of over one million geo-referenced, high-resolution photo-quadrats analysed using artificial intelligence to automatically estimate the proportional cover of benthic components. The dataset contains information on five major reef regions, and spans 2012–2018, including surveys before and after the 2016 global bleaching event. The taxonomic resolution attained by image analysis, as well as the spatially explicit nature of the images, allow for multi-scale spatial analyses, temporal assessments (decline and recovery), and serve for supporting image recognition developments. This standardised dataset across broad geographies offers a significant contribution towards a sound baseline for advancing our understanding of coral reef ecology and thereby taking collective and informed actions to mitigate catastrophic losses in coral reefs worldwide.

Measurement(s)	ecosystem • coral reef • composition
Technology Type(s)	automated image annotation • machine learning
Factor Type(s)	year of data collection • geographic location
Sample Characteristic - Organism	Anthozoa • Algae • Porifera
Sample Characteristic - Environment	marine coral reef biome • marine coral reef fore reef
Sample Characteristic - Location	Atlantic Ocean • Eastern Australia • Indian Ocean • Southeast Asia • Pacific Ocean • Great Barrier Reef

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.13007516

A doubling of stony coral cover on shallow forereefs at Carrie Bow Cay, Belize from 2014 to 2019

Article Open access 28 September 2021

Coral community data Heron Island Great Barrier Reef 1962–2016

Article Open access 12 October 2022

Reef Cover, a coral reef classification for global habitat mapping from remote sensing

Article Open access 02 August 2021

Background & Summary

The escalating deterioration of coral reefs over the past half-century^1,2 has imposed urgent challenges for coral reef science^3,4, especially at the spatial and temporal scales⁵ required to support science-based management and conservation at global scales^6,7. Novel research and management approaches that consider current ecological and socio-cultural paradigms of coral reefs are therefore pivotal for addressing such challenges^{4,7,8,9,10,11,12,13}. Many of these strategies, however, face major challenges in terms of data acquisition, analysis and implementation¹⁴. In addition, the reluctance to share data among scientists¹⁵ may often frustrate progress in reef science, management and conservation¹⁶. Clearly this increasing demand of scientific outputs needs to be pursued collectively as a top priority¹⁶ in order to take up the challenges of managing Anthropocene reefs effectively in a rapidly changing world^4,5,6,17.

There have been a large number of technological innovations that are revolutionising the collection and analysis of critical data^{18,19,20,21,22,23}. Yet, the lack of adequate metrics for monitoring³, the paucity of large-scale assemblage datasets⁷ and, limited availability of key long-term ecological data²⁴ have generated a lag in the task of incorporating scientific findings into management and policy actions. Large-scale and standardised field data have been recognised as essential information for tackling the scale of problems and potential solutions^3,6,7,25,26. Worldwide data collection and analysis have constrained our ability to detect changes at global scales^27,28, partly due to our limited understanding of future reef responses and management options at global-scales. Regional and sub-regional analyses have provided the most significant insights of contemporary coral reef dynamics and the implicated drivers. These studies, however, had to compile, filter and aggregate multiple small-scale data sets based on different methods (e.g. transects lines, chain, photo-quadrats)^{29,30,31,32,33}. These meta-analyses are indispensable, but the time-intensive approach can be minimised in the future.

Here, we present a standardised and contemporary archive of baseline images and data³⁴ of the world’s tropical coral reefs between 2012 to 2018. The dataset overcomes limitations and satisfies requirements of some aforementioned data needs in terms of scale, extent, and standardisation for advancing reef science and management. The imagery and ecological data were produced within the framework of the XL Catlin Seaview Survey Project³⁵. The dataset includes over one million standardised and geo-referenced photo-quadrat images, each one covering approximately 1 m² of substrate. These images were taken at 860 reef locations within the Western Atlantic (Caribbean, Lucayan Archipelago, and Bermuda), the Central Indian Ocean (Maldives and Chagos Archipelagos), the Central Pacific Ocean (Hawaii), the Coral Triangle and Southeast Asia (Indonesia, Philippines, Timor-Leste, Solomon Islands, and Taiwan), and Eastern Australia (Great Barrier Reef). The locations were surveyed using an underwater propulsion vehicle customised with high-definition cameras along 1.5 to 2.0 km transects at a standard depth of 10 m (±2 m), covering unparalleled areas of coral reefs (over 1300 km of fore-reef slopes). Multi-temporal images exist for some reef locations before and after the 2016 global mass bleaching event in the Great Barrier Reef, Indonesia, Maldives, Philippines, Hawaii and Taiwan.

The dataset provides estimates of the areal extent of main benthic groups (e.g., “hard coral”, “algae”, etc.) at each reef location and for more specific categories for each photo-quadrat. These estimations were obtained from the application of an automated image annotation method (Deep Learning Convolutional Neural Networks)²³. Validation of this approach showed an unbiased 97% agreement between human and automated annotations, with errors of the automated estimations ranging between <2% and 7%²³.

The dataset offers exceptional opportunities for analysis across multiple spatial scales from local to global, enabling the potential detection of change while providing a baseline by which to investigate the contemporary decline and recovery of coral reefs, and addressing a range of broad and specific ecological questions. Furthermore, the dataset offers human-classified annotations and images that can be used to train and validate automated image classifiers. The dataset and imagery have contributed to a substantial number of publications^{23,25,35,36,37,38,39}. The ultimate goal of this record is to expedite scientific investigation and evidence-based management. Given the remoteness of some surveyed locations, broad geographic representation as well as detailed spatial and taxonomic resolution, the data provide a unique resource to further advance our understanding of ecological responses across a range of changing pressures and build a stronger base knowledge that supports conservation actions.

Methods

A comprehensive description of the methodological aspects used during the field surveys and image analysis have been published in González-Rivero et al.^23,25,35. Therefore, here we include a synopsis of how this dataset was generated and made available to the wider community.

Our approach involved the rapid acquisition of high-resolution imagery over large extent of reefs and efficient image analysis to provide key information about the state of coral reef benthic habitat across multiple spatial scales²³. The data generation and processing involved three main components: (1) photographic surveys, (2) post-processing of images and (3) image analysis, which are described and summarised below in Fig. 1.

Photographic surveys

An underwater propulsion vehicle customised with a camera system (“SVII”, Supplementary Fig. 1), consisting of three synchronised DSLR (Digital Single-Lens Reflex) cameras (Cannon 5D-MkII cameras and Nikon Fisheye Nikkor lens with 10.5 mm focal length), was used to survey the fore-reef (reef slope) habitats from five major coral reef regions: Central Pacific Ocean, Western Atlantic Ocean, Central Indian Ocean, Southeast Asia and Eastern Australia in 23 countries or territories (Table 1, Supplementary Fig. 2). Within each region, multiple reef locations were surveyed aiming to capture the variability and status of fore-reefs environments across regions and within each region. Sampling design varied according to particular environmental and socioeconomic factors potentially influencing the distribution and structure of coral reef assemblages at each region and/or country. Overall, prior to field expeditions, reef localities were selected considering factors such as wave exposure, reef zones (i.e. fore-reefs), local anthropogenic stressors (e.g. coastal development), fishing pressures, levels of management (e.g. marine park, protected areas), and presence of monitoring sites.

Table 1 Summary of the photographic surveys conducted between 2012 and 2018.

Full size table

Underwater images were collected in each reef location once every three seconds, approximately every 2 m apart, following a transect along the seascape at a standard depth of 10 m (±2 m). Although overlap between consecutive images is possible, the process for extracting standardised photo-quadrats from an image ensures that the photo-quadrats are non-overlapping between and within images (see further details next section). Each transect averaged 1.8 km in length, hereafter referred to as a “survey”. See Supplementary Fig. 3 for an explanation of the hierarchical structure of the photographic surveys. No artificial illumination was used during image capture, but light exposure was manually adjusted by modifying the ISO during the dive, using an on-board tablet computer encased in an underwater housing (Supplementary Fig. 1). This computer enabled the diver to control camera settings (exposure and shutter speed) according to light conditions. Images were geo-referenced using a surface GPS unit tethered to the diver (Supplementary Fig. 1). Altitude and depth of the camera relative to the reef substrate and surface were logged at half-second intervals using a Micron Tritech transponder (altitude, Supplementary Fig. 1) and pressure sensor (depth) in order to select the imagery within a particular depth and to scale and crop the images during the post-processing stage. Further details about the photographic surveys are provided in González-Rivero et al.^25,35.

Post-processing of images for manual and automated annotation

The post-processing pipeline produced images with features required for manual and automated annotation in terms of size and appearance. The process involved several steps that transformed the raw images from the downward facing camera into photo-quadrats of 1 m², hereafter referred to as a “quadrat” (Fig. 1). As imagery was collected without artificial light using a fisheye lens, each image was processed prior to annotation in order to balance colour and to correct the non-linear distortion introduced by the fisheye lens²³ (Fig. 1). Initially, colour balance and lens distortion correction were manually applied on the raw images using Photoshop (Adobe Systems, California, USA). Later, in order to optimise the manual post-processing time of thousands of images, an automatic batch processing was conducted on compressed images²³ (jpeg format) using Photoshop and ImageMagick, the latter an open-source software for image processing (https://imagemagick.org/index.php). In addition, using the geometry of the lens and altitude values, images were cropped to a standardised area of approximately 1 m² of substrate^23,35 (Fig. 1). Thus, the number of nonoverlapping quadrats extracted from one single raw image varied depending on the distance between the camera and the reef surface. Figure 1 illustrates a situation where the altitude of the camera allowed for the extraction of two quadrats from one raw image. Further details about colour balance and lens distortion correction and cropping are provided in González-Rivero et al.^23,35.

Image analysis: manual and automated annotation for estimating covers of benthic categories

Manual annotation of the benthic components by a human expert took at least 10 minutes per quadrat, creating a bottleneck between image post-processing and the required data-product. To address this issue, we developed an automated image analysis to identify and estimate the relative abundance of benthic components such as particular types of corals, algae, and other organisms as well as non-living components. To do this, automated image annotation based on deep learning methods (Deep Learning Convolutional Neural Networks)²³ were applied to automatically identify benthic categories from images based on training using human annotators (manual annotation). The process for implementing a Convolutional Neural Network (hereafter “network”) and classify coral reef images implied three main stages: (i) label-set (benthic categories) definition, (ii) training and fine-tuning of the network, and (iii) automated image annotation and data processing.

Label-set definition

As a part of the manual and automated annotation processes to extract benthic cover estimates, label-sets of benthic categories were established based on their functional relevance to coral reef ecosystems and their features to be reliably identified from images by human annotators²⁵. The labels were derived, modified and/or simplified from existing classification schemes^40,41, and were grouped according to the main benthic groups of coral reefs including hard coral, soft coral, other invertebrates, algae, and other. Since coral reef assemblages vary in species composition at global and regional scales, and surveys were conducted at different times between 2012 and 2018 across the regions, nine label-sets accounted for such biogeographical and temporal disparity. In general, a label-set was developed after each main survey expedition to a specific region. The label-sets varied in complexity (from 23 to 61 labels), considering the differential capacity to visually recognise (in photographs) corals to the lowest possible taxon between the regions. While label-sets for the Atlantic and Central Pacific (Hawaii) included categories with coral genus and species, for the Indian Ocean (Maldives, Chagos Archipelago), Southeast Asia (Indonesia, Philippines, Timor-Leste, Solomon Islands, and Taiwan), and Eastern Australia, corals comprised labels based on a combination of taxonomy (e.g., family and genus) and colony morphology (e.g., branching, massive, encrusting, foliose, tabular).

The other main benthic groups were generally characterised by labels reflecting morphology and/or functional groups across the regions. “Soft Corals” were classified into three groups: 1) Alcyoniidae (soft corals), the dominant genera; 2) Sea fans and plumes from the family Gorgoniidae; and 3) Other soft corals. “Algae” groups were categorised according to their functional relevance: 1) Crustose coralline algae; 2) Macroalgae; and 3) Epilithic Algal Matrix. The latter is a multi-specific algal assemblage smothering the reef surface of up to 1 cm in height (dominated by algal turfs). “Other Invertebrates” consisted of labels to classify sessile invertebrates different to soft corals (e.g., Millepora, bryozoans, clams, tunicates, soft hexacorrallia, hydroids) and some mobile invertebrates observed in the images (mostly echinoderms). The remaining group, “Other”, consisted of sand, sediments, and occasional organisms or objects detected in the images such as fish, human debris (e.g., plastic, rope, etc.), and transect hardware. The exception within these main groups were the “Sponges”, which were classified and represented by multiple labels only in the Atlantic (given their abundance and diversity in the Caribbean), including categories with sponge genus and species, and major growth forms (rope, tube, encrusting, massive).

Training and fine-tuning of the network

The deep learning approach used relies on a convolutional neural network architecture named VGG-D 16⁴². Details on the initialisation and utilisation of this network are provided in González-Rivero et al.²³. A total of nine networks were used, one for each country within the regions, except for the Western Atlantic Ocean, where the network was trained using data from several countries, and the Philippines and Indonesia, where the network was trained using data from those two countries. (Table 2). The first step in implementing a network was to randomly select a subset of images from the whole regional set to be classified, which were then divided into training and testing sets (Fig. 1). Human experts manually annotated both sets using the corresponding label-set under CoralNet⁴³, an online platform designed for image analysis of coral reef related materials (https://coralnet.ucsd.edu/). The number of images and points manually annotated per network is presented in Table 2 (generally 100 points per image for training sets and 40 or 50 points per image for testing sets).

Table 2 Summary of the images, manual point annotations, and test transects used during the train and test processes of each network.

Full size table

Each training and testing data set were exported from CoralNet⁴³ and used along with the associated quadrats to support an independent training and fine-tuning process aimed to find the network configuration that produced the best outcomes. Initially, each quadrat used from the training and testing sets was converted to a set of patches cropped out around each annotation point location. The patch area to crop around each annotation point was set to 224 × 224 pixels to align with the pre-defined image input size of the VGG-D architecture. The fine-tuning exercise ran in general for 40 K iterations to establish the best combination of model parameters or weights that minimised the cross-entropy loss while the overall accuracy increased. An independent 20% subset from the original set of quadrats was used to assess the performance of the final classification (% of accuracy). In addition, parameters of learning rate and image scale were independently optimised for each network by running an experiment using different values for such parameters in order to select the values that derived the smallest errors per label. Further details of the model parametrisation for each network are provided in González-Rivero et al.²³ (see Supplementary Material).

Automated image annotation and data processing

Once optimised, a network was used to automatically annotate the corresponding set of non-annotated quadrats. The quadrats were processed through the network, where for each quadrat, 50 points (input patches) were classified using the associated labels. Upon completion of automated image annotation for a specific region/country, the annotation outputs containing locations of 50 pixels (i.e., their x and y coordinates) with their associated labels per quadrat (a csv file per quadrat) were incorporated and collated into a MySQL database along with information about the field surveys. In addition to the manual and automated annotations tables (raw data), we provide two levels of aggregation for the benthic data. First, the relative abundance (cover) for each of the benthic labels per quadrat, which was calculated as the ratio between the numbers of points classified for a given label by the total number of points evaluated in a quadrat. Second, the relative abundance for each of the main benthic groups (hard coral, soft coral, other invertebrates, algae, and other) per survey, which involved three calculations: 1) summarise the quadrat covers by image averaging all the quadrats from one single image per label, 2) summarise image covers by survey averaging all the images across one survey per label, and 3) merge survey data by main benthic groups summing the covers of all labels belonging to the same group across one survey.

Data Records

The dataset presented here has been made freely available through The University of Queensland “eSpace” repository³⁴, and released under a Creative Commons Attribution license (CC BY 3.0; https://creativecommons.org/licenses/by/3.0/deed.en_US). For attribution, we expect users to cite this paper when using the content. It includes three core components: 1) a series of relational tables (csv format) with the manual and automated annotations (raw data), 2) a series of relational tables (csv format) with benthic covers per survey and quadrat (processed data), dataset IDs, and label-set descriptions, and 3) the imagery associated with the dataset (jpeg format). We are providing cover data from more than one million of quadrats (over 55 million of associated data records) based on automated annotation, and over 859 K points (image features) classified by humans within selected quadrats through the CoralNet⁴³ web interface that can be used to implement approaches of automated image analyses.

A unique identification number (ID) was assigned to every survey (5 digits), image (9 digits), and quadrat (11 digits) which are the basis for the database relational links among tables and files. For example, in quadrat ID 17001644602, the first five digits correspond to the survey ID (17001), the survey ID and the next four digits (6446, a number automatically assigned by the camera between 0001 and 9999) to the image ID (170016446), and the image ID along with the last two digits (02, quadrat number within an image automatically assigned during the cropping process) to the quadrat ID. The core table “seasurvey_quadrat.csv” contains all the survey, image and quadrat IDs (over 1.1 million records) and is zipped within the “tabular-data.zip” file. See Supplementary Fig. 3 for an explanation of the hierarchical structure of the dataset.

Manual and automated annotations

The structure of the tables containing the data records from manual (testing and training datasets) and automated annotations is presented in Table 3. Each annotation record is identified by the quadrat ID and includes the coordinates of the pixel that has been annotated (x and y), and its corresponding label. Both types of annotation records are zipped within the “tabular-data.zip” file. While there is a unique file (“seaviewsurvey_annotations.csv”) for the automated annotations with more than 55.2 million data records, there are nine files with the manual annotations, which correspond to the regions/countries conforming the training sets (Table 2). They were named “annotations_” with the region/country code (Table 3) appended (e.g., annotations_PAC_AUS.csv).

Table 3 List of tables/files (and their structure) included in the repository.

Full size table

The table “seaviewsurvey_labelsets.csv” provides the description of the label-sets used for each region. The structure of this table is presented in Table 3, where each record corresponds to a label within a region. Refer to the Usage Notes section for instructions on accessing visual examples of the labels.

Benthic cover

The structure of the tables containing the data records from benthic covers per survey and quadrat is presented in Table 3. Both types of cover records are zipped within the “tabular-data.zip” file. Each data record in the surveys file (“seaviewsurvey_surveys.csv”) corresponds to a unique survey identified by the survey ID and contains the cover of the five main benthic groups (hard coral, soft coral, other invertebrates, algae, and other) along with metadata of the survey (e.g., ocean, country, latitude, longitude, etc.). Cover records per quadrat were grouped in five files and named “seaviewsurvey_reefcover_” with the ocean and/or region appended (e.g., seaviewsurvey_reefcover_pacificaustralia.csv) to facilitate the data retrieval by the main regions surveyed.

Imagery

The imagery is grouped under three main folders within the repository³⁴: (1) photo-quadrats, (2) annotated-images, and (3) survey-previews. The images files (jpeg format) associated with the quadrats cover records are stored in the “photo-quadrats” folder. To facilitate the image retrieval, quadrats are grouped by surveys in 860 zip files (.zip). Each zip folder survey was named with a code using the combination of four information components separated by underscores: the ocean in which the survey occurs (ATL = Atlantic Ocean, IND = Indian Ocean, PAC = Pacific Ocean), the three letter country code of the Exclusive Economic Zone within which the transect occurs (e.g., AUS = Australia), the unique five digit survey ID number, and the year and month of the survey formatted as YYYYMM (e.g., PAC_AUS_15007_201212.zip). Such naming convention is used in the “folder_name” data field within the “seaviewsurvey_surveys.csv” table.

The images files (jpeg format) associated with the manual annotations are stored in the “annotated-images” folder. There, images were grouped into nine zip folders according to the nine training sets of the manual annotations (Table 2). Therefore, folders were named using the corresponding annotations file name but excluding the initial part (e.g., for the “annotations_PAC_AUS.csv” file, the images folder was named as “PAC_AUS.zip”).

Low-resolution previews of all photo-quadrats per each survey (tiled into one jpeg image per survey) were zipped into archives grouped by ocean and country (e.g., “PAC_AUS” refers to the Pacific Ocean and Australia) under the “survey-previews” folder. These preview images provide a coarse overview of the content of the photo-quadrats that may help users to identify which surveys they are most interested in downloading.

Technical Validation

The methodological approach of automated image annotation used to generate this dataset is described and validated in Gonzalez-Rivero et al.²³. In order to evaluate the reliability of the estimations of benthic covers per each network we used the set of images and manual annotations defined above as testing datasets. Each testing dataset was constructed from contiguous images within standard spatial units (hereafter called “test transects”) with an extent of 30 m in length, concomitant with most coral reef monitoring programs^31,44,45,46 and best represents the spatial heterogeneity within a site²⁵. Thus, the aggregation of images within 30 m test transects allowed the evaluation of the performance of automated estimations within a scale that is consistent with monitoring sampling units, accounting for the variability in benthic abundance estimation among images. Test transects were selected at random while ensuring that images used for training the networks were not included. A total of 5,747 images, within 517 test transects (Table 2), were annotated (identical annotations points) by trained human observers, hereafter called “observer”, and the networks, hereafter called “machine”. The benthic composition within these test transects was averaged across images and contrasted between the two groups annotated: observer vs. machine. Specifically, we compared the error between machine and observer at the level of benthic categories and the consistency between machine and observer estimations from a community perspective.

We used the Absolute Error (E) to estimate the variability in the machine estimates when compared against observer estimations of abundance of a given benthic category. The absolute error (hereafter called “error”) for each category (i) was calculated as the absolute difference between the abundance estimated by the machine (m) and the observer (o; Eq. (1)). The error was calculated and compared at two aggregation levels: a) main benthic groups and b) label-set, sensu González-Rivero et al.²⁵:

$${{\rm{E}}}_{i}=\left|{m}_{i}-{o}_{i}\right|$$

(1)

Such errors indicated that the trained networks (machine) were able to produce cover estimates comparable to those generated by observers, and therefore suitable for spatial and temporal analysis and monitoring²³. While the errors are variable among regions and benthic categories (Fig. 2), they are within the range of previously reported inter- and intra-observer variability for established monitoring programs⁴⁷ (e.g., 2–5%, Long Term Monitoring Program, AIMS, Australia). Among main benthic groups (e.g., hard coral, algae), “Algae” showed the highest errors among regions, ranging between 2% (Pacific, Hawaii) and 5% (Southeast Asia, Fig. 2). The abundance of “Hard Coral” and “Soft Coral” estimated by the machine showed errors between 1% (Atlantic) and 4% (Australia), and 0.5% (Indian Ocean) and 3% (Australia), respectively. The remaining groups (Other and Other Invertebrates) showed a consistent error below 2% (Fig. 2).

The error of machine estimates was more consistent within main groups, with the only exception of the “Algae” group (Fig. 3). “Epilithic Algal Matrix”, as a functional group, comprised of a diverse number of algae groups (e.g., turf, cyanobacteria) was the most variable label (5%–7% error). The error of estimations, within the “Hard Coral” group, remained below 2% among regions, while “Soft Coral”, in particular “Other soft corals”, showed an error of up to 3%. This label is comprised of a large diversity of genera and growth forms, while more taxonomically defined labels showed an error below 2%. The remaining labels within the “Other” group, comprised mainly by substrate categories (e.g., sand, terrigenous sediment), and “Other Invertebrates”, comprise of benthic invertebrates other than hard and soft corals, showed a consistently low error (below 1%–2%; Fig. 3).

In addition, the overall performance of automated image annotation was evaluated by 1) correlating the estimates of abundance (cover) produced by the machine against those produced by the observer, and 2) evaluating the overall agreement between machine and observer estimations using the Bland-Altman plot⁴⁸, also called difference plots. Correlation was evaluated using the coefficient of determination (R²) from a linear regression model, which also evaluated the significance of this correlation. The R² provides an indication of the intensity of the correlation by evaluating the co-variance between the observer and machine estimations. The Bland-Altman plot determines differences between the two estimations against the observer estimations, or reference sensu Krouwer⁴⁹. We used the Bland-Altman plot analysis⁵⁰ to evaluate: (1) the mean of the difference or bias of machine estimations, (2) the homogeneity of the difference between machine and observer across the mean (i.e., over-dispersion) and, (3) the critical difference or agreement limits. The latter refers to the range, within the 95% Confidence Interval, of the difference between the two methods, and can be used as a reference to define where the measurements fall out of the range of the agreement (i.e., precision of the agreement). Bias refers to the difference between the two methods and the Bland-Altman plot can help visualise whether this bias changes across the mean of values evaluated, and therefore a measurement of the consistency of the bias⁵⁰.

Expectedly, network estimations (machine) of benthic cover were highly correlated with observer estimations for all five global regions (R² = 0.97, P < 0.001, Fig. 4). Most importantly, the differences between the machine and observer were unbiased across the spectrum of benthic cover (mean ~ 0), and the variability around the mean difference was estimated at 4% (Critical Difference or 95% Confidence Interval of the difference) for all labels across the surveyed regions (Fig. 4).

Lastly, to evaluate the machine performance for estimating community structure, pair-wise comparisons of manual and automated estimations of benthic composition within each test transect were executed using the Bray-Curtis similarity index. This index is sensitive to misrepresentation in the automated estimation of abundance for specific labels or benthic groups when compared against manual observations. Therefore, index values of 100% will represent a complete resemblance between machine and observer estimations for community composition. While the Absolute Error already provides a metric for label-specific performance of automated annotations, the community-wide analysis lays out a synthesis analysis to understand how closely represented is the automated estimation of benthic composition against manual observations across the range of communities within a region.

The comparison of communities within regions using Bray-Curtis similarity index showed the estimations of benthic composition were consistent between observer and machine (i.e., between 84% and 94% of similarity) among and within regions (Fig. 5). Across regions, Australia and South East Asia exhibited the lowest values of similarities, 84% and 88% respectively. Indian Ocean and Atlantic showed similarities around 90%, while Central Pacific Ocean presented up to 94% of similarity between automated estimations and manual observations. Irrespective of the differences in community structure among and within regions, this comparison corroborates the reliability of the machine estimations at both label and community level.

Machine estimations were generally better for well-defined and large organisms in quadrats, such as corals and soft corals, showing the lowest error (<1%–2%). The window size (224 × 224 pixels), however, penalises the estimation of smaller, patchy or less defined organisms, such as algal categories (error 3%–6%). Additional caveats and discussions in terms of the errors and accuracy of data derived from automated classifiers can be found in Beijbom et al.^18,19, González-Rivero et al.^23,25 and Williams et al.²².

In summary, the validation exercise demonstrates that the trained networks generated cover estimates unbiased compared to those derived from observers, with an overall error of 4%. Thus, our data set represents a useful source of information to support spatio-temporal analysis of coral reefs ranging from small- (presence/absence within quadrats 1 m²) to broad-scales (regional/global analysis).

Usage Notes

Users can identify transects of relevance using a map or file naming convention present on the data repository³⁴. The survey ID (“surveyid” in tables) is the unique identifier of each reef location in space and time that users will require as a key cross-reference to navigate within the repository and link the different cover tables, as well as cover records with the imagery folders.

Our relational dataset contains some tables with thousands to millions of rows (e.g., cover, quadrat and annotation tables), hence it is advised to use a software that can handle this structure, such as R, PostGRES, Microsoft Access, etc. Manipulating these data with Microsoft Excel is not advised. For users interested in working with the whole dataset, the best approach will be to create a database, for instance in the MySQL system, and import all the csv files provided within the “tabular-data.zip”. Then the user can relate the tables using the “surveyid”, “imageid” and “quadratid” fields, which are cross-referenced between tables (see Data Records section and Table 3).

Accessing benthic covers

We provide guidelines to access cover records from the Indian Ocean Region (hereafter called “IOR”) per survey and quadrat as a transferable example for retrieving any cover record. The first step is to download the “tabular-data.zip” file from the repository³⁴. As soon as the file is unzipped, 18 csv files (i.e., tables) are extracted including six with cover records, one file for each of the five regions and one summary file. Within the latter, the “seaviewsurvey_surveys.csv” file, the user can directly access the percent cover of main benthic groups (hard coral, algae, soft coral, other invertebrate, and other) summarised per survey (“surveyid” column). Filtering this table by the “ocean” column, users can select the surveys associated only with IOR, in this case 92 surveys. In addition, users can filter the surveys per country (“country” column), link each survey to the zip folder with the images (“folder_name” column) and the geographic location (columns with latitude and longitude at the start and end of the survey).

If users require percentage covers of detailed benthic categories per quadrat they must work with the file “seaviewsurvey_reefcover_indianocean.csv” (if the user is interested in other region/country, please refer to Table 1 to find the appropriate file). Users can relate cover records at the level of surveys (seaviewsurvey_surveys.csv file) with records at the level of quadrats (seaviewsurvey_reefcover_indianocean.csv) using the “surveyid” field.

To summarise percentage covers at the level of quadrats, users must sum the values of the benthic categories of interest. In our example with the IOR, by selecting and merging from the 45 field names of the columns in the “seaviewsurvey_reefcover_indianocean.csv” file. For this purpose, we recommend using the label-set table “seaviewsurvey_labelsets.csv” to understand the labels of different benthic categories, review the descriptions and examples, and guide your merging scheme. Filtering the label-set table by the “region” column, in this case selecting Indian Ocean, users will obtain the 45 categories (under the “label” column) associated with this region. Please note we also provide a reduced set of categories (under “merged_label” and “merged_name” columns) used in the technical validation, however users can merge the categories based on their criteria. A good consistency check to use after merging categories is that the cover fields in each row should always sum to 1.0 (these values are proportions, and 1.0 represents 100%).

To create summaries of the cover data for spatial analyses (at different scales) users will need to consider the cases where from a single georeferenced image there is more than one 1 m² photo-quadrat extracted (see Post-processing of images section). This implies all the quadrats originated from the same image have the same georeference. For this situation, we recommend to aggregate the cover table (in this case seaviewsurvey_reefcover_indianocean.csv) by the “imageid” field, taking the mean cover value among all records (quadrats) for an image. From this point, users can spatially aggregate the images according with their own criteria and/or purposes. For instance, we applied hierarchical clustering to obtain the 30 m test transects for the technical validation. Further details about this method are provided in González-Rivero et al.²³ (see SM4 in Supplementary Material) and Vercelloni et al.³⁷. When aggregating, users can relate cover records using the “surveyid”, “imageid” and “quadratid” fields. It is important to remind here that the dataset has a unique ID for each survey (five digits), image (nine digits), and quadrat (eleven digits) that link the different tables and files. Thus, in the ID 36001004202 from the IOR cover dataset, the first five digits correspond to the survey ID (36001), this survey ID and the next four digits to the image ID (360010042), and the image ID along with the last two digits to the quadrat ID (36001004202, 02 implies there are at least two quadrats available for the 360010042 image).

Accessing temporal data

Table 1 identifies which regions/countries have temporal data. Using the “seaviewsurvey_surveys.csv” table users can identify which reef locations have been surveyed multiple times by filtering by the “transectid” field. A transect ID will appear more than once in this field if the transect has been surveyed temporally in more than one year. For instance, filtering the table by Indian Ocean/Maldives, and then filtering the first transect ID, 37001, you will find there are two surveys ID associated with this transect, 37001 and 46001, which were done in 2015 and 2017 respectively as recorded in the “surveydate” field. This means this reef location has available images and cover data for these years. Once the user has identified and selected reef locations with temporal data, we recommend plotting the georeferenced data of the images of the surveys of interest in order to visually explore the spatial overlap between them over the different years and design appropriate temporal comparisons. Further details about temporal assessments are provided in González-Rivero et al.²³ (see SM4 in Supplementary Material), Kennedy et al.³⁹ and Vercelloni et al.³⁷.

Accessing the imagery associated with benthic covers

To inspect the image quadrats associated with the data in detail, users only need the survey ID(s) and to download the set of quadrats required from the “photo-quadrats” directory at the repository³⁴. Within this directory, the different zip folders of each survey can be identified with a code (e.g., IND_MDV_37001_201503) that includes the five digits of the survey ID in the middle of the code (i.e., 37001). Alternatively, users can find the zip folder’s name associated with any survey using the “seaviewsurvey_surveys.csv” table and looking at the “folder_name” column. Once a survey has been downloaded and unzipped, all the associated quadrats (jpeg format) are identified with a unique ID, the “quadratid” in which the first five digits correspond to the survey ID. Users can always related each photo-quadrat to the cover records from the data tables using the “quadratid”.

The sets of three raw photographs (downward, left and right facing), in CR2 format (Canon Raw version 2 format) that constitute each survey, including the downward facing files used to generate the quadrats of this dataset will be also accessible through an open-access data repository in the future.

Accessing annotations data and their images

Extracting the “tabular-data.zip” file users can access nine files with the records of manual annotations used for training and validating each network (one per network). Table 2 lists the networks and the name of the corresponding csv file. For instance using the “annotations_IND_MDV.csv” table and filtering by the “data_set” field users can differentiate if the record was used for either training (“train”) or validation (“test”). The image quadrats specifically associated with the annotations can be accessed from the “annotated-images” directory on the repository³⁴. Within this directory, the different zip folders of the networks are identified with the corresponding annotations file name but excluding the “annotations” part. Thus, for the “annotations_IND_MDV.csv” set of annotations, the matching images folder is “IND_MDV.zip”. By associating annotations data and their images with the labelset information (seaviewsurvey_labelsets.csv file), users can extract specific visual examples of the labels used during the manual annotations for the different regions.

Additionally, if users are interested in visual examples of the labels, the Supplementary Table 1 provides the URLs to view multiple examples of images annotated at CoralNet (https://coralnet.ucsd.edu/) for each label. Users need to search for the region and label of interest within the table, click on the relevant URL or also copy the URL and paste it into any browser address bar to access the images directly.

In addition to the manual annotations, within the “tabular-data.zip” file users can have access to one file with all raw records of the automated annotations (“seaviewsurvey_annotations.csv”). The records of this table can be related to the cover records (processed data of covers files) by the “quadratid” fields.

Code availability

The documented code and examples to implement the automated image analyses method from this study can be found in the following repository: https://github.com/mgonzalezrivero/reef_learning.

References

Hoegh-Guldberg, O. et al. Coral Reefs Under Rapid Climate Change and Ocean Acidification. Science 318, 1737-1742, 10.1126/science.1152509%J (2007).
Pandolfi, J. M. et al. Global Trajectories of the Long-Term Decline of Coral Reef Ecosystems. Science 301, 955-958, 10.1126/science.1085706%J (2003).
Hughes, T. P., Graham, N. A., Jackson, J. B., Mumby, P. J. & Steneck, R. S. Rising to the challenge of sustaining coral reef resilience. Trends in ecology & evolution 25, 633–642, https://doi.org/10.1016/j.tree.2010.07.011 (2010).
Article Google Scholar
Norström, A. V. et al. Guiding coral reef futures in the Anthropocene. Frontiers in Ecology and the Environment 14, 490–498, https://doi.org/10.1002/fee.1427 (2016).
Article Google Scholar
Williams, G. J. et al. Coral reef ecology in the Anthropocene. Funct Ecol 33, 1014–1022, https://doi.org/10.1111/1365-2435.13290 (2019).
Article Google Scholar
Bellwood, D. R. et al. Coral reef conservation in the Anthropocene: Confronting spatial mismatches and prioritizing functions. Biological Conservation 236, 604–615, https://doi.org/10.1016/j.biocon.2019.05.056 (2019).
Article Google Scholar
Darling, E. S. et al. Social–environmental drivers inform strategic management of coral reefs in the Anthropocene. Nature Ecology & Evolution 3, 1341–1350, https://doi.org/10.1038/s41559-019-0953-8 (2019).
Article Google Scholar
Hoegh-Guldberg, O., Kennedy, E. V., Beyer, H. L., McClennen, C. & Possingham, H. P. Securing a Long-term Future for Coral Reefs. Trends in Ecology & Evolution 33, 936–944, https://doi.org/10.1016/j.tree.2018.09.006 (2018).
Article Google Scholar
McCook, L. J. et al. Adaptive management of the Great Barrier Reef: a globally significant demonstration of the benefits of networks of marine reserves. Proceedings of the National Academy of Sciences 107, 18278–18285, https://doi.org/10.1073/pnas.0909335107 (2010).
Article ADS Google Scholar
Barnes, M. L. et al. Social-ecological alignment and ecological conditions in coral reefs. Nature Communications 10, 2039, https://doi.org/10.1038/s41467-019-09994-1 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Beyer, H. L. et al. Risk-sensitive planning for conserving coral reefs under rapid climate change. Conservation Letters 11, e12587, https://doi.org/10.1111/conl.12587 (2018).
Article Google Scholar
McLeod, E. et al. The future of resilience-based management in coral reef ecosystems. Journal of Environmental Management 233, 291–301, https://doi.org/10.1016/j.jenvman.2018.11.034 (2019).
Article PubMed Google Scholar
van Oppen, M. J. H. et al. Shifting paradigms in restoration of the world’s coral reefs. Global Change Biology 23, 3437–3448, https://doi.org/10.1111/gcb.13647 (2017).
Article ADS PubMed Google Scholar
Pendleton, L. H. et al. Disrupting data sharing for a healthier ocean. ICES Journal of Marine Science 76, 1415–1423, 10.1093/icesjms/fsz068%J (2019).
Costello, M. J., Michener, W. K., Gahegan, M., Zhang, Z.-Q. & Bourne, P. E. Biodiversity data should be published, cited, and peer reviewed. Trends in Ecology & Evolution 28, 454–461, https://doi.org/10.1016/j.tree.2013.05.002 (2013).
Article Google Scholar
Obura, D. O. et al. Coral Reef Monitoring, Reef Assessment Technologies, and Ecosystem-Based Management. 6, https://doi.org/10.3389/fmars.2019.00580 (2019).
Hughes, T. P. et al. Coral reefs in the Anthropocene. Nature 546, 82–90, https://doi.org/10.1038/nature22901 (2017).
Article ADS CAS PubMed Google Scholar
Beijbom, O. et al. Towards Automated Annotation of Benthic Survey Images: Variability of Human Experts and Operational Modes of Automation. PloS one 10, e0130312, https://doi.org/10.1371/journal.pone.0130312 (2015).
Article CAS PubMed PubMed Central Google Scholar
Beijbom, O. et al. Improving automated annotation of benthic survey images using wide-band fluorescence. Scientific reports 6, 23166, https://doi.org/10.1038/srep23166 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hedley, D. J. et al. Remote Sensing of Coral Reefs for Monitoring and Management: A Review. Remote Sensing 8, https://doi.org/10.3390/rs8020118 (2016).
Madin, E. M. P., Darling, E. S. & Hardt, M. J. Emerging Technologies and Coral Reef Conservation: Opportunities, Challenges, and Moving Forward. Frontiers in Marine Science 6, 727 (2019).
Article Google Scholar
Williams, I. D. et al. Leveraging Automated Image Analysis Tools to Transform Our Capacity to Assess Status and Trends of Coral Reefs. Frontiers in Marine Science 6, https://doi.org/10.3389/fmars.2019.00222 (2019).
González-Rivero, M. et al. Monitoring of Coral Reefs Using Artificial Intelligence: A Feasible and Cost-Effective Approach. Remote Sensing 12, 489, https://doi.org/10.3390/rs12030489 (2020).
Article ADS Google Scholar
Gouezo, M. et al. Drivers of recovery and reassembly of coral reef communities. Proc Biol Sci 286, 20182908, https://doi.org/10.1098/rspb.2018.2908 (2019).
Article PubMed PubMed Central Google Scholar
González-Rivero, M. et al. Scaling up Ecological Measurements of Coral Reefs Using Semi-Automated Field Image Collection and Analysis. Remote Sensing 8, 30, https://doi.org/10.3390/rs8010030 (2016).
Article ADS Google Scholar
Obura, D. O. et al. Coral Reef Monitoring, Reef Assessment Technologies, and Ecosystem-Based Management. Frontiers in Marine Science 6, 580 (2019).
Article Google Scholar
Bruno, J. F., Sweatman, H., Precht, W. F., Selig, E. R. & Schutte, V. G. W. Assessing evidence of phase shifts from coral to macroalgal dominance on coral reefs. Ecology 90, 1478–1484, https://doi.org/10.1890/08-1781.1 (2009).
Article PubMed Google Scholar
Selig, E. R. & Bruno, J. F. A Global Analysis of the Effectiveness of Marine Protected Areas in Preventing Coral Loss. PLOS ONE 5, e9278, https://doi.org/10.1371/journal.pone.0009278 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Bruno, J. F. & Selig, E. R. Regional Decline of Coral Cover in the Indo-Pacific: Timing, Extent, and Subregional Comparisons. PLOS ONE 2, e711, https://doi.org/10.1371/journal.pone.0000711 (2007).
Article ADS PubMed PubMed Central Google Scholar
Côté, I. M., Gill, J. A., Gardner, T. A. & Watkinson, A. R. Measuring coral reef decline through meta-analyses. Philos Trans R Soc Lond B Biol Sci 360, 385–395, https://doi.org/10.1098/rstb.2004.1591 (2005).
Article PubMed PubMed Central Google Scholar
Jackson, J., Donovan, M., Cramer, K. & Lam, V. Status and trends of Caribbean coral reefs: 1970–2012. (Global Coral Reef Monitoring Network, 2014).
Moritz, C. et al. Status and Trends of Coral Reefs of the Pacific. (Global Coral Monitoring Network, 2018).
Obura, D. O. et al. Coral Reef Status Report for the Western Indian Ocean. (Global Coral Reef Monitoring Network, 2017).
González-Rivero, M. et al. Seaview Survey Photo-quadrat and Image Classification Dataset University of Queensland Library https://doi.org/10.14264/uql.2019.930 (2019).
González-Rivero, M. et al. The Catlin Seaview Survey - kilometre-scale seascape assessment, and monitoring of coral reef ecosystems. Aquatic Conservation: Marine and Freshwater Ecosystems 24, 184–198, https://doi.org/10.1002/aqc.2505 (2014).
Article Google Scholar
Peterson, E. E. et al. Monitoring through many eyes: Integrating scientific and crowd-sourced datasets to improve monitoring of the Great Barrier Reef. arXiv preprint arXiv:1808.05298 (2018).
Vercelloni, J. et al. Forecasting intensifying disturbance effects on coral reefs. Global Change Biology, https://doi.org/10.1111/gcb.15059 (2020).
Bryant, D. E. P. et al. Comparison of two photographic methodologies for collecting and analyzing the condition of coral reef ecosystems. Ecosphere 8, e01971, https://doi.org/10.1002/ecs2.1971 (2017).
Article Google Scholar
Kennedy, E. V. et al. Coral Reef Community Changes in Karimunjawa National Park, Indonesia: Assessing the Efficacy of Management in the Face of Local and Global Stressors. Journal of Marine Science and Engineering 8(10), 760 (2020).
Article Google Scholar
Althaus, F. et al. A Standardised Vocabulary for Identifying Benthic Biota and Substrata from Underwater Imagery: The CATAMI Classification Scheme. PLOS ONE 10, e0141039, https://doi.org/10.1371/journal.pone.0141039 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wallace, C. C. Staghorn corals of the world: a revision of the coral genus Acropora (Scleractinia; Astrocoeniina; Acroporidae) worldwide, with emphasis on morphology, phylogeny and biogeography / Carden C. Wallace. (CSIRO Publishing, 1999).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
Beijbom, O., Edmunds, P. J., Kline, D. I., Mitchell, B. G. & Kriegman, D. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. 1170–1177 (IEEE).
Brown, E. K. et al. Development of benthic sampling methods for the Coral Reef Assessment and Monitoring Program (CRAMP) in Hawai’i. Pacific Science 58, 145–158 (2004).
Article Google Scholar
Murdoch, T. J. Status and Trends of Bermuda Reefs and Fishes: 2015 Report Card. (Bermuda Zoological Society, Flatts, Bermuda, 2017).
Sweatman, H. H. et al. Long-term monitoring of the Great Barrier Reef. Status Report Number 7. (Australian Institute of Marine Science, 2005).
Ninio, R., Delean, J., Osborne, K. & Sweatman, H. Estimating cover of benthic organisms from underwater video images: variability associated with multiple observers. Marine Ecology-Progress Series 265, 107–116, https://doi.org/10.3354/meps265107 (2003).
Article ADS Google Scholar
Bland, J. M. & Altman, D. G. Measuring agreement in method comparison studies. Statistical Methods in Medical Research 8, 135–160, https://doi.org/10.1177/096228029900800204 (1999).
Article CAS PubMed Google Scholar
Krouwer, J. S. Why Bland–Altman plots should use X, not (Y+X)/2 when X is a reference method. Statistics in Medicine 27, 778–780, https://doi.org/10.1002/sim.3086 (2008).
Article MathSciNet PubMed Google Scholar
Giavarina, D. Understanding Bland Altman analysis. Biochemia Medica 25, 141–151, https://doi.org/10.11613/BM.2015.015 (2015).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the support of the following people (alphabetical by surname): Sori Arinta, Allen Chen, Illiana Chollett, Norbert Englebert, Patrick Gartrell, Yeray Gonzalez-Marrero, Susie Green, Selene Gutierrez, David Harris, Craig Heatherington, Kyra Hay, Ana Teresa Herrera-Reveles, Tadzio Holtrop, Nizam Ibrahim, Russell Kelley, Kathryn Markey, Angela Marulanda, Morana Mihaljevic, Christopher Mooney, Sara Naylor, Lorna Parry, Catalina Reyes-Nivia, Tyrone Ridgeway, Eugenia Sampayo, John Schleyer, Ulricke Seebeck, Ricardo Sepulveda, Kristin Stolberg, Amy Stringer, Abbie Taylor, Linda Tonk, Mark JA Vermeij, and Richard Vevers. Further, we would like to thank these organisations (alphabetical): Anguilla Fisheries Department, Australian Centre of Field Robotics, Australian Department for Energy and Environment, Australian Institute of Marine Science (AIMS), Biodiversity Research Centre / Academia Sinica (Taiwan), Bermuda Institute of Ocean Sciences (BIOS), Natural History Museum (Bermuda), Chagos Conservation Trust, Comisión Nacional para el Conocimiento y Uso de la Biodiversidad (Mexico), Conservation International, Diponogoro University, Dutch Caribbean Nature Alliance, Global Change Institute at The University of Queensland, Google Taiwan, Hawaiian Islands Humpback Whale National Marine Sanctuary, Hawaii Institute of Marine Biology (USA), Hawaii Pacific University (USA), Healthy Reefs for Healthy People Initiative, Indonesian Institute of Sciences, International Union for the Conservation of Nature (IUCN), James Cook University (Australia), Japan Agency for Marine Science and Technology (Japan), Joy Family Foundation, Living Oceans Foundation, Maldives Marine Research Institute, Marine Research Centre (Maldives), Maunalei Ahupua’a Community Marine & Terrestrial Management Area, Museum of Tropical Queensland, National Oceanic and Atmospheric Administration (NOAA), Netherlands Centre for Biodiversity (Netherlands), Old Dominion University (USA), Oregon State University (OSU USA), Queensland Government (Australia), Queensland University of Technology (Australia), PANEDIA, Réserve Naturelle Nationale de Saint-Martin, Sam Ratulangi University, Scripps Institution of Oceanography / University of California San Diego (USA), Smithsonian Institute (Florida), STINAPA (Bonaire), Tel Aviv University (Israel), Timor-Leste Ministry of Agriculture and Fisheries, Tubbataha Management Office, Tubbataha Reef Marine Park,Underwater Earth, Universidad Central de Venezuela (Venezuela), Universidad Nacional Autonoma de Mexico (UNAM Mexico), Université Antilles Guyane (Guadeloupe), University of Amsterdam (Netherlands) / CARMABI (Curacao), University of Bremen (Germany), University of California Berkeley (USA), University of Maine (USA), University of Sydney (Australia), University of the French West Indies and Guiana, University of the Ryukyus (Japan), University of Vienna (Austria), Vulcan Inc, Waitt Foundation, Wildlife Conservation Society (Belize), World Wildlife Fund, XL Catlin Ltd. (now AXA XL), XL Catlin Seaview Survey and XL Catlin Global Reef Record for their support. Coral reef photo-quadrats and associated image classification data were developed by The University of Queensland using survey photographs sourced from the XL Catlin Seaview Suvey and XL Catlin Global Reef Record, undertaken by the University of Queensland and Underwater Earth, with additional funding from Vulcan Inc., Academica Sinica, and the Australian government. Funding for this project was also provided by the Australian Research Council (ARC Laureate and ARC Centre for Excellence for Coral Reef Studies to O.H.-G.). This project also received support from QRIScloud, the Queensland Cyber Infrastructure Foundation (QCIF), and the Nectar Research Cloud supported by the National Collaborative Research Infrastructure Strategy (NCRIS).

Author information

These authors contributed equally: Alberto Rodriguez-Ramirez, Manuel González-Rivero, Ove Hoegh-Guldberg.

Authors and Affiliations

Global Change Institute, The University of Queensland, St Lucia, QLD, 4072, Australia
Alberto Rodriguez-Ramirez, Manuel González-Rivero, Oscar Beijbom, Pim Bongaerts, Peter Dalton, Emma V. Kennedy, Sebastian Lopez-Marcano, Benjamin P. Neal, Julie Vercelloni & Ove Hoegh-Guldberg
School of Biological Sciences, The University of Queensland, St Lucia, QLD, 4072, Australia
Alberto Rodriguez-Ramirez, Kristen T. Brown, Sophie Dove, Hawthorne L. Beyer & Ove Hoegh-Guldberg
Australian Institute of Marine Science, Cape Cleveland, QLD, 4810, Australia
Manuel González-Rivero
Australian Research Council (ARC) Centre for Excellence for Coral Reef Studies at The University of Queensland, St Lucia, QLD, 4072, Australia
Manuel González-Rivero, Kristen T. Brown, Dominic E. P. Bryant, Sophie Dove, Anjani Ganase, Catherine J. S. Kim, Veronica Z. Radice, Julie Vercelloni, Hawthorne L. Beyer & Ove Hoegh-Guldberg
Berkeley Artificial Intelligence Research, University of California, Berkeley, CA, 94720, USA
Oscar Beijbom
Underwater Earth, Sydney, NSW, Australia
Christophe Bailhache
California Academy of Sciences, San Francisco, CA, 94118, USA
Pim Bongaerts
Remote Sensing Research Centre, School of Earth and Environmental Sciences, The University of Queensland, St Lucia, QLD, 4072, Australia
Emma V. Kennedy
Australian Rivers Institute, Griffith University, Gold Coast, QLD, 4222, Australia
Sebastian Lopez-Marcano
Colby College, Environmental Studies Department, Waterville, Maine, ME, 04901, USA
Benjamin P. Neal
Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, 04544, USA
Benjamin P. Neal
ARC Centre of Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, QLD, 4000, Australia
Julie Vercelloni
School of Mathematical Sciences, Science and Engineering Faculty, Queensland University of Technology, Brisbane, QLD, 4000, Australia
Julie Vercelloni

Authors

Alberto Rodriguez-Ramirez
View author publications
You can also search for this author in PubMed Google Scholar
Manuel González-Rivero
View author publications
You can also search for this author in PubMed Google Scholar
Oscar Beijbom
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Bailhache
View author publications
You can also search for this author in PubMed Google Scholar
Pim Bongaerts
View author publications
You can also search for this author in PubMed Google Scholar
Kristen T. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Dominic E. P. Bryant
View author publications
You can also search for this author in PubMed Google Scholar
Peter Dalton
View author publications
You can also search for this author in PubMed Google Scholar
Sophie Dove
View author publications
You can also search for this author in PubMed Google Scholar
Anjani Ganase
View author publications
You can also search for this author in PubMed Google Scholar
Emma V. Kennedy
View author publications
You can also search for this author in PubMed Google Scholar
Catherine J. S. Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Lopez-Marcano
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin P. Neal
View author publications
You can also search for this author in PubMed Google Scholar
Veronica Z. Radice
View author publications
You can also search for this author in PubMed Google Scholar
Julie Vercelloni
View author publications
You can also search for this author in PubMed Google Scholar
Hawthorne L. Beyer
View author publications
You can also search for this author in PubMed Google Scholar
Ove Hoegh-Guldberg
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.R.-R., M.G.-R. and O.H.-G. conceived the idea of this data descriptor. A.R.-R., M.G.-R. H.L.B. and O.H.-G. wrote the manuscript. M.G.-R., P.D., B.P.N., E.V.K., D.E.P.B. and J.V. led expeditions. M.G.-R., C.B., K.T.B., D.E.P.B., P.D., A.G., E.V.K., C.J.S.K., B.P.N., V.Z.R., A.R.-R. and J.V. collected the imagery. M.G.-R, C.B., P.B. and O.H.-G. contributed to develop field methods. M.G.-R., A.R.-R. and O.B. developed and implemented the image analysis. M.G.-R., A.R.-R. and S.L.-M. processed the data. M.G.-R., A.R.-R., D.E.P.B., A.G. and C.J.S.K. contributed to train the automated classifiers. C.B., O.B., S.D., P.B., E.V.K. and J.V. provided further technical and scientific advice to the project. A.R.-R. and H.L.B. designed and implemented the dataset in the repository. O.H.-G. and M.G.-R. conceived and led the project. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Alberto Rodriguez-Ramirez, Manuel González-Rivero or Ove Hoegh-Guldberg.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary File 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Rodriguez-Ramirez, A., González-Rivero, M., Beijbom, O. et al. A contemporary baseline record of the world’s coral reefs. Sci Data 7, 355 (2020). https://doi.org/10.1038/s41597-020-00698-6

Download citation

Received: 11 June 2020
Accepted: 02 October 2020
Published: 20 October 2020
DOI: https://doi.org/10.1038/s41597-020-00698-6

This article is cited by

Rapid seagrass meadow expansion in an Indian Ocean bright spot
- Matthew Floyd
- Holly K. East
- Andrew J. Suggitt
Scientific Reports (2024)
A global database on coral recovery following marine heatwaves
- Robert van Woesik
- Chelsey Kratochwill
Scientific Data (2024)
Decadal stability in coral cover could mask hidden changes on reefs in the East Asian Seas
- Y. K. S. Chan
- Y. A. Affendi
- D. Huang
Communications Biology (2023)
Estimating rates of coral carbonate production from aerial and archive imagery by applying colony scale conversion metrics
- E. Husband
- C. T. Perry
- I. D. Lange
Coral Reefs (2022)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Photographic surveys

Post-processing of images for manual and automated annotation

Image analysis: manual and automated annotation for estimating covers of benthic categories

Label-set definition

Training and fine-tuning of the network

Automated image annotation and data processing

Data Records

Manual and automated annotations

Benthic cover

Imagery

Technical Validation

Usage Notes

Accessing benthic covers

Accessing temporal data

Accessing the imagery associated with benthic covers

Accessing annotations data and their images

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links