Introduction

The geographic extent of deep-sea corals (DSC) is far from being fully understood1. Throughout the world’s oceans, elucidation of the processes that determine their spatial patterns and abundances will improve our understanding of the structure of continental margin communities. At each new location explored, many new DSCs species have been revealed and notable differences in the distribution and habitat preferences of different coral taxa observed2,3,4.

With advancements in the use of technology in offshore areas, considerable progress in DSC research has occurred5. Ecological surveys typically state the depth range of recorded species and the most and least frequent and/ or abundant taxa found. Data on species composition and abundances may give insight into many aspects of the community, including population dynamics and ecological interactions like competition and predation6. The establishment of these DSC sites requires specific physical, chemical, and geological conditions, including parameters such as temperature, salinity, dissolved oxygen concentration, current intensity, and substratum7,8,9. Depth ranges vary for each species depending on regional variables, such as water mass structure and bathymetric gradients in productivity10,11. Thus, understanding the requirements for establishing DSC habitats and the variability in depth ranges among species, we can gain a comprehensive understanding of the dynamics and interactions within the studied communities.

Model outputs have indicated that continental slopes are one of the main suitable coral sites worldwide8. The bathyal (~200–3000 m depth) region12 has large protruding topographical features, interfering with bottom current flow and creating colonizable hard substrates at multiple depths; this enhances seabed erosion and resuspends particulates, which provides an elevated supply of suspended food particles for corals13. Along the Brazilian margin, five main water masses with distinct intensity and transport directions influence the continental slope14, which may contribute to regional species diversity pools.

Species distribution models presented by ref. 15 indicated that Santos basin in the Southeastern Brazilian Margin is a suitable region for the presence of DSC. Video recorded by remotely operated vehicles (ROV) during Petrobras surveys in this region detected an abundance of corals. Analyzing these video recordings in the northern region of Santos Basin, this study found that most of the coral sites were coral mounds that were distributed on the middle slope, while coral reef sites dominated the upper slope. The richness of DSC species was characterized, and we examined the following hypotheses that; (1) Vertical distance among samples at two depth strata (upper and middle slope) affects coral communities (species richness and assemblage structure) on a local scale, more than geographic horizontal distance (three different areas at the same depth); and (2) spatial distribution patterns of DSC species occur as a function of environmental factors (physicochemical properties). Although some species have broad distributions, most species occurred more frequently on the middle slope; richness was high in areas D and E (at the same depth). The possible influence of physical water properties on DSC richness and assemblage structure was then examined, finding a relation between assemblages and water mass differences.

Results

Water mass characteristics

The historical collection of CTD data in the vicinity of areas A–E recorded an average temperature of 14.40; 9.56; 4.25; 4.41 and 4.60 °C, respectively (Fig. 1). In 50 years, the highest temperature range was approximately four degrees and occurred around area A, with a minimum temperature of 12.38 and maximum of 16.61 °C. Area B presented temperatures ranging between 7.99 and 11.07 °C, and areas C, D and E showed almost no variability over the years (3.79–4.74; 4–4.78; 4.13–4.99 °C, respectively).

Fig. 1: Profiles of salinity and temperature.
figure 1

Historical (a, b) practical salinity and (c, d) potential temperature profiles within a radius of ~20 km of positions A, B, C, D, E. Positions A, B, C, D, E depths are 245, 494, 886, 822, and 853 m, respectively. The number of CTD casts with valid data within the site bottom depth (horizontal colored lines) ±5 m are indicated by ‘n’. Boxplots of (e, f) salinity and (g, h) temperature were calculated by selecting valid data within the above-mentioned radius and vertical interval range. In the boxplots, the central value represents the median value, the box represent quartiles, while the whiskers show the quartile ±1.5× interquartile range. The values of the horizontal axes of the top panels correspond to the ranges in the bottom panels.

Salinity historical records in the vicinity of areas A–E recorded an average of 35.58; 34.93; 34.51; 34.49 and 34.50, respectively (Fig. 1). Salinity did not vary notably around each sampled area among years, with a maximum observed range of 0.63 (Area A). The lowest and highest salinity values recorded for each area were: 35.27–35.90, area A; 34.70–35.12, area B; 34.40–34.64, area C; 34.45–34.54, area D; and 34.40–34.53, area E. Because historical data were studied around each sampled area, the temperature and salinity profiles are grouped according to their spatial position (Fig. 1).

The T–S diagrams indicated the presence of two main water masses influencing the sampling areas: SACW with neutral densities between 26.35 and 27.10 kg m−3; AAIW between 27.10 and 27.60 kg m−3; (Fig. 2). Temperature and salinity values associated with area A are majorly within the density range of SACW, while values associated with area B lie in a transition density range (~27.10 kg m−3) between SACW and AAIW. In the deepest levels (areas C, D and E), the dominant water mass is the AAIW.

Fig. 2: T–S diagram of historical temperature and salinity.
figure 2

T–S diagram of historical temperature (in situ) and salinity (practical salinity scale) data within a radius of approximately 20 km around individual coral areas (a) A, B, C, (b) D and E. The blue dotted contours depict the neutral density and serve as water mass density limits in the southwestern South Atlantic51. Colored markers indicate the selected T–S pairs within the vertical range that includes a depth range of the bottom depth ±5 m. Gray dots represent pairs at other depths. Panel (b) shows the E points superimposed over the D points, as both areas were sampled at similar depths and within similar density ranges.

Coral sites

The 69 studied deep-sea coral sites comprise three distinct morphological features: living coral reefs, live coral reefs on the top of deep-sea coral carbonate mounds, and extinct coral carbonate mounds where the framework building corals are no longer alive and growing.

Considering their biological and geological characteristics, the upper slope coral formations (area A) are reef sites. Colonies of D. pertusum (formerly Lophelia pertusa) composed of living polyps at their extremities and a large volume of dead coral structure underneath (a volume that exceeds the living portion), characterize dense reef sites around 245 m deep, with few associated DSC (Fig. 3). This is the beginning of coral mound formation, but there has not been sufficient time for coral skeleton to accumulate to the extent where it shrouds the underlying substrata.

Fig. 3: Coral sites.
figure 3

Three distinct morphological features of coral sites in the study areas. Area A - living coral reefs; Area B – live coral reefs on top of extinct deep-sea coral carbonate mounds; and areas C, D, and E - extinct coral carbonate mounds.

At a depth of 494 m on the upper slope, area B was identified as a coral carbonate mound. Although we could not extract cores from the specific mounds examined in this study to verify whether they contain many years of alternation of coral skeletons with sediment, every coral formation exhibiting the surface characteristics described in ref. 11 was considered a deep-sea coral carbonate mound.

In area B, the studied coral carbonate mound was found in association with live D. pertusum forming reef structures, with several associated DSC (Fig. 3). In contrast, no live D. pertusum forming reef structures were observed at the studied sites on the middle slope. At greater depths, which correspond to the reflective targets studied between 785 and 963 m, the coral formations on the middle slope (areas C, D, and E) consisted of extinct coral carbonate mounds that were colonized by solitary scleractinian corals, individual colonies of scleractinian corals, octocorals, and black corals (Antipatharia).

Deep-sea coral assemblage characterization

The species accumulation curves indicated a more comprehensive sampling effort when all coral sites in the northern Santos Basin were combined (Supplementary Fig. 1). The species accumulation curves for the upper slope did not reach an asymptote, suggesting that additional coral species could be found with increased sampling effort, whereas the sampling effort in the middle slope was relatively sufficient (Supplementary Fig. 1). Regarding the study areas, although site A recorded lower richness of DSC, the sampling effort was adequate to produce a representative collection of DSC from the study sites, even though the curve did not reach the asymptote (Supplementary Fig. 1). As for location B, even though there was only one sampling unit (indicating a need for increased sampling effort), it is noteworthy that the survey time, distance covered by the ROV, and the number of frames analyzed in area B revealed an extensive coral site in that region and provided important background information on the deep-sea coral richness recorded (Supplementary Table 1).

A total of 2054 live DSC belonging to 63 taxa were recorded across 69 coral sites (Supplementary Fig. 2). Live scleractinians were the most frequently observed taxa, followed by the Octocorallia group, representing 98% and 90% occurrence, respectively, at the studied coral sites. Antipatharians and Anthoathecata were the least frequently observed taxa (found at 47% and 42% of coral sites, respectively).

The highest richness values were recorded for the taxa Octocorallia, Antipatharia, Scleractinia, and Anthoathecata, in that order. A total of 42 taxa belonging to eight families of Octocorallia were recorded. Although Octocorallia were found in all areas of the study site, among the most common taxa, the species Paramuricea sp. 1, Narella sp. 1, Pseudoanthomastus sp. 1, Trachythela rudis sp. 1, and Keratoisididae sp. 3 were more frequent in areas C, D, and E, and absent in A (Supplementary Data 1). Notably, two Octocorallia taxa, Narella sp. 1 and Paragorgia sp. 2, appear to be new to science (publication intended shortly).

Regarding scleractinia, three families (Caryophylliidae, Dendrophylliidae, and Oculinidae) and seven taxa were recorded (Supplementary Data 1). Among them, Solenosmilia variabilis and Desmophyllum pertusum were the most frequent, representing 85% and 51%, respectively. However, D. pertusum was more common in areas A and B, while S. variabilis occurred exclusively in areas C, D, and E (Supplementary Data 1). Eleven antipatharians from the Leiopathidae and Schizopathidae families were found, with Bathypathes sp. 1 being the most frequent (33%) among black corals. Lastly, the Anthoathecata order had the lowest taxonomic richness, with Stylasteridae sp. 1 (35%) being the most frequent (Supplementary Data 1).

In terms of distribution range, the majority of DSC (60%) were limited to the middle slope (785–963 m depth), whereas only 7.3% of the species occurred exclusively on the upper slope (241–502 m depth) (Supplementary Data 1). Among the groups, most Octocorallian species were exclusively found on the mid-slope (30 species or 71%). Scleractinian species had a broad distribution along upper and middle slope regions (4 species or 57%), except for S. variabilis and Enallopsammia rostrata, which were restricted to the middle slope, and Pourtalosmilia sp. 1 on the upper slope (Supplementary Data 1). Antipatharians were also well-represented across all depth strata, while Anthoathecata species were more restricted to the deeper regions (middle slope) (Supplementary Data 1).

Spatial distribution

Vertical distribution - upper and middle slope

Deep-sea coral assemblage frequency (PERMANOVA: Pseudo-F = 64.419, DF = 1, p = 0.001) and richness (PERMANOVA: Pseudo-F = 7.345, DF = 1, p = 0.008) showed significant differences between assemblages on the upper slope (241–494 m) and middle slope (827–939 m depth strata) (Supplementary Table 2).

The DSC frequency SOM training resulted in twenty-five cells arranged in a 6 × 4 rectangular grid (quantization error = 0.01; topographic error = 0.08). Two distinct assemblage groups were identified, indicating strong gradients in the depth distribution (Fig. 4a, b). The deep coral reef assemblage found on the upper slope, between 241 and 494 m depth, could be clearly distinguished in the cells on the right side of the neural map (cluster 2, gray) (Fig. 4a, b). The second group (cluster 1, light gray) is mainly composed of the assembly registered on the middle slope, between 800 and 1000 m, represented by cells on the left side of the neural map (Fig. 4a, b).

Fig. 4: Vertical distributions - Areas A, B and C.
figure 4

a Hierarchical clustering of the trained self-organizing map (SOM) by the Depth; (b) Distribution of sampling depths on the SOM, with grayscales zones corresponding to the indicators in the cluster diagram, and (c) Distribution patterns of the best match units taxa in the trained SOM according to the frequency of occurrence of DSC. Each map corresponds to one taxa; these can be superimposed onto the map representing the cluster distribution of depths to show at which depths they are more frequently observed.

According to BMU (Best Matching Units), among species of DSC that had the highest correlations with the vertical spatial distribution, D. pertusum, P. conferta, Leiopathes sp. 1 and Madrepora oculata had high occurrences on the upper slope (cluster 2; gray) (Fig. 4b, c). S. variabilis, Pseudoanthomastus sp 1, Narella sp. 1, Keratoisidinae sp. 7, Victogorgia sp. 1, Primnoidae sp. 2 and Enallopsammia rostrata were the main taxa accounting for similarities among middle slope assemblages (cluster 1; light gray) (Fig. 4a, b, c).

The Mantel tests indicated that geographical distances between deep-sea coral sites (areas A, B and C - stratified profile) were not correlated with the DSC assemblage frequency (r = −0.079, P = 0.885) and richness (r = −0.146, P = 0.958).

Horizontal distribution

Assemblage (PERMANOVA: Pseudo-F = 5.518, DF = 2, p < 0.001), richness (PERMANOVA: Pseudo-F = 23.526, DF = 2, p < 0.001), and density (PERMANOVA: Pseudo-F = 18.426, DF = 2, p < 0.001) of DSC differed significantly at area C (Fig. 5a). The average richness and density were significantly higher at areas D and E, and lowest in C (Fig. 5a). Octocorallia and Scleractinia also had significantly higher richness and density at areas D and E (Fig. 5a and Supplementary Table 3).

Fig. 5: Horizontal distribution - Areas C, D and E.
figure 5

a Individuals m-1 and richness of deep-sea corals, Octocorallia, Scleractinia, Antipatharia, and Anthoathecata respectively at each coral site. The boxes show the 25–75% quartile, while circles indicate outliers and whiskers non-outliers. The central line represents the median and rectangular the mean; (b) Hierarchical clustering of the trained self-organizing map (SOM) by the coral sites; (c) Distribution of sampling coral sites on the SOM, with grayscale zones corresponding to the indicators in the cluster diagram; and (d) Distribution patterns of taxa in the trained SOM according to the DSC density. Each map corresponds to one taxa; these can be superimposed onto the map representing the cluster distribution of coral sites, showing at which area they are more frequently observed.

The DSC abundance SOM training resulted in forty cells arranged in a 6 × 6 rectangular grid (quantization error = 0.01; topographic error = 0.2) and pointed to two distinct groups, indicating different DSC assemblages between coral sites (Fig. 5b, c). Assemblage structure at area C at the bottom of the spatial map (cluster 1; light gray) clearly differed from areas D and E, at the middle and top of the map (cluster 2; gray) (Fig. 5b, c).

Of the species with the highest correlations with the horizontal spatial distribution pattern, according to the BMU (Best Matching Units), S. variabilis, Paramuricea sp. 1, Narella sp. 1, Desmophyllum sp. 1, Trachythela rudis, Stolonifera sp. 1 and Errina gracilis, occurred almost exclusively in areas D and E, with their maximum abundance at the middle and upper of the map (cluster 2; gray) (Fig. 5d). In contrast, taxa such as D. pertusum, Pseudoanthomastus sp. 1, Keratoisididae sp. 3 and Keratoisis sp. 7 occur throughout the studied area (Fig. 5d), while Priminoidae 1 has a higher abundance in area C, seen on the bottom side of the map (cluster 1; light gray) (Fig. 5d).

The mantel test indicated that geographical distances between coral sites (ares C, D and E) were not correlated with the DSC assemblage density (individuals m−1) (r = 0.004, P = 0.577) and richness (r = 0.085, P = 0.052).

Relationship between environmental variables and coral assemblages

The principal coordinates analysis (PCO) indicated the relative importance of water masses (physical-chemical environmental parameters) to the distribution and abundance of taxa, with axis 1 explaining 23% and axis 2 explaining 13% of the total variance. The taxa D. pertusum, Caryophylliidae sp. 1, Pourtalosmilia sp. 1 and Madrepora oculata were more associated with SAWC (Fig. 6). In contrast, most DSC taxa (Desmophyllum sp. 1, Paramuricea sp. 1, Narella sp. 1, Trachytella rudis, Keratoisididae sp. 7, Solenosmilis variabilis, Pseudoanthomastus sp. 1, Stolonifera sp. 1 and Enallopsammia rostrata) were correlated with AAIW (Fig. 6).

Fig. 6: PCO plots of deep-sea coral taxa and water mass associations.
figure 6

Principal coordinates analysis (PCO) plots based on deep-sea coral taxa (Spearman correlation coefficient (|rSP|) ≥0.2) and its association with South Atlantic Central Water (SACW) and Antarctic Intermediate Water (AAIW). Labels: □ Upper slope; Middle slope.

GAM models indicated that temperature and salinity were significant predictors of DSC richness, explaining approximately 30% of the variation (Table 1). Results indicated that DSC taxa showed an optimal range of occurrence from 4 to 6 °C, and salinity values between 34.4 and 34.6 (Fig. 7).

Table 1 Results of the GAM fitted model for deep-sea coral richness and environmental variables.
Fig. 7: Relationships between deep-sea corals and environmental variables.
figure 7

GAM analysis illustrating significant relationships of deep-sea coral richness and (a) temperature (°C), (b) salinity and (c) density seawater. Lines represent the estimates of the best models; dashed lines are confidence intervals (95%); circles indicate data sampling points.

Discussion

This study unveiled previously undescribed deep-sea coral sites and taxonomic richness in the northern Santos Basin, Brazil, at depths ranging from 241 to 963 m and latitudes between approximately 23 °S and 24 °S. Dense deep-sea coral reef sites were observed on the upper slope coral sites under the influence of the South Atlantic Central Water (SACW), while the carbonate coral mounds sites were found predominantly on the middle slope under the influence of the Antarctic Intermediate Water (AAIW). The vertical and horizontal geographic distances between deep coral sites (measured as linear distance) showed no correlation with the assemblage structure and richness. However, even for the sites located at the same local scale, coral assemblage structure and richness varied according to depth and also among coral carbonate mounds at similar depths.

Despite individual colonies of the framework-forming Desmophyllum pertusum occurring in all sampled areas and at 51% of the total studied coral sites, this species only built dense reef-like structures on the upper slope under influence of SACW. It is known that seawater temperature can determine the distribution of DSC16, taking into account the possibility of local species adaptation17. While populations in the temperate North Atlantic are known to be affected by temperatures above 15 °C18, in the tropical South Atlantic, D. pertusum has developed under a temperature variation of 12.38 to 16.61 °C, based on the last 50 years of available environmental data.

Considering that the dense D. pertusum reef structures on the upper slope of Santos Basin developed within their natural thermal range17, and taking into account variation in pre-existing metabolic stores or underlying genetic variation19, the growth of these colonies was likely primarily controlled by food availability. This can be determined by a combination of surface productivity and hydrodynamic processes20 that can complement food sources around this area. Examples of active hydrodynamic processes in the region are the poleward-flowing Brazil Current, and the equatorward Intermediate Western Boundary Current, which respectively influence the top 500 m of the water column, and depths below 500 m21.

Is it known that reef habitats are the beginning of coral mound formation22. The studied upper slope coral site registered at 494 m depth is a coral carbonate mound with D. pertusum colonies building a relatively large reef-like structure on the top (Fig. 4). Nevertheless, the main D. pertusum colonial growth forms (morphotypes) in area B look like columnar structures, differing from the “bush-shaped” colonies in area A23. It suggests a stronger influence of bottom current velocities and water mass characteristics, since this particular site is registered in a transition density range between SACW and AAIW. The columnar architecture represent a growth strategy in response to low flow speeds and/or low food concentrations23, as opposed to bushy forms, that represent a growth strategy in response to a high energy local environment23.

Water masses have diverse environmental characteristics, such as distinctive currents, temperature, salinity, nutrients and pH. As depth changes, distinct hydrodynamic regimes and sedimentary processes act on sites24,25. On the middle slope, the framework-forming Solenosmilia variabilis was the main live colonial taxa on the surface of the coral carbonate mounds, with no records for upper slope sites. This species is widely distributed along the Brazilian coast26. The nearest reports that also had a high frequency of occurrence of S. variabilis are from the adjacent Campos Basin, with the majority of records between 700 and 1200 m depth11.

S. variabilis are recorded under a temperature variation of 3.79–4.99 °C and salinity variation of 34.49–35.52, corroborating the temperature and salinity in other records and in global predictive models8,27. Depth, salinity and particulate organic carbon (POC) were the main factors predicting the distribution of S. variabilis off the Brazilian margin15.

The characteristics of AAIW correspond to the zone with the highest taxonomic richness in the present study. Octocorallia is the group responsible for the highest taxonomic richness in the region, accounting for approximately 66% of the registered taxa (n = 42). This is not unexpected, as they are known to harbor the highest diversity among coral taxa, with over 3000 described species worldwide28. Thirty-two percent of the Octocorallia taxa had records from a coral site under SACW influence, at ~9.56 °C (Area B). However, most of this group is registered on the middle slope, at ~4–5 °C.

Although the studied deeper coral sites are under the influence of the same water mass (AAIW), DSC assemblages, abundance, and richness varied among these areas, with a significant increase from C to E. Species such as S. variabilis, Paramuricea sp. 1, Narella sp. 1, Desmophyllum sp. 1, Trachythela rudis, Stolonifera sp. 1 and Errina gracilis, occurred almost exclusively in areas D and E. At the northern end of the basin, areas D and E are the closest sites to two physical mechanisms that occur along Cabo Frio, quasi-seasonal coastal upwelling and eddy-induced upwelling29,30. The penetration and surfacing of SACW in the inner shelf region influence ocean productivity30, where mass and energy are transferred through trophic webs31. High abundances of marine snow are commonly observed in benthic systems of the Cabo Frio upwelling region32. Thus, high food availability may explain the highest deep-sea coral richness and abundance in the vicinity.

Although upwelling phenomena are important in the structuring of biological communities, further studies will help elucidate some of these deep-sea coral pattern differences among coral carbonate mounds. The lack of correlation found in the mantel test with geographic distance may be due to the analyzed areas being on the same spatial scale (local scale). Thus, distances are not sufficient to cause differences in the assemblages. The coral mounds located in area C are linked to features identified as “pockmarks”33, which can modify the characteristics of the sediment34, change the current flow35, and harbor diverse and highly specialized fauna36,37. However, it was observed that the richness of DSC is lower in area C, as illustrated in Fig. 7. Therefore, additional investigations are necessary, including the determination of the age/time of formation of the extinct carbonate mounds, the presence of erosive grooves associated with the action of bottom currents, and other relevant features.

The Brazilian slope was identified as a favorable region for deep-sea coral habitats15, and the number of recorded taxa in the SW Atlantic has been increasing in recently explored areas11,24,25,26,34,38,39,40,41. However, access to deep-sea science data in unexplored regions is extremely limited, primarily due to high costs42. Therefore, industry-science partnerships have provided valuable databases to enhance and expedite scientific discovery in these areas43,44.

The southeastern slope of Brazil has accumulated a substantial amount of environmental and biological data, collected using a diverse array of underwater technologies by the PETROBRAS company. This deep-sea coral study conducted with an ROV in the Santos Basin, which identified morphospecies, is expected to improve the assessment of deep-sea coral richness in the area. ROV data is an underutilized scientific resource with substantial intrinsic value and global coverage42, and its benefits outweigh potential limitations associated with industry-provided data analysis.

Despite the concentration of ROV exploration efforts in areas of industry interest and the lack of standardized methodology in offshore operations, which may compromise the analysis of qualitative and quantitative data, the provided database has enabled the present study and is crucial for establishing basic knowledge about DSC in Brazil. DSC taxonomy is complex, and morphological descriptions sometimes lack sufficient clarity, which could result in biodiversity underestimation. Video footage resolution may also limit DSC identification. The image quality obtained from an ROV, particularly older models, can sometimes preclude accurate species-level identification45. Therefore, it is likely that the known inventory will grow with increased sampling effort as well as with the use of molecular tools, essential for assisting classical taxonomy46.

Deep coral-reef systems are clearly complex, where a number of overlapping ecological and physical processes may simultaneously influence DSC assemblage structure. The results of the present work make an important contribution to understanding the distribution and ecology of DSC on the continental slope. DSC distribution and composition were defined in previously unexplored regions in the northern Santos basin in the Southwestern Atlantic. The most frequently occurring stony coral was Solenosmilia variabilis. D. pertusum primarily inhabited areas with the presence of SACW whereas S. variabilis occurred predominantly at depths with the presence of AAIW. The differences in DSC abundance and richness occurred among studied sites with the influence of the temperature and salinity. However, as with all models that seek to explain the structure of assemblages, a substantial proportion of the variation was not explained, indicating the influence of other factors. Here, we propose that water mass may be an important predictor for determining DSC distribution patterns of the most dominant reef-forming species.

Methods

Study area

The study region, Santos Basin off the coast of Brazil, is located in the southwest (SW) Atlantic Ocean, with an area ~350,000 km247. Data were collected from the upper (241–251 m depth, area A; 494 m depth, area B) and middle slope (827–945 m depth, area C; 815–963 m depth, area D; 785–876 m, area E;) in the northern Santos Basin (Fig. 1), where the isobaths are narrow and the shelf break occurs between 120 and 180 m depth48.

A regional bathymetric map derived from a mosaic of single beam echosounder data from the Brazilian Navy, nautical charts, depth data from the Brazilian Continental Shelf Survey Plan, 3D seismic data (especially of the continental slope and São Paulo Plateau) and multibeam surveys, which was provided by PCR-BS (Projeto de Caracterização Regional da Bacia de Santos)49, illustrates that the study areas are on the upper and middle slope, between 241 and 963 m depth (Fig. 8). A slope gradient map derived from a bathymetric grid highlights the seafloor morphology around the studied areas. The areas are located on a smooth slope, with gradients varying from 0.65 to 2 ° (Fig. 8).

Fig. 8: Study area map.
figure 8

Map of the study areas (A; B; C; D; E) showing the location of the northern Santos Basin in southeastern Brazil; SW Atlantic.

The water column in the studied region (up to 1000 m depth) is composed of three water masses: Tropical Water (TW; 0–150 m), South Atlantic Central Water (SACW; 150–500 m) and Antarctic Intermediate Water (AAIW; 500–1000 m)14,50. TW is oligotrophic, warm (>20 °C) and saline (>36)14,50, within which phytoplankton blooms occur that supply food to the seafloor. Below the TW is the SACW, pycnocline water formed by subsidence at the Subtropical Convergence50, with temperatures varying from 6 to 18 °C and salinities from 34.6 to 36, tending to dominate the mid to outer-shelf. Underneath it, the AAIW is characterized by a water column salinity minimum and being oxygen-rich5,14.

Water mass characteristics

To identify the water properties (such as temperature and salinity) and its variability around the deep coralline studied areas, a collection of CTD casts over the continental shelf and slope was used. It consists of measurements made by historical oceanographic surveys and argo floats, spanning from November 1972 to March 2022. Selected hydrographic databases were obtained from NOAA’s National Oceanographic Data Center, the Brazilian Navy, Argo program and from Instituto Oceanográfico at the University of São Paulo. The collection and organization are part of efforts of the PCR-BS Project (Santos Basin Environmental Characterization). All casts were binned to a vertical grid with a 1 m resolution.

Water mass identification was based on neutral density ranges (Supplementary Table 4) identified by ref. 51. Neutral density52 was computed using the original ‘gamma-n’ fortran package (v3.1) wrapped in python. The practical salinity scale and in situ temperature were used to calculate neutral density.

Data collection

Prior to ROV imaging, data from seafloor geophysical surveys previously carried out by Petrobras were analyzed to increase the chances of finding DSC sites (the same approach was applied in the adjacent Basin, see ref. 11). These data were collected using a multibeam echosounder, indicating a large number of acoustically strongly-reflective targets between 200 and 1000 m depth. These targets typically correspond to hard substrata, thus indicating sites of interest for further exploration by ROV11.

Selecting some sites with strongly-reflective targets (identified as coral sites), a deep-sea coral image database was generated using ROV videos from five areas, during seven research cruises (Fig.8). Videos were recorded within the study sites between 2009 and 2012, resulting in approximately 179 h of video at depths ranging from 241 to 963 m (Supplementary Table 1).

The videos were collected by ROV survey ships contracted by Petrobras through the SENSIMAR Project (Sensitive Marine Environments of Southeast Brazil) to perform environmental surveys as well as routine oil and gas industry inspections. They were equipped with multiple video cameras that varied in resolution. Laser scales were not used and high definition video was available only in more recent operations. The ROV traversed slowly, close to the bottom. Occasionally the ROV paths were interrupted for sampling and detailed camera zooms. Images were obtained from the continuously recorded video for each coral site.

Image data processing and analysis

Prior to image analysis, one still “.JPG” frame was generated for every five seconds of video recording using ffmpeg software (version 4.2.2) at each coral site, and each live coral colony in the field of view was identified from its closest-range images. Close range images facilitated species recognition by allowing clear visualization of small colonies or other coral species. Low-quality images (e.g., unfocused, with high turbidity, or low illumination, among others) were discarded.

Because identification to the lowest taxonomic level was not always possible through images, distinguishing morphological characteristics were used to assign putative species or “morphospecies”53. Species level identification was performed according to gross morphology, branching pattern, polyp size, and type53,54,55,56,57,58.

Data pre-treatment

Camera arrangement, camera type, and logging methods differed between surveys. Therefore, to test the null hypothesis that DSC assemblage composition did not differ among years, an Analysis of Similarity (ANOSIM) was carried out using PRIMER v6 with the PERMANOVA + software package. Since no difference was found (Global R = 0.061, p = 0.417), the different years were grouped into a single data set for further analyses. This improved the power of the tests related to the study of spatial distribution.

Deep-sea corals assemblage characterization

The sampling effort was evaluated using a species accumulation curve, with a non-parametric bootstrap59, for three different situations: a) the studied coral sites in the Santos Basin, b) the studied coral sites on the upper slope (areas A and B) and middle slope (areas C, D, and E); and c) coral sites located in each studied area (A, B, C, D, and E). This method assumes that all species occur randomly, without taking species abundance into account. The index estimates were obtained through the analytical equation of ref. 60 using the EstimateS software61.

Deep-sea coral assemblages were characterized by the frequency of occurrence (FO), which represents the percentage of each taxon´s occurrence compared to the total number of coral sites analyzed in each of the areas A, B, C, D, and E (Supplementary Table 1). Abundance, on the other hand, refers to the actual number of individuals of a specific species present in a given area or community. In this study, density of live DSC, which is the abundance per meter of track, was calculated solely for coral sites in areas C, D and E. Richness and density estimates were based on DSC from the coral site ROV video record in relation to the total track of each site (individuals/richness per meter of track). Species richness was recorded for each coral site in each area: A, B, C, D, and E.

Spatial distributions

A permutational multivariate analysis of variance (PERMANOVA)62 was performed twice, examining both vertical and horizontal distribution of the coral species. When overall significant differences were detected, pairwise tests with Bonferroni correction for multiple tests (sites C, D and E) were performed using the ‘pairwise.adonis’ function in the R Package ‘pairwiseAdonis’63.

An unsupervised machine learning technique, a self-organizing map (SOM) training algorithm64,65, was then used to characterize DSC assemblage patterns for several depths (vertical distribution) and areas (horizontal distribution). The SOM was generated using the software “An Interactive Machine Learning App for Environmental Science”, an open-source application available on GitHub66.

The SOM was formed by two layers of artificial neurons, the input layer (nodes or neurons that are computational units) and the output layer (neurons arranged in a two-dimensional grid), both connected through weights65. In SOM analysis, each sample is treated as an n-dimensional input vector defined by its variables67. The input vector is fed by the input layer into a neural network and each is connected to an output vector with a single weight vector68. The SOM output is an ordered two-dimensional map composed of nodes called neurons; each is connected to its neighbors by a neighborhood relationship that dictates the map’s structure.

The basic SOM training algorithm consists of three phases. In the first phase (competitive), the output neuron that presents the weight vector with the smallest distance from the input is considered the winner, also called BMU (Best Matching Unit). In the second phase (cooperative) the neighborhood of this neuron is defined. Finally, the last phase (adaptive) occurs when the weights of the code vectors of the winning neuron and its neighborhood are adjusted, making it possible to detect similarities between the samples.

In the present work, each sample was assigned to an output neuron that has a coefficient vector associated with the input data. The coefficient vector is referred to as a weight vector W between the input and output layers. The weights establish a connection between the input units (deep-sea coral species) and their associated output units (coral sites).

The algorithm can be described as follows: when an input vector × is presented to the SOM, in this case, frequency occurrence data of DSC species (first SOM testing depth differences) and the density (ind m−1) of each DSC species (second SOM testing spatial differences), the neurons in the output layer compete with each other, and the winner (whose weight is the minimum distance from the input vector) is chosen. The winner and its predefined neighbors in the algorithm update their weight vectors according to the SOM learning rules as follows:

$${m}_{{ij}}(t+1)={w}_{{ij}}+\alpha \, (t).{h}_{{jc}}(t)\, [{{{{{\boldsymbol{x}}}}}}\,{{{{{\rm{i}}}}}}\, \left(t\right)-{m}_{{ij}}(t)]$$

where mij (t) represents the weight between a node i in the input layer and a node j in the output layer at iteration time t. The learning rate factor α (t) is a decreasing function of iteration time t. The neighborhood function h jc (t) is a smoothing kernel defined over the points of the lattice that determines the size of the neighborhood of the winning node (c) to be updated during the learning process. This learning process continues until all the weight vectors have completed their interactions.

In the present work, as neighborhood functions between neurons were tested in the bubble function and the iterative training process, we chose to use 500 iterations. After training, the WardD2 linkage method was applied to a hierarchical cluster analysis (HCA) to highlight sets of real objects on the map69. The “elbow method (or knee method)” was used to determine the ideal number of clusters70. Therefore, the final map will show similar data samples close to each other (groups) and dissimilar data samples far from each other. As error quantification criteria, the average quantization error (QE) and the topological error (TE) are calculated. The average quantization error is the average distance between each data vector and its ‘best matching unit’ (BMU), measuring the resolution of the map. For the Kohonen map grid, the topological error indicates the accuracy of mapping in terms of preserving the topology65.

A Mantel test (Pearson’s correlation) with 5000 permutations71, was used to test the hypothesis that assemblage structure and richness of DSC are correlated to vertical and horizontal geographic distances between DSC sites (linear distance - meters). The analyses were performed in the R statistical environment (The R Development Core Team, 2009) using functions from the vegan package72.

Vertical distribution - upper and middle slope

In the statistical analyses of vertical distributions, areas A, B and C were used to assess the differences between coral community assemblies at different depths, eliminating areas D and E to reduce possible interference from additional factors in more geographically distant areas. In addition, to avoid underestimation errors, species frequency data was used (instead of density data - individuals.m−1), because areas A and B contain dense reefs of Desmophyllum pertusum (formerly Lophelia pertusa) that made it impossible to count live coral colonies, obscuring the density determination.

In the first PERMANOVA analysis, the depth data were divided into upper slope and middle slope to facilitate sampling design analysis and the visualization of DSC variation along the vertical gradient. Thus, two depth strata (241–494 m - upper slope, and 827–945 m - middle slope) were the prediction factors for the DSC assemblage frequency occurrence (log10(x + 1) transformed data – Jaccard similarity matrices) and species richness (log10(x + 1) transformed data – Euclidean distance matrices).

In the first training process, the SOM used an input layer comprising 47 nodes (one by taxa) connected to the sampling datasets of 2 depth strata. Prior to analysis, the species frequency was log10(x + 1) transformed for the SOM training run. The subsequent learning phase determined the ‘best matching unit’ based on the Euclidean distance measure. Further ordering and tuning phases were repeated to adjust the neighboring units.

In the Mantel test, to evaluate the effects of geographic distances at vertical strata in the DSC structure, we used two biological matrices. The first was the DSC assemblage frequency (Jaccard similarity measure) and the second was based on the richness of DSC (Euclidean similarity measure). The spatial matrix was the linear geographic distance (distance similarity measure) between the centroid of areas A, B and C.

Horizontal distribution - middle slope

In the second PERMANOVA analysis, we tested the hypothesis that the DSC assemblages varied with areas. Areas C, D, and E (all coral mound sites within the same depth strata) were used as predictors for assemblage structure. The dependent variables were DSC group and assemblage (log10(x + 1) transformed density data – Bray Curtis similarity matrices), species richness, and density of live DSC per meter of each studied coral mound (log10(x + 1) transformed data – Euclidean distance matrices). To avoid underestimation errors, coral abundance data from areas A and B were removed from this particular analysis as these areas contained dense D. pertusum reefs that obscured determination of density (see Fig. 3).

The second training process for the SOM used an input layer including 55 nodes (one by taxa) connected to the sampling datasets from three areas. Prior to analysis, the species density data were log10(x + 1) transformed for the SOM training run. The subsequent learning phase determined the ‘best matching unit’ based on the Bray-Curtis index.

To evaluate the effects of geographic distances at horizontal distribution of the DSC structure, we used two biological matrices. The first was the DSC assemblage density (Bray-Curtis similarity measure) and the second a richness of DSC (Euclidean similarity measure). The spatial matrix was a linear geographic distance (distance similarity measure) between the centroid of areas C, D and E.

Relationship between environmental variables and coral assemblages

To summarize multivariate patterns in DSC assemblages across different temperatures, salinities and density of seawater, PCO was performed. Displayed species showed a Spearman correlation coefficient (|rSP|) ≥0.2 with one or more axes. All multivariate analyses were performed using PRIMER v6 with the PERMANOVA + software package. Generalized additive models (GAM) were used to evaluate the influence of environmental variables on DSC richness, due to the expectancy of the non-linear response of species and predictor variables75. A GAM with a Gamma distribution and the log-link function was used for the modeling. The Gamma distribution was chosen after exploring several alternative distributions (e.g., Gaussian, Poisson, and negative binomial). This distribution is appropriate for continuous, positive, and asymmetric data. Model selection was performed using Akaike information criterion (AIC). The relative importance of each predictor and complete model were evaluated by the R2 statistic.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.