Introduction

Phytoplankton stabilise Earth’s climate by sequestering atmospheric carbon dioxide (CO2) via the Biological Carbon Pump (BCP), a suite of biological and physical processes that transport photosynthetically-fixed organic matter to the deep ocean1,2. Phytoplankton are also the gateway for carbon and energy to enter food-webs, and are fundamental to supporting highly-productive ecosystems, for example, as the primary food source for keystone species such as Antarctic krill3. Both the biogeochemical and trophic roles of phytoplankton are dependent on the taxonomic community composition4,5,6, and consequently, there is increasing focus on understanding how specific Southern Ocean phytoplankton groups will respond to future climate scenarios7,8,9,10,11. Advances in understanding the spatial variability in phytoplankton community composition over broad scales in the Southern Ocean are currently limited but urgently required.

The Southern Ocean is characterised by a sequence of environmentally distinct oceanographic zones (Fig. 1), distinguished by physio-chemical properties including temperature, salinity, and nutrients12,13,14. These different environmental conditions exert strong bottom-up control on phytoplankton biomass and diversity, which subsequently influence the efficiency of CO2 sequestration and ecosystem productivity15,16. The most prominent hydrodynamic regional feature is the Antarctic Polar Front (APF), a distinctive oceanographic boundary between Antarctic and Subantarctic waters12,13. The APF separates cold, fresh, macronutrient-rich Antarctic Surface Water (AASW), from warmer, saltier, and macronutrient rich but silicate-poor Subantarctic water (SAW)17, with the denser AASW subducting beneath the lighter SAW12. The APF is recognised as a “Biogeochemical Divide”18, characterised by discontinuity in the partial pressure of CO2 (pCO2) and silicate concentration between Antarctic and Subantarctic water19,20, reflecting differences in the contributions of biological and physical mechanisms to the control of export production and air-sea CO2 exchange18. These changes in physical and chemical properties also influence the biosphere, with high species level endemicity observed among many non-mammalian fauna either side of the APF21.

Fig. 1: The circumpolar distribution of pigment sampling locations.
figure 1

Pigment distributions across the six oceanographic zones (Subtropical Zone, Subantarctic Zone, Polar Frontal Zone, Permanently Open Ocean Zone, Seasonal Sea Ice Zone, and the Antarctic Shelf; (A) and sampling frequency in respective years (B) and months (C). Note that the year 2006 shows an anomalously high number of samples due to two large sampling campaigns (CORSACS49 and BROKE-West50).

These geospatial differences in biogeochemistry18 imply a corresponding divide in phytoplankton community composition and functionality, but this has not been demonstrated empirically. In this manuscript, we reveal consistent zonal differences in phytoplankton community structure over the circumpolar Southern Ocean, based on a large dataset of phytoplankton pigments samples (n = 14,824) amassed over 26 years from 58 oceanographic voyages between 32°S and the Antarctic coast (see Methods). We present analysis of this pigment dataset using the inversion method “phytoclass22, which determined the differences in the chlorophyll a (Chl a) biomass and proportions of seven phytoplankton groups mainly at Class level (diatoms, haptophytes, cryptophytes, dinoflagellates, green algae, pelagophytes, and Synechococcus) across the Southern Ocean. Further, we identify the distributions of four distinct community types, revealed by cluster analysis, and show their distribution relative to the APF. The analysis identifies a phytoplankton “Class Divide” that exists as the biological analogue to the Biogeochemical Divide.

Results and Discussion

Phytoclass analysis and data distributions

The phytoclass software provided robust quantification of the Chl a biomass for phytoplankton groups estimated from the pigment data. The median root mean square error associated with each run was 0.012 mg Chl a m−3 against a median biomass of 0.42 mg Chl a m−3 and ranged from <0.01 mg Chl a m−3 to 28 mg Chl a m−3 (a coastal station in the Western Antarctic Peninsula). There were strong relationships between phytoplankton pigment markers and the Chl a biomass for each respective group (Supplementary Table 1).

The HPLC data used in this study provided high-quality measurements of Chl a and phytoplankton accessory pigments concentration between the Subtropics and Antarctic Shelf from 1996 to 2022, spanning the duration of the Ocean Colour satellite record, but was highly biased to the summer months (Fig. 1). The majority (85%) of samples were collected in austral summer, primarily in January (56%). Therefore, we do not report on the seasonal dynamics of phytoplankton groups; although seasonal patterns and succession are important15,23,24, they vary regionally and are hence blurred somewhat by zonal averaging. The Antarctic Shelf region (hereafter referred as ‘Shelf’) was the most sampled, comprising 6,544 (44%) of total samples, largely due to sustained long-term research efforts in the Ross Sea and Western Antarctic Peninsula (WAP). Sample size typically decreased in northerly latitudes, with the Subtropical Zone (STZ) having the fewest samples (n = 470).

Zonal variation of phytoplankton biomass and community composition

Figure 2 summarises phytoplankton community biomass and taxonomic composition, displaying the mean total Chl a (TChl a) biomass and phytoplankton group proportion in each region and oceanographic zone. The mean circumpolar phytoplankton biomass across all oceanographic zones was 0.89 mg Chl a m−3, with the highest mean biomass of 1.5 mg Chl a m−3 in the Shelf, 3 times greater than the Seasonal Sea Ice Zone (SSIZ) and Permanently Open Ocean Zone (POOZ) (0.4 and 0.5 mg Chl a m−3, respectively; Fig. 2). North of the APF, TChl a in the Polar Frontal Zone (PFZ), Subantarctic Zone (SAZ) and STZ ranged between 0.4 to 0.6 mg Chl a m−3. Averaging across all zones, diatoms were the most prevalent group with a mean biomass of 0.4 mg Chl a m−3 and mean proportion of 42% of TChl a, with haptophytes being the second-most abundant group with a mean biomass of 0.3 mg Chl a m−3 and 33% relative contribution (Supplementary Table. 3). Cryptophytes were the only other group to make a notable contribution (>10% of TChl a) to community biomass, although this was only in the Shelf (Supplementary Table. 2).

Fig. 2: Mean proportion of the main phytoplankton groups by oceanographic zone (rows) and region (columns).
figure 2

The horizontal black line represents the Antarctic Polar Front (APF), where the diatom-dominated circumpolar community composition shifts to haptophyte dominance. The zones include the seasonal sea ice zone (SSIZ), the Permanently Open Ocean Zone (POOZ), the Polar Frontal Zone (PFZ); the Subantarctic Zone (SAZ) and the Subtropical Zone (STZ). The colour of the wedges indicates the phytoplankton group, and the size of the pie reflects the mean Chl a biomass, as identified in the keys on the right of the figure. Sample size is indicated by the number to the bottom right of each pie and “Insufficient data” indicates where sample size is less than 10.

A clear feature of Fig. 2 is the shift in phytoplankton community composition at Class level at the APF, emphasising its role as a major boundary between diatom-dominated Antarctic zones and haptophyte-dominated sub-Antarctic zones. Although the mean TChl a biomass was similar either side of the APF, the community composition differed greatly (Fig. 2), with haptophytes dominating all regions of the PFZ whereas diatoms dominated all regions in the POOZ. Furthermore, with few exceptions, the proportions of phytoplankton groups/classes in communities south of the APF were remarkably consistent, even though the total biomasses differed, whereas the relative compositions of communities north of the APF were more varied.

With respect to zonal variability, mean diatom biomass south of the APF was greatest in the Shelf region (0.7 mg Chl a m−3), relative to the SSIZ and POOZ at 0.2 and 0.3 mg Chl a m−3, respectively. Circumpolar dominance of diatoms was a prevalent feature of all zones south of the APF, with a mean proportion of 46%, whereas the proportion of haptophytes was lower at 32%. The only exception to this pattern was in the Pacific region where haptophytes dominated with a mean biomass of 0.75 mg Chl a m−3, relative to 0.5 mg Chl a m−3 for diatoms, reflecting the large proportion of Phaeocystis spp. in the Ross Sea25 (supplementary Table. 3). The mean proportion of flagellated groups - green algae (6%), cryptophytes (6%), and pelagophytes (5%) - was also consistent across all zones south of the APF, with slightly elevated proportion of cryptophytes (8.5%) and dinoflagellates (5%) in the Shelf region. Although community composition was consistent between zones when averaged across time and space, biomass varied greatly, signalling that environmental factors that determine community composition and biomass likely differ.

Phytoplankton community composition and biomass were also consistent across the zones to the north of the APF with respect to haptophyte dominance over diatoms, but differences emerged at the circumpolar scale, with greater variance between regions compared with south of the APF (Fig. 2). Haptophytes typically showed the highest proportion north of the APF, with a mean contribution to TChl a of 40%, reflecting the dominance of coccolithophores (e.g., E. huxleyi)26 that have a lower iron (Fe) requirement19, whereas diatom growth is limited by silicate and iron27. The haptophyte dominance north of the APF is also consistent with the location of the Great Calcite Belt in which sub-Antarctic waters are characterised by the elevated particulate inorganic carbon and calcite synthesis of coccolithophorid haptophytes28,29. Conversely, south of the APF the haptophytes are primarily Phaeocystis antarctica, a species which exists in both solitary and colonial forms, that has greater Fe requirement than coccolithophores30,31. In addition, green algae were an important component north of the APF, with a greater mean proportion than diatoms, and were occasionally dominant in regions of the STZ and SAZ, where they reached a maximum of 24%. Synechococcus were more prevalent north of the APF with the highest biomass (0.04 mg Chl a m−3) in the warmer STZ accounting for 10% of TChl a.

The 26 years of pigment data revealed the presence of a north-south Class Divide at the APF between haptophyte dominance and higher green algae biomass to the north, and diatom dominance to the south. Regionally constrained Southern Ocean pigment studies on phytoplankton communities typically show agreement with this study. For example, over a latitudinal transect between the Antarctic and Equatorial Pacific, DiTullio et al.32 showed negligible diatom biomass between the APF and STF32, concurrent with diminished silicate concentrations; pelagophyte and haptophyte biomass on the other hand were elevated. Pigment-based studies, such as Schlüter et al.33, have also confirmed higher biomass of coccolithophores (such as E. huxleyi) type haptophytes in the Subantarctic33. In higher latitudes, other studies have also observed haptophyte dominance in the Ross Sea25,34,35 and higher proportions of cryptophytes in the WAP have previously been identified through pigment analysis36. Although our dataset indicates haptophyte (Phaeocystis Spp.) dominance in the Ross Sea, this region regularly harbours large diatom blooms, especially around Terra Nova Bay34,37, which is also reflected in our dataset. Although there are clear similarities between this study and others, we acknowledge that any seasonal succession such as the transition of haptophytes to diatoms from spring to summer15, will not be apparent in this analysis.

Cluster analysis identifies regional community distributions

Cluster analysis of the proportion of phytoplankton groups in each sample revealed four different community types (supplementary Table. 4) – a haptophyte-dominated community (cluster-H), a mixed community co-dominated by diatoms and haptophytes, with greater proportions of flagellates (cluster-M), a diatom-dominated community (cluster-D), and a community primarily comprised of haptophytes and green algae (cluster-HG). Clusters were derived using hierarchical clustering techniques on the proportions of phytoplankton classes and the DynamicTreeCut package38 was used to prune dendrogram branches (see Methods). Despite no a priori information on the geographical co-ordinates in the analysis, clear spatial patterns emerged either side of the APF (Fig. 3). Clusters-H, M and D were predominantly located south of the APF, with only a few samples to the north (Supplementary Table. 4), whereas cluster-HG only contained samples from north of the APF. In zones south of the APF, there did not appear to be any association between specific clusters and geographical location (Fig. 3), apart from clear regional differences between the Ross Sea and the WAP. In the Ross Sea, there was higher density of cluster-H samples near the Ross Ice Shelf while cluster-D were located mainly around Terra Nova Bay; conversely there was a higher density of cluster-M and cluster-D in the WAP but no clear spatial patterns in their relative distributions.

Fig. 3: Circumpolar distribution of phytoplankton community types derived from cluster analysis.
figure 3

A represents cluster distributions on a circumpolar scale, B is within the Western Antarctic Peninsula and C is in the Ross Sea. Cluster-H represents haptophyte domination, Cluster-M represents a mixed community, Cluster-D represents diatom domination, and Cluster-HG represents co-domination by haptophytes and green algae. Some samples have been spatially displaced to limit overlap of samples taken at a similar location. The Antarctic Polar Front is depicted by the bold line.

All clusters reflect a mix of phytoplankton groups, and as such, are not directly comparable to the proportion of phytoplankton groups displayed in Fig. 2. Yet the Class Divide between characteristic phytoplankton communities at the APF also separated cluster-HG north of the APF from the other clusters south of the APF. The overlapping distribution of clusters H, M and D south of the APF was consistent with the similarity in mean proportion of phytoplankton classes across zones (i.e., no cluster was found solely within a particular zone).

The identified Class Divide provides a biological analogue to the well-known discontinuity in environmental variables “the Biogeochemical Divide”18, demonstrating the role of phytoplankton community composition in nutrient cycling and fate of carbon either side of the APF. These findings also highlight the importance of physico-chemical factors that shape phytoplankton communities as differences in nutrient and environmental conditions are large at this divide18,19,20.

Many modelling studies and experiments have investigated how taxonomic groups will respond to a changing climate, and in turn how this will affect food-web dynamics and biogeochemical cycles2,39. Given the pressing need to understand spatial patterns in phytoplankton community structure, and how they relate to environmental parameters, the results presented here represent a substantial advance in characterising the distribution of phytoplankton groups throughout the Southern Ocean. Our results highlight the importance of incorporating phytoplankton community composition in ocean biogeochemical and earth system models. Recognition of the Class Divide and associated community consistency either side of the APF, and adoption of the cluster analysis to reduce the dimensionality of phytoplankton diversity, would simplify representation of phytoplankton groups in models and future projections of the Southern Ocean.

Methods

Geographical scope

Phytoplankton pigment data from 58 voyages between 1996 – 2022 were acquired from personal correspondence with scientists, publicly available repositories, and voyages associated with publications (Supplementary Table. 5).

To ensure consistency amongst major algal taxa, only pigment samples with all the following pigments were considered for analysis: Fucoxanthin (Fuco), 19’-Butanoyloxyfucoxanthin (But), 19’-Hexanoyloxyfucoxanthin (Hex), Chlorophyll b (Chl b), Zeaxanthin (Zea), Peridinin (Per), Prasinoxanthin (Pras), Alloxanthin (Allo), and Chlorophyll a (Chl a). This eliminated many of the pigment samples associated with the Palmer Long Term Ecological Research Centre, data prior to 1995, and much of the data in the MAREDAT database (Peloquin et al., 2013). Dependent upon availability, the following pigments were also considered: Neoxanthin (Neo), Lutein (Lut), Violaxanthin (Viol), Chlorophyll c2 monogalactosyldiacylglyceride ester Chl c2-MGDG [14:0/14:0] from Chrysochromulina polylepis, (MDGD-14), Chlorophyll c2 monogalactosyldiacylglyceride ester Chl c2-MGDG [18:4/14:0] from Emiliania huxleyi (MDGD-18). The pigment data then underwent quality control with data omitted if Chl a concentration was zero. Further to this, the distributions of pigments were inspected for outliers by examining frequency distributions; if present they were examined relative to other pigments at the sampling location and, if spurious, omitted from the dataset. Spatial co-ordinates and dates were also analysed to ensure format homogeneity.

Phytoclass inversion

Prior to inversion, the pigment samples were geospatially divided into the oceanographic zones listed above, using the analysis package “SF” in the programming language “R”40. As sea surface height (SSH) contours determined from remote sensing products do not correspond to the position of oceanographic fronts, there is currently little evidence of recent meridional frontal migration41, and so our zonal division of the samples was based on the static frontal boundaries defined by Orsi et al.12, which include the STF, SAF, and APF12. The American National Snow and Ice Data Centre (NSIDC) baseline median value of the maximum winter sea-ice extent between (1991 – 2020)42 was used to determine the boundary of the SSIZ. To further divide samples from the Antarctic continental shelf, we used the bathymetry data of Amblas et al., (2018) for the Antarctic shelf break43.

Prior to analysis, samples were hierarchically clustered based on their pigment to chlorophyll a ratios. First, pigment samples with zero concentration were imputed to 0.1% of their minimum concentrations to allow for the division by Chl a. Pigment ratios were then transformed using the Box-Cox method44. Manhattan distances were calculated between the standardised pigment ratios and then clustered using the Ward method45 to account for the high dimensionality of the pigment data, and to identify outliers. The dynamcTreeCuti R- package38 was used to determine the height at which to cut the dendrogram. This method allows for dynamic pruning of dendrogram branches based on their significance and has been proven effective at clustering potential outliers38. A minimum cluster size was set to 13 to account for sensitivity to sample sizes, as suggested by Hayward et al., 202322. The samples split into 720 clusters, which were inverted to phytoplankton groups using the phytoclass program22.

The phytoclass program is an inversion tool that uses the diagnostic marker pigments within phytoplankton groups to estimate their contribution to total chlorophyll a biomass. The method inverts mass concentrations of phytoplankton pigments into predefined phytoplankton groups and returns the biomass of each group in units of chlorophyll a concentration. This approach is similar to the widely used CHEMTAX method46, as both address the problem that diagnostic pigments may be shared between taxa with unknown ratios, however, phytoclass has higher accuracy22 and can be implemented on a large number of datasets. The phytoclass software was set with an iteration count of 1000 and a step of 0.006, increasing the probability of the algorithm finding the global minimum of the solution space.

The following phytoplankton groups were considered in the analysis: diatoms, haptophytes, cryptophytes, dinoflagellates, green algae, pelagophytes, and Synechococcus. To maintain a low condition number (i.e., ensure that the matrix inversion has a solution), and improve the accuracy of the matrix inversion, we opted for simplicity in the set-up of the phytoclass analysis, by omitting haptophyte subgroups. If a major diagnostic pigment was absent from a cluster, then the relevant phytoplankton group would be omitted from the analysis, as for example, with chlorophyll b and the green algae lineage. Due to the latitudinal dependence and thermal threshold of Synechococcus in the Southern Ocean47, and its potential mismatch with flavobacteria48, this group was omitted from analysis in the Shelf and SSIZ regions. As most pigment samples did not contain the ‘gyroxanthin’ pigment, dinoflagellate type-2 (e.g., Karenia species) were also omitted.

To ensure the program adequately fit phytoplankton groups to their respective pigments, the root mean square error (RMSE) was assessed for each cluster and pigment-to-chlorophyll a ratios were also inspected for outliers. If outliers were present, or at the border of their limits, as defined in the literature, the cluster was inspected and reanalysed with the removal of an outlying pigment. Regression and multiple linear regression analysis were then performed between phytoplankton groups and their respective pigment concentrations, to ensure no non-linear relationships between pigments and phytoplankton groups were present.

Cluster analysis

Following the inversion of pigments into respective biomasses for phytoplankton groups, samples were then divided by total chlorophyll a to determine the relative contribution of each group to community biomass. Phytoplankton proportions were Box-Cox transformed and then Euclidean distances between the relative biomass of phytoplankton groups were generated by applying the “DynamicCutTree” package. This method pruned the hierarchical dendrogram at sensible heights, with a minimum sample size of 1000 to ensure a manageable number of clusters38.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.