Introduction

Dissolved inorganic P concentration of many ocean gyres is very low (Karl, 2000; Wu et al., 2000), near or below the detection limits of analytical methods (Karl and Tien, 1992), and it was shown at times to be the nutrient-limiting productivity (Wu et al., 2000; Sañudo-Wilhelmy et al., 2001; Thingstad et al., 2005; Mather et al., 2008; Lomas et al., 2010). Dissolved organic P concentrations in the open ocean are approximately five times higher than those of dissolved inorganic P, varying between different oceanic regions and depths (for example, 0.22 μM in the upper 100 m in the ALOHA station (Karl and Björkman, 2002)). Phosphonates (Pns) are organic P compounds containing C–P bonds that comprise up to 25% of the high-molecular-weight dissolved organic P pool in the open ocean (Clark et al., 1998). In contrast to the more labile N–P, S–P and O–P linkages, the C–P linkage is resistant to photolysis, thermal decomposition, chemical hydrolysis and phosphatases (Kononova and Nesmeyanova, 2002; Quinn et al., 2007). The chemical stability of Pns may explain their use in membrane lipids and exopolysaccharides, which might be susceptible to degradation owing to their extracellular location (Quinn et al., 2007; White and Metcalf, 2007). Pns bioavailability was suggested to influence markedly bacterial primary production (Karl and Björkman, 2002; Gilbert et al., 2009; Thomas et al., 2010), possibly providing an evolutionary advantage to species able to utilize Pn. The gene encoding the Pn-binding protein (phnD) of the ABC-type Pn transporter is found in genomes of different marine bacteria and is used as one of several proxies for the ability to assimilate Pns in natural microbial assemblages (Dyhrman et al., 2006; Ilikchyan et al., 2009, 2010).

Although different phosphonate (Pn) utilization pathways exist (McGrath et al., 1995; Kononova and Nesmeyanova, 2002; Quinn et al., 2007; Martinez et al., 2010; Gomez-Garcia et al., 2011), the uptake mediated by the ABC-type transporter, Pn-binding-protein (PhnD) is shared across different microbial phyla. The phnD gene is found in numerous marine bacterial genomes, including genomes of the globally abundant marine bacteria Trichodesmium (Dyhrman et al., 2006), Synechococcus, Prochlorococcus and Pelagibacter (the SAR11 group). Enrichment in SAR11 Pn utilization genes or peptides was reported in the P-depleted eastern Mediterranean Sea (via metagenomics (Feingersch et al., 2010)) and the Sargasso Sea (via metaproteomics (Sowell et al., 2009) and metagenomics (Coleman and Chisholm, 2010)), whereas cyanobacterial phnD genes were shown to be expressed in P-deficient media and in the environment (Dyhrman et al., 2006; Ilikchyan et al., 2009, 2010; Beversdorf et al., 2010).

In this work, we studied the distribution of organic P-utilizing microbes in different marine systems using the phnD gene, a gene-proxy indicator for organic Pn utilization. Prochlorococcus genes were found to be the dominant phnD genes in the environments and were therefore expressed in Escherichia coli and their binding to different Pn molecules was monitored in order to find the corresponding Pn substrates.

Materials and methods

Abundance and classification of microbial P and Pn utilization genes in different Global Ocean Sampling (GOS) stations

The ‘GOS: all ORF peptides (P)’ database, downloaded from CAMERA (https://portal.camera.calit2.net/gridsphere/gridsphere), was first BLASTp queried with six PhnD and seven PstS sequences from representative taxonomic phyla (-p blastp -e 1e-5 –F F). The result was a list of sequences that were blasted further against the NCBI non-redundant database in order to determine if they are PhnD and PstS (-p blastp -e 1e-5 –F F). Hits with E-value <1 × 10−20 were categorized and classified according to the top high-scoring sequence pair of a query as determined by the NCBI taxonomic identifier. The PhnD (315 aa average) and PstS (352 aa average) hits were normalized for each GOS station as the stations vary in sequencing depth and sampling methods. In order to compensate for this variation, the average copy number of the genes was calculated as previously described (Beszteri et al., 2010) and can be see in Equation (1):

where is the average genome size, Im the gene length and the average copy number of the gene concerned.

The average genome size in every station was calculated with the method developed by Raes et al. (2007) (Equation (2)) using 35 COGs of single copy marker genes obtained from the STRING protein DB version. 9.0. (Szklarczyk et al., 2011).

Equation (2):

where Ls is the average read length of sample rm,s the count of reads annotated as marker m from sample s and Rs the total number of base pairs sequenced from sample s.

The pstS or phnD gene sequences from GOS that were similar to Psychroflexus torquis were classified as ‘other’, because the genome sequence of the P. torquis contains small fragments of unknown environmental DNA, as surmised from rDNA in unassembled reads (Howard et al., 2006).

PhnD protein expression

The two phnD genes from Prochlorococcus MIT9301 and the phnD gene from E. coli were synthesized and cloned in the expression vector pJexpress406 (T5 promoterA, Kanr, Ampr, Clmr, pUC originC) by DNA2.0 (Menlo Park, CA, USA) as His-tagged and signal sequence-less (Rizk et al., 2006) proteins, and the codon usage was optimized to fit E. coli usage. One Shot TOP10 E. coli cells (Invitrogen, Carlsbad, CA, USA) transformed with the expression vector carrying the appropriate PhnD inserts were plated on LB agar plates with ampicillin (100 μg ml−1), and grown overnight at 37 °C. Single colonies were suspended in 15 ml of LB with ampicillin (100 μg ml−1), grown overnight at 37 °C and used to inoculate TB medium (10 ml of overnight culture in 1 l) supplemented with ampicillin (50 μg ml−1). Cultures were grown at 210 r.p.m. and 37 °C until OD600=0.45–0.8. IPTG (0.1 mM) was then added and the cells were grown overnight at 210 r.p.m. and 18 °C for overnight expression.

Cells from 1 l cultures were harvested and resuspended in 45 ml binding buffer (30 mM imidazole, 0.5 M NaCl, 20 mM phosphate buffer, pH 7.4). The cells were disrupted by two passages through a French-Press at room temperature. Cell extract was centrifuged (30 500 g for 30 min at 4 °C), and supernatant was transferred to a new centrifuge tube. The soluble fraction containing the recombinant proteins was purified by Fast Protein Liquid Chromatography using the AKTA Explorer (Pharmacia, Uppsala, Sweden) system equipped with 5 ml His-trap column (HisTrap HP Columns, GE Healthcare, Uppsala, Sweden) and eluted with elution buffer (0.5 M imidazole, 0.5 M NaCl, 20 mM phosphate buffer, pH 7.4) according to the manufacturer's instructions.

Titrations of microcalorimetry measurements

Titrations of microcalorimetry measurements were performed with a VP-isothermal microcalorimetry (ITC) calorimeter (Microcal, Northampton, MA, USA). Purified PhnD protein solutions for ITC were dialyzed twice for 12 h, 5 ml against 3 lof buffer A (50 mM Tris HCl, pH 7.4, 500 mM NaCl and 0.02% NaN3). Pn solutions were prepared by dilution with buffer A. Aliquots (10 μl) of the Pn solutions at about 10 times the PhnD protein concentration were added by means of a 280-μl rotating stirrer-syringe to the reaction cell containing 1.41 ml of 0.1 μM of the different PhnD protein solutions. The heat of dilution was determined to be negligible in separate titrations of the Pn solutions into the buffer solution. Calorimetric data analysis was carried out using Origin 7.0 software (MicroCal).

Isothermal titration calorimetery between the PhnD proteins and the different Pn ligands can be described by the following direct relation shown in Equation (3):

where (qi) is the amount of heat released or observed proportional to the bound ligand, v is the volume of the reaction cell and ΔLi is the increase in the concentration of bound ligand after the ith injection.

The quantity ΔLi is the difference between the concentration of bound ligand in the ith and (i–1)th injections, and its functional form depends on the specific binding model. For the simplest case, in which the protein has one binding site, it becomes Equation (4):

where Ka is the binding constant and [L] is the concentration of free ligand (Leavitt and Freire, 2001).

Results and discussion

In order to account for the distribution and prevalence of organic P-utilizing microbes in different marine systems, we calculated the relative abundance of gene-proxy indicators for organic Pn utilization (the phnD gene) in the GOS project (Rusch et al., 2007), and compared it with inorganic P utilization (based on the pstS gene encoding for the ABC-type transporter phosphate-binding protein (Martiny et al., 2009)). GOS is currently the largest collection of metagenomic datasets that includes samples from the Atlantic, Pacific and Indian Oceans. Although different microbial groups show the potential for inorganic P utilization (Figure 1a), the BLAST results imply that the main factors in organic P utilization in surface waters are Prochlorococcus (Figure 1b in the Pacific and Indian Ocean stations and in one station in the Sargasso Sea) and to a lesser extent SAR11 bacteria (Figure 1b in several Atlantic Ocean stations).

Figure 1
figure 1

Relative abundance of microbial inorganic P and Pn utilization genes at different GOS stations. (a) The relative quantity of pstS genes of various microbial groups (rows) at each station (columns) is represented by the area of the corresponding spot (if any). (b) The relative quantity of phnD genes of various microbial groups. The figure was constructed using the ggplot2 package (Wickham, 2009) in R. See station locations overlaid on a P concentrations map in Figure S3.

Prochlorococcus cell abundance alone could not explain the high abundance of Prochlorococcus phnD genes, as SAR11s are the dominant bacteria in these GOS stations (SAR11 is eight times more abundant over the entire GOS sets compared with Prochlorococcus based on 16S gene recruitments (see Table 8 in Rusch et al., 2007)). One possible explanation is that our phylogenetic group assignments are based on homology searches and are therefore prone to errors such as wrong affiliations as a result of possible horizontal gene transfers. Another possible explanation is the existence of SAR11 types that completely lack any copy of the phnD gene, or alternatively the existence of Prochlorococcus strains having more than one copy of the phnD gene. Indeed, a search through different SAR11 and Prochlorococcus genomes revealed the absence of the phnD gene from the genomes of two SAR11 isolates (‘Candidatus Pelagibacter ubique’ HTCC1062 (Rappé et al., 2002; Giovannoni et al., 2005) and HTCC1002 (Rappé et al., 2002); see Table 1), and the existence of two different phn operons (coding for the different components of ABC-type Pn transporters) in two Prochlorococcus genomes (the high-light (HL) adapted MIT9301 and the low-light (LL) adapted MIT9303 ecotypes; see Table 1). As already observed by Martiny et al. (2009), the same Prochlorococcus ecotypes also show an increase in the number of pstS genes in their genomes. Interestingly, this trend is also observed among marine Synechococcus strains (Moore et al., 2005). Moreover, not only does the gene organization of the two phn operons in Prochlorococcus differ (phn D CE and phnC D E), the proteins themselves are also distinct (possessing 27% aa identity) and cluster separately based on protein phylogeny (Figure 2). The Prochlorococcus phnCDE operon is found in the vicinity of another operon that contains two novel Pn utilization genes (phnY and phnZ) that were identified recently via functional screening in E. coli (Martinez et al., 2010). Both the phnCDE and phnYZ operons are located in a genomic island in Prochlorococcus MIT9301 (Kettler et al., 2007). It is important to note that the phnD primers previously used by Ilikchyan et al. (2009, 2010) to follow Prochlorococcus phnD expression will recognize only the phnD gene from the phn D CE and not the copy in the phnCDE operon.

Table 1 Presence/absence of phnD and pstS genes in Prochlorococcus and SAR11 genomes
Figure 2
figure 2

Neighbor-joining tree of Prochlorococcus PhnD proteins. Evolutionary distances for PhnD proteins were determined from the alignment of 320 aa positions using a neighbor-joining analysis. Neighbor-joining and maximum parsimony analyses were conducted using PAUP. Bootstrap resampling (1000) of neighbor-joining and maximum parsimony trees were performed in all analyses to provide confidence estimates for the inferred topologies. Bootstrap values (neighbor-joining/maximum parsimony) greater than 50% are indicated above the branches. Prochlorococcus PhnD proteins are shown in green. HL – high-light adapted strains, LL – low-light adapted strains. Different phn operon organizations in Prochlorococcus MIT9301 and MIT9303 and in different reference genomes are shown on the right; gray ORF represents genes not related to the phn operon.

In order to determine Pn-binding specificities for the two different Prochlorococcus PhnD proteins (each one possessing less than 30% aa identity to the E. coli PhnD homolog), the phnD genes from the phn D CE (locus P9301_07261; phnD1) and phnC D E (locus P9301_12511; phnD2) operons of the HL-adapted Prochlorococcus MIT9301 (as MIT9301 often shares the highest sequence similarity with GOS sequences (Kettler et al., 2007)) were expressed as His-tagged proteins in E. coli and tested against a battery of different simple Pns, phosphite and phosphate using ITC measurements. ITC measures the heat capacity change upon the interaction between a macromolecule and a ligand. Titration of the ligand versus the macromolecule results in an isotherm-binding curve that can be fitted to a simple binding equation to calculate the equilibrium constant. As expected, the purified control His-tagged E. coli PhnD protein bound different Pns with the previously reported affinities (Rizk et al., 2006) (2-aminoethylphosphonate (2-AEPn) ethylphosphonate (EPn) > methylphosphonate (MPn) phosphonoacetate (PnAc) aminomethylphosphonate (AMPn); see Table 2 for Kd values and Figure S1 for raw data. It is important to note that using ITC measurements, we failed to detect any binding of the E. coli PhnD protein either to phosphite or to phosphate. These were previously suggested to be P substrates in E. coli (Metcalf and Wanner, 1991; Rizk et al., 2006)). Interestingly, both Prochlorococcus PhnD proteins showed a completely different specificity range: one (PhnD2) binds strongly only to MPn, EPn and inorganic phosphite, whereas the other (PhnD1) binds strongly to inorganic phosphite and with very weak affinities to MPn and phosphate (see Table S1 for dissociation constants). Although this suggests that simple Pns and phosphites could be utilized by Prochlorococcus possessing PhnDs similar to PhnD2 (observed in only few cultured ecotypes), it also implies that in the environment, Prochlorococcus possessing PhnDs similar to PhnD1 (observed in all cultured ecotypes isolated so far) use phosphites. Indeed, Martinez et al. (in press) were able to show that Prochlorococcus MIT9301 can grow on phosphite as a sole P source. However, under the growth condition tested, MIT9301 failed to grow on different Pns (2-AEPn, EPn or phosphonoalanine) as a P source. Based on these findings, Martinez et al. (in press) suggested that the second Pn operon phnCDE (re-annotated as ptxABC by the authors) is a phosphite utilization operon along with the neighboring phnY and phnZ genes and an adjacent newly identified phosphite dehydrogenase (ptxD). However, other Prochlorococcus strains tested, which lack the ptxABCD operon but contain the phnDCE operon, failed to grow on phosphite or Pns (2-AEPn, EPn or phosphonoalanine) under the growth conditions tested (Martinez et al., in press). Although our biochemical binding assay predicted that the transporter encoded by the phnCDE (ptxABC) operon would transport simple Pns (MPn and EPn) as well as phosphite, it also predicted that the transporter encoded by the phnDCE operon is a phosphite transporter. This could not be resolved by Martinez et al. (in press) under the growth conditions used in their experiments. Further experiments using different growth conditions are therefore needed in order to confirm whether the transporter encoded by the phnDCE operon transport phosphite and the transporter encoded by the phnCDE (ptxABC) operon could also transport simple Pns (MPn and EPn). No data currently exist on phosphite abundance in the marine environment (David Karl, personal communication). The current widely used methodologies (Lomas et al., 2010) of total dissolved P measurement and subsequent calculation of organic P by the difference between phosphate and total dissolved P would hide any phosphite as organic P.

Table 2 Dissociation constants of selected Pns to the E. coli PhnD and to two different PhnD versions of Prochlorococcus MIT9301 measured at 30 °C.

Examination of publicly available marine metatranscriptomic datasets (Hewson et al., 2009; Stewart et al., 2010) revealed that both Prochlorococcus phnD1 and phnD2 genes are expressed in several Atlantic Ocean surface water sites but not in Pacific Ocean sites (Table 3). This is in agreement with recent observations of the enrichment of P utilization genes (Prochlorococcus and SAR11) in the BATS station (Bermuda Atlantic Time Series, North Atlantic) compared with the HOT station (Hawaii Ocean Time Series, North Pacific) (Coleman and Chisholm, 2010, 2011).

Table 3 Prochlorococcus phnD transcript presence (absolute numbers) in different site locations inferred by BLASTn (E-value < 1e-5) of the two Prochlorococcus marinus MIT9301 phnD genes

Besides the two different PhnD proteins from Prochlorococcus MIT9301, a phylogenetic tree constructed with all Prochlorococcus-like PhnDs from GOS revealed the existence of other potential phosphite or PhnD families (some of which having less than 70% identity to PhnDs from cultured Prochlorococcus (marked with black arrows in Figure S2)), with the majority (more than 90%) belonging to the PhnD1 enlarged family.

The marine cyanobacteria of the genera Prochlorococcus are globally important marine primary producers (Chisholm et al., 1988; Partensky et al., 1999) and have undergone extensive genome streamlining (Partensky and Garczarek, 2010). It seems that Prochlorococcus ecotypes adapted to low-P environments use at least three different strategies to deal with low-P availability: (i) they substitute their phospholipids with non-P membrane lipids such as sulphoquinovosyldiacylglycerol (Van Mooy et al., 2009) and therefore reduce the cellular demand for P by half; (ii) they increase the number of organic (Table 1) and inorganic P transporters (Martiny et al., 2009); and (iii) they have an arsenal of transporters with different affinities to phosphite and different Pns (Table 2) to be ready on demand. Future in vivo growth experiments using different Prochlorococcus strains are needed to test which Prochlorococcus cells can supplement growth using phosphite or simple Pns (like MPn and EPn) as P sources.