Introduction

Lytic bacteriophages are obligate parasites, thus their survival depends on the ability to encounter and reproduce inside a suitable host. In planktonic communities, the likelihood of finding such a host is a function of the concentration of both host and phage. However, as lytic phages kill their hosts in the course of reproduction, they reduce the density of susceptible hosts and select for variants in the host population that are resistant to infection (Bohannan and Lenski, 2000; Buckling and Rainey, 2002). Phage mutants capable of infecting resistant hosts often emerge (Lenski and Levin, 1985; Qimron et al., 2006; Poullain et al., 2008; Meyer et al., 2010; Paterson et al., 2010). Such phage–host coevolution is now realized to be a central process that generates and maintains extensive diversity among microbes (Rodriguez-Valera et al., 2009; Avrani et al., 2011; Cordero and Polz, 2014). However, much less attention is given to the mechanisms underlying the generation and preservation of viral diversity (Paterson et al., 2010; Mizuno et al., 2014).

Antagonistic host–phage coexistence in nature is often explained by reciprocal selection, whereby phages select for resistant hosts and resistant hosts select for phages capable of re-infecting resistant hosts (Avrani et al., 2012; Martiny et al., 2014). In experimental systems, such mutation and counter-mutation cycles that lead to an arms race result in selective sweeps in which host and phage mutants with increasing resistance and infectivity ranges replace their predecessors (Buckling and Rainey, 2002; Mizoguchi et al., 2003; Marston et al., 2012). However, resistance that cannot be overcome by phage mutants is sometimes observed (Lenski and Levin, 1985; Duffy et al., 2006; Lennon and Martiny, 2008). In addition, ever increasing fitness costs associated with these mutations are expected to eventually limit the propagation of the arms race (Gomez and Buckling, 2011; Hall et al., 2011).

In an alternative ecological model of fluctuating selection that takes fitness costs into consideration, such as that proposed in the ‘kill-the-winner’ model, antagonistic coexistence is explained by density-dependent fluctuations of susceptible and resistant host types (Woolhouse et al., 2002; Winter et al., 2010; Avrani et al., 2012). These fluctuations result from alternating selective forces, whereby susceptible hosts and phages fluctuate in predator–prey-like cycles: phages infecting susceptible hosts increase in abundance, selecting for resistant hosts; consequently, phage numbers decrease as they cannot infect the resistant hosts, relieving phage predation pressure; thereupon, resistant hosts are outcompeted by susceptible hosts, as the latter are not burdened by pleiotropic costs of resistance (Gomez and Buckling, 2011; Hall et al., 2011). This model does not require the emergence of resistance-breaking mutants (RBMs) but does require pleiotropic costs of resistance. Note that we refer to mutant phages capable of re-infecting resistant hosts as ‘resistance-breaking mutants’, following terminology used by the plant virus community (Harrison, 2002; Palloix et al., 2009), rather than ‘host range mutants’, as these phages infect an allelic variant of the same host rather than a truly new host.

Bacteria–phage coevolution studies abound (reviewed in Buckling and Brockhurst (2012); Koskella and Brockhurst (2014) and Martiny et al. (2014)) with many studies focused on the host, both on the mutations that confer resistance as well as their fitness effects (Lenski, 1988a; Lennon et al., 2007; Forde et al., 2008; Avrani et al., 2011). Clearly, the diverse mechanisms of phage resistance in bacteria (Hyman and Abedon, 2010; Labrie et al., 2010) have dictated diverse mechanisms allowing phages to overcome resistance (Samson et al., 2013). However, one of the most common modes of resistance is by mutations in cell surface structures that prevent adsorption of phage to the cell surface of the host (Bohannan and Lenski, 2000). In these cases, resistance-breaking phage mutants arise by changes in structural genes mediating host recognition and attachment. In tailed phages such mutations typically modify the tail and the tail fiber (Ravin et al., 2002; Qimron et al., 2006; Paterson et al., 2010; Scanlan et al., 2011; Meyer et al., 2012; Perry et al., 2015), although mutations in other genes have also been found (Paterson et al., 2010; Marston et al., 2012). Typically, RBMs retain their ability to infect the original susceptible host, and in some cases result in a reduction in phage fitness (Chao et al., 1977; Duffy et al., 2006; Ford et al., 2014).

An example of coexistence in the environment is that of unicellular marine picocyanobacterium, Prochlorococcus and the phages infecting it. Prochlorococcus, found in warm sunlit waters of the ocean in concentrations often exceeding 105 cells per ml (Johnson et al., 2006; Zwirglmaier et al., 2008), are the world’s most numerous photosynthetic organism (Partensky et al., 1999; Flombaum et al., 2013). Coexisting with Prochlorococcus are phages (Sullivan et al., 2003; Dekel-Bird et al., 2015) that are thought to influence their hosts’ abundance, diversity, metabolism and evolution (Mann, 2003; Lindell et al., 2004; Muhling et al., 2005; Avrani et al., 2011; Thompson et al., 2011; Parsons et al., 2012). Cyanophages isolated to date have double-stranded DNA genomes and belong to the three morphologically defined families of the tailed bacteriophages, the Myoviridae, Podoviridae and Siphoviridae (Suttle and Chan, 1993; Waterbury and Valois, 1993; Mann, 2003; Sullivan et al., 2003; Wang and Chen, 2008; Sabehi et al., 2012). Of relevance to this work are the cyanopodophages that resemble the enteric T7 phage.

The similarity of T7-like cyanophages to the T7 archetype is found for a number of different characteristics such as: virion structure (Liu et al., 2010); genome organization, including the presence of many homologous genes, their conserved arrangement along the genome and their grouping according to functional modules (Chen and Lu, 2002; Sullivan et al., 2005; Pope et al., 2007; Labrie et al., 2013); and their expression program (Lindell et al., 2007). T7-like cyanophages are typically very host-specific, infecting only one or two closely related hosts (Sullivan et al., 2003; Dekel-Bird et al., 2013; Dekel-Bird et al., 2015). However, it remains unclear how such narrow host range phages survive when resistance dominates.

Prochlorococcus resistance to T7-like cyanophages occurs at a high frequency and is often through the prevention of phage attachment to the cell surface (Avrani et al., 2011). Many of these phage-resistant strains have fitness costs of lower growth rate or enhanced susceptibility to other phages. Most phage resistance mutations in Prochlorococcus are in genes of unknown function, and occur in genomic islands (Avrani et al., 2011), hypervariable genomic regions with different gene content among closely related isolates (Coleman et al., 2006). From the phage perspective, the abundant Prochlorococcus population is composed of numerous subpopulations with different phage susceptibilities, with only a fraction of these being available to any given phage type (Avrani et al., 2011). A high degree of resistance would lead to low encounter rates between susceptible hosts and co-occurring phages, and was previously suggested as a mechanism facilitating cyanobacterial–cyanophage coexistence (Waterbury and Valois, 1993; Avrani et al., 2011).

That evolution of phage resistance drives host diversity in genomic susceptibility regions is now well described (Rodriguez-Valera et al., 2009; Avrani et al., 2011; Kashtan et al., 2014); however, its consequence for phage diversity and evolution remains unclear. In an experimental coevolution study in a related system, several arms race cycles were observed between a marine Synechococcus host and a T4-like cyanophage (Marston et al., 2012). Phage mutations that likely evolved in response to host resistance occurred in genes of unknown function and their effect on phage fitness remains unknown.

Here we set out to investigate whether coexistence between the highly abundant Prochlorococus and T7-like cyanophages is facilitated by an arms race. To this end, we tested the ability of T7-like cyanophages to overcome resistance in Prochlorococcus and characterized RBMs for host range, fitness costs and genetic diversity. In agreement with the arms race model, some RBMs were isolated that had an expanded host range and no fitness cost. Furthermore, the mutations identified were in tail-related genes that are highly diverse among isolates and natural populations of T7-like cyanophages. However, RBMs were not detected in most interactions investigated. This dearth of mutants indicates that ecological coexistence between these antagonists is unlikely to be solely due to continuous arms race cycles. Rather we suggest a model whereby coexistence and genetic diversity of hosts and phages is due to the combination of an arms race, cost-driven density-dependent fluctuations and passive host-switching.

Materials and methods

Cyanobacterial strains and growth conditions

Prochlorococcus MED4 (henceforth MED4-wild type (WT)) and phage-resistant strains derived from Prochlorococcus MED4 (MED4-R#) were the main cyanobacterial strains used in this study. The phage-resistant strains (MED4-R1, MED4-R5, MED4-R6 and MED4-R8) were previously isolated by Avrani et al. (2011) as colonies that grew on plates in the presence of a selecting phage (Supplementary Table S1). These strains were chosen because they have been genotypically and phenotypically characterized (Supplementary Table S1; Avrani et al., 2011) and had not lost their resistance due to continued evolution (Avrani and Lindell, 2015) by the onset of this study. However, the MED4-R8 strain underwent changes in its resistance range during the period of this study (Avrani and Lindell, 2015) and was therefore used only for screening of resistance-breaking phages. Several other strains of Prochlorococcus and Synechococcus were used for phage–host range characterization and as controls for adsorption assays (see below). The doubling time of the cyanobacterial strains used in this study was between 1 and 2 days.

Prochlorococcus was grown in PRO99 medium (Moore et al., 2007) with a Mediterranean seawater base. Cultures were grown at 22 °C under cool white light at an intensity of 10–20 μmol photons m–2 s–1, with a 14:10 h light:dark cycle. Synechococcus was grown under the same conditions in the ASW artifical seawater medium (Lindell et al., 1998). Growth in liquid cultures was monitored by chlorophyll a (chl a) fluorescence measured using a Synergy MX microplate reader (BioTek, Winooski, CA, USA), with excitation and emission wavelengths at 440±20 and 680±20 nm, respectively. Relative cell concentrations were determined from chl a fluorescence, and verified to have the same number of cells in the different strains by colony plating.

Prochlorococcus colonies and lawns were grown in pour plates of the PRO99 medium supplemented with 0.28% low-melting-point agarose (UltraPure, Invitrogen, Carlsbad, CA, USA) and 1 mM Na2SO3 (Lindell, 2014). To achieve high plating efficiencies, a heterotrophic ‘helper’ strain, Alteromonas sp. EZ55, was added to the medium (Morris et al., 2008).

Cyanophage strains, their propagation and enumeration

Three T7-like cyanophage strains that infect Prochlorococcus MED4 were used in this study: P-SSP7, P-GSP1 and P-TIP38. Their ability to infect the resistant Prochlorococcus strains is shown in Supplementary Table S1. Phages were propagated from single plaques in liquid cultures of exponentially growing host cells. Cell debris were removed from viral lysates by centrifugation (10 000 g at 20 °C for 15 min) and filtration (0.22 μm Millex GV syringe filter, Millipore (Cork, Ireland); or 0.2 μm Rapidflow filter unit, Nalgene, Rochester, NY, USA). Phages were enumerated by plaque assays in which serially diluted phages were plated in pour plates after a 1–4 h adsorption step to host cells in liquid under host growth conditions.

Screening for RBM phages

We screened for RBMs of T7-like cyanophages on phage-resistant strains of Prochlorococcus MED4. Two approaches were employed that differ in the number of rounds of evolutionary selection allowed. The first method, plaque formation, allowed for a single round of selection. Six interactions were tested based on the resistance range of the MED4-derived resistant strains (Supplementary Table S1): the P-SSP7 phage was tested on the MED4-R1, MED4-R5, MED4-R6 and MED4-R8 strains and the P-GSP1 and P-TIP38 phages were tested on the MED4-R8 strain. Large volume lysates were produced from single plaques on the WT MED4 host. Lysates were concentrated using a centrifugal filtration at 4 °C (Amicon Ultracel 100 KDa, Millipore) and plated on lawns of phage-resistant host strains such that only resistant-breaking mutants would form plaques. Three to four assays with independent phage lysates were carried out for each host–phage pair.

The second method, host-mixing, allowed for multiple evolutionary steps. Phages were added to mixed host populations in liquid consisting of a minority of susceptible hosts and a majority of phage-resistant cells (Benmayor et al., 2009). Phages were serially transferred weekly for 10 rounds: 1% of a 0.2 μm filtrate was transferred to naive, mixed host populations each round. Spontaneous mutations that arise during replication of the WT phage on the susceptible host and that enable infection of resistant cells are thus highly favored and lead to a decline in the resistant host population. Four mixed cell compositions were used with susceptible hosts added at frequencies of 0.01%, 0.1%, 1% and 10%, and the remainder made up of resistant cells. A total of 8–9 × 107 cells (colony-forming units) were used each round, and the first round of infection contained 5.7 × 106 plaque-forming units of the WT P-SSP7 phage (multiplicity of infection=0.07). Cultures of susceptible or resistant hosts only, as well as non-infected cultures, were used as controls for phage infection and cell growth, respectively. Four replicate phage populations were evolved on each of the mixed cell compositions and on the controls for each host–phage interaction.

Isolation of RBM phages

RBM phages of P-SSP7 were isolated from assays employing both single and multiple rounds of selection. Well-isolated plaques from the single-round assays were plaque-purified a second time and then propagated in liquid on the same ‘resistant’ host. Six RBMs were isolated in this manner: RBM1–RBM6 (Supplementary Table S1). An additional six RBMs were isolated from the host-mixing assays that allowed for multiple rounds of selection (RBM7–RBM12): mutants from round 2 did not form plaques and were isolated using a liquid dilution to extinction assay, whereas mutants from rounds 4 and 6 formed plaques and were plaque-purified before propagation in liquid. WT control phages (WT1–WT3) were isolated from serial dilutions of each of the initial lysates using plaque assays on the WT MED4 host and served as paired controls for the phenotypic and genotypic characterization of the mutant phages (Supplementary Table S1).

Phenotypic characterization of isolated resistance-breaking phage mutants

Phenotypic characterization was carried out for the first three independently isolated mutants (RBM1, RBM2 and RBM3) that also underwent whole-genome sequencing (see below).

Host range

Phage–host range was tested on Prochlorococcus MED4-WT and three of the four resistant MED4 strains as well as on 10 other Prochlorococcus and Synechococcus strains (see Supplementary Table S1 for a list of strains used). MED4-R8 was not used due to continued evolution that led to changes in its susceptibility after RBM screenings were completed (Avrani and Lindell, 2015). Host range was tested from growth curves of liquid cultures in microtiter plates. Cyanobacterial lysis was monitored by a decline in chl a fluorescence relative to uninfected control cultures on the MED4-WT, MED4-R1, MED4-R5 and MED4-R6 strains. Spot assays were used to determine host range on the other cyanobacterial strains: Five microliter of phage lysates was spotted onto 2–3-day-old lawns of cyanobacteria and monitored for appearance of plaque-like clearings. Host ranges were determined using four to six replicate infections for liquid cultures and three replicates for spot assays.

Phage fitness

The production of phage progeny over multiple rounds of infection was used to determine relative phage fitness. Exponentially growing Prochlorococcus cultures were diluted to an equal concentration (~2 × 108 cells per ml) and were infected at an multiplicity of infection of 0.0001–0.001. These low phage:host ratios were used to ensure that host availability did not limit viral replication over the 2–3-day period of the experiment (Supplementary Figure S1). This long period was chosen as cell doubling and phage replication times are on the order of days and hours, respectively. The number of free phages (0.22 μm filtrate after 100-fold dilution) was measured by plaque assays on lawns of MED4-WT at the onset (N0) and at the end of the assay (Nt) at t=2 or 3 days (Supplementary Figure S1). Fitness is expressed as the number of phage population doublings per day as (log2(Nt/N0))/t (Bull et al., 1997), where t is the assay time in days. When a mix of phages (both WT and RBM) was tested in direct competition in the same vessel, the abundance of each phage type was determined from the ability of 100 picked plaques to lyse the WT and resistant strains in replicate liquid assays. A plaque that lysed both hosts was determined to be a RBM phage, whereas those that lysed only the WT host were determined to be WT phages. Six to thirteen replicate infections were carried out for fitness measurements of each phage–host interaction.

Phage adsorption

Single-point adsorption assays were performed in which percent adsorption was determined from the number of unattached phage at 4 h relative to immediately after phage addition, as previously described (Avrani et al., 2011). Briefly, exponentially growing cells were concentrated by centrifugation (12 000 g at 21 °C for 10 min) to ~2 × 109 colony-forming units per ml and mixed with phages at a low multiplicity of infection of 0.003–0.014. The free phage titer in the 0.22 μm filtrate (filtered after 100-fold dilution) was determined by plaque assay immediately after phage addition (0 h) and after 4 h. Five to eight replicates were carried out to measure adsorption for each phage–host interaction.

Genetic characterization of RBMs

Mutations in the RBMs of P-SSP7 were determined from whole-genome sequencing of three RBMs (RBM1, RBM2 and RBM3) and were compared to the sequenced genomes of three paired-control, WT phages. Phage genomic DNA was purified from concentrated lysates using the Lambda mini kit (Qiagen, Hilden, Germany) and sheared (Covaris E200, Woburn, MA, USA). DNA-sequencing libraries were constructed using TruSEQ DNA preparation kit (Illumina, San Diego, CA, USA). Paired-end sequencing (2 × 250 bp) was carried out on a MiSEQ sequencer (Illumina) at the Technion Genome Center (Haifa, Israel). Sequencing reads were analyzed on the GALAXY platform (Goecks et al., 2010). Quality analysis was done using FastQC. Read ends were trimmed uniformly leaving only positions in which 75% of the reads or more had a PHRED quality score of >20. PCR duplicates were removed by rmdup using SAMtools (V.0.1.18) (Li et al., 2009). Paired-end mapping against the P-SSP7 reference genome (GenBank accession: NC_006882) was carried out using the BWA package for Illumina (V.0.6.1; default settings except max. edit distance=3). Variants were identified by mPileup and were inspected manually using the Integrative Genomics Viewer (V. 2.2.5; Robinson et al., 2011). Mapped reads are available in the Sequence Read Archive of the National Center for Biotechnology Information under Bioproject PRJNA352672. Mutations found in the genomes of RBMs different to those in the paired controls were identified as resistance-breaking-associated mutations (Supplementary Table S2). Verification of the mutations, as well as genotyping of other phage mutants (RBM4–RBM12), was done for genes identified by whole-genome sequencing. Four independent PCRs were carried out for each gene in each phage mutant (Supplementary Table S3), pooled and Sanger sequenced.

To search for homologs of genes of interest the conserved domain and the non-redundant databases of the National Center for Biotechnology Information were searched using BLAST, PSI-BLAST (Altschul et al., 1997) and HMMER (Finn et al., 2011). In addition, the Pfam database was searched using HHpred (Söding, 2005) with a query of protein sequences aligned by MUSCLE (Edgar, 2004).

Diversity of resistance-breaking genes

Core genes in genomes of isolates

To compare the diversity among T7-like core genes, we calculated the average information content of each gene by applying Shannon’s diversity as described previously (Avrani et al., 2011). Briefly, amino-acid sequences of the homologs were globally aligned with minimum gaps (MAAFT —legacygappenalty —localpair —maxiterate 1000; Katoh et al., 2002). Position-specific information content, −p(a)log2(p(a)), where p(a) is the proportion of sequences containing amino acid a, was averaged across all alignment positions having at least five non-gap characters. The information content calculation was implemented using a custom R script (Supplementary File 1). Core genes were compared between eight genomes (P-SSP7, P-SSP10, P-GSP1, P-SSP9, P-HP1, P-SSP2, P-RSP5 and Syn5) spanning the phylogenetic diversity of T7-like cyanophages (Labrie et al., 2013).

Environmental diversity

Metagenomic recruitment was used to identify natural populations of T7-like cyanophage isolates that are present in publically available marine viral metagenomes. A phage was defined to be present in the population when reads recruited to the reference genome at >90% BLASTn identity, had a median coverage >5 and were evenly distributed across the genome (mean/median <10). Sequences from viral metagenomes (Supplementary Table S4) were first filtered by mapping to the cyanophage reference genome using Bowtie2 (Langmead and Salzberg, 2012) set to the very-sensitive-local parameters (-D 20 -R 3 -N 0 -L 20 -i S,1,0.5). Successfully mapped reads were then recruited based on the best hit to the reference genome by BLASTn (E-value <1 × 10−5, >50 nt and >90% identity; Camacho et al., 2009). Metagenomic islands were detected and defined largely as described in Mizuno et al. (2014), as stretches having <20% of the median coverage but with a minimum BLASTn identity of 90% (rather than 98%). This enabled identification of somewhat more divergent phage populations. The minimum island length was set at 300 bp (rather than 500 bp) to allow identification of islands corresponding to small genes.

Statistics

Before statistical analyses, the data were tested for normal distribution using the Shapiro–Wilk normality test (P>0.05; Shahbaba, 2012). The Welch two-sample t-test was used when the data were normally distributed, whereas the nonparametric Mann–Whitney test was used when they were not. P-values were adjusted for multiple testing by the method of Benjamini and Hochberg (1995). Pearson’s r coefficient was computed to assess the relationship between phage adsorption and fitness using the means of the measurements. Bootstrap resampling of observed metagenomic islands was used to generate, for each gene in each phage genome, a null distribution under the assumption that islands are randomly distributed in the genome. P-values were then estimated by comparing the observed occurrence of a gene in islands with the null distribution (Supplementary Methods). All statistical analysis were done in R (V.3.2.3 (R Development Core Team, 2015)).

Results and discussion

Phage resistance of some Prochlorococcus strains cannot be broken by phage mutants

Previously, we isolated strains of Prochlorococcus MED4 that were resistant to infection by host-specific T7-like cyanopodophages (Avrani et al., 2011; Supplementary Figure S2). Here we investigated six phage–host interactions for the ability of the phages to overcome resistance in these cyanobacterial strains (Table 1). Specifically, populations of the P-SSP7 phage were screened for RBMs on four resistant strains (MED4-R1, MED4-R5, MED4-R6 and MED4-R8), as was the ability of the P-GSP1 and P-TIP38 phages to overcome resistance on MED4-R8. Initially an assay with a single selection step was used, whereby RBMs form plaques on lawns of resistant cells (see Materials and methods). Mutant P-SSP7 phages were found on two resistant strains, MED4-R5 and MED4-R6 (Table 1). In contrast, no RBMs were found on the other two resistant strains, MED4-R1 (with the P-SSP7 phage) and MED4-R8 (with the P-SSP7, P-GSP1 and P-TIP38 phages), despite screening over 1011 infective phage particles for each interaction (Table 1). Thus, RBMs were not found for four of the six interactions, suggesting that phage mutations enabling reinfection of some resistant strains are rare or even non-existent. Alternatively, multiple sequentially selected mutations may be required to overcome resistance (Scanlan et al., 2011; Marston et al., 2012), or RBMs may not be able to form plaques.

Table 1 Frequencies of resistance-breaking mutants in populations of T7-like cyanophages forming plaques on Prochlorococcus MED4 phage-resistant strains

To address these possibilities, we employed a host-mixing experiment described by Benmayor et al. (2009) that facilitates selection of resistance-breakers over multiple steps of phage evolution in liquid. In this experiment, viral populations are serially transferred on mixed bacterial populations consisting of a minority of susceptible hosts and a majority of phage-resistant cells (see Materials and methods). We tested the P-SSP7 phage on two resistant hosts: one for which P-SSP7 mutants were detected in the single-step method (MED4-R6) and one for which they were not (MED4-R1). Consistent with our previous findings, resistance-breaking phage mutants were evident after three transfers in the presence of MED4-R6, as seen from lysis of the mixed susceptible and resistant host population (Figure 1). Mutants infecting MED4-R6 evolved in all host-mixing ratios tested (Supplementary Figure S3A), and were evident earlier when higher proportions of the susceptible host were present (Supplementary Figure S3B). This likely resulted from an increase in phage population size due to replication on a larger population of susceptible hosts, leading to an increased chance for resistance-breaking mutations to occur. In contrast, no RBMs capable of infecting MED4-R1 were observed even after 10 transfers of the evolving P-SSP7 population at all susceptible to resistant host ratios used (Figure 1; Supplementary Figure S3). Thus, our findings suggest that resistance-breaking phage mutants cannot emerge for some host mutations, and that such resistant hosts pose a genetic hurdle that limits the propagation of the arms race.

Figure 1
figure 1

Experimental evolution of P-SSP7 resistance-breaking mutants. Four populations of the P-SSP7-WT phage were evolved on a host mix composed of 10% susceptible host cells (MED4-WT) and 90% phage-resistant cells: MED4-R6 (in left panels) and MED4-R1 (in right panels). P-SSP7-WT phages were added on day 0 (multiplicity of infection=0.07). Every 7 days, for a total of 10 transfers, 1% of each of the four evolving phage populations were transferred to a naive host population of the same initial composition. Growth curves of host populations infected with descendants of the same evolving phage population are shown by the same color. Growth of these host populations without phage is shown in black (n=8). Cell types infected separately served as controls for the ability of the evolving phage populations to infect the susceptible strain (MED4-WT) and for the inability to infect resistant strains (MED4-R1 and MED4-R6). a.u., arbitrary units. See Supplementary Figure S3 for similar results with different host mix ratios.

Inspection of our resistant strains did not yield any general insights into the types of mutations that permitted resistance-breaking versus those that posed a genetic barrier. Three of the four strains had mutations in a single cluster of non-core genes, whereas the fourth had a mutation in a core gene, all of which are located in the same genomic island (Supplementary Table S1A). Rather, the ability to overcome resistance appears related to the particular resistance gene harboring the mutation. The two resistant strains that yielded resistance-breaking phage mutants (MED4-R5 and MED4-R6) had different mutations in the PMM1247 gene, a gene of unknown function in Prochlorococcus MED4. However, a second mutation common to both MED4-R5 and MED4-R1 (in the PMM1249 gene) appears unrelated to the resistance-breaking phenomenon as the MED4-R1 strain did not yield any RBMs nor could it be infected by RBMs isolated on other strains.

Phenotype of RBMs

Six RBMs derived from P-SSP7 were isolated from plaques formed on lawns of the MED4-R5 and MED4-R6 strains during the single selection assay (Supplementary Table S1B). We began phenotypic characterization of three independently isolated phage mutants, RBM1, RBM2 and RBM3, by assessing their host range. All three RBMs maintained their ability to infect the WT MED4 and expanded their host range to infect both MED4-R5 and MED4-R6 (Supplementary Figure S4). However, they could not infect MED4-R1, or other Prochlorococcus or Synechococcus strains tested (Supplementary Table S1B). Thus, these mutants had a limited expansion in host range among some resistant MED4 types, but not beyond.

An expansion of host range, even if limited, is clearly beneficial to resistance-breaking phage mutants, providing them with an increased pool of susceptible hosts. However, the relative fitness of a resistance-breaking phage may not be the same on the original (MED4-WT) and acquired (MED4-R6) hosts. To evaluate their relative fitness, we measured phage fitness on each host as the phage progeny yield over multiple cycles of infection. Indeed, we found that all three of the RBMs had different fitness levels on their two hosts (Figure 2). Fitness of RBM1 and RBM2 was considerably lower on the acquired MED4-R6 host than on the original host by 54% and 66% for the two phages, respectively (n=7–13, P=0.001–0.0002). In contrast, the fitness of RBM3 was high on both its hosts, being only 8% greater on the acquired host (n=7–10, P=0.01). It is important to point out, however, that the lower fitness on the new host, as found for RBM1 and RBM2, should not be considered a cost, as this is a marked improvement over the WT phage that cannot infect this host at all.

Figure 2
figure 2

Fitness of resistance-breaking mutants (RBMs) and wild-type (WT) P-SSP7 phages. (a) Fitness is shown as the number of phage population doublings per day (W). Fitness was measured for all phages at 2 (triangle and diamond shapes) or 3 days (squares) after infection (Supplementary Figure S1) when infecting the MED4-WT host as well as for RBMs when infecting MED4-R6. Fitness on MED4-WT includes results from infections of RBM and WT phages separately and in direct competition (Supplementary Figure S9). Bars indicate means of all experiments (n=6–13). Experiments carried out at the same time have the same shaped data symbols. Significant differences in fitness are indicated (*P<0.05; **P<0.01; ***P<0.001). The WT phage used is the paired control for the specific RBM. (b) Relative fitness of RBMs is shown as the ratio between mean fitness of RBMs on the tested host and the WT phage on the MED4-WT host (set as 1). Error bars show the propagated s.e.m. for the ratios of the fitness measurements in a. (c) Relationship between fitness on the original WT host (MED4-WT) and on the acquired resistant host (MED4-R6) for the three RBMs. Each datum point represents the mean±s.e.m. of the measurements shown in a.

Next, we assessed how the fitness of the RBMs compared to that of the ancestral WT phage (P-SSP7-WT) on their common MED4-WT host. Surprisingly, the fitness of RBM1 and RBM2 was significantly greater (30%) on the MED4-WT host relative to the ancestral WT phage (n=6–10; P=0.001). However, the fitness of RBM3 was significantly lower (16%) than that of the WT phage on their common host (n=7–10; P=0.001; Figure 2). Thus, the ability of RBM3 to efficiently infect the new host came with a cost on the original host, whereas less efficient infection of the acquired host by RBM1 and RBM2 did not lead to a fitness cost on the original host.

We expected that the gain in ability to infect resistant hosts would be manifested through adsorption to the resistant strains, as resistance of MED4-R5 (Avrani, 2011) and MED4-R6 (Figure 3a) impaired adsorption of P-SSP7-WT to the resistant strains. Consistent with this expectation, adsorption of RBM3 was significantly greater than that of P-SSP7-WT to MED4-R6 (P=0.005). However, adsorption of RBM1 and RBM2 to this host remained low with no significant difference detectable in comparison to the WT phage (Figures 3a, P=0.57 for RBM1 and P=0.24 for RBM2). As adsorption is an essential first step for infection, some level of adsorption must have been regained and we assume that the increase was too small to be detected using the adsorption assay. Indeed, average phage fitness was linearly correlated with average phage adsorption (Figure 3b), suggesting that an important determinant of phage fitness that evolved in these phages was their capacity to adsorb to the resistant hosts.

Figure 3
figure 3

Attachment of resistance-breaking mutants (RBMs) and wild-type (WT) P-SSP7 phages to Prochlorococcus. (a) Percent attachment of the three RBMs and their WT ancestral phages to MED4-WT and MED4-R6 4 h after phage addition. Prochlorococcus sp. MIT9515 served as a control for attachment to a non-host strain. Significant differences in attachment between each of the RBMs and the WT phage are shown for each of the three hosts (*P<0.05; **P<0.01). Attachment of P-SSP7-WT to MED4-R6 is lower than to WT host (P=0.027). Attachment to resistant and non-host strains was similar for all phages (P=0.95) except RBM3. Bars indicate means of all experiments (n=5–8). Experiments performed at the same time have the same shaped data symbols. (b) Positive correlation between phage attachment and fitness (calculated for the means). Each data point represents the mean±s.e.m. of the measurements shown in Figures 2a and 3a. Results from all three paired-control WT phages are combined.

Interestingly, adsorption of RBM3 to the non-host Prochlorococcus strain, MIT9515, also increased significantly relative to the P-SSP7-WT phage (Figures 3a, P=0.02). In contrast, adsorption of RBM1 and RBM2 to this non-host did not differ significantly from P-SSP7-WT (Figures 3a, P=0.86 for RBM1 and P=0.24 for RBM2). This suggests that the benefit of efficient adsorption of RBM3 to the MED4-R6 host comes with a cost of decreased adsorption specificity. This is likely to be quite detrimental in natural communities in which susceptible hosts are only a minute fraction of the community (Waterbury and Valois, 1993).

In summary, we found that RBM3, which efficiently infects the resistant host, suffers decreased fitness on its original host as well as lower adsorption specificity, whereas RBM1 and RBM2 are both less efficient in infecting the new host and have no measurable fitness cost associated with their mutation. Although our sample size of RBMs is small, these findings suggest a negative relationship between infectivity on the new host relative to fitness on the WT host (Figure 2c).

Genotype of RBM phages

To determine the mutations responsible for the resistance-breaking phenotypes, we sequenced the whole genome of the RBM1, RBM2 and RBM3 phages. Whole-genome sequencing revealed that they all had amino-acid changing mutations in two regions (Table 2; Figure 4a): all three mutants harbored the exact same mutation in gene 23 (numbering is according to the P-SSP7 genome). This gene encodes a short protein (93 aa) with a proline-rich N terminus and is positioned upstream of the portal protein gene in the P-SSP7 genome. In addition, each phage mutant had one to two mutations in genes 37 and 38, which together with gene 39 form a cassette of three paralogous genes in P-SSP7 (Supplementary Figure S5) located directly downstream of the tail fiber gene, gene 36 (Sullivan et al., 2005; Liu et al., 2010). The RBM1 mutant also had a non-synonymous substitution in the exonuclease gene (gene 19) in a non-conserved position (Table 2).

Table 2 Characteristics of P-SSP7 resistance-breaking mutants (RBMs)
Figure 4
figure 4

Resistance-breaking mutations are in genes occurring in metagenomic islands in wild populations of P-SSP7. (a) Convergence of resistance-breaking mutations in three independent phage mutants. The genomic map of P-SSP7 is shown with the positions of resistance-breaking genes marked in red. The portal protein and tail fiber genes are marked in green and blue, respectively. Triangles mark the position of mutations in the genomes of each of the three completely sequenced P-SSP7 resistance-breaking mutants (RBMs) compared with the reference genome (NC_006882). Mutations unique to the RBM are in red, whereas mutations shared with wild-type paired-control phages are in white. (see Table S2 and Supplementary Results and Discussion for details). (b) The locations of metagenomic islands that under-recruited to the genome of P-SSP7 are shown for 12 viral metagenomes for which P-SSP7 was present (detailed in Supplementary File 3). Significant enrichment of genes in observed metagenomic islands compared with randomly positioned islands of equivalent size in 10 000 bootstrap replicates are indicated (*P<0.05; ***P<0.001). (c, d) Examples of recruitment plots of viral metagenomes from the Indian (c, TARA station 38, 5 m) and Pacific (d, ALOHA station, 120 m) oceans. These are representative of data sets with (c) and without (d) metagenomic islands for gene 23, encoding the proline-rich protein. Recruited sequence fragments (in blue) are mapped along the P-SSP7 genome (x axis) at their identity level to that genome (y axis). At the top of each recruitment plot, the base-specific read coverage of the metagenomic fragments with over 90% BLASTn identity is shown in log scale. The dashed line in this plot marks the coverage threshold used for designating regions as metagenomic islands. The positions of resistance-breaking genes are indicated by yellow boxes and shading in all panels. Metagenomic islands are indicated by red boxes in c and d.

Genotyping of nine additional isolates of RBMs by sequencing PCR amplicons of genes 23, 37 and 38 revealed all but one to have mutations in at least one of these genes (Table 2). Such evolutionary convergence of resistance-breaking mutations suggests that the protein products of genes 23, 37 and 38 are important for determining host range in P-SSP7. Furthermore, our data suggest that a mutation in gene 23 is typically the first to enable resistance-breaking and that a subsequent mutation in genes 37 or 38 increases their virulence (Supplementary Results and Discussion). However, as 1 of the 12 mutants did not have a mutation in either of these genes, mutations in other regions can also confer the resistance-breaking phenotype.

Putative structural function of mutant genes

As genes 23, 37 and 38 appear to be involved in host recognition and attachment of the phage to the cell surface of the host, one would expect them to form part of the tail structure. Although P-SSP7 gp23, gp37 and gp38 proteins are found in virion particles (Lindell et al., 2007), their exact location in the virion is unknown (Liu et al., 2010). Nonetheless, two lines of evidence suggest that these genes are likely to be tail-related: their distant homology to tail-related genes in other phages and the conservation of genomic position relative to homologous tail-related genes in the genome of T7.

We identified homologs of the P-SSP7 gene 23 in nearly all T7-like cyanophage genomes based on proline richness of the N terminus (Supplementary Figure S6; see Supplementary Results and Discussion). Furthermore, the position of this gene is conserved, being upstream of the portal protein gene (Supplementary Table S5). In coliphage T7, gene 7.3 is in this same genomic position and it shares remote homology with the C terminus of the cyanophage proline-rich proteins (Supplementary Results and Discussion). In T7, gp7.3 is involved in host recognition and attachment (Dunn and Studier, 1983; Kemp et al., 2005). Although its exact function and location in the T7 virion remains unknown (Kemp et al., 2005; Cuervo et al., 2013), it is presently thought to act as a scaffold for tail assembly (Kemp et al., 2005).

Unlike gene 23, the P-SSP7 genes 37 and 38 have no homologs in other T7-like cyanophages (Supplementary Results and Discussion). Nevertheless, they do have a low level of identity to a tail fiber-like protein of a T4-like cyanophage (Supplementary Results and Discussion). The genomic position of these genes downstream of the tail fiber gene further alludes to a potential role in tail fiber structure.

Although direct evidence is lacking for the structural role of the protein products of genes 23, 37 and 38, current information suggests that these are previously unrecognized host-recognition proteins that function in tail assembly and in the tail fiber structure. Furthermore, these findings suggest that similar to other T7-like phages (Studier, 1975; Garcia et al., 2003; Qimron et al., 2006; Paterson et al., 2010; Scanlan et al., 2011), over-coming resistance in these T7-like cyanophages is related to tail fiber genes and genes involved in tail assembly.

Hyperdiversity of resistance-breaking genes

Coevolution of phages with their host populations can drive the rapid divergence of phage genes involved in recognition and attachment to the host relative to the rest of the phage genome (Paterson et al., 2010; Scanlan et al., 2011; Koskella and Brockhurst, 2014). To test for signals of hyperdiversity in resistant-breaking mutant genes, we analyzed their diversity across the genomes of T7-like cyanophages. The occurrence of genes 37 and 38 in the P-SSP7 genome alone indicates that these genes are part of the flexible genome and that their hyperdiversity is at the level of gene content. In contrast, we identified homologs of gene 23 in all T7-like cyanophage genomes but one. Thus, we consider this to be a core gene (Supplementary Results and Discussion). A comparison of protein sequence diversity of gene 23 to that of other T7-like cyanophage core genes showed that it is among the most diverse core genes (Figure 5). The two genes that are more diverse, the tail fiber protein and one of the three internal core proteins that form the channel that transverses the host cell envelope (Hu et al., 2013), are also likely to be involved in host range in T7 and other T7-like phages (Qimron et al., 2006; Paterson et al., 2010; Scanlan et al., 2011). Thus, the most diverse core genes in T7-like cyanophages function in host recognition, and their hyperdiversity is likely the result of rapid coevolution with the host.

Figure 5
figure 5

Diversity of the proline-rich proteins (black) relative to proteins of other core genes (gray) in T7-like cyanophages. Diversity across eight genomes was calculated as information content (IC; Avrani et al., 2011), ranging from 0, which indicates identity of all sequences, to 4.3, indicating no identity across the entire alignment. For genes of unknown function, the P-SSP7 protein numbers are indicated. DNA and RNA polymerases were not included in the analysis due to the occurrence of frameshifts in gene sequence for many genomes that hinders the inference of the corresponding amino-acid sequence; however, these are among the most conserved core genes. SSB, single-strand binding; RNR, ribonucleotide reductase.

Next, we examined the diversity of these resistance-breaking genes in natural populations in the oceans using recruitment analysis. We focused on 12 viral metagenomes: 3 from the Pacific Ocean and the others from the Indian Ocean (Supplementary Figure S7), in which P-SSP7-like phages were present. The presence of P-SSP7 in these metagenomes is evident from the high coverage of metagenomic sequences along most of the P-SSP7 genome (see Materials and methods). However, some genomic regions have very low coverage. Such under-recruiting regions, known as metagenomic islands, have been previously observed for bacteria and phages (Rodriguez-Valera et al., 2009; Mizuno et al., 2014) and can be explained by a high level of sequence divergence or even absence of these genes from natural populations.

The most under-represented genomic region of P-SSP7 in the metagenomes is that of the tail fiber gene and the four genes downstream of it, including resistance-breaking genes 37 and 38 (Figure 4). The occurrence of these genes in metagenomic islands (that is, their divergence or absence in viromes) is significantly higher than would be expected had the islands been randomly distributed over the genome (P<0.012). A similar pattern of under-recruitment downstream of the tail fiber gene was found for seven of the eight other T7-like cyanophages that are well represented in viral metagenomes (Supplementary Figure S8). These findings are in agreement with previous comparative genomic analyses showing variable gene content downstream of the tail fiber gene in T7-like cyanophages (Island III in Labrie et al. (2013)) and in viral metagenomes mapping to T7-like pelagiphages (Mizuno et al., 2014).

In P-SSP7, gene 23 was also significantly under-represented in viral metagenomes, having a low recruitment in 8 of 19 of the metagenomic data sets (Figure 5). Of the reads that did recruit to gene 23, half (49% of 600) had polymorphisms in at least one of the three amino-acid positions for which we experimentally found resistance-breaking mutations (in positions 41, 42 and 43). Only two code for an amino acid identical to that in our mutants (N41T) with the vast majority found in position 43. These data indicate that the diversity of gene 23 is both at the level of sequence diversification and gene content in natural populations of P-SSP7-like phages. It should be noted that unlike for P-SSP7 the proline-rich homologs of gene 23 were not under-represented in any of the other T7-like cyanophages (Supplementary Figure S8).

In summary, our results directly link evolution of host recognition with diversity in T7-like cyanophages and suggest that the coevolutionary process drives this diversity. Indeed, resistance-breaking loci identified experimentally are highly diverse in wild populations of T7-like cyanophages in both gene sequence and gene content. The diversity of these resistance-breaking loci is reminiscent of the diversity of susceptibility regions observed in Prochlorococcus (Avrani et al., 2011). Thus, antagonistic coexistence has led to rapid evolution and population-level diversity in genomic islands harboring genes involved in phage recognition of hosts in both host and phage genomes.

A combination of models is needed to explain coexistence in nature

Our combined findings suggest that an integrative model incorporating multiple coevolutionary processes is required to adequately explain antagonistic coexistence between Prochlorococcus and its T7-like cyanophages. On one hand, our ability to isolate resistant hosts (Avrani et al., 2011) and RBMs with an expanded host range for at least some resistant Prochlorococcus hosts (this study) demonstrates that arms race cycles can occur (Figure 6). Furthermore, hyperdiversity of resistance and resistance-breaking loci in hosts and phages suggests that mutations and counter-mutations with reciprocal selection cycles occur in natural populations and are not purged by selective sweeps or genetic barriers to coevolution. On the other hand, our finding of many cases for which RBMs did not evolve suggests that phage counter-mutations alone cannot explain phage survival. Indeed, if resistant hosts sweep through the population, host-specific phages would remain host-less and become extinct. Such cases would constitute a dead end to the cyclic nature of the arms race.

Figure 6
figure 6

Model of multiple coevolutionary processes proposed to act at the same time on different interacting partners to enable community-level coexistence of Prochlorococcus and T7-like cyanophages. Each evolutionary process is shaded by a different color. Hosts and phages in different phenotypic states with respect to the interaction with each other are shown in white squares. Different manifestations of these states are shown in white circles. Descriptions of transitions between the different states are indicated by arrows. Phages that overcome the hurdle of a genetic barrier by passive host-switching avoid extinction and return to the evolutionary playing field. The hurdle imposed by a genetic barrier (marked by a white circle) is the same hurdle that is overcome by passive host-switching (marked by the red and white bar).

Alternatively, density-dependent fluctuations between subpopulations of hosts that differ in their genetic susceptibility to phages could explain coexistence, even in the absence of resistance-breaking phage mutants (Avrani et al., 2011; Avrani et al., 2012). These fluctuations require, and are driven by, costs of resistance such as reduced growth rate and greater susceptibility to other phage types (Avrani et al., 2011; Figure 6). However, apparent cost-less phage resistance mutations (Duffy et al., 2006; Lennon et al., 2007; Avrani et al., 2011; Ford et al., 2014) and costs that are ameliorated with continued evolution (Lenski, 1988b; Kashiwagi and Yomo, 2011; Avrani and Lindell, 2015) are known to occur. Thus, these situations would still leave host-specific phages host-less, leading to their extinction.

Thus, a mechanism that enables phage persistence in the face of a reduction in the pool of susceptible hosts must exist. Previously, we suggested that passive host-switching can balance such host loss without a mutation occurring in the phage to overcome host resistance (Avrani et al., 2012). In this scenario, a phage acquires a new host as a result of a mutation in a non-host cell that turns it into a host. This can be the result of continued evolution that returns a resistant host to susceptibility (Meyer et al., 2010; Avrani and Lindell, 2015) or due to a mutation that grants a cell resistance to one phage type while making it susceptible to another, as in enhanced infection (Lennon et al., 2007; Avrani et al., 2011; Marston et al., 2012; Castillo et al., 2014; Figure 6). These processes can allow the recoupling of a phage with the original host it could previously infect or enable a phage to infect a different host entirely. Thus, in the context of the community, passive host-switching would prevent phage extinction despite resistance mutations in hosts that cannot be overcome by counter-mutations in phages.

In light of the above, we propose a three-pronged model, whereby a limited evolutionary arms race, density-dependent fluctuating selection and passive host-switching all work together to enable host–phage coexistence in the wild (Figure 6). This model paints a picture of complex host–phage coexistence that shifts between fluctuating selection and arms race dynamics as a function of fitness costs in the host and their elimination by continued evolution. And, when genetic barriers arise to the emergence of RBMs in the phage, passive host-switching supplies alternative hosts from within the diverse host community. This combined model is compatible with the high degree of microbial and viral diversity seen in wild communities of both Prochlorococcus and their phages in genomic regions previously hypothesized to be (Rodriguez-Valera et al., 2009; Labrie et al., 2013; Kashtan et al., 2014), and now empirically shown to be (Avrani et al., 2011; Avrani and Lindell, 2015, this study), involved in phage recognition of hosts. This population-level diversity leads to a mix of host subpopulations with a variety of susceptibilities to co-occurring phages and a mix of phage subpopulations with varying abilities to infect hosts. The composition and structure of such populations would constantly change due to the coevolutionary process, leading to a complex and dynamic network of host–phage interactions that, perhaps counter-intuitively, result in a stable community in nature. Indeed, such diversity-induced stable coexistence likely extends to many host–phage systems (Rodriguez-Valera et al., 2009; Flores et al., 2011; Cordero and Polz, 2014; Mizuno et al., 2014).

Our findings show a clear asymmetry in the evolutionary capacity between Prochlorococcus and T7-like cyanophages. We found that resistant hosts are commonly isolated (Avrani et al., 2011), yet phage mutants that overcome this resistance were often not found (this study). Even when phage RBMs were observed, they occurred at a frequency that is ~100–1000-fold lower than the frequencies reported for phage resistance in the hosts (1 in 3 × 104 (2 × 104 s.d.) resistant hosts (Avrani et al., 2011) versus 1 in 6 × 107 (4 × 107 s.d.) resistance-breaking phages (Table 1)). Evolutionary asymmetry between hosts and phages is not unique to our system. This asymmetry, in which hosts outplay phages, has been observed in different model systems from a variety of environments including the gut, soil, phyllosphere and the oceans (Lenski, 1984; Bohannan and Lenski, 2000; Gomez and Buckling, 2011; Koskella and Parr, 2015; Scanlan et al., 2015, this study), although the form in which this asymmetry is manifested may vary for different model systems. The advantage for the host may be due to the relative ease of acquiring a loss-of-function mutation in a host cell surface molecule, whereas obtaining a specific gain-of-function mutation by the phage is likely to be more limited. This would be most severe when a host mutation leads to complete loss of the phage receptor, as was previously suggested by Bohannan and Lenski (2000). Irrespective of the reasons for the asymmetry, these findings indicate that the coevolutionary mechanisms supporting coexistence are not necessarily a mirror image between hosts and their phages. Rather, multilayered coevolutionary processes, of arms race, fluctuating selection and passive host-switching, are likely to be in play at any one time among different subpopulations of each of the antagonists and these may well differ between microbial hosts and their viruses.