Introduction

The type III secretion system (T3SS) is believed to confer a major fitness advantage to a wide range of Gram-negative bacteria that are pathogens of plants or animals. By delivering effectors that suppress the defense mechanisms of hosts (Hueck, 1998; Cornelis and Van Gijsegem, 2000; Block and Alfano, 2011), it contributes to creating more favorable conditions for bacterial multiplication in the host. The functions and evolutionary origins of this secretion system have been intensively explored because of its critical role in pathogenicity (Troisfontaines and Cornelis, 2005). Three complementary analytical approaches are needed to attain a full understanding of the significance of the T3SS for bacterial fitness, pathogenicity and evolution. These approaches are as follows: mutational analyses of model strains and population genetics and comparative genomics of strains representative of pathogen diversity. For plant pathogenic bacteria, the latter two have been hindered by the research focus on highly virulent crop pathogens and a concomitant neglect of the pathogen diversity present on wild plants and in the environment in general. This neglect of pathogen diversity is partly due to the narrow species descriptions of plant pathogenic bacteria, which include the ability to induce a hypersensitive response (HR) on tobacco (see for example the LOPAT test used to identify Pseudomonas syringae, (Lelliot et al., 1966)). This narrow species description automatically excludes plant pathogen species members that might be missing a T3SS as induction of an HR is an indicator of a functional T3SS (He, 1996). Strains deficient in their capacity to induce HR (HR) have recently been described for Erwinia pyrifoliae (Jock et al., 2003), Pectobacterium sp. (Kim et al., 2009) and P. syringae (Mohr et al., 2008; Clarke et al., 2010; Kniskern et al., 2011). All strains examined in these studies were isolated from plants (wild or cultivated) or from leaf litter in cultivated fields. The HR strains of Pectobacterium represent two species, P. carotovorum and P. wasabiae. But the P. syringae strains are remarkably homogenous. All HR strains of P. syringae characterized to date represent a single phylogenetic group and consistently lack the canonical hrp/hrc cluster (Mohr et al., 2008; Clarke et al., 2010; Kniskern et al., 2011). Furthermore, these strains harbor a novel hrp/hrc cluster with limited effectiveness in expressing and/or delivering effectors to plant cells. HR strains of E. pyrifoliae and P. syringae were reported to be incapable of causing disease on the hosts tested, whereas the HR strains of Pectobacterium spp. maintained pathogenicity on the original host from which they were isolated. The considerable role of pectolytic enzymes in the pathogenicity of Pectobacterium spp. might explain why there was no loss in pathogenicity for this group of bacteria.

Recent work on the ecology of P. syringae in nonagricultural habitats revealed the existence of nine new clades of this bacterium that had not been previously observed (Morris et al., 2010). All phylogenetic descriptions of this species before the study of nonagricultural habitats focused on strains from plants. The extensive phylogenetic diversity observed in nonagricultural habitats could also be a source of genetic diversity of the T3SS and of pathotypes. Strains in these clades are clearly apt at surviving in a wide range of habitats, such as snowpack, water and epilithic biofilms. The key traits for adaptation to these specific habitats, and for enabling many strains of P. syringae to be fit in multiple habitats including plant tissues, have not been identified. In particular, the role that the T3SS has in this adaptation has not been explored. Hence, we have explored the frequency of occurrence and the genetic diversity of strains of P. syringae, from nonagricultural habitats, that are HR in tobacco. This work reveals that HR strains are frequent in nonagricultural habitats and have diverse phylogenetic origins. Furthermore—and surprisingly—their incapacity to induce hypersensitivity is not necessarily indicative of their incapacity to cause disease on plants. This work provides a unique comparative foundation for identifying and studying the genetic factors that underlie evolutionary adaptation and ecological specialization of P. syringae.

Materials and methods

Origin of strains

A total of 63 strains of P. syringae were deployed in this study, including 40 strains unable to incite a hypersensitive reaction (HR) and 23 strains capable of inciting a hypersensitive reaction (HR+) obtained from 21 samples of water, snowpack, epilithic biofilms and wild plants as described in the Supplementary Table 1. Six reference strains were also included. Samples were part of a survey for hypersensitive incompetent strains that included a range of environmental substrates collected from 2004 to 2010, mostly in alpine regions and outside of agricultural fields (Table 1) and processed as described previously (Morris et al., 2007, 2008, 2010) to determine the density of P. syringae populations. Phenotypic traits that confirm identity as P. syringae were determined as described previously (absence of arginine dihydrolase and cytochrome oxidase). The 40 strains of P. syringae unable to incite a hypersensitive reaction used for this study were selected from that survey to represent the range of sample sites, substrates and phylogenetic diversity. Reference strains included the HR strains Psy508 and Psy642 (Mohr et al., 2008; Clarke et al., 2010) and the HR+ strains B728A (from the stock culture of S Hirano, University of Wisconsin, Madison, WI, USA), DC3000 and 1448A from J Greenberg (The University of Chicago), and Cit7 from SE Lindow (University of California, Berkeley, CA, USA). All strains were stored at −80 °C in 20% glycerol, and all phenotypic and molecular tests were conducted on cultures prepared from a single transfer of the storage cultures.

Table 1 Characteristics of environmental samples, from alpine regions and outside of cropped fields, used in survey for populations of Pseudomonas syringae incapable of inducing a hypersensitive reaction in tobacco (HR)

Hypersensitivity, pathogenicity and growth dynamics of bacteria in plants

For hypersensitivity tests, aqueous bacterial suspensions at 108 colony-forming units (c.f.u.) ml−1 from 48 h cultures on King's medium B (King et al., 1954) were infiltrated with a needleless syringe into the leaves of tobacco (Nicotiana tabacum L. cv. Xanthi). Hypersensitivity was scored as negative if necrotic patches did not develop in the infiltrated zone within 48 h after infiltration. Assays were repeated three times and included an HR+ strain as a control.

The capacity of strains to induce disease symptoms on cantaloupe seedlings was determined for strains, regardless of their capacity to induce hypersensitivity on tobacco. Aggressiveness on cantaloupe plants (Cucumis melo var. cantalupensis Naud. cv. Védrantais) was quantified by injecting 50 μl of bacterial suspensions (prepared as for hypersensitivity tests) into the junction of the cotyledons of 12 plants at the cotyledon stage (ca. 10 days after sowing) as described previously (Morris et al., 2010). Plants were incubated for 7 days at 25 °C during the daylight period (16 h) and 18 °C at night (8 h), and symptoms were observed daily after inoculation. Final disease severity was noted at 7 days after inoculation on a scale of 0–4: 0 (no symptoms), 1 (1 cotyledon wilted or discolored), 2 (both cotyledons wilted or discolored), 3 (both cotyledons completely wilted and the upper stem of the plant collapsed) and 4 (the plant collapsed at the base and the entire plant had fallen on the soil surface).

To determine population dynamics of strains in cantaloupe plants at the cotyledon stage, plants were inoculated and incubated in the same way as for pathogenicity tests. At each sampling time for up to 7 days after inoculation, 3–6 plants were randomly sampled from the 12 plants inoculated with each strain, weighed, and individually macerated in 10 ml of 0.1 M phosphate buffer. Aliquots of the macerate were diluted and plated on KBC medium (Mohan and Schaad, 1987), the same medium used to isolate these strains from the environment. Colonies were counted after 4 days of incubation at 22–25 °C in the dark. All colonies were fluorescent under ultraviolet light as expected and there were no indications of contamination of the plants with other bacteria.

PCR detection of genes in the hrp/hrc cluster

PCR detection was conducted on DNA purified from 48 h cultures on KB medium from strains stored at −80 °C as described above. Bacterial suspensions (1012 c.f.u. ml−1) prepared in sterile distilled water were used for DNA extraction with a Puregene DNA purification system cell and tissue kit (Gentra Systems, Minneapolis, MN, USA) following the manufacturer's instructions.

For detection of hrp/hrc cluster genes (hrpK1, hrpL, hrcC), CEL effectors (avrE1, hrpW1) and hopI1, primers described by Mohr et al. (2008) were used for amplification, whereas for phage detection we designed forward primer PhF (5′-TCAGGCCGCTTTCAACTTGGC-3′) and reverse primer PhR (5′-CCATAYGGCGCAGTGGTGG- 3′) based on the sequence of a putative transcriptional regulator gene (locus tag ABX64468) located within the prophage at the hrp/hrc locus of Psy508 (Mohr et al., 2008). To detect the presence of the hrcC ortholog of Psy642, colony PCR was performed as previously described (Clarke et al., 2010). Amplification conditions for detection of these eight genes are described in detail in the Supplementary Information section.

Southern blotting

To validate the results of PCR reactions, Southern blot analyses were performed for a set of strains representing the different profiles of absence and presence of the six conserved genes as described previously (Vinatzer et al., 2006; Mohr et al., 2008). Genomic DNA of P. syringae pv. syringae B728a, P. syringae 508, and hypersensitive incompetent P. syringae UB15, UB185, UB186, UB190, UB192, UB193, UB195, UB246, UB313, SZ30, SZ37, SZ45, SZ124, SZ131 and TA043 was digested with HindIII and EcoRI (New England Biolabs, Frankfurt, Germany). Digested DNA was electrophoresed on a 1% Tris-borate-EDTA agarose gel. Southern blotting, hybridization and detection was performed following the manufacturer's instructions (Roche Diagnostics, Indianapolis, IN, USA) using a mixture of digoxigenin-labeled PCR products of the six genes amplified from strain B728a and the digoxigenin-labeled PCR product of the amplification of a housekeeping gene (gapA) from strain B728a as probes.

Gene sequencing and phylogenetic analysis

From previous studies (Mohr et al., 2008; Clarke et al., 2010; Morris et al., 2010), the clade identity was known for 19 strains used here relative to the phylogenetic tree reported by Morris et al. (2010), and based on four housekeeping genes (rpoD, gyrB, cts and gapA). For the remaining strains, assignment to clades was based on our previous results that strains could be accurately assigned to clades based on DNA similarity in the cts sequence using a threshold dissimilarity of 1.8% (Morris et al., 2010). The sequence of the cts gene was determined as previously described (Morris et al., 2010). On the basis of the aligned and truncated (ca. 400 bp) cts sequences (DAMBE version 5.1.1 (Xia and Xie, 2001)), the genetic distance (dissimilarity) of each strain from each of the other 121 strains of P. syringae that constitute our previously reported phylogenetic tree (Morris et al., 2010) was determined using MEGA, version 3.1 (Kumar et al., 1994). There were no cases in which strains had ambiguous similarities (that is, there were no cases in which the similarity criterion was met for more than one clade).

If the similarity criterion was not met, the other three housekeeping genes (rpoD, gyrB and gapA) were sequenced. Details of PCR amplifications are described in the Supplementary Information section. A phylogenetic tree was constructed with the concatenated four housekeeping gene sequences from these strains and those from the ensemble of P. syringae, P. viridiflava and P. fluorescens used in a previous work (Morris et al., 2010). All strains fell into previously described clades.

Sequences of the PCR-amplified bacteriophage gene fragment were aligned with DAMBE version 5.1.1 and truncated to 302 bp.

Results

HR strains are abundant in nonagricultural habitats and can dominate the P. syringae population in some cases

HR strains were detected in 65% of the 126 environmental samples analyzed (Table 1) and were found in headwaters of rivers, snowpack, epilithic biofilms, wild plants and leaf litter. HR strains were also found in some samples of freshly fallen rain but not in any of the samples of fresh snowfall. When detected, the HR fraction of the total P. syringae population varied greatly, from 2 to 100%, and HR strains attained population densities of up to 105 c.f.u. l−1 of river water or 109 c.f.u. g−1 of plant tissue. In 20% of the samples, HR strains constituted 50% or more of the total P. syringae population (Figure 1). The frequency of HR strains in the total P. syringae population was not related to the P. syringae population density (Figure 1). For the samples analyzed here, P. syringae population density did not exceed 107 c.f.u. g−1 or c.f.u. l−1, except in one case. Strains from some of these samples were further characterized for the presence of components of the T3SS and for their pathogenicity to cantaloupe seedlings (Table 2).

Figure 1
figure 1

Fraction of HR strains in the total P. syringae populations in environmental samples in terms of the density of the P. syringae population in the sample. Population densities are expressed as log c.f.u. g−1 for biofilms, plant and leaf litter or log c.f.u. l−1 for water, rain and snow (as snowmelt). The data constitute values from 100 of the 126 samples described in Table 1. For these 100 samples, they were processed to permit quantification of the density of the HR component population as well as that of the total P. syringae population.

Table 2 Genetic and pathogenic diversity of strains of Pseudomonas syringae according to the phylogenetic context of the strains. The presence or absence of genes was based on PCR detection

HR strains lack at least one conserved effector gene

The conserved type III effector gene hopI1 (Jelenska et al., 2007) was absent from all HR strains. In most of these HR strains, a varying number (or even all) of the canonical hrp/hrc locus genes hrpK1, hrpL and hrcC and the conserved effector locus effector genes avrE1 and hrpW1 were also absent based on PCR detection and subsequent Southern blotting (Table 2). Interestingly, for about a dozen HR+ strains we were also unable to detect one or more of the six tested genes (Table 2). For the other 16 HR+ strains, all six genes were detected with PCR. Among the HR strains, the most frequent molecular profile was similar to that previously reported for strain Psy508 (Mohr et al., 2008) and Psy642 (Clarke et al., 2010) and for HR strains from Arabidopsis thaliana (Kniskern et al., 2011), viz. strains lacked all six conserved genes tested (Table 2). An example of the results observed with PCR detection and subsequent Southern blotting are presented in the Supplementary Figure S1.

HR strains lacking all six tested hrp/hrc genes and conserved effector genes are found in multiple distinct clades

In all previous descriptions of HR P. syringae lacking the six tested genes, all strains fell into a single phylogenetic group, clade 2C (Mohr et al., 2008; Clarke et al., 2010; Kniskern et al., 2011). Clade 2C corresponds to clade SZ-030 in our previous work (Morris et al., 2010)) and we will thus continue calling this clade SZ-030 here. In this present work, HR strains lacking all six conserved genes of the canonical hrp/hrc locus were found in clade SZ-030 and in three additional phylogenetic groups, clades TA-002, UB-246 and Group 2b (Table 2). In addition to the previously reported strains from plants (cultivated and wild plants, weeds, debris), strains lacking the canonical hrp/hrc locus were isolated from snow pack and headwaters of rivers.

Previous descriptions of strains in the SZ-030 clade indicated that they possess a distantly related variant of the hrcC gene (Clarke et al., 2010), called hrcCPsy642 from here on. The presence of hrcCPsy642 was observed only in HR strains lacking all the six conserved genes in the canonical T3SS. This included all strains in the SZ-030 clade tested here and one strain outside of this clade, strain USA-050. The sequence of a 560-bp long fragment of the USA-050 hrcC allele was found to be 97% similar to that of strain Psy642.

Some but not all P. syringae strains from French sources that lack the canonical hrp/hrc locus island carry identical sequences originating from a bacteriophage

Previous work reported that strain Psy508 lacking the hrp/hrc locus harbors a prophage downstream of the associated tRNAleu (Mohr et al., 2008). We sought to determine whether all strains lacking the canonical hrp/hrc locus island had this same sequence because its placement in the genome is suggestive of its potential role in the absence of this island. PCR with primers for one of the prophage genes of Psy508, a putative transcriptional regulator, revealed the presence of the prophage sequence in 21 strains lacking the hrp/hrc locus, including the reference strain Psy508. Strains containing the phage sequence were found in all but one of the clades that contained strains without the hrp/hrc locus, viz. clades SZ-030, TA-002 and UB-254 (Table 2). However, the phage sequence was not present in all strains lacking the hrp/hrc locus; it was found in 18 of 23 strains in clade SZ-030, in 1 of 12 strains in clade TA-002, in 2 of 2 strains in clade UB-246 and not in the strain from the Group 2b clade. It was not detected in any strain for which at least one gene in the canonical hrp/hrc cluster was detected. The nucleotide sequence of the prophage was determined. The 380-bp sequence was identical among all strains collected in France and showed 91% similarity with the prophage sequence reported for strain Psy508 isolated from New York (USA; Mohr et al., 2008).

Incapacity to induce HR is not necessarily indicative of the incapacity to cause disease or to persist in plant tissue

For many species of plant pathogenic bacteria, and in particular for P. syringae, the ability to induce hypersensitivity has been considered to be an indicator of the capacity to cause disease in susceptible host plants (He, 1996). Similarly, the inability to induce hypersensitivity is considered indicative of the absence or incapacitation of a functional T3SS and hence of the incapacity to cause disease. Therefore, it is not surprising that most HR strains were not pathogenic in an assay on cantaloupe seedlings. In all clades, with the exception of clade TA-002, HR strains caused at most weak disease symptoms (score=1) in 1 of the 12 inoculated seedlings after 7 days of incubation. However, HR strains in clade TA-002 were among the most aggressive on cantaloupe of the strains tested in this study. In particular, of 15 strains characterized in the TA-002 clade, 13 were HR and 12 lacked all six genes in the canonical hrp/hrc locus; 8 of these strains caused compatible disease reactions on 75% or more of the cantaloupe seedlings that were inoculated (Table 2; Figure 3). The range of pathogenic capacity of these strains, indicated by their aggressiveness on cantaloupe, was similar to that observed among HR+ strains in the broad host range clades Groups 2a and 2b (Morris et al., 2010).

When inoculated into cantaloupe seedlings, HR+ strains with complete canonical T3SS (CC0654, DC3000) or with a partial set of the canonical alleles (USA-032) increased their mean population size by a factor of 10–30 within 4 days after inoculation (Figure 2a). For CC0654 and USA-032, this corresponded to marked disease symptoms, but for strain DC3000 only one plant manifested slight symptoms (disease score=1) during the incubation period (Figure 2b). HR strain TA-043 showed a slight population increase at 2 days after inoculation and afterwards stabilized around the initial population size although disease symptoms were expressed. For the HR strains SZ-030 and CC1504, population size decreased slightly and stabilized at around 107 c.f.u. per plant. As expected, no disease symptoms were observed for these strains. For all strains considered together, plants with no symptoms harbored population sizes up to 3 × 108 c.f.u. per plant. Populations in plants with a disease score of 1 ranged in size from 2 × 107 to 1.5 × 108 c.f.u. per plant; those with a disease score of 2 or more had populations of 5 × 107 to 6 × 108 c.f.u. per plant.

Figure 2
figure 2

Population dynamics of P. syringae (a) and disease (b) in cantaloupe seedlings after inoculation with either HR+ or HR strains. HR+ strains are represented by solid lines and symbols (♦ CC0654; DC3000; ▪ USA-032) and HR strains by dashed lines and open symbols ( TA-043; SZ-030; □ CC1504). At each sampling time, 3–6 seedlings were cut off at the soil level and individually macerated to determine population densities. Disease was rated on each of the 3–6 plants sampled. Plants inoculated with strains SZ-030 and CC1504 did not show any disease symptoms. Only one plant inoculated with strain DC3000 showed disease symptoms at 2 days after inoculation. For strain TA-043, half of the inoculated plants showed symptoms at 7 days after inoculation. Error bars indicate the s.e.m.

Figure 3
figure 3

Reaction of cantaloupe plants at the cotyledon stage to inoculation with HR+ and HR strains of P. syringae after 7 days of incubation: plants were inoculated with (a) sterile distilled water, (b) HR strain CC1524 in the clade of the same name (c) HR+ strain CC0094 in the Group 2b clade, or (d) HR strain TA043 in the TA002 clade. Plants shown here were chosen at random among the 12 plants inoculated for each strain.

Discussion

The natural occurrence of variants of bacterial plant pathogens with apparent deficiencies in their T3SS has raised questions about the significance of the T3SS for the fitness of these bacterial species. For P. syringae, it has been proposed strains deficient in their overall pathogenicity have adapted to reduce the fitness costs of host-specific virulence in an environment where there is a series of ephemeral host plants (Kniskern et al., 2011). Absence of an effective T3SS could widen the range of plant species on which strains can grow without triggering an HR. In fact, many of the HR strains have the same level of reproductive fitness in plants as pathogenic strains—in absence of symptoms (Clarke et al., 2010; Kniskern et al., 2011). For the strains tested here, induction of disease symptoms in cantaloupe was associated with increases of population densities of about 1 order of magnitude. However, HR strains were able to persist in plants at densities above 106 c.f.u. per plant (or 106 c.f.u. g−1, data converted to a per g basis not shown here), which is on the high range of P. syringae population densities naturally observed in wild plants and symptomless crops (Morris et al., 2008). Ephemeral weeds and wild plants might be part of the selective pressures leading to the successful survival of strains of P. syringae deficient in their T3SS. However, P. syringae is clearly apt at inhabiting a range of nonplant and nonliving substrates, thereby suggesting that this species has considerable fitness in these substrates. The proliferation of strains deficient in the T3SS might result from selection pressures specifically related to these substrates. Here we have shown that T3SS-deficient P. syringae can be found not only in wild plants but also in epilithic biofilms, snow pack and river and lake waters in relatively pristine environments and can compose the dominant fraction of the P. syringae population in these substrates. These observations strongly suggest that deficiency in the canonical T3SS does not confer a marked loss of fitness of P. syringae in the context of the ensemble of habitats it exploits under natural conditions. T3SSs are considered to be all-purpose tools for undermining eukaryotic biology, thereby assuring an advantage for the bacterium (Pallen et al., 2005) not only during pathogenesis but also during symbiosis (Downie, 2010; Marchetti et al., 2010) and to foster mutualism or commensalism in the rhizosphere (Lugtenberg and Kamilova, 2009) or the mycosphere (Nazir et al., 2010). Co-existence of HR+ and HR strains would suggest that in certain environmental habitats, the subversion of eukaryotic cell biology—that of the wild plant host or of the algal or fungal co-inhabitants of the same biofilm, for example, has no decisive role in the fitness of P. syringae. This might be because the population densities that they attain under natural conditions are not large enough to trigger detrimental host responses. However, we cannot rule out that T3SSs in some HR strains, for example, the T3SS in clade SZ-030 (Clarke et al., 2010; O’Brien et al., 2010), have a role in the interaction with eukaryotes and are equally effective as the canonical T3SS in suppressing host responses.

We have also observed that pathogenicity does not necessarily depend on the canonical T3SS. The surprising capacity of HR strains to cause disease symptoms in cantaloupe was confined to one clade (TA-002), but the symptoms induced were of the same range of severity as those caused by HR+ strains across diverse clades. The type III injectisome is usually one of many different munitions in the whole arsenal used to attack plant cells, and is often complimented by toxins and cell wall-degrading enzymes. Pathogenicity of strains in the TA-002 clade might involve a range of arms independent of a type III injectisome. However, no syringomycin-like broad host range toxins have been detected in these strains according to bioassays (data not shown). Alternatively (or in addition), pathogenicity of strains in the TA-002 clade might be the result of an unknown T3SS that does not elicit hypersensitivity in tobacco but that is capable of delivering effectors. Genome sequencing has led to an important broadening of the known diversity of T3SSs (Pallen et al., 2005). Current efforts to analyze fully sequenced genomes of P. syringae strains from nonagricultural habitats, including HR strains, will eventually reveal the full range of potential pathogenicity factors of P. syringae. Nevertheless, this result is troubling because it weakens the utility of the hypersensitivity test in disease diagnostics. This test has been a key criterion to separate pathogens from saprophytes. Furthermore, all papers reporting HR strains of P. syringae have referred to them as ‘nonpathogenic’. If strains in the TA-002 clade have not been reported as the causal agent of epidemics of crop disease it might be due to their incapacity to do so under field conditions, but more likely it is due to the fact that a positive hypersensitive reaction is generally used early in the diagnostic process as a criterion for targeting candidate causal agents.

The absence of the canonical T3SS in multiple disparate clades raises interesting questions about the processes leading to the acquisition or the loss of the canonical T3SS in P. syringae. There is strong evidence that the canonical T3SS was acquired by the most recent common ancestor of the P. syringae species complex (Sawada et al., 1999) and has evolved via diversifying selection involving mostly mutations (Guttman et al., 2006). However, those phylogenetic studies ignored the existence of nine clades of P. syringae, five of which are more deeply rooted in the phylogenetic tree than the other previously described clades (Morris et al., 2010), the most deeply rooted being clade UB-246. We speculate that the canonical T3SS was acquired at the time of diversification of UB-246 from all of the other clades. The most parsimonious explication for T3SS deficiency in the other clades is subsequent loss by the ancestor of the clades in which it is currently absent. However, clade TA-002 contains several strains that effectively induce hypersensitivity and for which some, but not all, of the six conserved genes were detected. This suggests a diversification of the genes beyond what could be detected with the primers used here, designed based on only three completely sequenced P. syringae genomes (Mohr et al., 2008). For all the strains in the CC1524 clade, closely related to TA-002, we could only detect some of the six conserved genes in the canonical T3SS. Similarly, for all strains in clades UB-370, USA-102 and USA-032, which are together in another branch of the phylogenetic tree, some, but not all, of the six conserved genes were detected in these otherwise HR+ strains. This suggests that divergence of the alleles in the canonical T3SS is more rapid in some branches than elsewhere in the phylogenetic tree of P. syringae. Our observations highlight that natural habitats likely harbor a greater diversity of alleles of the genes in the T3SS of P. syringae than currently observed in strains from crops thereby raising exciting questions about the nature of the drivers of this diversification.

The presence of a bacteriophage sequence in strain Psy508 near the site of the (missing) canonical hrp/hrc cluster (Mohr et al., 2008) raised questions about the link between the presence of this phage and the absence of the canonical T3SS. Here we show that, although this prophage sequence is present only in strains lacking the canonical T3SS, it is not universally present in all strains with this deficiency. Furthermore, acquisition of the phage seems to have occurred multiple times and perhaps at distinctly different dates as indicated by sequence divergence between the French and US strains. This suggests that there were multiple and very recent acquisitions of the same phage in clades TA-002, UB-246 and SZ-030 in France, and that strain Psy508 acquired a variant of the same phage or had acquired it at a much earlier date. We must also consider that some strains in which we did not detect the phage may have at one time acquired it and then lost it again.

For the two functional T3SSs described to date in P. syringae (Clarke et al., 2010; Tampakaki et al., 2010), our observations suggest an exclusivity of the type of T3SS harbored in the genomes of P. syringae. Similarly in P. viridiflava, strains harbor only one of the two types of T3SS identified by Bergelson and colleagues (Araki et al., 2006). However, Pallen et al. (2005) reported that certain bacteria (for example, Burkholderia spp.,) can accumulate different T3SSs. The secrets of T3SS polymorphism in P. syringae will be disclosed by genomic analyses that incorporate the novel strains described here.

Our results also have implications for understanding which habitats have been important in the evolutionary history of P. syringae. Compared with other members of the genus such as P. fluorescens, P. syringae has a much reduced capacity to use carbon sources (Sands et al., 1970). It has been suggested that the ancestor of P. syringae lost a range of metabolic functions, as an adaptation to the nutritionally constrained environment characteristic of the plants compared with the more nutritionally diverse and copious environments encountered by P. fluorescens as a saprophyte (Mithani et al., 2011). This analysis was based on genomes of P. syringae in the Groups 1, 2 and 3 clades. In light of the new clades of P. syringae that our work has revealed, it would be important to consider the metabolic adaptations of the strains in the UB-246 clade, the most deeply rooted of the known P. syringae clades and represented solely by HR strains. If they resemble those of P. syringae in general, then this would suggest that adaptation to a nutritionally constrained environment would be most probably for success as a resident of pristine freshwater habitats. We propose that loss of metabolic capacities in P. syringae might have originally been an adaptation for the oligotrophic conditions of certain freshwater habitats. Dissolved organic carbon sources in freshwater generally originate from plants (leaching and degradation). The ability to use sparse quantities of plant-derived carbon sources could have prepared P. syringae for a more intimate association with plants, but the intimate association with plants itself might not have been the initial selection pressure for adaptation to oligotrophic conditions.

The persistence of nonpathogenic variants of plant and animal pathogens in the environment has been a springboard for the development of biological control for plant pathogens (van Alfen, 1982; Kerr, 1987; Larkin et al., 1996) and of diseases of human and animal intestinal and urinary tracts (Kruis et al., 2004; Schneitz, 2005). The diversity, abundance and ubiquity of nonpathogenic strains of P. syringae suggest that competitive exclusion of pathogen strains by nonpathogenic conspecifics as a means to control disease might be worth exploring. Competitive exclusion between ice nucleation active and inactive variants of P. syringae has successfully limited frost damage induced by this bacterium under experimental conditions (Lindow, 1987). Inundative biological control is the most obvious of the potential practical applications of this work, but strategies for plant breeding or crop management that foster proliferation of nonpathogenic strains of P. syringae might also have potential to mitigate disease development.