The impact of modern agriculture on the evolutionary trajectory of plant pathogens is a central question for crop sustainability. The Green Revolution replaced traditional rice landraces with high-yielding varieties, creating a uniform selection pressure that allows measuring the effect of such intervention. In this study, we analyzed a unique historical pathogen record to assess the impact of a major resistance gene, Xa4, in the population structure of Xanthomonas oryzae pv. oryzae (Xoo) collected in the Philippines in a span of 40 years. After the deployment of Xa4 in the early 1960s, the emergence of virulent pathogen groups was associated with the increasing adoption of rice varieties carrying Xa4, which reached 80% of the total planted area. Whole genomes analysis of a representative sample suggested six major pathogen groups with distinctive signatures of selection in genes related to secretion system, cell-wall degradation, lipopolysaccharide production, and detoxification of host defense components. Association genetics also suggested that each population might evolve different mechanisms to adapt to Xa4. Interestingly, we found evidence of strong selective sweep affecting several populations in the mid-1980s, suggesting a major bottleneck that coincides with the peak of Xa4 deployment in the archipelago. Our study highlights how modern agricultural practices facilitate the adaptation of pathogens to overcome the effects of standard crop improvement efforts.
The emergence of plant pathogens in agricultural ecosystems represents a threat to food security. Selection for underrepresented virulent clones has become a hallmark of modern agriculture . While natural selective processes affecting the biology of plant pathogens have been defined, the contribution of human interventions in shaping pathogen populations in agricultural systems is poorly understood. The exponential growth of monocultures during the Green Revolution, as a response to increasing human population, has given some opportunities to understand the effect of directional selection on biotrophic pathogens with a high reproduction rate.
In the Philippines as many other countries in Asia, rice is a staple food. Early archeological records of rice production date back to sometime after 1500 B.C. , but the first signs of crop selection did not start until 1901 . For the most part of the 20th century, farmers used locally adapted landraces that were grown in relative isolation from other parts of Asia. The situation changed in the early 1960s after the International Rice Research Institute (IRRI) was established and the country emerged as a natural laboratory for testing rice varieties during the Green Revolution. This is where semi-dwarf high-yielding varieties, such as IR8 were first bred, tested, and rapidly spread across Asia, replacing traditional landraces . Nevertheless, IR8 was soon found susceptible to bacterial blight caused by Xanthomonas oryzae pv. oryzae (Xoo). In 1967, breeders released IR20 carrying Xa4, a bacterial blight resistance gene located at chromosome 11 . The genomic region carrying this gene was then frequently used in breeding programs for decades to come. By the mid-80s, Xa4 has spread in millions of hectares across Asia within IR64, one of the first rice mega varieties developed . Xa4 appears to encode a cell wall-associated kinase that promotes cellulose synthesis and reinforcement during pathogen attack .
The fact that virulent Xoo populations emerged in the Philippines after the deployment of Xa4 opened the case to understand the effect of human interventions in the population structure of a plant pathogen. While other resistance genes (i.e. xa5, xa13, Xa7, or Xa21) were used in the breeding programs, their deployment appears to be marginal or not significant compared with Xa4. In the current study, we took advantage of a unique historical record from the Philippines archipelago to examine the effect of a major selection process in the population of Xoo that occurred during the Green Revolution. We traced the presence of Xa4 in the overall rice germplasm and showed that continuous deployment reached at least ~80% of the total rice-farming area. This massive selection pressure, in fact, favored the emergence of Xa4-overcoming groups. Evidence of selective sweep suggests that several populations were affected during short periods of time. Overall, we documented how modern breeding schemes were instrumental to shape contemporary pathogen populations. This finding is particularly important because it expands our understanding on how to contain emerging pathogens in modern agriculture.
Material and methods
Mining of Xa4 alleles in the 3000 Rice Genomes Project
We used the Xa4 gene sequences described in Hu et al.  and BLASTn  in the indica rice genome 93-11 . The physical location of Xa4 in the 93-11 genome was at locus position 610,197 to 615,135 base pair (bp) (Os9311_04g012750). We used this information to extract the gene sequences of Xa4 in the 3000 rice panel stored in the Rice single nucleotide polymorphisms (SNP)-seek Database . The genes were translated into amino acid sequences and aligned using MUSCLE . We further focused on the 45th position of Xa4 and assessed allele variants (Additional Data S1). The alleles observed at that position: D/D, E/E, D/E, and stop codon. The glycine residue (D/D) is predicted to be associated with the resistance phenotype .
Evaluation of Xa4 activity in a 1.2K diversity panel
To characterize the rice diversity panel, a total of 1263 accessions were grown in the quarantine area at the IRRI plot field. All plants were inoculated with three strains of Xoo from different races. The isolates used were PXO61 (Race 1), PXO86 (Race 2), and PXO99A (Race 6). The experiment consisted of three replications in a randomized design. IRBB near-isogenic lines (Table S1) were used as controls. Due to the varying maturity of the accessions, staggered planting was performed. Bacteria were grown for 72 h at 28 °C on modified Wakimoto’s medium [0.5 g of Ca(NO3)2·4H2, 0.82 g of Na2HPO4, 5 g of peptone, 20 g of sucrose, 300 g of potato 15 g agar per liter of water]. These were suspended in distilled water, adjusted to 109 CFU/ml, and used as inoculums. Inoculation was done at maximum tillering stage (45–50 days after sowing) via leaf-clip method as described by Mew et al. . Disease assessment was scored by lesion length in centimeters 14 days post inoculation (dpi) and highlighted varieties with disease reaction as resistant (R < 5 cm) or susceptible (S > 15 cm).
Assessment of Xa genes in modern rice varieties in the Philippines
To determine the varietal share or proportion of area planted with released varieties in the Philippines, we first obtained the publicly available data from a consolidated source at the National Seed Quality Control Services and the Philippine Rice Research Institute. Surveys were performed from 1985 to 2009 . We categorized released varieties into Xa4-containing and noncontaining varieties. In addition, a total of 109 varieties, released between 1970 and 2010, were selected to detect the presence of Xa genes. These varieties include irrigated and lowland ecosystems. Genomic DNA from selected accessions was extracted using the standard CTAB method  and was quantified and normalized by ND-1000 NanoDrop spectrophotometer (NanoDrop Technologies; http://www.nanodrop.com) to a concentration of 50 ng/μl. Molecular markers for Xa4, xa5, and Xa21 specific alleles were used in PCR to discriminate the Xa-containing genotypes as described [14, 15].
Bacterial strain collection and pathogenicity test
We accessed 1822 randomly collected Xoo live cultures maintained at IRRI, some of these records were reevaluated from Quibod et al.  or are part of recent collections. The metadata includes isolate code, isolate phenotype, collection site, and year of collection (Additional Data S2). All strains were randomly isolated from naturally infected epidemics occurring from 1972 to 2015 in different regions in the Philippines. The race of each strain was determined based on the pathogenicity reactions to differential varieties carrying single-resistance genes  or a set of the near-isogenic lines. A complete list of IRBB lines is shown in at Table S1. Strains collected were cultured and revived using modified Wakimoto’s medium.
For this study, we used 91 Xoo strains from different races and locations for subsequent pathogenicity tests (Additional Data S3). We phenotypically recharacterized the 91 Xoo strains based on seven known resistance genes (Table S1) and IR24. The seven lines used are as follows: IRBB4, IRBB5, IRBB7, IRBB10, IRBB13, IRBB14, and IRBB21. A split plot design with rice cultivar as the main plot and bacterial strain as the subplot was employed in virulence tests. The selected five fully expanded leaves in each of the three plants were inoculated as described above. Xoo was grown and maintained following the methods described above. Disease reactions were scored by measuring the expressed lesion length in centimeters at 14 dpi with the following designations: <5 cm = resistant (R), 5–10 cm = moderately resistant (MR), 10–15 cm = moderately susceptible (MS), and >15 cm = susceptible (S).
Genome sequencing, assembly, annotation, pan-genome, and SNP mining
We selected 80 representative Xoo strains from different Philippine Xoo races (Additional Data S3) which were isolated across various parts of the Philippines for sequencing. The other Xoo strains were already sequenced using PacBio [16, 18, 19]. Ion Proton was used to sequence single-end reads and reads between 1.2 and 2.6 million with mean read length of 158–174 per strain were acquired. The Xoo cultures were grown overnight at 30 °C and extraction of DNA was done using the Easy-DNA kit (Invitrogen, USA) following the manufacturers’ protocol. De-novo assembly and error read correction were performed using MIRA 3.4.1 . As suggested by Baez-Ortega et al. , the optimized parameter group of importance for the assemblies are as follows: Align (-AL:mo = 19, -AL:ms = 15), Assembly (-AS:nop = 5, -AS:rbl = 3, -AS:urd = no, -AS:ardml = 200, -AS:mrl = 40), Clipping (-CL:qcmq = 20), and MISC (-MI:lcs = 500). The resulting contigs from the assembly were evaluated using the quality assessment tool QUAST 2.3  and aligned to the reference genomes of Xoo strains PXO99A and PXO86. In addition, contigs with a total length of less than 500 bp were removed. The number of contigs obtained ranges from 803 to 1188 with a mean total size of 4.7 Mb. The assembled genome coverage was from 36.5× to 81.8×. Furthermore, the final Xoo contigs were oriented through ABACAS 1.1  using the nearest complete Xoo genome strains as the reference according to Fig. 2a. Prokka  was applied for gene calling and annotation. After annotation, pan-genome analyses were constructed for genes and IGRs through Roary 3.11.0  and Piggy , respectively. Core genome alignment and variant calling were performed using Parsnp which is part of the Harvest suite 1.1.2 . The core genome of all the 91 Xoo strains is ~3.3 Mb with ~12,900 SNPs.
Phylogenetic, molecular clock, and population genetic structure analyses
Phylogenetic reconstruction was evaluated using the core genome alignment and executed via a maximum-likelihood approach implemented in RaxML 8.2.9 . Statistical confidence for each node was passed to 1000 bootstrap run, employing the general time reversal model of nucleotide substitution with the Gamma model of rate heterogeneity. With the same parameters, we also performed a global phylogenetic analysis using SNPs from strains listed in Additional Data S3.To perform the molecular clock analysis, metadata of each PX-A lineage strains were included. SNPs with putative recombination events were masked. We used TempEst v1.5.1  to estimate the divergence rate via a root-to-tip branch length linear regression. BEAST v1.8.4  was used for a Bayesian time-calibrated phylogenetic reconstruction with five replicates under a strict clock and three different population models. The nucleotide substitution configuration was selected with the general time-reversible model. Each run was allowed to continue for 100 million iterations, sampling the posterior every 1000th iteration. Run logs were combined using LogCombiner v1.8.4. Trees were summarized using TreeAnnotator v1.8.4. The effective sample size for all the parameters of interest was greater than 200. Each branch of maximum clade credibility tree was reported at greater than 50% posterior probability. The trees produced in the analysis were visualized using the R package ggtree .
To understand the level of substructuring we performed a hierarchical-clustering approach implemented in HeirBAPS  using the SNP data. Estimated BAPS clusters were obtained through five independent runs through two levels of nested clustering with a prior upper bound interval of 10–30. The nested genetic populations of the first and second levels are 6 and 20, respectively. Clustering was also validated using Principal Component Analysis (data not shown). Using the core genome, we calculated the genome-wide nucleotide diversity (pi), Wright’s fixation index (FST), and Tajima’s D with the assistance of the R package PopGenome . Each of the measured genetic analyses used sliding windows of 1, 5, and 10 kb pair at different population structure groups which is also subdivided as before or after the mid-1980s.
Detection of recombination and selection
ClonalFrameML 1.25  was used to detect recombination events in the Xoo core genome alignment obtained from the Harvest suite. In addition, the tree obtained from RAxML was loaded as the starting tree. The standard ClonalFrame model was operated to collect an initial result for inferred recombination with 1000 bootstrap. For selection detection, we assembled a codon alignment from SNPs mapped from the PXO99A genome annotation. Signals for selection were inferred applying FUBAR  which is implemented in HyPhy 2.2.4 . For FUBAR parameters, 10 MCMC chains were used, each consisting 1,000,000 iterations. The first 50,000 were discarded as burn-in and 300 postburn-in samples were drawn with the Dirichlet prior set to 0.5. Evidence of positive selection for each codon was identified at posterior probability ≥ 0.90. Genes with codons under strong-positive selection were grouped into different Gene Ontology (GO) terms using Blast2GO . GO enrichment analysis was performed with the R package clusterProfiler  and the p-values were corrected for multiple comparisons under Benjamini–Hochberg procedure. Enriched levels have adjusted p-value of less than 0.05 and q-value of less than 0.05.
Identification of Xa4-associated interaction
Microbial association study was performed using the R package treeWAS . The genotypic data input used was a combination of SNPs and the Roary and Piggy presence–absence matrix. The phenotypic data for the association study were from the lesion length produced in IRBB4. The phenotypic data are used in three ways: (1) binary which is either susceptible (S) or resistant (R), (2) discrete which is susceptible, MS, MR, or resistant, and (3) continuous which is the measured lesion length data in cm. Table S2 shows how the qualitative phenotypic data are grouped according to lesion length data. treeWAS uses three kinds of association test based on population structure correction and these are simultaneous, subsequent, and terminal. A genomic variant is considered linked to Xa4 if the p-value is <0.01 in nine or eight intersecting sets on phenotype-treeWas association. Bonferroni correction was used.
Xoo diversity from an endemic area in the Philippines
To detect the representation of Xoo diversity in a field, we established a trapping system by deploying 30 near-isogenic lines with single and multiple Xa resistance genes (Table S1) in a farmer’s field where high disease pressure of bacterial blight was constantly observed. This experiment was conducted in two consecutive rice growing periods during the wet season in 2014 and 2015 in Victoria, Laguna. The established field was divided into microplots. Each near-isogenic line was grown in 3-m2 microplot area. The experiment was laid in randomized complete design with three replications per line. Rice seedlings were transplanted at 21 days after sowing with 20 × 20 cm planting distance between hills. A total of 45 hills were transplanted per microplot, each hill consisting of three seedlings. IR24, a susceptible line, served as border between microplots with two rows transplanted at the same time as the tested lines. Natural infection of bacterial blight on rice plants was assessed during the peak of the disease between the late tillering to heading growth stages of the plants. The assessment for disease severity was done by randomly selecting seven hills in each microplot. In each hill, five diseased leaves were randomly recorded for the percentage diseased leaf area following the IRRI Standard Evaluation System for Rice 5th edition. Rice leaves were collected in all near-isogenic lines for isolation and strain characterization through pathotyping as mentioned above.
Results and discussion
A major resistance gene defined the Green Revolution in the Philippines
Allele frequency of resistance genes tends to experience drastic changes in agriculture compared with natural systems . To estimate the frequency of Xa4 before the introduction of improved varieties, we investigated the allelic diversity and phenotypic pattern of Xa4 in two different rice diversity panels representing similar genetic groups. Recently, Hu et al.  speculated that residue D45 might be associated with the resistance phenotype. We first mined 3000 rice genomes  and found that alleles containing such residues are present in more than 62.5% of the accession worldwide (Fig. S1A, B; Additional Data S1). However, when we characterized the phenotypic response of 1263 nonredundant rice landraces to Xoo tester strains, less than 1.27% showed phenotypic patterns that are consistent with Xa4 activity (Fig. S2). This inconsistency suggests that D45 might not be associated with resistance in rice, or that an unknown suppressing mechanism in the pathogen prevent the allele from activating immunity. Nevertheless, it is highly likely that landraces used in the Philippines through traditional agriculture maintained resistance alleles of Xa4 at very low frequency.
Apparently, this situation changed with the rapid introduction and adoption of improved varieties during the Green Revolution. Between 1960 and 2010, more than 100 rice varieties were released in the Philippines’ national system . Using Xa4-linked markers, we found that more than 70% of those lines carry a genomic introgression with Xa4. In contrast, the proportion of xa5-  or Xa21-containing  varieties was not represented during the same period (Fig. S3). While previous reports suggested that Xa4 strengthens cell-wall integrity and confers beneficial agronomic traits [6, 45], it is unclear whether this effect contributed to its selection during field trials or it was a completely unintentional event . Likewise, farmers’ adoption rates of these varieties followed rapid changes in the landscape. By 1985, nearly 82% of the total rice-cultivated areas in the Philippines were under Xa4-containing varieties. Selection pressure of Xa4 reached a plateau in the late 1980s or early 1990s when the area reached a maximum of 91.5% in 1988 (Fig. S4). While the frequency of these varieties decreased gradually during the next decades, Xa4 appeared to be prevalent in the landscape (Fig. S4). Temporal and spatial variation of resistance alleles has been described in broad ranges of situations . However, in here we presented a case in which human intervention drove the expansion of improved varieties, causing significant alterations on the allele frequency of a single resistance gene within a semi-isolated environment.
Xa4 deployment is associated with a major population shift of Xoo
Artificial deployment of a single-dominant gene might generate enough selection to drive significant changes in the population structure of a plant pathogen in short periods of time . To investigate the effect of Xa4 deployment on Xoo populations, we profiled the pathogenicity reaction of 1822 isolates representing disease outbreaks that occurred in the Philippines between 1970 and 2015. Each entry was inoculated in a set of tester rice lines carrying different resistance genes (Table S1). The isolates were characterized into 11 races and four derived groups based on its phenotypic reaction (Additional Data S2). Overall, we found that Xa4-overcoming strains increased in frequency and this expansion peaked sometime in the early 90s (Fig. 1a). In the early 1970s, around 20% of field strains were able to grow on Xa4. Scientists started noticing a significant increase in the frequency of Xa4 virulent strains recovered after the release of IR20 . However, it was not until early 1990, which is also the peak of Xa4 deployment in the archipelago, when the frequency of Xa4 virulent outbreaks increased dramatically to 84% (Fig. 1a). To further characterize the population shift, we dissected the composition of Xoo races recovered before and after the outbreaks in the 90s. Interestingly, we noticed the emergence of a new race complex called 9com (Fig. 1b). Other races with different phenotypic background, such as Race 2 and Race 3, were systematically recovered during the intensification of Xa4 cultivation across the country (Fig. 1b; Fig. S4). Therefore, it is highly likely that the continuous cultivation of Xa4-containing varieties drove the expansion of Xa4-overcoming groups in a short period of time. Theoretical models that measure the effect of applying strong selection on field microbes have been described . These models can be used to describe an incidence in which human interventions change the evolutionary trajectory of a plant pathogen [48,49,50]. For instance, Yr17-overcoming strains of yellow rust were only detected in wheat varieties in Northern European countries where those varieties were planted in large areas . A similar scenario can explain the emergence of stem rust Ug99 overcoming Sr31 in wheat  or the spread of Fusarium wilt TR4 in banana . However, our study adds an extra layer of detail by using a traceable record of disease outbreaks in a semi-isolated environment over long periods of time to establish a clear correlation with agricultural deployment.
Xoo populations in the Philippines maintain distinct genomic features
To further characterize the genetic composition of key Xoo groups in the Philippines, we sequenced 91 strains to assess genetic ancestry, population structure, and genomic distribution of coding and noncoding elements (Additional Data S3). The assembled genomes represent ten races isolated across the archipelago during a period of 37 years (Fig. 2a–c). Using a maximum-likelihood analysis, we identified three groups matching reported Philippine Xanthomonas lineages, namely PX-A, PX-B, and PX-C . As reported earlier, PX lineages appear to be diverse (Fig. S5) and match ancestral lineages in South Asia  and Southeast Asia [54, 55]. Recently, Carpenter et al.  used whole-genome analysis to link PX groups with Indian Xoo lineages and found a similar pattern.
Population structure analysis based on allele frequencies grouped existing lineages into six modern populations: PX-A1, PX-A2, PX-A3, PX-B1, PX-B2, and PX-C1 (Fig. 2a) that indicate that events of local host adaptation and genetic exchange might follow the putative colonization of the archipelago and elsewhere. As suggested within other bacterial systems , we hypothesized that dispensable coding and noncoding regions will be shared by Xoo individuals with similar evolutionary histories. To test this hypothesis, we classified the 401,285 predicted open reading frames and 221,849 intergenic regions (IGR) into 15,716 orthologous genes and 11,741 IGR clusters. We plotted all single-copy genes and IGRs that were present in at least two strains and showed that all members of the same population tend to have similar sets of dispensable genomic features as well (Fig. S6A, B).
Pan-genomic dispensable elements across all populations account for 62% of the genes and 55% of the IGRs. However, the number of dispensable elements in each population ranges from 24 to 52% of the genes and 31 to 56% of the IGRs (Figs. S6A, B and S7), which indicates a highly diverse population. Even though several genes might be missing due to limitations during sequencing and assembly, the trend could indicate similar capabilities in each population as a result of ancestry but also of parallel adaptation to the local host. Furthermore, almost 21% of the pan-genome is comprised of mobile genetic element (MGE) gene clusters. This finding is also aligned with the distribution of transposable elements in Xoo  as those might play critical roles in spreading fitness-related genes in Xanthomonas genomes . Whether or not dispensable genomic features, such as genes or mobile elements, provided alternative functions with selective advantage is unclear , but under an evolutionary perspective our data is consistent with the idea that members of the same population share similar genes, ancestry, and might have similar adaptation potential.
Xoo populations show distinct signatures of mutation and recombination
Mutation and recombination are intrinsic forces that influence the evolutionary potential of a pathogen population . To investigate the evolutionary trajectory of Xoo, we estimated the rate of mutation and recombination across different PX populations. Overall, the ratio of recombination to mutation (R/θ) showed distinct patterns, suggesting that each PX population has a unique signature, and therefore a unique evolutionary trajectory (Fig. 3; Table S3). R/θ in the sample was 0.452 (95% CI: 0.451–0.453), the average fragment length was 938 bp (95% CI: 937–940 bp), and the divergence of each fragment was 0.006647 (95% CI: 0.006643–0.006651). Thus, genetic variation through mutation was twice more likely to occur than recombination (Fig. 4a), but because each recombination event introduces approximately six substitutions, recombination introduced substitution over mutation is 2.82 (95% CI: 2.81–2.83) times higher within recombinant fragments. These values were similar to those found in Indian Xoo populations  or what has been reported for other Xanthomonas species [59, 60].
Interestingly, R/θ values show different patterns across PX populations (Figs. 3 and 4a; Table S3). We found that recombination is more frequent in PX-A2, PX-A1, and PX-A3 where the genomes of these groups account for 64% of the overall recombinant SNPs found in genic and IGR (Figs. 3 and 4a, b), compared with PX-B1 and PX-B2 which both account for 35% of recombination signal. A total of 691 recombination events were observed, majority of which involved small recombination fragments with a length of less than 1000 bp. While the highest fragment detected was 13,696 bp, the average recombination fragment length in each population was quite small (Figs. 3 and 4b), suggesting that contribution might be rather ancestral and most likely impact general fitness. For instance, we found a recombinant hotspot in the genomic position between the PXO_02079 and PXO_02062 genes (Fig. S8) which are enriched in components from the restriction-modification system components that are involved in bacterial defense systems, epigenetic-related mechanisms, and recombination and genome rearrangements .
On the other hand, genetic variations due to mutation show significant differences across Xoo populations (Fig. 3a) with values ranging between 22.95% (PX-A3) and 0.13% (PX-C1). To assess the impact of these mutations in the fitness of each population, we calculated the rate of synonymous and nonsynonymous codon changes (dN–dS ratio) per population (Figs. 3c and S9A, D) using a rapid hierarchical Bayesian method . While the overall average dN–dS ratio per codon was 1.83 (Fig. S9A), we found signatures of positive selection (p-value < 0.05) enriched within genes involved in membrane trafficking, carbohydrate metabolism, proteolysis, and pathogenicity (Fig. S9B, C). Furthermore, we discovered that PX-A populations have significantly higher dN–dS ratios on average than PX-B populations (Fig. S9D). However, genes undergoing selection processes are enriched in populations PX-A2, PX-A3, and PX-B2 (Fig. 4c) which also coincide with the frequency of mutational SNPs (Fig. 4a). Our data support different evolutionary signals across Xoo populations in the archipelago. However, whether those changes were shaped by host adaptation or other environmental factors remains unclear.
Xoo appear to evolve different mechanisms to overcome Xa4
Plant pathogens evolved multiple strategies to evade or suppress host immunity . To investigate the genetic bases of Xoo adaptation to Xa4, we performed a phylogenetic-based genome-wide association  between SNPs, dispensable genes, and IGRs (Fig. S10) and three phenotypic datasets. Overall, we discovered candidate genetic regions linked to Xa4 virulence (Fig. S11A). We found 1544 genomic variants (SNP, presence/absence genes, and IGRs) showing significant association with the phenotype (Fig. S10). We focused on SNP-based data because it accounts for 85.16% of the significant hits (p ≥ 0.01). Furthermore, a total of 31 Xa4-associated SNPs was selected based on 8–9 intersecting sets of phenotype-treeWas association (Fig. S10). These mutations are located within coding regions or in the vicinity of 14 highly targeted genes (Fig. S11A; Additional Data S4). The mutations are likely to cause nonsynonymous substitution, frameshift, or expression polymorphism in known pathogenicity functions such as secretion system, cell-wall degradation, detoxification of reactive oxygen species, and lipopolysaccharide production (Fig. S11C). Interestingly, a number of selected SNP variations were also within genes subjected to purifying selection (Fig. S11B), which might indicate that variations target genes with important virulence functions. Our findings align with reports on other pathogens like Pseudomonas syringae which found that the majority of effector families undergo purifying selection as their main evolutionary pathway . Meanwhile, two genes (PXO_01955 and PXO_01954), which are also near the secretion system regulon hrpX and hrpG, showed positively selected SNPs (Fig. S11C). The hrpX and hrpG are crucial genes in the signaling and coordination of virulence effectors inside the host . While the genome-wide association’s resolution is relatively limited due to sample size, it is strong enough to point out to a range of pathogenicity-related regions in the Xoo genome. The multiple genetic signals also suggest that Xoo might evolve more than one way to adapt to Xa4, a strategy that has been suggested for bacterial adaptation to antibiotics .
Xa4 is a wall-associated kinase with pleiotropic effects including cell-wall reinforcement. Incompatible interaction with Xoo produces a rapid induction of the gene, leading to a resistance phenotype . Xoo likely evades recognition by modulating the components of the secretion system or preventing enzymatic degradation of the host cell. Another alternative is that unknown effector genes actively suppress rice innate immunity. While the identity of AvrXa4 is still unknown, the forthcoming functional characterization of these candidate genes will bring insights into the specific mechanisms of adaptation to Xa4.
PX-A1 emerge recently and became dominant in the Philippines archipelago
The overall data suggest that PX-A1 emerged during the early 1990s as a response to the accumulative activity of Xa4 in the Philippines. To estimate the time of the most recent common ancestor of this population, we used a Bayesian model over 10,251 SNPs. Interestingly, the time-scaled tree estimated a quite recent appearance of this population, likely after the 1950s (Fig. S12). Most of the PX-A1 members were detected for the first time during the wet season of 1991–1992 (Fig. 1a, b), suggesting that PX-A1 was already present in the country before the Green Revolution.
During the next two decades, however, PX-A1 became the dominant population in the Philippines (Fig. 1a, b). PX-A1 involves a number of races with similar virulent patterns , but it is not clear that all the races are prevailing. If PX-A1 emerges after a recent event of recombination, we speculated that in the absence of a major selection pressure other than Xa4, PX-A1 members will maintain the same fitness under local conditions. To assess this question, we used a diversity trapping system consisting of 30 near-isogenic lines planted in a disease endemic area during 2 consecutive years. More than 95% of the strains recovered from symptoms belonged to PX-A1 but were assigned to different races based on their phenotypic reaction with other Xa genes (Fig. S13, Table S1). All the recovered races have the capability to overcome Xa4 but also segregate in response to Xa7 (Fig. 2a), suggesting that PX-A1 maintains several races with similar fitness in the field.
We looked closely into the region comprising AvrXa7, the effector recognized by Xa7 . We found that the effector lies close to an unstable 20-kb pair region with a high recombination rate (Figs. S8 and S14A). Sequence alignment of this region suggests several events of gene transfer with members from PX-B populations, including AvrXa7 and the genetic cluster of XopAA (Fig. S14A and B). This observation aligned with other reports which link the evolution of Xoo effector genes to active recombination events . Probably the strong host selection imposed by Xa4 drove the expansion of PX-A1 in nature and the absence of other forces (e.g. lack of Xa7) reduced the impact of gene conversion on overall fitness to avoid clone selection.
Genome-wide analysis suggests recent selective sweep in Xoo
Strong directional selection is capable to favor changes in the phenotype of a population within a few generations . This bottleneck hypothesis implies that genetic diversity within each population changed drastically in a short period of time, leaving signatures behind it. We used quantitative data from outbreaks collected before and after the early 1990s to show a significant change in the virulence reaction of strains to Xa4 compared with xa5, Xa7, and Xa10 (Figs. 1 and S3). To test for signatures of demography and selection, we used a genome-wide allele frequency distribution on each population. We found Xoo populations showing genome-wide Tajima’s D, Fixation index (FST), and nucleotide diversity (pi) values significantly different from the null expectation (Table S4). For instance, pi values in PX-A2, PX-A3, and PX-B1 were significantly higher than the rest of the populations with an average of 19-fold (Table S4). Furthermore, Tajima’s D values also support a range of nonrandom events driving the population structure of Xoo in the archipelago. We found that PX-A1 (Tajima’s D = −1.9) and PX-B1 (Tajima’s D = −1.8) having significantly lower values of diversity compared with what is expected under neutrality (Table S4), suggesting that Xoo populations might experience different evolutionary trajectories, some driven by ancestry but also by stochastic events such as selection, demographic fluctuation, or genetic drift (Fig. 5a). Evidence of these mechanism shaping pathogenicity factors within the genome of barley and wheat pathogens have been reported [70, 71].
To investigate if the deployment of Xa4 caused a selective sweep during the Green Revolution, we used allele frequency distribution in populations before and after the early 1980s and found evidence of strong directional selection (Fig. 5a; Table S5). Apart from PX-A1 negative values, PX-A2 and PX-B1 also appear to experience a substantial reduction of Tajima’s D values after the 1980s. However, the removal of genetic diversity in each population might have different contexts. In the case of PX-A1 which emerged after the 1990s, the bottleneck might be recent and directional as it removed genetic variation homogeneously across the genome of this group (Fig. S15). This finding is consistent with our field observations that indicate PX-A1 as the prevalent group in the archipelago. The case of PX-B1 population, on the other hand, matches the emergence of Race 2 soon after Xa4 was deployed . While selection favored few genotypes that became dominant, the populations quickly expanded across the geography, accumulating neutral mutations. Interestingly, our data showed that the genetic diversity of PX-B1 after the early 1980s was significantly higher and is distributed in particular regions of the genome (Figs. 5b and S15). In both cases, PX-A1 or PX-B1, the data showed nonrandom changes in allele frequency, which is consistent with a dramatic event of selective sweep. This effect is illustrated in Fig. 6. While Xa4 deployment appears to have a causative effect, we cannot exclude alternative explanations, such as cryptic sampling bias or unknown demographic patterns, that might render similar results.
Coevolution of host and pathogens across agricultural ecosystems is a major concern for long-term strategies of food security. In such a uniform setup, human interventions are likely to increase the magnitude and frequency of emerging virulent pathogens. In this report, we documented a case in which a modern breeding revolution actually shaped the population of a pathogen in a short period of time. This selection force leaves distinct genetic signatures in the genome and causes irreversible changes in the overall fitness of the pathogen. The report provides important insight into the adaptation of pathogens to modern agriculture and advice on the directions of future strategies that include the spatial and temporal deployment of valuable resistance genes.
The sequences produced in this study are stored under BioProject PRJNA525332. Selection analysis results are under Additional Data S4. Genomic files in genbank format are available at https://figshare.com/articles/The_impact_of_the_Green_Revolution_on_the_population_structure_of_rice_pathogens/7842026.
McDonald BA, Linde C. Pathogen population genetics, evolutionary potential, and durable resistance. Annu Rev Phytopathol. 2002;40:349–79.
Snow BE, Shutler R, Nelson DE, Vogel JS, Southon JR. Evidence of early rice cultivation in the Philippines. Philipp Q Cult Soc. 1986;14:3–11.
De Leon JC. Rice that Filipinos grow and eat. Makati City, Philippines: Philippine Institute for Development Studies; 2012.
Mackill DJ, Khush GS. IR64: a high-quality and high-yielding mega variety. Rice. 2018;11:18.
Yoshimura S, Yoshimura A, Iwata N, McCouch SR, Abenes ML, Baraoidan MR, et al. Tagging and combining bacterial blight resistance genes in rice using RAPD and RFLP markers. Mol Breed. 1995;1:375–87.
Hu K, Cao J, Zhang J, Xia F, Ke Y, Zhang H, et al. Improvement of multiple agronomic traits by a disease resistance gene via cell wall reinforcement. Nat Plants. 2017;3:17009.
Altschul S. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.
Yu J, Hu S, Wang J, Wong GK-S, Li S, Liu B, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002;296:79–92.
Alexandrov N, Tai S, Wang W, Mansueto L, Palis K, Fuentes RR, et al. SNP-Seek database of SNPs derived from 3000 rice genomes. Nucleic Acids Res. 2015;43:D1023–7.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.
Mew TW, Vera Cruz CM, Medalla ES. Changes in race frequency of Xanthomonas oryzae pv. oryzae in response to rice cultivars planted in the Philippines. Plant Dis. 1992;76:1029–32.
Brennan JP, Malabayabas A. Australian Centre for International Agricultural Research. International Rice Research Institute’s contribution to rice varietal yield improvement in South-East Asia. Canberra, A.C.T: Australian Centre for International Agricultural Research; 2011.
Doyle J, Doyle JL. Isolation of plant DNA from fresh tissue. Focus. 1990;12:13–5.
Ma B, Wang W, Zhao B, Zhou Y, Zhu L, Zhai W. Studies of PCR marker for the rice bacterial blight resistance gene Xa-4. Hereditas. 1999;21:9–12.
Zaka A, Grande G, Coronejo T, Quibod IL, Chen C-W, Chang S-J, et al. Natural variations in the promoter of OsSWEET13 and OsSWEET14 expand the range of resistance against Xanthomonas oryzae pv. oryzae. PLOS ONE. 2018;13:e0203711.
Quibod IL, Perez-Quintero A, Booher NJ, Dossa GS, Grande G, Szurek B, et al. Effector diversification contributes to Xanthomonas oryzae pv. oryzae phenotypic adaptation in a semi-isolated environment. Sci Rep. 2016;6:34137.
Cottyn B, Mew TW. Bacterial blight of rice. In: Goodman RM, ed. Encyclopedia of plant and crop science. New York, NY: Marcel Dekker; 2004. p. 79–83.
Booher NJ, Carpenter SCD, Sebra RP, Wang L, Salzberg SL, Leach JE, et al. Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships. Micro Genomics. 2015;1:e000032.
Grau J, Reschke M, Erkes A, Streubel J, Morgan RD, Wilson GG, et al. AnnoTALE: bioinformatics tools for identification, annotation, and nomenclature of TALEs from Xanthomonas genomic sequences. Sci Rep. 2016;6:21077.
Chevreux B, Wetter T, Sándor S. Genome sequence assembly using trace signals and additional sequence information. Comput. Sci. Biol. Proc. Ger. Conf. Bioinform. 1999. pp 45–56.
Baez-Ortega A, Lorenzo-Diaz F, Hernandez M, Gonzalez-Vila CI, Roda-Garcia JL, Colebrook M, et al. IonGAP: integrative bacterial genome analysis for Ion Torrent sequence data. Bioinformatics. 2015;31:2870–3.
Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–5.
Assefa S, Keane TM, Otto TD, Newbold C, Berriman M. ABACAS: algorithm-based automatic contiguation of assembled sequences. Bioinformatics. 2009;25:1968–9.
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–9.
Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31:3691–3.
Thorpe HA, Bayliss SC, Sheppard SK, Feil EJ. Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria. GigaScience. 2018;7:giy015.
Treangen TJ, Ondov BD, Koren S, Phillippy AM. The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol. 2014;15:524.
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.
Rambaut A, Lam TT, Max Carvalho L, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016;2:vew007.
Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29:1969–73.
Yu G, Smith DK, Zhu H, Guan Y, Lam TT-Y. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8:28–36.
Cheng L, Connor TR, Siren J, Aanensen DM, Corander J. Hierarchical and spatially explicit clustering of DNA sequences with BAPS software. Mol Biol Evol. 2013;30:1224–8.
Pfeifer B, Wittelsbürger U, Ramos-Onsins SE, Lercher MJ. PopGenome: an efficient Swiss army knife for population genomic analyses in R. Mol Biol Evol. 2014;31:1929–36.
Didelot X, Wilson DJ. ClonalFrameML: efficient inference of recombination in whole bacterial genomes. PLOS Comput Biol. 2015;11:e1004041.
Murrell B, Moola S, Mabona A, Weighill T, Sheward D, Kosakovsky Pond SL, et al. FUBAR: a fast, unconstrained bayesian approximation for inferring selection. Mol Biol Evol. 2013;30:1196–205.
Pond SLK, Frost SDW, Muse SV. HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005;21:676–9.
Gotz S, Garcia-Gomez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008;36:3420–35.
Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS J Integr Biol. 2012;16:284–7.
Collins C, Didelot X. A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. PLOS Comput Biol. 2018;14:e1005958.
Laine A-L, Burdon JJ, Dodds PN, Thrall PH. Spatial variation in disease resistance: from molecules to metapopulations. J Ecol. 2011;99:96–112.
Wang W, Mauleon R, Hu Z, Chebotarov D, Tai S, Wu Z, et al. Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature. 2018;557:43–49.
Raitzer DA, Sparks AH, Huelgas Z, Maligalig R, Balangue Z, Launio C, et al. Is rice improvement still making a difference? Rome, Italy: CGIAR Independent Science and Partnership Council; 2015.
Iyer AS, McCouch SR. The rice bacterial blight resistance gene xa5 encodes a novel form of disease resistance. Mol Plant Microbe Interact. 2004;17:1348–54.
Song W-Y, Wang G-L, Chen L-L, Kim H-S, Pi L-Y, Holsten T, et al. A receptor kinase-like protein encoded by the rice disease resistance gene, Xa21. Science. 1995;270:1804–6.
Krattinger SG, Keller B. Resistance: double gain with one gene. Nat Plants. 2017;3:17019.
Thrall PH, Burdon JJ. Host-pathogen dynamics in a metapopulation context: the ecological and evolutionary consequences of being spatial. J Ecol. 1997;85:743–53.
Lo Iacono G, van den Bosch F, Paveley N. The evolution of plant pathogens in response to host resistance: factors affecting the gain from deployment of qualitative and quantitative resistance. J Theor Biol. 2012;304:152–63.
Bayles RA, Flath K, Hovmøller MS, de Vallavieille-Pope C. Breakdown of the Yr17 resistance to yellow rust of wheat in northern Europe. Agronomie. 2000;20:805–11.
Zhan J, Mundt CC, Hoffer ME, McDonald BA. Local adaptation and effect of host genotype on the rate of pathogen evolution: an experimental test in a plant pathosystem. J Evol Biol. 2002;15:634–47.
Mundt CC. Durable resistance: a key to sustainable management of pathogens and pests. Infect Genet Evol. 2014;27:446–55.
Singh RP, Hodson DP, Huerta-Espino J, Jin Y, Bhavani S, Njau P, et al. The emergence of Ug99 races of the stem rust fungus is a threat to world wheat production. Annu Rev Phytopathol. 2011;49:465–81.
Ploetz RC. Fusarium wilt of banana. Phytopathology. 2015;105:1512–21.
Midha S, Bansal K, Kumar S, Girija AM, Mishra D, Brahma K, et al. Population genomic insights into variation and evolution of Xanthomonas oryzae pv. oryzae. Sci Rep. 2017;7:40694.
Adhikari TB, Cruz CMV, Zhang Q, Nelson RJ, Skinner DZ, Mew TW, et al. Genetic diversity of Xanthomonas oryzae pv. oryzae in Asia. Appl Env Microbiol. 1995;61:966–71.
Poulin L, Grygiel P, Magne M, Gagnevin L, Rodriguez-R LM, Forero Serna N, et al. New multilocus variable-number tandem-repeat analysis tool for surveillance and local epidemiology of bacterial leaf blight and bacterial leaf streak of rice caused by Xanthomonas oryzae. Appl Environ Microbiol. 2015;81:688–98.
Carpenter SCD, Mishra P, Ghoshal C, Dash PK, Wang L, Midha S, et al. A strain of an emerging Indian Xanthomonas oryzae pv. oryzae pathotype defeats the rice bacterial blight resistance gene xa13 without inducing a clade III SWEET gene and is nearly identical to a recent Thai isolate. Front Microbiol. 2018;9:2703.
Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R. The microbial pan-genome. Curr Opin Genet Dev. 2005;15:589–94.
Ferreira RM, Oliveira ACP, de, Moreira LM, Belasque J, Gourbeyre E, Siguier P, et al. A TALE of transposition: Tn3-like transposons play a major role in the spread of pathogenicity determinants of Xanthomonas citri and other Xanthomonads. mBio. 2015;6:e02505–14.
Bansal K, Midha S, Kumar S, Patil PB. Ecological and evolutionary insights into Xanthomonas citri pathovar diversity. Appl Environ Microbiol. 2017;83:e02993–16.
Jibrin MO, Potnis N, Timilsina S, Minsavage GV, Vallad GE, Roberts PD, et al. Genomic inference of recombination-mediated evolution in Xanthomonas euvesicatoria and X. perforans. Appl Environ Microbiol. 2018;84:e00136–18.
Vasu K, Nagaraja V. Diverse functions of restriction-modification systems in addition to cellular defense. Microbiol Mol Biol Rev. 2013;77:53–72.
Toruño TY, Stergiopoulos I, Coaker G. Plant-pathogen effectors: cellular probes interfering with plant defenses in spatial and temporal manners. Annu Rev Phytopathol. 2016;54:419–41.
Rohmer L, Guttman DS, Dangl JL. Diverse evolutionary mechanisms shape the type III effector virulence factor repertoire in the plant pathogen Pseudomonas syringae. Genetics. 2004;167:1341–60.
Guo Y, Figueiredo F, Jones J, Wang N. HrpG and HrpX play global roles in coordinating different virulence traits of Xanthomonas axonopodis pv. citri. Mol Plant Microbe Interact. 2011;24:649–61.
Munita JM, Arias CA. Mechanisms of antibiotic resistance. Microbiol Spectr. 2016;4. https://doi.org/10.1128/microbiolspec.VMBF-0016-2015.
Vera Cruz CM, Bai J, Ona I, Leung H, Nelson RJ, Mew T-W, et al. Predicting durability of a disease resistance gene based on an assessment of the fitness loss and epidemiological consequences of avirulence gene mutation. Proc Natl Acad Sci. 2000;97:13500–5.
Yang B, Zhu W, Johnson LB, White FF. The virulence factor AvrXa7 of Xanthomonas oryzae pv. oryzae is a type III secretion pathway-dependent nuclear-localized double-stranded DNA-binding protein. Proc Natl Acad Sci. 2000;97:9807–12.
Erkes A, Reschke M, Boch J, Grau J. Evolution of transcription activator-like effectors in Xanthomonas oryzae. Genome Biol Evol. 2017;9:1599–615.
Hendry AP, Kinnison MT. Perspective: the pace of modern life: measuring rates of contemporary microevolution. Evolution. 1999;53:1637.
Mohd-Assaad N, McDonald BA, Croll D. Genome-wide detection of genes under positive selection in worldwide populations of the barley scald pathogen. Genome Biol Evol. 2018;10:1315–32.
Hartmann FE, McDonald BA, Croll D. Genome-wide evidence for divergent selection between populations of a major agricultural pathogen. Mol Ecol. 2018;27:2725–41.
Vera Cruz CM, Ardales EY, Skinner DZ, Talag J, Nelson RJ, Louws FJ, et al. Measurement of haplotypic variation in Xanthomonas oryzae pv. oryzae within a single field by rep-PCR and RFLP analyses. Phytopathology. 1996;86:1352–9.
Authors would like to thank Epifania Garcia, Ismael Mamiit, and Isabelita Oña for technical assistance. Scientists at IRRI are partially funded by the Research Program on Rice Agri-food System (RICE) from the Consortium for International Agricultural Research (CGIAR). Funding support also came from the Department of Science and Technology (DOST)–Philippine Council for Industry and Emerging Technology Research and Development. We also thank the DOST–Advanced Science and Technology Institute (DOST-ASTI) for free access to high-performance computing services. Veronica Roman-Reyna was funded by the Newton Fund.
Conflict of interest
The authors declare that they have no conflict of interest.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Quibod, I.L., Atieza-Grande, G., Oreiro, E.G. et al. The Green Revolution shaped the population structure of the rice pathogen Xanthomonas oryzae pv. oryzae. ISME J (2019) doi:10.1038/s41396-019-0545-2