Abstract
The introgression of chromosome segments from wild relatives is an established strategy to enrich crop germplasm with disease-resistance genes1. Here we use mutagenesis and transcriptome sequencing to clone the leaf rust resistance gene Lr9, which was introduced into bread wheat from the wild grass species Aegilops umbellulata2. We established that Lr9 encodes an unusual tandem kinase fusion protein. Long-read sequencing of a wheat Lr9 introgression line and the putative Ae. umbellulata Lr9 donor enabled us to assemble the ~28.4-Mb Lr9 translocation and to identify the translocation breakpoint. We likewise cloned Lr58, which was reportedly introgressed from Aegilops triuncialis3, but has an identical coding sequence compared to Lr9. Cytogenetic and haplotype analyses corroborate that the two genes originate from the same translocation event. Our work sheds light on the emerging role of kinase fusion proteins in wheat disease resistance, expanding the repertoire of disease-resistance genes for breeding.
Similar content being viewed by others
Main
Bread wheat (Triticum aestivum; 2n=6x=42; AABBDD genome) is one of the most widely cultivated crops worldwide4. The fungal pathogen Puccinia triticina (Pt) causes leaf rust, which is among the most widespread and devastating wheat diseases, resulting in annual production losses of ~25 million tons5,6. Of the 69 known leaf rust (Lr) resistance genes in wheat, 28 were introgressed from the secondary and tertiary gene pools through interspecific hybridizations7. The genus Aegilops (annual goat grasses) comprises diploid and polyploid species that serve as reservoirs to increase wheat genetic diversity8,9. The two resistance genes Lr9 and Lr58 were introgressed into bread wheat from the U-genomes of Aegilops umbellulata (diploid; 2n=2x=14; UU genome) and Aegilops triuncialis (tetraploid; 2n=4x=28; UtUtCtCt genome), respectively2,3. The Lr9 introgression is of historic importance, as it was the first disease-resistance gene transferred into bread wheat in the 1950s using irradiation, although the Ae. umbellulata donor was not specified2. The resulting bread wheat line, ‘Transfer’, carried an Ae. umbellulata translocation at the end of chromosome arm 6BL2,10. The Lr58 introgression reportedly occurred spontaneously through homoeologous recombination while backcrossing Ae. triuncialis accession TA10438 to the susceptible bread wheat line WL711 (ref. 3). Lr58 was mapped to chromosome arm 2BL in the resulting backcross line TA5605 (ref. 3). Wheat cultivars carrying Lr9 were released in the late 1960s. Lr9 is still effective in many wheat growing areas, although virulent Pt isolates emerged a few years after the release of the first Lr9-carrying wheat cultivars11. Lr58 has not been widely deployed in wheat breeding, and Pt races virulent on Lr58 have been reported3.
Both Lr9 and Lr58 conferred strong resistance against multiple Pt isolates in controlled conditions (Fig. 1a and Extended Data Fig. 1). To clone the two genes, we screened ethyl methanesulfonate (EMS)-induced mutant populations in the genetic backgrounds of ThatcherLr9 (TcLr9; Transfer/6*Thatcher; 919 M2 families) and TA5605 (Lr58 carrier; ~7,400 M2 families) for loss of resistance. We identified and validated 17 and 104 susceptible mutants in these two backgrounds, respectively (Fig. 1b and Extended Data Fig. 2). Because alien introgressions often show suppressed recombination1, we developed MutIsoSeq to clone Lr9 and Lr58. MutIsoSeq combines isoform sequencing (Iso-seq) of WT parental lines and transcriptome deep sequencing (RNA-seq) of EMS mutants to identify altered transcripts for candidate genes and does not require genetic mapping (Fig. 1c). First, we generated Iso-seq reads derived from RNA isolated from Pt-inoculated TcLr9 and TA5605 leaves. We then mapped RNA-seq reads from 10 TcLr9 and 10 TA5605 susceptible mutants against their respective parents and identified EMS-type (G/C to A/T) mutations. For both sets of mutants, we only identified one transcript with EMS-type point mutations in all 10 sequenced mutants (Table 1, Supplementary Note 1 and Supplementary Figs. 1 and 2). Surprisingly, the putative Lr9 and Lr58 coding sequences were identical (Supplementary Fig. 3a–c) suggesting that (1) two identical genes were introgressed independently from two different U-genome species or (2) Lr9 and Lr58 are actually the same original introgression and named differently from pedigree or mapping errors. Several lines of evidence indicate that Lr9 and Lr58 are from the same original introgression, that Ae. triuncialis accession TA10438 is not the donor of Lr58 and that Lr58 is likely Lr9 (Supplementary Note 2).
The putative 3,504-bp Lr9/Lr58 (Lr9 hereafter) coding sequence encoded a 1,167 amino acid protein with an N-terminal tandem kinase domain followed by a von Willebrand factor A (vWA) domain and a Vwaint domain in the C-terminus (Fig. 1d). Wheat tandem kinases (WTKs) have recently emerged as a new prominent player involved in disease resistance in Triticeae, the botanical tribe including wheat and barley12,13,14,15,16,17,18. The fusion of a WTK to another domain, however, is unique and highly unusual. vWA and Vwaint domains often occur together (subsequently referred to as vWA/Vwaint domain) and are thought to participate in protein–protein interactions in plants19,20. The vWA-containing copine proteins in Arabidopsis and rice are regulators of disease resistance and the Arabidopsis copine protein BON1 might be guarded by the NLR immune receptor SNC1, supporting a potential role of vWA-containing proteins in defense19,21,22. The Lr9 kinase domains belong to the LRR_8B subfamily (cysteine-rich receptor-like kinases), which is the most frequent kinase subfamily found in WTK proteins23. To reflect the introduced nomenclature, we refer to the Lr9 candidate as WTK6-vWA hereafter. A PCR marker derived from WTK6-vWA showed complete linkage with disease resistance in F2 populations derived from crosses between TcLr9 and the susceptible wheat cultivar Avocet S (136 F2 plants) and TA5605 with Avocet S (128 F2 plants) (Extended Data Figs. 3 and 4), confirming that WTK6-vWA identified through whole-transcriptome sequencing co-segregates with the gene mediating leaf rust resistance in TcLr9 and TA5605.
We amplified and Sanger-sequenced the WTK6-vWA genic region from the remaining 101 Lr9 mutants and detected missense or nonsense mutations in all but one mutant in the TcLr9 background, which might harbor second-site mutations or a mutation in the regulatory region. In total, we defined 97 independent mutation events in Lr9 across the 120 mutants (that is, different nucleotide transitions or identical transitions in the two genetic backgrounds). We identified 19 premature stop codon mutations, 67 nonredundant amino acid substitutions, four splice-site mutations, and one start codon mutation (Fig. 1d, Extended Data Fig. 5 and Supplementary Table 1). We further validated the identity of Lr9 by virus-induced gene silencing (VIGS). Silencing of WTK6-vWA increased leaf rust susceptibility (Fig. 1e). Together, the mutant analysis, genetic mapping and silencing confirm that WTK6-vWA is Lr9.
The 67 nonredundant amino acid substitution mutations enabled us to establish a detailed map of critical residues required for WTK6-vWA function. One amino acid substitution mapped to the N-terminal loop, 48 in kinase 1, 13 in kinase 2 and 5 in the C-terminal vWA/Vwaint domain. Protein modeling revealed a marked difference in the effects of individual amino acid substitutions between the two kinase domains (Fig. 2). Thirty-one of 48 amino acid substitutions in kinase 1 were predicted to be surface-localized, while most (8 of 13) of the substitutions in kinase 2 affected internal residues (Fig. 2 and Supplementary Table 1). Substitutions at surface-localized amino acids are likely to interfere with protein function, while substitutions of internal amino acids might affect protein structure and stability24,25. Seventeen amino acid substitutions in kinase 1 affected conserved subdomains and loops, including ATP-binding sites, compared to only three amino acids for the conserved kinase 2 subdomains (Extended Data Fig. 5)26. In conserved subdomain III, kinase 2 carried an aspartic acid (D) instead of the highly conserved glutamic acid (E) found in kinase 1 (Extended Data Fig. 5), indicating that kinase 2 might be a pseudokinase13. Most WTK proteins involved in disease resistance in wheat and barley have been suggested to harbor a kinase and a pseudokinase domain12,13,14,15,16,17,18. Substitution of the subdomain III glutamic acid in the Arabidopsis EVR kinase abrogated kinase activity27. Thus, we conclude that the two kinase domains may have different roles in WTK6-vWA.
WTK6-vWA-like genes represent a very small gene family in bread wheat, with zero to two full-length copies identified in wheat reference assemblies (Supplementary Table 2)28,29,30,31,32,33. All bread wheat WTK6-vWA homologs represented Lr9 orthologs located on chromosome arms 2AL and 2BL. The wheat lines ArinaLrFor and Fielder carried the closest Lr9 orthologs on chromosome 2A, with 90.1% amino acid sequence identity to Lr9. We also identified one to eight genes encoding proteins with a single kinase domain followed by a vWA/Vwaint domain (referred to as kinase-vWAs) in individual assemblies of bread wheat cultivars and various grass species (Extended Data Fig. 6 and Supplementary Table 2)28,29,30,31,32,33,34,35,36,37,38,39,40,41,42. Phylogenetic analyses provide evidence that the fusion of kinase and vWA domains occurred at least three times during the evolution of grasses (Extended Data Fig. 6 and Supplementary Figs. 4 and 5). We hypothesize that kinase-vWA proteins have a role in PAMP-triggered immunity and are targets of pathogen virulence effectors. The acquisition of a second kinase domain in WTK6-vWA might have resulted in a new function, whereby kinase 2 and the vWA/Vwaint domain serve as decoy23,43, while kinase 1 activity is required to initiate a defense response after effector perception. All five amino acid substitutions in the vWA/Vwaint domain were surface-localized, supporting that this domain might be involved in effector recognition (Fig. 2 and Supplementary Table 1).
We used a diagnostic Kompetitive Allele Specific PCR (KASP) marker derived from a polymorphism that distinguished WTK6-vWA alleles from TcLr9 and Ae. triuncialis accession TA10438 (Supplementary Note 2) to screen a diversity panel comprising 59 Ae. umbellulata and 205 Ae. triuncialis accessions (Supplementary Fig. 6 and Supplementary Table 3). No Ae. triuncialis accession carried Lr9, and only one Ae. umbellulata accession (TA1851) was positive for Lr9. We generated contig-level assemblies of TcLr9 and the putative Ae. umbellulata Lr9 donor TA1851 using PacBio circular consensus sequencing (CCS)44 to determine the genomic sequence of Lr9, including surrounding regions (Table 2). We assembled a 5.66-Mb and 14.99-Mb Lr9-containing contig in TcLr9 and TA1851, respectively (Fig. 3a). The Lr9 gene sequence spanned 10.63 kb and had 14 exons (Extended Data Fig. 7a). Lr9 transcript levels slightly increased after inoculation with an avirulent Pt isolate (Extended Data Fig. 7b). The Lr9 gene sequence was identical between TcLr9 and TA1851 and the two contigs showed 99.99% sequence identity over 5.58 Mb, confirming that TA1851 is the Lr9 donor (Fig. 3a). The few polymorphisms mainly represented short insertion/deletions and are likely considered sequencing errors. The TcLr9 contig had two large deletions of 24.8 and 46.0 kb compared to TA1851, most likely caused by the irradiation treatment (Fig. 3a). Mapping rates for TA5605, Thatcher and Ae. triuncialis TA10438 Illumina reads confirmed that TA5605 carries the same introgression as TcLr9, while the mapping rates for Thatcher and TA10438 reads were low (Fig. 3b), thus further supporting that Lr9 and Lr58 come from the same original introgression.
To reconstruct the history of the Lr9 translocation, we first estimated the size of the Lr9 translocation to be ~35–50 Mb by genomic in situ hybridization (GISH; Fig. 3c). We then mapped TA1851 k-mers to the TcLr9 assembly, which identified 13 TcLr9 contigs with high k-mer mapping rates (Fig. 3d and Supplementary Table 4). The 13 TcLr9 contigs were 0.28–5.66 Mb in size and matched five contigs in TA1851. We ordered and oriented the five TA1851 contigs based on a published Ae. umbellulata genetic map45 (Supplementary Table 5). One TcLr9 contig spanned a 585-bp gap between two neighboring TA1851 contigs, which left four gaps of unknown size. Most of the gaps between the TcLr9 contigs were small (<4 kb) based on the corresponding TA1851 sequence, suggesting that we covered most of the Lr9 translocation. TcLr9 contig TcLr9ptg000262l marked the terminal end of the translocation, with Lr9 locating 6.05 Mb proximal to the end. We identified the proximal breakpoint of the Ae. umbellulata translocation in TcLr9 contig TcLr9ptg002383l (Fig. 3d). k-mer mapping rates and dot plots revealed the exact breakpoint between bread wheat chromosome 6B and the Ae. umbellulata translocation (Fig. 3e and Extended Data Fig. 8), which was located in the putative promoter region of an NLR gene in bread wheat, while the breakpoint in Ae. umbellulata corresponded to a retrotransposon. These results confirmed that the use of irradiation led to a fusion of nonhomologous chromosome segments. The Ae. umbellulata translocation spanned ~28.4 Mb, in agreement with the cytogenetic estimate. Bread wheat line Chinese Spring was used as a recurrent parent to generate Transfer, which allowed us to compare the exact positions of the translocation breakpoints in TcLr9 (carrying the Lr9 translocation from Transfer) and the Chinese Spring reference genome28,30. The translocation breakpoint was at position 725.61 Mb (RefSeq v2.1), which was 5,580,341 bp proximal to the end of Chinese Spring chromosome 6B, indicating that this 5.58-Mb segment including 87 high-confidence genes is missing in Lr9-containing wheat lines (Fig. 3f and Supplementary Table 6). Alien introgressions, including the Lr9-containing Ae. umbellulata translocation, are often associated with reduced yields46. Our results show that such yield decline might be caused both by undesirable alleles on the Ae. umbellulata introgression (linkage drag) and by the loss of this 5.58-Mb chromosomal segment.
In summary, we identified an unusual class of kinase fusion proteins involved in disease resistance. Our results indicate that the two kinase domains have diverse functions and we hypothesize that the second kinase and the vWA/Vwaint domain are integrated decoys. A second example of an ‘integrated decoy kinase fusion protein’ is the protein encoded by the stem rust resistance gene Sr4347. Sr43 contains a kinase domain fused to two domains of unknown function that might serve as decoys. Together, these two studies highlight the emerging role of unusual kinase fusion proteins in wheat disease resistance13,14,15,16,17,18,48,49,50,51, which will form the basis to use this protein class in breeding.
Methods
Plant material
Bread wheat accessions Transfer (TA5524), WL711, TA5605, Ae. umbellulata accession TA1851 and Ae. triuncialis accession TA10438 were obtained from the Wheat Genetics Resource Center (WGRC). TcLr9 (Transfer/6*Thatcher) is a near-isogenic line carrying Lr9 from Transfer in the genetic background of the susceptible wheat line Thatcher. TcLr9 and TA5605 were crossed with leaf rust susceptible wheat cultivar Avocet S to develop segregating F2 populations for Lr9 (136 F2 plants) and Lr58 (128 F2 plants), respectively. An F2 population derived from a cross between TcLr9 and TA5605 (133 F2 plants) was also generated.
Leaf rust inoculations
P. triticina isolates B9414, 93012, 94015, 95012, 96007 and 96237 are avirulent on Lr9 (refs. 52,53). Isolate MNPSD is virulent on Lr9 (ref. 54). Isolates were propagated on seedlings of the susceptible wheat cultivar Thatcher. Freshly collected urediniospores were used for inoculation experiments. Inoculations were performed using a high-pressure air sprayer or a settling tower. For the spray inoculation, urediniospores were suspended in FC-43 oil (3M Fluorinert FC-43) and sprayed onto plants using a glass sprayer connected to a high-pressure air pipe. For the settling tower inoculation, plants were grown for 12 d. Then, the second leaf was removed and the first leaf was pinned to the soil with the adaxial side facing upward. A mixture of 10 mg urediniospores and 300 mg lycopodium powder (Sigma-Aldrich, 19108) was blown over the plants in a settling tower55. Inoculated plants were placed in an inoculation box equipped with a humidifier overnight and transferred to a walk-in Conviron growth room (16 h day/8 h night, 18 °C/16 °C). Symptoms were evaluated 12 d after inoculation. Infected leaves were scanned using an Epson Perfection V600 Photo scanner.
EMS mutagenesis and mutant screening
Grains of wheat lines TcLr9 and TA5605 were evenly divided and soaked in two 1-l flasks with 200 ml water at 4 °C for 16 h. Grains were washed three times in distilled water. Then, 200 ml of 0.45% and 0.55% EMS solutions (Sigma-Aldrich, M0880) were added and grains were incubated on a shaker at 80 rpm at room temperature for 16 h. Then, grains were washed three times for 45 min each, and then transferred into mesh bags and put under running tap water for 30 min. After the washing, grains were put into 55 × 28 cm black plastic trays with blotting paper at the bottom, which were later covered with plastic lids and kept at 4 °C for 36 h. The treated grains were planted into 18-well trays filled with Stender soil, 12 to 14 grains per well. Plants were grown in a greenhouse under speed-breeding conditions (22 h day/2 h night, 21 °C/18 °C)56. We performed three EMS experiments for TA5605 (with approximately 2,000, 4,000 and 5,000 grains per experiment), and one for TcLr9 (~2,000 grains). Spikes from the first and second batches of TA5605 M1 plants were bulk collected. Single spikes were collected from the third batch of TA5605 M1 plants and from TcLr9 M1 plants. For the mutant screening, M2 grains from a single spike were planted in one well of 18-well or 24-well trays and grown in a greenhouse (22 h day/2 h night, 21 °C/18 °C) or growth room (16 h day/8 h night, 20 °C) for 3–4 weeks. Batches of six trays were spray-inoculated with 30 mg fresh urediniospores of Pt isolate B9414 suspended in 20 ml FC-43 oil, and then kept in an inoculation box overnight. Plants were transferred and grown in a walk-in Conviron growth room for 10–14 d. Susceptible plants were transferred into single pots. In total, 919 and ~7,400 M2 families were screened for TcLr9 and TA5605, respectively.
Iso-seq library preparation and sequencing
Total RNA from infected TcLr9 and TA5605 seedling leaves (48, 72, 96 and 120 h after inoculation with the avirulent Pt isolate B9414) was extracted using the Maxwell RSC Plant RNA Kit (Promega, AS1500). To prepare the Iso-seq libraries, 300 ng of total RNA was used as input for complementary DNA (cDNA) synthesis. Each sample was first barcoded (Supplementary Table 7) and then subjected to cDNA amplification using 12 cycles. Purified cDNAs were pooled in equal molarity and then subjected to library preparation using the SMRTbell Express Template Prep Kit 2.0 (Pacific Biosciences, 100-938-900) following the Iso-seq protocol. Libraries were prepared for sequencing by annealing primer v4 with Sequel II Binding Kit 2.1 and the Internal Control Kit 1.0 (Pacific Biosciences, 101-843-000). Two SMRT Cells 8M (Pacific Biosciences, 101-389-001), one for TcLr9 and one for TA5605, were sequenced on the PacBio Sequel II system using the Sequencing Kit 2.0 (Pacific Biosciences, 101-820-200).
RNA-seq of susceptible mutants
Total RNA from the Pt-inoculated fourth to sixth leaves of ten susceptible TcLr9 M2 mutants and ten TA5605 M3 mutants was extracted using the Maxwell RSC Plant RNA Kit (Promega, AS1500). RNA-seq was performed as a service at Novogene. In brief, mRNA samples were enriched using oligo(dT) beads and then used to prepare strand-specific cDNA libraries. Sequencing was done on Illumina Novaseq 6000, which produced an average of 233.2 and 262.7 million paired-end reads (2 × 150 bp) for TcLr9 and TA5605 mutants, respectively.
MutIsoSeq
The principle of MutIsoSeq is to use full-length transcripts produced with PacBio Iso-seq as a reference, map RNA-seq reads of mutants to the reference, and identify transcripts that carry multiple independent EMS-type point mutations (Fig. 1c). The workflow of MutIsoSeq is illustrated in Supplementary Fig. 1. The raw data from PacBio Sequel II were processed following the Iso-seq pipeline (v.3; https://github.com/PacificBiosciences/IsoSeq) for the generation of CCS reads and full-length, nonconcatemer (FLNC) reads. FLNC reads were merged and clustered into ‘clustered.hq.fasta.gz’ files containing 300,066 and 283,330 high-quality transcripts for TcLr9 and TA5605, respectively. Transposable elements were masked by Repeatmasker (v4.0.7; http://www.repeatmasker.org) using the TREP database57 (v.2019) as an external library, which resulted in 6.29% and 6.08% masked sequences, respectively. Then, SeqKit v2.2.0 (ref. 58) was used to sort all the transcripts from longest to shortest, and the fasta files generated during this step were used as references. We used BBMap (v.38.96)59 (parameters—ambiguous=best; subfilter=1; trimreaddescriptions=t nodisk) to map the Illumina reads from the susceptible mutants to the references. To exclude nonspecific alignments as much as possible, we used the parameter ‘subfilter=1’, which removes alignments with more than one substitution. Iso-seq captures different isoforms originating from the same gene. If a gene had multiple isoforms, Illumina reads would be randomly assigned to different isoforms by default, which would reduce the average coverage per isoform. We thus used the parameter ‘ambiguous=best’, which assigns reads to the first best site. Because transcripts were ordered by size, this step resulted in the mapping of most reads to the longest isoform of a gene. The resulting SAM files were sorted and indexed by SAMtools (v1.6)60. We followed the MutChromSeq pipeline for the remaining steps (https://github.com/steuernb/MutChromSeq)61. Briefly, sorted SAM files were converted into the mpileup format using SAMtools (v1.6)60. Individual pileup files were subsequently converted into XML format using Pileup2XML.jar (parameter, -a 0.01 -c 5). Reports were given by MutChromSeq.jar using the parameters -n 10 -c 10 -a 0.01 -z 2 (TcLr9-M13 and TA5605-Spt1 were used with the parameter '-w' because we did not use the wild-type (WT) TcLr9 and TA5605 RNA-seq data) (Supplementary Fig. 2). For both Lr9 and Lr58, only one candidate transcript was identified that had EMS-type G/C-to-A/T transitions in all 20 sequenced mutants. We performed a BLAST search with the identified candidate transcripts against the high-quality transcripts using TBtools (v1.09873)62, which identified 72 and 61 transcripts for Lr9 and Lr58, respectively (Supplementary Fig. 3a,b). Only transcripts that encoded a full-length protein spanning all the detected mutants were retained, resulting in 16 transcripts each for Lr9 and Lr58. Twelve and 11 of the retained Lr9 and Lr58 transcripts encoded the same 1,167 amino acid protein, respectively, and they mainly differed in the length of the 5′ and 3′ untranslated regions (Supplementary Fig. 3c). The transcripts encoding this protein version accounted for the majority of the total transcripts (68.15% and 73.26% coverage, which were calculated using the coverage number of transcripts, representing the number of molecules sequenced in the PacBio SMRT cells) (Supplementary Fig. 3a,b). We also found four Lr9 transcripts (representing 2.54% coverage) and four Lr58 transcripts (2.18% coverage) that encoded full-length proteins lacking 12 to 24 amino acids at the N-terminus, and one Lr58 transcript (0.41% coverage) encoding a full-length protein with 28 extra amino acids. The transcripts encoding the 1,167 amino acid protein were considered the main protein version because of the high transcript abundance.
Sanger sequencing of EMS mutants
We performed a BLAST search with the WTK6-vWA CDS against the genome sequence of bread wheat cultivars29,31 and found full-length matches on chromosome arms 2AL and 2BL of wheat cultivars ArinaLrFor and Fielder. The corresponding genomic sequences were downloaded and aligned to WTK6-vWA full-length transcript/46588 of TcLr9 using Geneious Prime (v.2020.2.4). Primers were designed from the UTRs and the CDS of WTK6-vWA and intron sequence of ArinaLrFor and Fielder (Supplementary Table 8). PCR amplifications were done in 20 μl reactions containing 1× GoTaq Green Master Mix (Promega, M7122), ∼100 ng genomic DNA and 100 nM primers. Primer pairs K1-F/K1-R, K2-F2/K2-R1 and vWA-F2/vWA-R1 were used to amplify 1,949-bp, 2,313-bp and 1,050-bp fragments from the DNA samples of mutants. Amplicons were Sanger-sequenced using primers K1-F/K1-MR/K1-R, K2-F2/K2-MF1/K2-MR/K2-R1 and vWA-F2/vWA-R1, respectively. For the mutants from the last batch of mutant screening (TA5605-Spt124 to Spt186), primer pair K1-F/K1-R was used to amplify the 1,949-bp fragment, and nested PCR63 primer pairs K2-F1/vWA-R1 (for the first round PCR) and K2-F2/vWA-R2 (for the second round PCR) were used to amplify a second 3,740-bp fragment from the mutants, which were subsequently Sanger-sequenced using primers K1-exon1-R/K1-MF1/K1-MF2/K1-R and K2-F2/K2-F3/K2-F4/K2-R1/vWA-F1/vWA-R2. Sanger sequencing files were aligned to the WTK6-vWA CDS and genomic sequence to find polymorphisms using Geneious Prime (v.2020.2.4). Primer sequences are provided in Supplementary Table 8. We chose a conservative approach and only considered mutants as independent if they contained nucleotide transitions at different positions in the Lr9 gene or at the same position but in two different genetic backgrounds (TA5605 and TcLr9), which resulted in 97 independent mutation events.
Virus-induced gene silencing
To develop specific VIGS probes, we first performed a BLAST search with the WTK6-vWA CDS against various wheat genomes29. WTK6-vWA sequence stretches with little homology in the wheat assemblies were selected as targets (Fig. 1d). Primers for the two targets and the phytoene desaturase genes (PDS)64 were used to amplify a 223-bp (VIGS1), 226-bp (VIGS2) and 400-bp (PDS) fragment from the TA5605 cDNA, respectively (Supplementary Table 8). Fragments were cloned into the BSMV-γ (barley stripe mosaic virus (BSMV)) vector64, resulting in constructs BSMV-γVIGS1, BSMV-γVIGS2 and BSMV-γPDS. These constructs together with BSMV-γ (empty vector), BSMV-γGFP32, BSMV-α and BSMV-β were transformed into Agrobacterium tumefaciens strain GV3101. Agrobacteria were cultured overnight at 28 °C in lysogeny broth with appropriate antibiotics. Cells were collected by centrifugation, then re-suspended and adjusted to OD600 = 0.7. Equal volumes of BSMV-α and BSMV-β were mixed with BSMV-γVIGS1, BSMV-γVIGS2, BSMV-γGFP, BSMV-γ and BSMV-γPDS, respectively. After 3 h incubation at 28 °C, cultures were infiltrated into Nicotiana benthamiana leaves. Infiltrated leaves were collected 5 d after infiltration and homogenized with PBS buffer (Gibco, 10010023) containing 1% Celite (Thermo Fisher Scientific, 68855-54-9). The buffer containing viral particles was rub inoculated on seedlings of TcLr9 and TA5605 at two-leaf stage. After 2 weeks, when viral symptoms were clearly visible, plants were spray-inoculated with P. triticina isolate B9414 as described above. Disease symptoms were evaluated and recorded 12 d after inoculation.
Protein structure prediction
To predict the 3D structure of WTK6-vWA, we used the open-source code of AlphaFold v2.0 (ref. 65). The input was the amino acid sequence of WTK6-vWA. The output consisted of five PDB format text files containing the predicted structure exactly as predicted by the model ('unrelaxed models'), which were later refined by an Amber relaxation procedure ('relaxed models') to provide the relaxed predicted structure reordering by model confidence based on the Local Distance Difference test (lDDT) score ('ranked models'). We used one of the ranked models with a reported lDDT score of 0.7507 of very good accuracy (lDDT > 0.6 is considered reasonable models and lDDT > 0.8 is considered great models, accuracy-wise). The positions of 67 nonredundant amino acid substitutions were visualized as spheres using PyMOL (v.2.3.0). Substituted residues that did not exhibit any visual surface on the 3D structure model were characterized as internal residues.
Cytogenetic analysis
The procedures for mitotic chromosome preparation, FISH and GISH, were adapted from refs. 66,67. A 3,740-bp fragment was amplified from TcLr9, TA5605 and TA10438 using nested PCR63 primer pairs K2-F1/vWA-R1 (for the first round PCR) and K2-F2/vWA-R2 (for the second round PCR) and Phusion High-Fidelity DNA Polymerase (New England Biolabs, M0530), which was subsequently Sanger-sequenced using primers K2-F2/K2-MR/K2-MF2/K2-R2/K2-R1/LastIntron-F/vWA-F3/vWA-R2 (Supplementary Table 8). The 3,740-bp fragment from TcLr9 was cloned into pJET1.2/blunt (Thermo Fisher Scientific, K1231) and used in FISH as a probe. Probe DNA was labeled with either digoxigenin-11-dUTP or biotin-16-dUTP following the manufacturer’s instructions (Roche, CA). Unlabeled total genomic wheat DNA was used as a blocker in GISH experiments. After post-hybridization washes, the probe hybridization signals were detected with Alexafluor 488 streptavidin (Invitrogen) for biotin-labeled probes, and rhodamine-conjugated anti-digoxigenin (Roche) for dig-labeled probes. Images were captured using a Zeiss Axioplan 2 microscope (Carl Zeiss Microscopy) mounted with a cooled CCD camera CoolSNAP HQ2 (Photometrics) and operated with AxioVision 4.8 software. The final contrast of the images was processed using Adobe Photoshop CS5.
Identification of WTK6-vWA homologs
WTK6-vWA homologous genes were retrieved by BLAST search against whole-genome assemblies of 12 bread wheat cultivars, durum wheat, wild emmer wheat and six Aegilops species at WheatOmics 1.0 (http://202.194.139.32/blast/blast.html)29,30,31,32,33,34,35,36,68; two rye cultivars and one hexaploid oat (OT3098) at GrainGenes (https://wheat.pw.usda.gov/blast/)37,38, Thinopyrum intermedium, Brachypodium distachyon, Dichanthelium oligosanthes and Eleusine coracana at Phytozome (https://phytozome-next.jgi.doe.gov)39,40; Eragrostis tef and Eragrostis curvula at GoGe (https://genomevolution.org/coge/)41,42 and Ae. umbellulata TA1851. Corresponding sequences were downloaded. Nonannotated sequences were manually annotated by MUSCLE alignment in Geneious Prime (v.2020.2.4). Translated protein sequences were used as input in the ‘NCBI Conserved Domain Search’ (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) for protein domain annotation.
Phylogenetic analyses
In total, 186 putative kinase and pseudokinase domains of tandem kinase proteins, tandem kinase-vWAs, and kinase-vWAs from different grass species were used13,14,15,16,17. The vWA domains of tandem kinase-vWA and kinase-vWA proteins from Supplementary Table 2, and vWA-containing proteins retrieved from proteomes of barley Morex v3 (https://wheat.pw.usda.gov/blast/), Aegilops speltoides AEG-9674-1 (http://202.194.139.32/blast/blast.html) and rice69 were used for the phylogenetic analyses. Conserved kinase and vWA domains were identified using NCBI Conserved Domain Search (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). Phylogenetic trees (neighbor-joining tree) for kinase and vWA domains were computed with Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/) and drawn with iTOL (https://itol.embl.de/).
KASP assay
Five microliters reaction containing 2.5 μl of KASP Master Mix (Low ROX KBS-1016-016), 0.07 μl of assay mix (KASP-Lr9-F/KASP-TA10438-F/KASP-R; Supplementary Table 8) and 2.5 μl (50 ng) of DNA was used for the Lr9 KASP assay. PCR cycling was performed using an ABI QuantStudio 6 Flex Real-Time PCR machine as follows: preread at 30 °C for 60 s and hold the stage at 94 °C for 15 min, followed by ten touchdown cycles (94 °C for 20 s; touchdown at 61 °C, decreasing by 0.6 °C per cycle for 60 s) and 29 additional cycles (94 °C for 20 s; 55 °C for 60 s). Plates were read at 30 °C for endpoint fluorescence measurement.
HiFi library preparation, sequencing and assembly of TcLr9 and TA1851
High molecular weight (HMW) DNA was extracted from young seedlings of TcLr9 and TA1851 using a modified Qiagen Genomic DNA extraction protocol (https://doi.org/10.17504/protocols.io.bafmibk6)70. DNA was sheared to the appropriate size range (15–20 kb) using Megaruptor 3 (Diagenode) for the construction of PacBio HiFi sequencing libraries. Library preparation was done with the Express Template Prep Kit 2.0 (100-938-900 + Enzyme Clean up 2.0 (101-932-600)), and size was selected with a PippinHT System (Sage Science, HTP0001). Final SMRTbell QC was assessed with Qubit dsDNA High Sensitivity (Thermo Fisher Scientific, Q33230) and FEMTO Pulse (Agilent Technologies, P-0003-0817). Sequencing was performed on PacBio Sequel II/IIe systems in CCS mode. The 169.0 Gb of TcLr9 PacBio HiFi reads and 130.9 Gb of TA1851 PacBio HiFi reads were assembled using hifiasm (v.0.16.1)71 with default parameters. The genome assemblies were evaluated using QUAST 5.0.2 (ref. 72) and BBMap (v.38.96)59.
Re-sequencing of TA5605, Thatcher and TA10438
Genomic DNA was extracted from leaf samples of bread wheat accessions TA5605, Thatcher and Ae. triuncialis accession TA10438 using a CTAB DNA extraction method. Library preparation and sequencing were done as a service by Novogene. In brief, sequencing libraries were generated using the NEBNext Ultra II DNA Library Prep Kit and sequenced on an Illumina NovaSeq 6000 system. Illumina sequencing resulted in about 564.3 Gb, 542.5 Gb and 561.8 Gb of 2 × 150 bp paired-end reads for TA5605, Thatcher and TA10438, respectively, corresponding to about 36-fold, 35-fold and 53-fold genome coverage for TA5605, Thatcher and TA10438, respectively.
Assembly of the Lr9 translocation
To calculate the average nucleotide identity (ANI) between contigs TcLr9ptg001727l (TcLr9 contig carrying Lr9) and TA1851ptg000193l (TA1851 contig carrying Lr9), we aligned them using Minimap2 (v.2.21; parameter ‘-ax asm5’)73. InDels and single-nucleotide polymorphisms (SNPs) were manually evaluated in nonoverlapping sliding windows of 10 kb along the two contigs. The assemblies of TcLr9 and TA1851 were merged and used as a reference for the Illumina read mapping of TA5605, Thatcher and TA10438. Illumina reads of TA5605, Thatcher and TA10438 were aligned against the combined PacBio assembly using BBMap (v.38.96; parameters—ambiguous=random; trimreaddescriptions=t; pairedonly=t; perfectmode=t nodisk). From the output SAM file, the read coverage for the contig TcLr9ptg001727l has been calculated in nonoverlapping sliding windows of 50 kb using bedtools (v2.30.0)74 and mosdepth (v. 0.3.3)75. The heatmap representing the read coverage was generated with TBtools (v1.09873)62. We used a k-mer-based approach to reconstruct the Lr9 translocation. Briefly, we counted the canonical 51-mers from the 130.9 Gb TA1851 raw HiFi reads using Jellyfish (v. 2.2.10)76, then filtered the 51-mers with at least four occurrences. The dump file containing the 51-mers was converted into fasta format using AWK, which generated 2,726,203,170 k-mers. Then, the 51-mers were mapped against the TcLr9 assembly using BWA-MEM (v.0.7.17) allowing only perfect matches (parameter, -k 51 -T 51), generating a bam file of k-mer mapping, which was imported to Geneious Prime (v.2022.1.1) and sorted by mean coverage from highest to lowest (Supplementary Table 4). Thirteen contigs with average mapping coverage above 20 were considered originating from Ae. umbellulata (Supplementary Table 4). We performed a BLAST search using nonrepetitive sequences of 13 TcLr9 contigs against the TA1851 assembly and identified five corresponding contigs in the Ae. umbellulata genome. To validate, order and orient the five selected TA1851 contigs, we anchored 3,009 markers (SNP tags) of the Ae. umbellulata linkage map45 to the TA1851 assembly using BBMap (v.38.96) (parameters—ambiguous=random; trimreaddescriptions=t; subfilter=1 nodisk)59 (Fig. 3d and Supplementary Table 5). k-mer mapping revealed an introgression breakpoint in contig TcLr9ptg002383l. To further investigate the breakpoint, we extracted 10 kb sequences upstream and downstream of the breakpoint from TA1851ptg000319l, TcLr9ptg002383l and the long arm of CS chromosome 6B. Dot plot graphs were generated for 20 kb sequences from TA1851 and CS against TcLr9 using Geneious Prime (v.2020.2.4), which revealed the same breakpoint as with k-mer mapping (Extended Data Fig. 8). We amplified the PCR markers SCS5550 (ref. 77) and Xncw-Lr58-1 (ref. 78) from TcLr9 and TA5605 and sequenced the PCR products using Sanger sequencing. Both markers showed the same sequence from TcLr9 and TA5605, and they were anchored to contigs TcLr9ptg001727l and TA1851ptg000193l, respectively. The Lr58 linked markers BCD410, KSUD23, KUSF11, BG123 and Xcfd50 (ref. 3) were also anchored to the TcLr9 contigs using published probe or SSR sequences (Fig. 3d).
Real-time quantitative PCR
Leaves of TcLr9 were collected 0, 24, 48, 72, 96, 120, 144 and 192 h after inoculation with the avirulent Pt isolate B9414. Total RNA was extracted using the Maxwell RSC Plant RNA Kit. First-strand cDNA was synthesized using the iScript cDNA Synthesis Kit (Bio-Rad). RT–qPCR was performed using the primer pairs KKV-qPCR-F/KKV-qPCR-R and Ta-GAPDH-qPCR-F/Ta-GAPDH-qPCR-R (Supplementary Table 8) on a QuantStudio 6 Flex Real-Time PCR system (Applied Biosystems) using PowerUp SYBR Green Master Mix (Thermo Fisher Scientific, A25742). The wheat glyceraldehyde 3-phosphate dehydrogenase gene (GAPDH) was used as the endogenous control. The 2−ΔΔCT method was used to normalize and calibrate transcript values relative to the endogenous Ta-GAPDH control79.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Data supporting the findings of this work are available within the paper and its Supplementary Information. The raw Iso-seq and RNA-seq data used for MutIsoSeq, the PacBio CCS reads used for de novo whole-genome assemblies, the Illumina raw reads of TA5605, TA10438, Thatcher and sorted TA5605 chromosome 2B were deposited in the European Nucleotide Archive (ENA) under study number PRJEB53839. The Lr9 genomic and mRNA sequences were deposited in NCBI Genbank under accession numbers ON872164 and ON872165. The genome assemblies of TcLr9 and Aegilops umbellulata accession TA1851, the assembly of sorted TA5605 chromosome 2B, the clustered Iso-seq transcripts of TcLr9 and TA5605 and the CDS and genomic sequence of Lr9 are available on the DRYAD database under https://doi.org/10.5061/dryad.gxd2547pw.
Code availability
All software used in this study are publicly available as described in the 'Methods' section and the Reporting Summary.
References
Wulff, B. B. & Moscou, M. J. Strategies for transferring resistance into wheat: from wide crosses to GM cassettes. Front. Plant Sci. 5, 692 (2014).
Sears, E. R. Brookhaven Symposia in Biology Vol. 9, pp. 1–21 (1956).
Kuraparthy, V. et al. A cryptic wheat-Aegilops triuncialis translocation with leaf rust resistance gene Lr58. Crop Sci. 47, 1995–2003 (2007).
Tadesse, W. et al. Genetic gains in wheat breeding and its role in feeding the world. Crop Breed. Genet. Genom. 1, e190005 (2019).
Savary, S. et al. The global burden of pathogens and pests on major food crops. Nat. Ecol. Evol. 3, 430–439 (2019).
Kolmer, J. Leaf rust of wheat: pathogen biology, variation and host resistance. Forests 4, 70–84 (2013).
Hafeez, A. N. et al. Creation and judicious application of a wheat resistance gene atlas. Mol. Plant 14, 1053–1070 (2021).
Said, M. et al. Development of DNA markers from physically mapped loci in Aegilops comosa and Aegilops umbellulata using single-gene FISH and chromosome sequences. Front. Plant Sci. 12, 689031 (2021).
Molnar-Lang, M., Ceoloni, C. & Dolezel, J. Alien Introgression in Wheat (Springer, 2015).
Schachermayr, G. et al. Identification and localization of molecular markers linked to the Lr9 leaf rust resistance gene of wheat. Theor. Appl. Genet. 88, 110–115 (1994).
Huerta-Espino, J., Singh, R. P. & Reyna-Martinez, J. First detection of virulence to genes Lr9 and Lr25 conferring resistance to leaf rust of wheat caused by Puccinia triticina in Mexico. Plant Dis. 92, 311 (2008).
Brueggeman, R. et al. The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases. Proc. Natl Acad. Sci. USA 99, 9328–9333 (2002).
Klymiuk, V. et al. Cloning of the wheat Yr15 resistance gene sheds light on the plant tandem kinase-pseudokinase family. Nat. Commun. 9, 3735 (2018).
Chen, S. S. et al. Wheat gene Sr60 encodes a protein with two putative kinase domains that confers resistance to stem rust. N. Phytol. 225, 948–959 (2020).
Lu, P. et al. A rare gain of function mutation in a wheat tandem kinase confers resistance to powdery mildew. Nat. Commun. 11, 680 (2020).
Gaurav, K. et al. Population genomic analysis of Aegilops tauschii identifies targets for bread wheat improvement. Nat. Biotechnol. 40, 422–431 (2022).
Yu, G. et al. Aegilops sharonensis genome-assisted identification of stem rust resistance gene Sr62. Nat. Commun. 13, 1607 (2022).
Arora, S. et al. A wheat kinase and immune receptor form host-specificity barriers against the blast fungus. Nat. Plants. 9, 385–392 (2023).
Liu, J. X., Jambunathan, N. & McNellis, T. Transgenic expression of the von Willebrand A domain of the BONZAI1/COPINE1 protein triggers a lesion-mimic phenotype in Arabidopsis. Planta 221, 85–94 (2005).
Whittaker, C. A. & Hynes, R. O. Distribution and evolution of von Willebrand/integrin A domains: widely dispersed domains with roles in cell adhesion and elsewhere. Mol. Biol. Cell 13, 3369–3387 (2002).
Yin, X. et al. Rice copine genes OsBON1 and OsBON3 function as suppressors of broad-spectrum disease resistance. Plant Biotechnol. J. 16, 1476–1487 (2018).
Yang, S. H. & Hua, J. A haplotype-specific resistance gene regulated by BONZAI1 mediates temperature-dependent growth control in Arabidopsis. Plant Cell 16, 1060–1071 (2004).
Klymiuk, V., Coaker, G., Fahima, T. & Pozniak, C. J. Tandem protein kinases emerge as new regulators of plant immunity. Mol. Plant Microbe Interact. 34, 1094–1102 (2021).
Huang, C., Yoshino-Koh, K. & Tesmer, J. J. A surface of the kinase domain critical for the allosteric activation of G protein-coupled receptor kinases. J. Biol. Chem. 284, 17206–17215 (2009).
Bajaj, K., Chakrabarti, P. & Varadarajan, R. Mutagenesis-based definitions and probes of residue burial in proteins. Proc. Natl Acad. Sci. USA 102, 16221–16226 (2005).
Hanks, S. K., Quinn, A. M. & Hunter, T. The protein kinase family: conserved features and deduced phylogeny of the catalytic domains. Science 241, 42–52 (1988).
Leslie, M. E., Lewis, M. W., Youn, J.-Y., Daniels, M. J. & Liljegren, S. J. The EVERSHED receptor-like kinase modulates floral organ shedding in Arabidopsis. Development 137, 467–476 (2010).
International Wheat Genome Sequencing Consortium. Shifting the limits in wheat research and breeding through a fully annotated and anchored reference genome sequence. Science 361, eaar7191 (2018).
Walkowiak, S. et al. Multiple wheat genomes reveal global variation in modern breeding. Nature 588, 277–283 (2020).
Zhu, T. et al. Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly. Plant J. 107, 303–314 (2021).
Sato, K. et al. Chromosome-scale genome assembly of the transformation-amenable common wheat cultivar ‘Fielder’. DNA Res. 28, dsab008 (2021).
Athiyannan, N. et al. Long-read genome sequencing of bread wheat facilitates disease resistance gene cloning. Nat. Genet. 54, 227–231 (2022).
Aury, J.-M. et al. Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding. GigaScience 11, giac034 (2022).
Maccaferri, M. et al. Durum wheat genome highlights past domestication signatures and future improvement targets. Nat. Genet. 51, 885–895 (2019).
Zhou, Y. et al. Introgressing the Aegilops tauschii genome into wheat as a basis for cereal improvement. Nat. Plants. 7, 774–786 (2021).
Li, L. F. et al. Genome sequences of five Sitopsis species of Aegilops and the origin of polyploid wheat B subgenome. Mol. Plant 15, 488–503 (2022).
Rabanus-Wallace, M. T. et al. Chromosome-scale genome assembly provides insights into rye biology, evolution and agronomic potential. Nat. Genet. 53, 564–573 (2021).
Li, G. et al. A high-quality genome assembly highlights rye genomic characteristics and agronomically important genes. Nat. Genet. 53, 574–584 (2021).
Studer, A. J. et al. The draft genome of the C3 panicoid grass species Dichanthelium oligosanthes. Genome Biol. 17, 223 (2016).
Hittalmani, S. et al. Genome and transcriptome sequence of finger millet (Eleusine coracana (L.) Gaertn.) provides insights into drought tolerance and nutraceutical properties. BMC Genomics 18, 465 (2017).
Carballo, J. et al. A high-quality genome of Eragrostis curvula grass provides insights into Poaceae evolution and supports new strategies to enhance forage quality. Sci. Rep. 9, 10250 (2019).
VanBuren, R. et al. Exceptional subgenome stability and functional divergence in the allotetraploid Ethiopian cereal teff. Nat. Commun. 11, 884 (2020).
van der Hoorn, R. A. & Kamoun, S. From Guard to Decoy: a new model for perception of plant pathogen effectors. Plant Cell 20, 2009–2017 (2008).
Wenger, A. M. et al. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat. Biotechnol. 37, 1155–1162 (2019).
Edae, E. A., Olivera, P. D., Jin, Y. & Rouse, M. N. Genotyping-by-sequencing facilitates a high-density consensus linkage map for Aegilops umbellulata, a wild relative of cultivated wheat. G3 (Bethesda) 7, 1551–1561 (2017).
Ortelli, S., Winzeler, M., Winzeler, H. & Nösberger, J. Leaf rust resistance gene Lr9 and winter wheat yield reduction: II. Leaf gas exchange and root activity. Crop Sci. 36, 1595–1601 (1996).
Yu, G. et al. The wheat stem rust resistance gene Sr43 encodes an unusual protein kinase. Nat. Genet. https://doi.org/10.1038/s41588-023-01402-1 (2023).
Fu, D. et al. A kinase-START gene confers temperature-dependent resistance to wheat stripe rust. Science 323, 1357–1360 (2009).
Faris, J. D. et al. A unique wheat disease resistance-like gene governs effector-triggered susceptibility to necrotrophic pathogens. Proc. Natl Acad. Sci. USA 107, 13544–13549 (2010).
Zhang, Z. et al. A protein kinase–major sperm protein gene hijacked by a necrotrophic fungal pathogen triggers disease susceptibility in wheat. Plant J. 106, 720–732 (2021).
Sánchez-Martín, J. et al. Wheat Pm4 resistance to powdery mildew is controlled by alternative splice variants encoding chimeric proteins. Nat. Plants. 7, 327–341 (2021).
Messmer, M. M. et al. Genetic analysis of durable leaf rust resistance in winter wheat. Theor. Appl. Genet. 100, 419–431 (2000).
Martinez, F., Niks, R. E., Singh, R. P. & Rubiales, D. Characterization of Lr46, a gene conferring partial resistance to wheat leaf rust. Hereditas 135, 111–114 (2001).
Kolmer, J. A. Virulence of Puccinia triticina, the wheat leaf rust fungus, in the United States in 2017. Plant Dis. 103, 2113–2120 (2019).
Wang, Y. et al. Orthologous receptor kinases quantitatively affect the host status of barley to leaf rust fungi. Nat. Plants. 5, 1129–1135 (2019).
Watson, A. et al. Speed breeding is a powerful tool to accelerate crop research and breeding. Nat. Plants. 4, 23–29 (2018).
Wicker, T., Matthews, D. E. & Keller, B. TREP: a database for Triticeae repetitive elements. Trends Plant Sci. 7, 561–562 (2002).
Shen, W., Le, S., Li, Y. & Hu, F. SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS One 11, e0163962 (2016).
Bushnell, B. BBMap: a fast, accurate, splice-aware aligner. Proceedings of 9th Annual Genomics of Energy & Environment Meeting (LBNL Department of Energy Joint Genome Institute, 2014).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Sánchez-Martín, J. et al. Rapid gene isolation in barley and wheat by mutant chromosome sequencing. Genome Biol. 17, 221 (2016).
Chen, C. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202 (2020).
Green, M. R. & Sambrook, J. Nested polymerase chain reaction (PCR). Cold Spring Harb. Protoc. 2019, (2019).
Yuan, C. et al. A high throughput barley stripe mosaic virus vector for virus induced gene silencing in monocots and dicots. PLoS One 6, e26468 (2011).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Koo, D.-H., Liu, W., Friebe, B. & Gill, B. S. Homoeologous recombination in the presence of Ph1 gene in wheat. Chromosoma 126, 531–540 (2017).
Koo, D.-H. et al. Extrachromosomal circular DNA-based amplification and transmission of herbicide resistance in crop weed Amaranthus palmeri. Proc. Natl Acad. Sci. USA 115, 3332–3337 (2018).
Ma, S. et al. WheatOmics: a platform combining multiple omics data to accelerate functional genomics studies in wheat. Mol. Plant 14, 1965–1968 (2021).
Karkute, S. G. et al. Genome-wide analysis of von Willebrand factor A gene family in rice for its role in imparting biotic stress resistance with emphasis on rice blast disease. Rice Sci. 29, 375–384 (2022).
Driguez, P. et al. LeafGo: Leaf to Genome, a quick workflow to produce high-quality de novo plant genomes using long-read sequencing technology. Genome Biol. 22, 256 (2021).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Pedersen, B. S. & Quinlan, A. R. Mosdepth: quick coverage calculation for genomes and exomes. Bioinformatics 34, 867–868 (2018).
Marcais, G. & Kingsford, C. Jellyfish: a fast k-mer counter. https://www.cbcb.umd.edu/software/jellyfish/jellyfish-manual-1.1.pdf (2012).
Gupta, S. K., Charpe, A., Koul, S., Prabhu, K. V. & Haq, Q. M. R. Development and validation of molecular markers linked to an Aegilops umbellulata–derived leaf-rust-resistance gene, Lr9, for marker-assisted selection in bread wheat. Genome 48, 823–830 (2005).
Kuraparthy, V., Sood, S., Guedira, G.-B. & Gill, B. S. Development of a PCR assay and marker-assisted transfer of leaf rust resistance gene Lr58 into adapted winter wheats. Euphytica 180, 227–234 (2011).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 25, 402–408 (2001).
Acknowledgements
We are grateful to L. Zou (KAUST) for technical assistance, Y. Zhou (KAUST) and H.I. Ahmed (KAUST) for discussions on bioinformatics analyses and the KAUST Bioscience Core Lab for sequencing support. We thank Z. Dubská, R. Šperková and J. Weiserová (Institute of Experimental Botany) for the preparation of chromosome samples for flow cytometry, M. Said and P. Cápál (Institute of Experimental Botany) for chromosome 2B sorting and J. Raupp from the Wheat Genetics Resource Center for providing germplasm. We thank J. A. Kolmer from the USDA-ARS Cereal Disease Laboratory, St. Paul, MN, for providing P. triticina isolate MNPSD. We thank P. Lu from IGDB, CAS, for providing TKP sequences for phylogenetic analysis. This publication is based upon work supported by the King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) under Award OSR-CRG2018-3768. D.K. was supported by WGRC/IUCRC and NSF (grant 1822162). I.M. received support from the Marie Curie Fellowship grant ‘AEGILWHEAT’ (H2020-MSCA-IF-2016-746253) and from the Hungarian National Research, Development and Innovation Office (K135057). M.K., K.H. and J.D. received support from the ERDF project 'Plants as a tool for sustainable global development' (CZ.02.1.01/0.0/0.0/16_019/0000827). Computational resources were supplied by the project 'e-Infrastruktura CZ' (e-INFRA LM2018140) provided within the program Projects of Large Research, Development and Innovations Infrastructures.
Author information
Authors and Affiliations
Contributions
Y.W. and S.G.K. designed the overall concept of the research. Y.W. performed molecular experiments and rust inoculations. Y.W. and N.A. generated mapping populations. Y.W., M.A. and E.C. performed assemblies and analyzed genomic data. D.K. and J.P. supplied germplasm and performed fluorescence in situ hybridization and GISH experiments. S.G. and L.J. performed protein modeling. M.K., I.M. and J.D. performed chromosome flow sorting. K.H. sequenced flow-sorted chromosome 2B. Y.W. and S.G.K. wrote the initial version of the manuscript. All authors contributed to subsequent versions and have read and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
All authors declare no competing interests.
Peer review
Peer review information
Nature Genetics thanks Tzion Fahima, Zhiyong Liu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Lr9 and Lr58 confer resistance to multiple Puccinia triticina (Pt) isolates.
Seedling inoculations on the Lr58-carrying wheat line TA5605 (near isogenic line in the genetic background of bread wheat line WL711) and the Lr9-containing bread wheat line ThatcherLr9. Images were taken 12 days after inoculation with seven different Pt isolates. Isolates B9414, 93012, 96237, 95012, 94015, and 96007 are avirulent on Lr9. Pt isolate MNPSD is virulent on Lr9. Scale bar = 1 cm.
Extended Data Fig. 2 Representative images showing leaf rust symptoms on susceptible TA5605 EMS mutants.
TA5605 M3 mutants (a) and M2 mutants (b) were inoculated with P. triticina isolate B9414. Shown are leaves 12 days after inoculation. Spt = Suceptible to Puccinia triticina. Scale bar = 1 cm.
Extended Data Fig. 3 WTK6-vWA co-segregated with Lr9-mediated disease resistance and a published Lr9 marker.
Shown are representatives images of an F2 population derived from a cross between ThatcherLr9 and Avocet S inoculated with the avirulent P. triticina isolate B9414. M-VWA represents a PCR marker developed from the WTK6-vWA coding sequence. SCS5550 represents a PCR marker reported to be linked to Lr9. Markers were genotyped as present (+) or absent (-). R = resistant, S = susceptible. Images were taken 12 days after inoculation. Scale bar = 1 cm.
Extended Data Fig. 4 WTK6-vWA co-segregated with Lr58-mediated disease resistance and a published Lr58 marker.
Shown are representative images of an F2 population derived from a cross between TA5605 and Avocet S inoculated with the avirulent P. triticina isolate 96007. M-VWA represents a PCR marker developed from the WTK6-vWA coding sequence. Xncw-Lr58-1 represents a PCR marker reported to be linked with Lr58. Markers were genotyped as present (+) or absent (-). R = resistant, S = susceptible. Images were taken 12 days after inoculation. Scale bar = 1 cm.
Extended Data Fig. 5 Non-redundant mutations in WTK6-vWA identified in the EMS mutagenesis.
The kinase 1 domain is surrounded by a red box, kinase 2 by a brown box, the vWA domain by a purple box, and the Vwaint domain by a blue box. Roman numerals represent conserved kinase subdomains. Letters above the Lr9 amino acid sequence represent EMS-induced amino acid substitutions. Letters in gray were found in partially susceptible mutants, indicating a knock-down of protein function. ‘*’ indicates mutations causing a premature stop codon. Black triangles = ATP binding site; red triangles = key conserved residues, black asterisks = putative substrate binding site, blue squares = residue determining RD and non-RD kinases, orange triangle = glutamic acid to aspartic acid substitution in subdomain III of kinase 2.
Extended Data Fig. 6 Schematic representation showing the origin and distribution of WTK6-vWA gene family in Poaceae.
Rectangular boxes in red and brown colors represent kinase domains belonging to the LRR_8B (cysteine rich kinases) and LRR_3 (leucine-rich-repeat receptor kinase subfamily 3) families. Bent ovals in blue, purple and orange represent vWA domains. Blue dots indicate kinase vWA gene fusion events. Numbers in brackets represent the maximum number of genes found in a given accession of each species.
Extended Data Fig. 7 Gene structure and RT-qPCR analysis of WTK6-vWA.
a, WTK6-vWA gene structure. Black boxes = exons; lines = introns; white boxes = 5’UTR and 3’UTR, red arrows indicate the positions of primers used for RT-qPCR. b, Relative expression of WTK6-vWA in ThatcherLr9 after inoculation with the avirulent P. triticina isolate B9414. hai = hours after inoculation. Error bars represent standard deviations of three biological replicates. Statistical analysis was done using a two-tailed t-test.
Extended Data Fig. 8 k-mer mapping rates and dot plots identified the exact breakpoint between the Ae. umbellulata translocation and bread wheat chromosome 6B.
a, Dotplot showing 20 kb sequence alignment between ThatcherLr9 and TA1851 around the translocation breakpoint. The green bar chart indicates mean k-mer mapping coverage (KMC) of TA1851 k-mers acrross the 20 kb region in ThatcherLr9. b, Dot plot showing a 20 kb sequence alignment between ThatcherLr9 and Chinese Spring around the translocation breakpoint.
Supplementary information
Supplementary Information
Supplementary Notes 1 and 2 and Supplementary Figs. 1–9.
Supplementary Tables
Supplementary Tables 1–8.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, Y., Abrouk, M., Gourdoupis, S. et al. An unusual tandem kinase fusion protein confers leaf rust resistance in wheat. Nat Genet 55, 914–920 (2023). https://doi.org/10.1038/s41588-023-01401-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41588-023-01401-2
This article is cited by
-
Integrated genome-wide association and transcriptomic analysis to identify receptor kinase genes to stripe rust resistance in wheat germplasm from southwestern China
BMC Plant Biology (2024)
-
SPL50 Regulates Cell Death and Resistance to Magnaporthe Oryzae in Rice
Rice (2024)
-
Plant pangenomes for crop improvement, biodiversity and evolution
Nature Reviews Genetics (2024)
-
A kinase fusion protein from Aegilops longissima confers resistance to wheat powdery mildew
Nature Communications (2024)
-
Wheat powdery mildew resistance gene Pm13 encodes a mixed lineage kinase domain-like protein
Nature Communications (2024)