Effector diversification within compartments of the Leptosphaeria maculans genome affected by Repeat-Induced Point mutations

Rouxel, Thierry; Grandaubert, Jonathan; Hane, James K.; Hoede, Claire; van de Wouw, Angela P.; Couloux, Arnaud; Dominguez, Victoria; Anthouard, Véronique; Bally, Pascal; Bourras, Salim; Cozijnsen, Anton J.; Ciuffetti, Lynda M.; Degrave, Alexandre; Dilmaghani, Azita; Duret, Laurent; Fudal, Isabelle; Goodwin, Stephen B.; Gout, Lilian; Glaser, Nicolas; Linglin, Juliette; Kema, Gert H. J.; Lapalu, Nicolas; Lawrence, Christopher B.; May, Kim; Meyer, Michel; Ollivier, Bénédicte; Poulain, Julie; Schoch, Conrad L.; Simon, Adeline; Spatafora, Joseph W.; Stachowiak, Anna; Turgeon, B. Gillian; Tyler, Brett M.; Vincent, Delphine; Weissenbach, Jean; Amselem, Joëlle; Quesneville, Hadi; Oliver, Richard P.; Wincker, Patrick; Balesdent, Marie-Hélène; Howlett, Barbara J.

doi:10.1038/ncomms1189

Download PDF

Article
Open access
Published: 15 February 2011

Effector diversification within compartments of the Leptosphaeria maculans genome affected by Repeat-Induced Point mutations

Thierry Rouxel¹^na1,
Jonathan Grandaubert¹^na1,
James K. Hane²,
Claire Hoede³,
Angela P. van de Wouw⁴,
Arnaud Couloux⁵,
Victoria Dominguez³,
Véronique Anthouard⁵,
Pascal Bally¹,
Salim Bourras¹,
Anton J. Cozijnsen⁴,
Lynda M. Ciuffetti⁶,
Alexandre Degrave¹,
Azita Dilmaghani¹,
Laurent Duret⁷,
Isabelle Fudal¹,
Stephen B. Goodwin⁸,
Lilian Gout¹,
Nicolas Glaser¹,
Juliette Linglin¹,
Gert H. J. Kema⁹,
Nicolas Lapalu³,
Christopher B. Lawrence¹⁰,
Kim May⁴,
Michel Meyer¹,
Bénédicte Ollivier¹,
Julie Poulain⁵,
Conrad L. Schoch¹¹,
Adeline Simon¹,
Joseph W. Spatafora⁶,
Anna Stachowiak¹²,
B. Gillian Turgeon¹³,
Brett M. Tyler¹⁰,
Delphine Vincent¹⁴,
Jean Weissenbach⁵,
Joëlle Amselem³,
Hadi Quesneville³,
Richard P. Oliver¹⁵,
Patrick Wincker⁵,
Marie-Hélène Balesdent¹ &
…
Barbara J. Howlett⁴

Nature Communications volume 2, Article number: 202 (2011) Cite this article

12k Accesses
356 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Fungi are of primary ecological, biotechnological and economic importance. Many fundamental biological processes that are shared by animals and fungi are studied in fungi due to their experimental tractability. Many fungi are pathogens or mutualists and are model systems to analyse effector genes and their mechanisms of diversification. In this study, we report the genome sequence of the phytopathogenic ascomycete Leptosphaeria maculans and characterize its repertoire of protein effectors. The L. maculans genome has an unusual bipartite structure with alternating distinct guanine and cytosine-equilibrated and adenine and thymine (AT)-rich blocks of homogenous nucleotide composition. The AT-rich blocks comprise one-third of the genome and contain effector genes and families of transposable elements, both of which are affected by repeat-induced point mutation, a fungal-specific genome defence mechanism. This genomic environment for effectors promotes rapid sequence diversification and underpins the evolutionary potential of the fungus to adapt rapidly to novel host-derived constraints.

Complete telomere-to-telomere genomes uncover virulence evolution conferred by chromosome fusion in oomycete plant pathogens

Article Open access 30 May 2024

Implications of the three-dimensional chromatin organization for genome evolution in a fungal plant pathogen

Article Open access 24 February 2024

Quantitative pathogenicity and host adaptation in a fungal plant pathogen revealed by whole-genome sequencing

Article Open access 02 March 2024

Introduction

Fungi are the most important pathogens of cultivated plants, causing about 20% yield losses worldwide. Such diseases are a major cause of malnutrition worldwide¹. Their phenotypic diversity and genotypic plasticity enable fungi to adapt to new host species and farming systems and to overcome new resistance genes or chemical treatments deployed in attempts to limit losses to crop yields². Along with such genotypic plasticity, natural or anthropogenic long-distance dispersal of fungi allows the emergence of novel, better-adapted phytopathogens and more damaging diseases. These processes of adaptation are exemplified by Leptosphaeria maculans 'brassicae' (Phyllum Ascomycota, class Dothideomycetes), which causes stem canker (blackleg) of oilseed rape (Brassica napus) and other crucifers. This fungus has been recorded on crucifers (mainly cabbages) since 1791, but only began to cause substantial damage to broad acre Brassica species and spread around the world in the last four decades³. Other phytopathogens often rapidly cause lesions on plants to ensure asexual reproduction. In contrast, L. maculans shows an unusually complex parasitic cycle with alternating saprotrophy associated with sexual reproduction on stem debris, necrotrophy and asexual sporulation on leaf lesions, endophytic and symptomless systemic growth, and a final necrotrophic stage at the stem base³.

Some features of filamentous fungal genomes are remarkably constant; for instance, size (20–60 Mb typically about 34 Mb), gene number (10,000–13,000), gene content, intron size and number, and the low content of repeated sequences⁴. Comparative genomic approaches have shown that most of the candidate 'pathogenicity genes' (for example, those encoding hydrolytic enzymes that can degrade plant cell walls, or involved in formation of infection structures) analysed in the last decade in a gene-by-gene approach are shared by saprobes and pathogens⁴. These genes were probably recruited as pathogenicity factors when phytopathogens evolved from saprobes, but they do not account for host range or host specificity of phytopathogens. Such roles are played by 'effector' proteins, which modulate host innate immunity, enable parasitic infection and are generally genus, species, or even isolate-specific^5,6. Such effector genes include those with a primary function as avirulence genes or encoding toxins or suppressors of plant defense. While bacteria produce few effectors (typically <30), which mostly seem to suppress plant innate immunity⁷, hundreds of candidate effectors have been identified in oomycetes^8,9,10. In fungi, in contrast, such a catalogue of effectors has only been established to-date in the hemibasidiomycete pathogen of maize, Ustilago maydis, in which many of the effector genes are organized as gene clusters¹¹.

In L. maculans, the only characterized effectors include a toxic secondary metabolite, sirodesmin PL¹² and the products of three avirulence genes, AvrLm1, AvrLm6 and AvrLm4-7, of which at least one, AvrLm4-7, is implicated in fungal fitness^13,14,15. These three avirulence genes show typical features of effector genes; that is, they are predominantly expressed early in infection, encode small proteins predicted to be secreted (SSPs) into the plant apoplast and have no or few matches in databases. Intriguingly, all three are located within large AT-rich, heterochromatin-like regions that are mostly devoid of other coding sequences^13,15.

In this paper, we describe the genome of L. maculans. We speculate how the genome, characterized by a distinct division into guanidine–cytosine (GC)-equilibrated and AT-rich blocks of homogenous nucleotide composition, has been reshaped following massive invasion by and subsequent degeneration of transposable elements (TEs). We also predict the repertoire of pathogenicity effectors for the first time in an ascomycete genome and we propose how the unusual genome structure may have led to the diversification and evolution of effectors.

Results

General features of the L. maculans genome

The haploid genome of strain v23.1.3 of L. maculans 'brassicae' was sequenced using a whole-genome shotgun strategy. This fungus is closely related to Phaeosphaeria (Stagonospora) nodorum, Pyrenophora tritici-repentis and Cochliobolus heterostrophus, as seen in the phylogeny based on sequence analysis of a range of genes (Supplementary Table S1; Fig. 1). The genome assembly had a total size of 45.12 Mb, scaffolded into 76 SuperContigs (SCs; 30 large SCs >143 kb; Tables 1 and 2; Supplementary Table S2). The correspondence of SCs to chromosomes was inferred by a combination of approaches (Fig. 2; Supplementary Figs S1 and S2). Conglomerated data are consistent with the presence of 17 or 18 chromosomes, ten of which correspond to single SCs (Supplementary Fig. S1; Supplementary Table S2).

**Figure 1: Phylogenetic relationships between Dothideomycetes and an example of microsynteny between related species.**

Table 1 Assembly statistics for the L. maculans genome.

Full size table

Table 2 Features of genomes of L. maculans and other related Dothideomycetes.

Full size table

**Figure 2: Main features of the *L. maculans* genome as exemplified by chromosome 5 SuperContig 1.**

Gene models were identified using the EuGene prediction pipeline (Supplementary Tables S3 and S4), and the resulting total of 12,469 genes is consistent with that in other Dothideomycetes (Table 2). Expression of 84.4% of predicted genes was detected using NimbleGen custom-oligoarrays in free-living mycelium or during early stages of oilseed rape infection (Table 3). About 10% of the genes were significantly overexpressed during infection (Table 3). Taking into account expressed-sequence-tag (EST), transcriptomic, and proteomic support, 84.8% of the gene models were biologically validated (Table 3). The genes are shorter than those in the other Dothideomycetes whose genomes have been sequenced (Table 2). Intergenic distances are shorter than those of P. nodorum, the closest relative to have been sequenced, and bi-directional promoters are common (Supplementary Table S5).

Table 3 Comparative features of SSP-encoding genes occurring in diverse genome environments.

Full size table

Automated finding and annotation of repeated elements in the genome using the REPET pipe-line (http://urgi.versailles.inra.fr/index.php/urgi/Tools/REPET) showed that they comprise one-third of the genome compared to 7% in P. nodorum (Table 2). Although most of the repeat elements are truncated and occur as mosaics of multiple families, their origin as TEs is clear (Supplementary Data S1 and S2). Class I elements (see ref. 16 and Table 4 for classification of TEs) dominate with nine families comprising 80% of the repeated elements (Table 4, Supplementary Data S1). Of these, just four families comprise 11.37 Mb, which is 25% of the genome assembly. Very few, if any, of the TEs are transcribed, as shown by EST inspection and transcriptomic analysis. TEs are clustered in blocks distributed across SCs, and the number of TE copies per SC correlates with size of the SC (R²=0.86; Supplementary Fig. S3).

Table 4 Main families and characteristics of transposable elements and other repeats in the L. maculans genome.

Full size table

The TEs are RIP affected

Alignment and comparison of repeat families also showed a pattern of nucleotide substitution consisting mainly of C-to-T and G-to-A changes, suggesting the presence of repeat-induced point mutation (RIP). RIP is a premeiotic repeat-inactivation mechanism specific to fungi and has been previously experimentally identified in L. maculans¹⁷. The L. maculans genome possesses orthologues of all the Neurospora crassa genes currently postulated to be necessary for RIP¹⁸ (Supplementary Table S6). Analysis using RIPCAL, a quantitative alignment-based method¹⁹, indicated that C bases within CpA dinucleotides were mutated to T, more frequently than the sum of CpC, CpG and CpT dinucleotides, confirming the action of RIP on all of the TEs (Supplementary Figs S4 and S5; Supplementary Data S2).

The compartmentalized genome of L. maculans

The L. maculans genome is larger and has a lower overall GC content (44.1% GC) than those of the related Dothideomycetes P. nodorum, Alternaria brassicicola, C. heterostrophus, P. tritici-repentis or the more divergent species Mycosphaerella graminicola (Table 2). As previously reported for a broader range of fungi²⁰, the larger size is consistent with the genome having been extensively invaded by TEs. The GC content of ESTs and other known coding sequences is 50.5%, and the low genome GC content is due to the compartmentalized structure of the genome into GC-equilibrated regions (51.0% GC content, sizes between 1 and 500 kb, average 70.4 kb; henceforth denoted as GC-blocks) alternating with AT-rich regions (henceforth denoted as AT-blocks; averaging 33.9% GC content; with sizes between 1 and 320 kb, average of 38.6 kb). Whole-genome analysis identified 413 AT-blocks and 399 GC-blocks (Supplementary Table S7). The AT-blocks cover 36% of the genome and are distributed within the large SCs, comprising between 23.1 and 49.2% (Fig. 2c, Supplementary Table S7; Supplementary Fig. S6). SC22, corresponding to a minichromosome,²¹ contains nine AT-blocks amounting to 92.5% of the SC (Supplementary Table S7).

As well as differences in GC content, the two types of genomic regions are dissimilar in terms of recombination frequency and gene content. The number of crossovers (CO) along a chromosome ranges between 1.16 and 3.31, depending on size of the chromosome, with one CO every 820 kb on average. The recombination frequency is significantly higher between marker pairs located within GC-blocks than those located on each side of one AT-block (F-Fisher=5.873, P=0.019; Fig. 2d, Supplementary Fig. S7).

GC-blocks contain 95% of the predicted genes of the genome, at a higher density (4.2 per 10 kb) than in other Dothideomycetes (Table 2) and are mostly devoid of TEs. In contrast, AT-blocks are gene-poor, comprising only 5.0% of the predicted coding sequences, and mainly contain mosaics of TEs mutated by RIP, thus resulting in a low-GC content of TEs. There are three categories of AT-blocks: telomeres, which include a Penelope retroelement²² (Supplementary Fig. S8); large AT-blocks (216 sized 13–325 kb); and mid-sized AT-blocks (197 sized 1–13 kb; Supplementary Fig. S9), mostly corresponding to single integrations of only two families of DNA transposons (Supplementary Table S8).

In almost half of the cases where pairs of orthologues are on the same SC, the genes flanking AT-blocks in L. maculans have orthologues in P. nodorum that are either two consecutive genes or genes separated by only a few others (Fig. 1a; Supplementary Data S3). A similar pattern was observed for C. heterostrophus and P. tritici-repentis, suggesting that the TEs invaded the genome after the separation of Leptosphaeria from other species of suborder Pleosporineae 50–57 million years ago (MYA; Fig. 1b).

The ribosomal DNA repeat is extensively affected by RIP

In eukaryotes, the ribosomal DNA (rDNA) comprises a multigene family organized as large arrays of tandem repeats. The core unit is a single transcription unit that includes the 18S or Small Subunit, 5.8S, and 28S or Large Subunit separated by internal transcribed spacers (ITS1 and ITS2). Each transcription unit is separated by the Intergenic Spacer (Fig. 3a). Although essential duplicated regions would be expected to be protected from RIP mutations, the rDNA repeats in L. maculans are in part affected by RIP (Fig. 3b,c, Supplementary Fig. S10). The number of rDNA repeats ranges between 56 and 225 in different L. maculans isolates²³. The assembly of strain v23.1.3 has >150 repeats, only two of which are highly similar (99.6% identity) and are not affected by RIP. Fifty complete rDNA units and 107 incomplete units are present, and most of them are on extreme ends of SC2 and SC19, which are not complete chromosomes. Many of these repeats are severely affected by RIP (Fig. 3, Supplementary Fig. S10). Selker²⁴ has suggested that rDNA repeats in the nucleolus organizer region are protected from RIP. Our data indicate that this is not the case in L. maculans, at least for a part of the array of tandem repeats.

**Figure 3: Repeat-induced point (RIP) mutation in ribosomal DNA of *L. maculans* shown as RIPCAL output.**

AT-blocks as niches for effectors

As described above, AT-blocks have few genes. Furthermore, 76% of these genes are located close to the borders with GC-blocks; only 24% (148 genes) are located within AT-blocks (Table 3; Supplementary Data S4 and S5). Protein comparisons and Gene Ontology (GO) analysis indicate that AT-blocks are enriched in genes likely to have a role in pathogenicity (Supplementary Fig. S11). These include orphan genes such as those encoding SSPs, genes involved in response to chemical or biotic stimuli (Supplementary Fig. S11), as well as non-ribosomal peptide synthetases and polyketide synthases, which encode enzymes involved in biosynthesis of secondary metabolites (Supplementary Tables S9 and S10; Supplementary Fig. S12).

One hundred and twenty-two (∼20%) of the genes located in AT-blocks encode putative SSPs (Table 3; Supplementary Data S4). Only 4.2% of the genes in the GC-blocks encode SSPs (529 genes), and these lack many features of known effectors of L. maculans (Table 3). In contrast, the SSPs encoded in AT-blocks have features indicative of effectors such as low EST support in in vitro grown cultures, low abundance in in vitro secretome samples, increased expression upon plant infection, lack of recognizable domains or homologues in other fungi, and high cysteine content (Table 3; Supplementary Data S4). Three TEs, the retrotransposon, RLx_Ayoly, and two DNA transposons, DTF_Elwe and DTx_Gimli, are significantly over-represented in the immediate vicinity of SSPs (Supplementary Fig. S13). Although SSPs are never embedded within a single TE, four SSPs are inserted between two tandemly repeated copies of the DNA transposon DTM_Sahana.

As well as the avirulence genes, two SSPs, LmCys1 and LmCys2, have been functionally analysed. LmCys1 contributes to fungal growth in planta, whereas LmCys2 contributes to suppression of plant defence responses, reflecting their roles as effectors (I. Fudal, unpublished data). Expression of 70.2% of the SSP-encoding genes was detected (Table 3). Of these, 72.7% of the SSP-encoding genes located within AT-blocks (compared with 19.1–22.2% in GC-blocks) were over-expressed at early stages of infection of cotyledons compared with in vitro mycelium growth (Table 3; Supplementary Fig. S14). Accordingly, these are postulated to be effectors. In addition, 45% of the predicted SSPs in AT-blocks show a presence/absence polymorphism in field populations, as is the case for avirulence genes in L. maculans and other fungi²⁵. The SSPs in GC-blocks include 110 (20.8%) with best BLAST hits to hypothetical proteins from P. nodorum. In contrast, very few SSPs in AT-blocks have identifiable orthologues; only two (1.8%) had a best match to a predicted protein of P. nodorum (Supplementary Data S4). In addition to their lack of orthologues, SSPs in AT-blocks also lack paralogues; only seven genes belong to gene families comprising one to four paralogues. Biases in codon usage occur: in GC-blocks, the preferred codon for each of the 20 amino acids ends with a C or a G and the preferred stop codon is TGA, whereas in SSP genes located in AT-blocks, the preferred codon ends with an A or T for 13 amino acids and the preferred stop codon is TAA (Supplementary Table S11). This, however, only has a limited impact on amino acid favoured usage by SSPs (Supplementary Table S12).

Motifs resembling the RxLR translocation motifs of oomycetes were sought²⁶ following the validation that one such motif, RYWT, present in the N-terminal part of AvrLm6 allows translocation into plant and animal cells²⁶. Searches for 〈[RKH] X [LMIFYW] X〉 or 〈[RKH] [LMIFYW] X [RKH]〉 showed that up to 60% of SSPs in AT-blocks and up to 73% of SSP in GC-blocks have putative 'RxLR-like' motifs, implicating these SSPs as candidate effectors that enter plant cells (Supplementary Data S4).

History of genome invasion by TEs

A range of 278–320 MYA is estimated for the origin of the Dothideomycetes with the crown radiation of the class during the Permian (251–289 MYA; Fig. 1b). The origins of the plant pathogenic Pleosporineae is determined at 97–112 MYA, placing it in the Cretaceous at a time when flowering plants were beginning to become widespread and eudicots were emerging, during the late Cretaceous and Paleocene. Leptosphaeria likely diverged from the other species analysed between 50 and 57 MYA (Fig. 1b). Phylogenetic analyses suggest three main features of genome invasion by TEs: transposition bursts mostly after separation of L. maculans from other species of suborder Pleosporineae as indicated by a 'recent' divergence of the TE families, estimated to 4–20 MYA (Fig. 4a); a single or few wave(s) of massive transposition(s) followed by a 'rapid' decay, with some cases like DTM_Sahana where divergence between copies is extremely low; and no on-going waves of genome invasion by TEs (Fig. 4b). Like other organisms with a high density of TEs, the L. maculans genome exhibits 'nesting', where repeats occur within previously inserted TEs. In this fungus, TEs are commonly invaded by other TEs generating a complex 'nesting network'. Eighty-five % of these cases correspond to TEs invading one other TE (primary nesting relationship). Most of the retrotransposon families investigated can invade or be invaded to similar extents (Supplementary Table S8). They also can invade TEs from the same family (self-nests), but usually at a very low frequency compared with invasion of retrotransposons from other families. In contrast, the DNA transposons are more commonly invaded (23.3% of the cases) than acting as invaders (3.5% of the cases; Supplementary Table S8). In accordance with overlapping divergence time estimates (Figs 1b and 4), these data indicate periods of overlapping transpositional activity for the long terminal repeats retrotransposons that form the major part of AT-blocks. In such a scenario, the later insertions would be preferentially tolerated in existing decayed transposons. These TEs, having undergone RIP in their turn, would initiate a positive reinforcement loop that would create large AT-rich and gene-poor blocks of homogeneous nucleotide composition.

**Figure 4: Dynamics of transposable elements in the *L. maculans* genome.**

Discussion

The peculiar genomic structure of the L. maculans genome is reminiscent of that discovered in mammals and some other vertebrates: the base composition (GC-content) varies widely along chromosomes, but locally, base composition is relatively homogenous. Such structural features have been termed 'isochores'²⁷. In L. maculans, AT-blocks are gene-poor, rich in TEs and deficient in recombination compared with GC-blocks, as in mammals²⁷. However, despite these similarities, these genomic landscapes seem to result from different mechanisms. In mammals, the evolution of GC-rich isochores is most likely driven by recombination: genomic regions sized between 100 kb to several Mb with a high recombination rate tend to increase in GC content relative to the rest of the genome. This pattern is not due to a mutational effect of recombination, but most probably due to biased gene conversion²⁸. In L. maculans, variations in base composition occur at a much finer scale (the isochore-like blocks are about 10–20 times smaller than in mammals), and it is unknown whether biased gene conversion contributes to increase the GC content of GC-blocks. Conversely, L. maculans isochores can be attributed to the AT-biased mutational pattern induced by RIP mutation of TEs and their flanking regions, thus leading to the evolution of AT-rich isochores.

Although the evolutionary forces we postulate shaped the L. maculans genome are common to many species, no fungal genome characterized so far has a similar isochore-like structure. This structure reflects extensive genome invasion by TEs that are nonetheless tolerated by the pathogen and the existence of an active RIP machinery (Supplementary Table S6) that has so far been restricted to the Pezizomycotina subphylum of the Ascomycota and maintenance of sexual reproduction (necessary for RIP). Whereas many species seem to have maintained an active RIP machinery, most of the sequenced fungal genomes are poor in TEs, indicating that run-away genome expansion is normally deleterious. Also, many fungal species have lost the ability to cross in nature (for example, Fusarium oxysporum, Magnaporthe oryzae) and no case of large-scale sculpting of repeat-rich regions is found in these species, only some ancient signatures of RIP are found²⁹.

On the basis of the characteristics of avirulence genes in L. maculans, we have described a comprehensive repertoire of putative effectors, which has not previously been done for an Ascomycete. In L. maculans, AT-rich blocks are enriched in effector-like sequences. Location of effector genes has been investigated in only some eukaryotic genomes. A few of the effectors of M. oryzae are subtelomeric³⁰, as are those in protozoan parasites of animals, such as Plasmodium and Trypanosoma³¹. Genomes of many Fusarium species contain supernumerary 'B' chromosomes enriched in strain-specific effectors and accounting for the host range of each 'forma specialis'^32,33. The genome of the oomycete Phytophthora infestans has a plethora of effector candidates embedded in repetitive DNA, and diversification of these effectors is postulated to occur via segmental duplication and variation in intraspecific copy number, resulting in rapidly diverging multigene families⁸. The association between one family of effectors and a LINE in Blumeria graminis, the barley powdery mildew fungus, is proposed to provide a mechanism for amplifying and diversifying effectors³⁴. Diversification of effectors in the species mentioned above is postulated to be associated with TE-driven gene duplication and generation of multigene families. In the L. maculans genome, SSP-encoding genes are associated with only a few TE families, which may indicate the ability of TEs to 'pickup and move' effectors. In contrast to the above examples, duplicated effector genes are not present in L. maculans, a finding consistent with the steady inactivation of TEs by RIP and with ancient transposition activity before underdoing RIP.

The origins of some effector genes might be at least partially ascribed to lateral gene transfer, a specialty of species within the Pezizomycotina^35,36,37. Regardless of the origin of the effector genes, our data suggest that RIP is an important mechanism for generating diversity for genes occurring within AT-blocks of the genome of L. maculans, in a manner not previously documented in any other species. RIP has previously been reported to be restricted to duplicated DNA, but most SSPs or other genes in AT-blocks are present in single copies. How, then, can RIP act on SSP-encoding genes? Studies in N. crassa indicated that the RIP machinery can occasionally overrun the repeated region into adjacent single-copy genes³⁸. The embedding of SSPs within RIP-degenerated TEs would then favour such RIP leakage (Supplementary Fig. S4c), while selection pressure to maintain functional effectors would prevent them from becoming extinct due to an excessive degree of RIP. This would result in extensive mutation of the affected gene and could account for the mutation rate required for diversifying selection. In contrast, effector genes that became detrimental to pathogen fitness, such as avirulence genes subjected to resistance gene selection, would be lost rapidly as alleles that have undergone extensive RIP are selected for³⁹. Evidence for this scenario is provided by examination of RIP indices and in alignment-based studies of alleles of SSPs³⁹. The genes (including SSPs) within AT-blocks had higher TpA/ApT indices than those in GC-blocks (Table 3; Supplementary Fig. S15), consistent with former genes having been RIP affected. RIP indices for the effectors located within AT-blocks thus would be a compromise between values leading to complete degeneration of the sequence and values enabling sequence diversification while retaining functionality. In plant-pathogen systems, diversifying selection operates on effector genes whose products interact with host proteins²⁵. This has been demonstrated for both resistance and avirulence genes, but mechanisms for the diversifying selection of effectors have not been proposed. RIP is shown here to be a potential factor to create the genetic (hyper)variation needed for selection to occur in L. maculans, and this process may also act on effectors in other fungi⁴⁰.

These findings allow speculation about an evolutionary scenario for birth of isochore-like structures in the L. maculans genome and its incidence on effector diversification. First, the genome was invaded by a few families of TEs over a (relatively) short time period, mostly after the separation of L. maculans from other related fungi. This TE invasion is unlikely to have been targeted to pre-existing effector-rich genome regions as seen in microsynteny analyses (Fig. 1a). Accordingly, the most recent invader, DTM_Sahana, is not specifically associated with SSPs. Second, waves of overlapping transposition occurred with probable transduction, translation or duplication of genes, resulting in the large amplification of a few families. Such transpositions were primarily targeted to other TEs as shown by the nesting of retrotransposons within other TEs. In parallel, duplicated copies of TEs and genes (either duplicated or not) hosted within TE-rich regions underwent RIP either to extinction for TEs or to generate gene diversity in cases where a strong selection pressure to retain genes was exerted. This eventually resulted in complete inactivation of transposition events, and the sculpting of the genome in an isochore-like structure. Effector genes were maintained in AT-blocks to favour rapid response to selection pressure^39,41 and probable epigenetic concerted regulation of their expression (Supplementary Fig. S14b). L. maculans shows intriguing evolutionary convergence with both higher eukaryotes in terms of an isochore-like genome structure, and with oomycetes in terms of hosting effectors in highly dynamic 'plastic' regions of the genome⁸. It differs in exploiting a RIP-based mechanism for diversification and inactivation of effector genes.

The sequencing of genomes of several species or subspecies of the recent and more ancient outgroups that derived from a common ancestor with L. maculans will provide more information on origin of effectors, genome invasion by TEs and the subsequent effect on generation/diversification of effectors, and thus test the validity of the proposed evolutionary scenario.

Methods

Phylogenetic analysis

A taxon set containing representatives of most classes in Ascomycota was selected from the data matrices produced in two previous papers^42,43. Sequences were concatenated from the Small Subunit and Large Subunit of the nuclear ribosomal RNA genes and three protein coding genes, namely the translation elongation factor-1α and the largest and second largest subunits of RNA polymerase II (Supplementary Table S1). A phylogenetic analysis was performed using RAxML v. 7.0.4 (ref. 44) applying unique model parameters for each gene and codon. A combined bootstrap and maximum likelihood (ML) tree search was performed in RAxML with 500 pseudo replicates. The best scoring ML tree was analysed in the program R8sv1.7 (ref. 45) to produce a chronogram (Fig. 1b).

Sequencing and assembly

L. maculans 'brassicae' isolate v23.1.3. was sequenced because it harbours numerous avirulence genes, three of which have been cloned by a map-based strategy involving large-scale sequencing of surrounding genomic regions^13,15,41. Isolate v23.1.3. results from a series of in vitro crosses between European field isolates⁴⁶ and is representative of the populations of the pathogen prevalent in Europe in the mid-1990s.

DNA was provided as agarose plugs containing partly digested conidia²¹. Whole-genome shotgun sequencing of three types of libraries (high-copy-number plasmids with 3.3 kb inserts; low-copy-number plasmids with 10 kb inserts and fosmids with inserts 35 or 40 kb; Supplementary Table S13) was performed, and also six cDNA libraries, including ones derived from infected plants, were sequenced (Supplementary Table S14). Sequencing reads were assembled using Arachne⁴⁷ (Table 1) and the correspondence of SCs to chromosome was inferred by aligning the genetic map to the genome sequence, hybridization of single-copy markers to chromosomal DNA separated by pulsed-field gel electrophoresis, identification of telomere-specific repeats, and by mesosynteny analyses (conserved gene content) with genomes of other Dothideomycetes (Supplementary Table S2).

L. maculans genome annotation

Automated structural annotation of the genome was performed using the URGI genomic annotation platform, including pipelines, databases and interfaces, developed or locally set up for fungi. The EuGene prediction pipeline v. 3.5a (ref. 48), which integrates ab initio (Eugene_IMM, SpliceMachine and Fgenesh 2.6 (www.softberry.com)) and similarity methods (BLASTn, GenomeThreader, BLASTx), was used to predict gene models. The functional annotation pipeline was run using InterProScan⁴⁹. Genome assembly and annotations are available at INRA (http://urgi.versailles.inra.fr/index.php/urgi/Species/Leptosphaeria).

Genome assemblies together with predicted gene models and annotations were deposited at DNA European Molecular Biology Laboratory/GenBank under the accession numbers FP929064 to FP929139 (SC assembly and annotations). ESTs were submitted to dbEST under accession nos. FQ032836 to FQ073829.

Full description and associated references for sequencing, assembly and gene annotation are provided as Supplementary Methods.

Annotation and analysis of repeated elements

TEs¹⁶ were identified and annotated using the 'REPET' pipeline (http://urgi.versailles.inra.fr/index.php/urgi/Tools/REPET), optimized to better annotate nested and fragmented TEs. Repeats were searched with BLASTER for an all-by-all BLASTn genome comparison, clustered with GROUPER, RECON and PILER, and consensuses built with the MAP multiple sequence alignment program. Consensuses were classified with BLASTER matches, using tBLASTx and BLASTx against the Repbase Update databank⁵⁰ and by identification of structural features such as long terminal repeats, terminal inverted repeats¹⁶ and so on. Additional steps of clustering and manual curation of data were performed, resulting in a series of consensuses used as an input for the REPET annotation pipeline part, comprising the TE detection software BLASTER, RepeatMasker and Censor, and the satellite detection softwares RepeatMasker, TRF and Mreps.

Analysis of the dynamics of genome invasion by TEs was first based on phylogenetic analysis of each family of repeats, retracing the evolutionary history, regardless of truncation, insertion in other TEs and deletion events⁵¹. After elimination of all RIP targets, the tree topology was used to retrace the dynamics and demography of TE invasion in the genome. Terminal fork branch lengths from the trees were used to calculate the age of the last transposition events of the copies in the genome. The divergence values were converted in estimated divergence time using a substitution rate of 1.05×10⁻⁹ nucleotide per site per year as applied to fungi^52,53.

Dynamics of TE aggregation over time was also analysed by a visual analysis of nesting relationships between TEs. Following the long join annotation, mosaics of TEs were visualised using Artemis v. 12.0 (http://www.sanger.ac.uk/Software/Artemis/) in SC0-22 and a data matrix recording the frequency with which a given TE family was inserted into another one (invader) and the frequency with which one given TE was recipient of an insertion from one or multiple other TEs (invaded TE) was generated (Supplementary Table S8). The statistical identification and significance of the favoured invasion of other TE families as compared with random association was evaluated with a χ²-test for given probabilities with simulated P-values, based on 20,000 replicates, as implemented in R.

RIP and DeRIP analyses

Automated analysis of RIP in L. maculans genomic DNA repeats was performed using RIPCAL (http://www.sourceforge.net/projects/ripcal), a software tool that performs both RIP index and alignment-based analyses¹⁹. In addition, RIP indices such as TpA/ApT and (CpA+TpG)/(ApC+GpT) were used to evaluate the effect of RIP on genes or genome regions for which multiple alignments could not be generated. DeRIP analyses, which predict putative ancient pre-RIP sequences, were performed using an updated version of RIPCAL, including the Perl script 'deripcal' and ripcal_summarise.

Analysis of AT-blocks

AT- and GC-blocks were manually discriminated from each scaffold using Artemis (http://www.sanger.ac.uk/Software/Artemis/), and a Python script was used to extract sequences and features of AT-blocks. TE content of AT- and GC-blocks was analysed using the REPET pipeline. Size distribution of AT-blocks, occurrence of AT-blocks on chromosomes and relationship between AT-blocks, TE content and chromosome length were calculated.

To evaluate meiotic recombination differences between AT- and GC-blocks, micro- and minisatellites located either in GC-blocks or located on both sides of a single AT-block were mapped in a reference cross, and the number of CO between two consecutive markers was calculated. The recombination frequency between two successive markers was calculated, plotted against the physical distance between the two markers and subjected to analysis of variance and a non-parametric test (Mann–Whitney test) using XLStat, to compare recombination frequencies between and within GC-blocks.

Intergenic distances were compared between AT- and GC-blocks in L. maculans, and also compared with those of the closely related Dothideomycete, P. nodorum (Supplementary Table S5).

GO annotations were compared between genes occurring in AT- and GC-blocks using the blast2GO program.

Identification and features of SSPs

Non-repeated regions within AT-blocks were identified following masking of TEs with RepeatMasker. The EMBOSS:GETORF program was used on these genomic regions to refine the identification of genes encoding SSPs with a size limit set at 600 amino acids (lower limit: 60 amino acids). A dedicated script combined the outputs of GETORF, FgeneSH and EuGene and a pipeline written in Python screened the predicted proteins according to their size and the presence of signal peptide and transmembrane domains (SignalP 3.0, TargetP and TMHMM). Base composition of the genes encoding SSPs (percent of each base in the sequence, GC content and GC3 content) and amino-acid count of the SSPs (as % of each amino acid in the protein) were calculated by custom Python scripts. Statistical bias in amino acid occurrence was evaluated by an F-test to determine if the variances were equal in both sets, followed by Student's t-test (95% confidence level) to compare the mean use of each amino acid in each set of predicted proteins. Biases in codon usage were evaluated using EMBOSS:CHIPS. A χ²-test for given probabilities with simulated values (20,000 replicates) as implemented in R was performed to test random association of SSP-encoding genes in AT-blocks with specific TEs. Motifs similar to the RxLR motif necessary for oomycete effectors to be translocated within plant cells were sought in predicted SSPs, using a Python script aiming at identification of motifs (〈[RKH] X [LMIFYW] X〉 or 〈[RKH] [LMIFYW] X [RKH]〉).

Analysis of expression patterns of SSP-encoding genes were compared between in vitro (mycelium grown in axenic medium) and in planta (3, 7 and 14 days after inoculation of oilseed rape cotyledons), either using the L. maculans whole-genome expression array (manufactured by NimbleGen Systems) or by quantitative reverse transcription-PCR on a selected subset of SSP-encoding genes.

Additional information

Acccession codes: Genome assemblies together with predicted gene models and annotations have been deposited in the DNA European Molecular Biology Laboratory/GenBank nucleotide core database under the accession numbers FP929064 to FP929139 (SC assembly and annotations). ESTs were submitted to dbEST under accession numbers FQ032836 to FQ073829. Oligo array data has been deposited in the gene Expression Omnibus under accession code GSE27152.

How to cite this article: Rouxel, T. et al. Effector diversification within compartments of the Leptosphaeria maculans genome affected by repeat-induced point mutations. Nat. Commun. 2:202 doi: 10.1038/ncomms1189 (2011).

Accession codes

Accessions

Gene Expression Omnibus

GSE27152

NCBI Reference Sequence

References

Skamniotia, P. & Gurr, S. J. Against the grain: safeguarding rice from rice blast disease. Trends Biotechnol. 27, 141–150 (2009).
Article Google Scholar
Oliver, R. P. & Solomon, P. S. Recent fungal diseases of crop plants: is lateral gene transfer a common theme? Mol. Plant-Microbe Interact. 21, 287–293 (2008).
Article CAS PubMed Google Scholar
Rouxel, T. & Balesdent, M. H. The stem canker (blackleg) fungus, Leptosphaeria maculans, enters the genomic era. Mol. Plant Pathol. 6, 225–241 (2005).
Article CAS PubMed Google Scholar
Soanes, D. N. et al. Comparative genome analysis of filamentous fungi reveals gene family expansions associated with fungal pathogenesis. Plos One 3, 1–15 (2008).
Article Google Scholar
Stergiopoulos, I. & de Wit, P. J. G. M. Fungal effector proteins. Annu. Rev. Phytopathol. 47, 233–263 (2009).
Article CAS PubMed Google Scholar
Rouxel, T. & Balesdent, M. H. Avirulence genes. in: Encyclopedia of Life Sciences (ELS) (John Wiley & Sons, 2010) (doi: 10.1002/9780470015902.a00212672010).
Alfano, J. R. Roadmap for future research on plant pathogen effectors. Mol. Plant Pathol. 10, 805–813 (2009).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestans. Nature 461, 393–398 (2009).
Article ADS CAS PubMed Google Scholar
Jiang, R. H. Y., Tripathy, S., Govers, F. & Tyler, B. M. RXLR effector reservoir in two Phytophthora species is dominated by a single rapidly evolving super-family with more than 700 members. Proc. Natl Acad. Sci. USA 105, 4874–4879 (2008).
Article ADS CAS PubMed Google Scholar
Tyler, B. M. et al. Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis. Science 313, 1261–1266 (2006).
Article ADS CAS PubMed Google Scholar
Kämper, J. et al. Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis. Nature 444, 97–101 (2006).
Article ADS PubMed Google Scholar
Elliott, C. E., Gardiner, D. M., Thomas, G., Cozijnsen, A. J., van de Wouw, A. & Howlett, B. J. Production of the toxin sirodesmin PL by Leptosphaeria maculans during infection of Brassica napus. Mol. Plant Pathol. 8, 791–802 (2007).
Article CAS PubMed Google Scholar
Fudal, I. et al. Heterochromatin-like regions as ecological niches for avirulence genes in the Leptosphaeria maculans genome: map-based cloning of AvrLm6. Mol. Plant-Microbe Interact. 20, 459–470 (2007).
Article CAS PubMed Google Scholar
Huang, Y. J., Li, Z. Q., Evans, N., Rouxel, T., Fitt, B. D. L. & Balesdent, M. H. Fitness cost associated with loss of the AvrLm4 function in Leptosphaeria maculans (Phoma stem canker of oilseed rape). Eur. J. Plant Pathol. 114, 77–89 (2006).
Article Google Scholar
Parlange, F. et al. Leptosphaeria maculans avirulence gene AvrLm4-7 confers a dual recognition specificity by Rlm4 and Rlm7 resistance genes of oilseed rape, and circumvents Rlm4-mediated recognition through a single amino acid change. Mol. Microbiol. 71, 851–863 (2009).
Article CAS PubMed Google Scholar
Wicker, T. et al. A unified classification system for eukaryotic transposable elements. Nat. Rev. Genet. 8, 973–982 (2007).
Article CAS PubMed Google Scholar
Idnurm, A. & Howlett, B. J. Analysis of loss of pathogenicity mutants reveals that repeat-induced point mutations can occur in the Dothideomycete Leptosphaeria maculans. Fungal Genet. Biol. 39, 31–37 (2003).
Article CAS PubMed Google Scholar
Espagne, E. et al. The genome sequence of the model ascomycete fungus Podospora anserina. Genome Biol. 9, R77 (2008).
Article PubMed PubMed Central Google Scholar
Hane, J. K. & Oliver, R. P. RIPCAL: a tool for alignment-based analysis of repeat-induced point mutations in fungal genomic sequences. BMC Bioinformatics 9, 478 (2008).
Article PubMed PubMed Central Google Scholar
Feschotte, C., Keswani, U., Ranganathan, N., Guibotsy, M. L. & Levine, D. Exploring repetitive DNA landscapes using REPCLASS, a tool that automates the classification of transposable elements in Eukaryotic genomes. Genome Biol. Evol. 1, 205–220 (2009).
Article PubMed PubMed Central Google Scholar
Leclair, S., Ansan-Melayah, D., Rouxel, T. & Balesdent, M. H. Meiotic behaviour of the minichromosome in the phytopathogenic ascomycete Leptosphaeria maculans. Curr. Genet. 30, 541–548 (1996).
Article CAS PubMed Google Scholar
Gladyshev, E. A. & Arkhipova, I. R. Telomere-associated endonuclease-deficient Penelope-like retroelements in diverse eukaryotes. Proc. Natl Acad. Sci USA 104, 9352–9357 (2007).
Article ADS CAS PubMed Google Scholar
Howlett, B. J., Cozijnsen, A. J. & Rolls, B. D. Organisation of ribosomal DNA in the ascomycete Leptosphaeria maculans. Microbiol. Res. 152, 1–7 (1997).
Article Google Scholar
Selker, E. U. Premeiotic instability of repeated sequences in Neurospora crassa. Annu. Rev. Genet. 24, 579–613 (1990).
Article CAS PubMed Google Scholar
Stukenbrock, E. H. & McDonald, B. A. Population genetics of fungal and oomycete effectors involved in gene-for-gene interactions. Mol. Plant-Microbe Interact. 22, 371–380 (2009).
Article CAS PubMed Google Scholar
Kale, S. D. et al. External lipid PI-3-P mediates entry of eukaryotic pathogen effectors into plant and animal host cells. Cell 142, 284–295 (2010).
Article CAS PubMed Google Scholar
Eyre-Walker, A. & Hurst, L. D. The evolution of isochores. Nat. Rev. Genet. 2, 549 (2001).
Article CAS PubMed Google Scholar
Duret, L. & Galtier, N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genomics Hum. Genet. 10, 285–311 (2009).
Article CAS PubMed Google Scholar
Ikeda, K. et al. Repeat-induced point mutation (RIP) in Magnaporthe grisea: implications for its sexual cycle in the natural field context. Mol. Microbiol. 45, 1355–1364 (2002).
Article CAS PubMed Google Scholar
Farman, M. L. Telomeres in the rice blast fungus Magnaporthe oryzae: the world of the end as we know it. FEMS Microbiol. Lett. 273, 125–132 (2007).
Article CAS PubMed Google Scholar
Pain, A. et al. The genome of the simian and human malaria parasite Plasmodium knowlesi. Nature 455, 799–803 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, L. J. et al. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium oxysporum. Nature 464, 367–373 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Coleman, J. J. et al. The genome of Nectria haematococca: contribution of supernumerary chromosomes to gene expansion. PloS Genet. 5, e1000618 (2009).
Article PubMed PubMed Central Google Scholar
Sacristan, S. et al. Coevolution between a family of parasite virulence effectors and a class of LINE-1 retrotransposons. PloS One 4, e7463 (2009).
Article ADS PubMed PubMed Central Google Scholar
Friesen, T. L. et al. Emergence of a new disease as a result of interspecific virulence gene transfer. Nat. Genet. 38, 953–956 (2006).
Article CAS PubMed Google Scholar
Marcet-Houben, M. & Gabaldón, T. Acquisition of prokaryotic genes by fungal genomes. Trends Genet. 26, 5–8 (2010).
Article CAS PubMed Google Scholar
Khaldi, N. & Wolfe, K. H. Elusive origins of the extra genes in Aspergillus oryzae. Plos One 3, e3036 (2008).
Article ADS PubMed PubMed Central Google Scholar
Irelan, J. T., Hagemann, A. T. & Selker, E. U. High frequency repeat-induced point mutation (RIP) is not associated with efficient recombination in Neurospora. Genetics 138, 1093–1103 (1994).
CAS PubMed PubMed Central Google Scholar
Fudal, I. et al. Repeat-induced point mutation (RIP) as an alternative mechanism of evolution towards virulence in Leptosphaeria maculans. Mol. Plant-Microbe Interact. 22, 932–941 (2009).
Article CAS PubMed Google Scholar
Stergiopoulos, I., De Kock, M. J. D., Lindhout, P. & de Wit, P.J.G.M. Allelic variation in the effector genes of the tomato pathogen Cladosporium fulvum reveals different modes of adaptive evolution. Mol. Plant-Microbe Interact. 20, 1271–1283 (2007).
Article CAS PubMed Google Scholar
Gout, L. et al. Genome structure impacts molecular evolution at the AvrLm1 avirulence locus of the plant pathogen Leptosphaeria maculans. Environ. Microbiol. 9, 2978–2992 (2007).
Article CAS PubMed Google Scholar
Schoch, C. L. et al. A class-wide phylogenetic assessment of Dothideomycetes. Stud. Mycol. 64, 1–15S10 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schoch, C. L. et al. The Ascomycota Tree of Life: a phylum-wide phylogeny clarifies the origin and evolution of fundamental reproductive and ecological traits. Syst. Biol. 58, 224–239 (2009).
Article CAS PubMed Google Scholar
Stamatakis, A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690 (2006).
Article CAS PubMed Google Scholar
Sanderson, M. J. r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics 19, 301–302 (2003).
Article CAS PubMed Google Scholar
Balesdent, M. H., Attard, A., Ansan-Melayah, D., Delourme, R., Renard, M. & Rouxel, T. Genetic control and host range of avirulence toward Brassica napus cultivars Quinta and Jet Neuf in Leptosphaeria maculans. Phytopathology 91, 70–76 (2001).
Article CAS PubMed Google Scholar
Jaffe, D. B. et al. Whole-genome sequence assembly for mammalian genomes: Arachne 2. Genome Res. 13, 91–96 (2003).
Article CAS PubMed PubMed Central Google Scholar
Foissac, S. et al. Genome annotation in plants and fungi: EuGene as a model platform. Curr Bioinformatics 3, 87–97 (2008).
Article CAS Google Scholar
Quevillon, E. et al. InterProScan: protein domains identifier. Nucleic Acids Res. 33 (Suppl. 2), W116–W120 (2005).
Article CAS PubMed PubMed Central Google Scholar
Jurka, J., Kapitonov, V. V., Pavlicek, A., Klonowski, P., Kohany, O. & Walichiewicz, J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
Article CAS PubMed Google Scholar
Fiston-Lavier, A. S. Etude de la dynamique des répétitions dans les génomes eucaryotes: de leur formation à leur élimination. PhD Thesis, University Pierre et Marie Curie, Paris, France (2008).
Berbee, M. L. & Taylor, J. W. Dating the molecular clock in fungi—how close are we? Fungal Biol. Rev. 24, 1–16 (2010).
Article Google Scholar
Kasuga, T., White, T. J. & Taylor, J. W. Estimation of nucleotide substitution rates in Eurotiomicete fungi. Mol. Biol. Evol. 19, 2318–2324 (2002).
Article CAS PubMed Google Scholar
Rouxel, T., Balesdent, M.H., Amselem, J. & Howlett, B. J. GnpGenome: a Genome Browser for Leptosphaeria maculans structural annotation (2010) http://urgi.versailles.inra.fr/cgi-bin/gbrowse/lmaculans_pub/.
Hane, J. K. et al. Dothideomycete plant interactions illuminated by genome sequencing and EST analysis of the wheat pathogen Stagonospora nodorum. Plant Cell 19, 3347–3368 (2007).
Article CAS PubMed PubMed Central Google Scholar
Cuiffetti, L. M. Pyrenophora tritici-repentis database (2008) http://www.broadinstitute.org/annotation/genome/pyrenophora_tritici_repentis/Home.html.
Turgeon, B. G. Cochliobolus heterostrophus C5 whole genome project (2008) http://genome.jgi-psf.org/CocheC5_1/CocheC5_1.home.html.
Lawrence, C. B. Alternaria brassicicola whole genome project (2006) http://genome.jgi-psf.org/Altbr1/Altbr1.home.html.
Goodwin, S. B. & Kema, G. H. J. Mycosphaerella graminicola whole genome project (2008)http://genome.jgi-psf.org/Mycgr3/Mycgr3.home.html.

Download references

Acknowledgements

We acknowledge Marc-Henri Lebrun (INRA-Bioger) and Francis Martin (INRA, Interactions arbres/micro-organismes, Champenoux, France) for support and advice. The genome sequencing of L. maculans was funded by the Genoscope, Institut de Génomique, CEA, France. The establishment of databases and interfaces was funded by Agence Nationale de la Recherche (GnpAnnot project; ANR-07-GPLA-051G). Whole-genome effector analysis was funded by ANR (FungEffector project; ANR-06-BLAN-0399). Recombination analysis and P.B. were funded by ANR (AvirLep project; ANR-07-GPLA-015). B.J.H and R.P.O. thank the Australian Grains Research and Development Corporation for funding. C.L.S was supported in part by the Intramural Research Program of the NIH, National Library of Medicine. J.W.S acknowledges support from the U.S. National Science Foundation, grant number DEB-0717476. B.M.T. was supported in part by the U.S. National Science Foundation, grant number IOS-0924861. Thanks are due to INRA-SPE department, the 'Leptosphaeria maculans' scientific and applied community, and the 'Dothideomycete' community for support of the L. maculans genome initiative.

Author information

Thierry Rouxel and Jonathan Grandaubert: These authors contributed equally to this work.

Authors and Affiliations

INRA-Bioger, UR1290, Avenue Lucien Brétignières, BP 01, Thiverval-Grignon, F-78850, France
Thierry Rouxel, Jonathan Grandaubert, Pascal Bally, Salim Bourras, Alexandre Degrave, Azita Dilmaghani, Isabelle Fudal, Lilian Gout, Nicolas Glaser, Juliette Linglin, Michel Meyer, Bénédicte Ollivier, Adeline Simon & Marie-Hélène Balesdent
Murdoch University, South Street, Murdoch, 6150, Western Australia, Australia
James K. Hane
INRA-URGI, Route de Saint Cyr, Versailles Cedex, F-78026, France
Claire Hoede, Victoria Dominguez, Nicolas Lapalu, Joëlle Amselem & Hadi Quesneville
School of Botany, University of Melbourne, 3010, Victoria, Australia
Angela P. van de Wouw, Anton J. Cozijnsen, Kim May & Barbara J. Howlett
GENOSCOPE, Centre National de Séquençage, Institut de Génomique CEA/DSV, 2, rue Gaston Crémieux, CP 5706, Evry Cedex, F-91057, France
Arnaud Couloux, Véronique Anthouard, Julie Poulain, Jean Weissenbach & Patrick Wincker
Department of Botany and Plant Pathology, Cordley Hall 2082, Oregon State University, Corvallis, 97331-2902, Oregon, USA
Lynda M. Ciuffetti & Joseph W. Spatafora
Laboratoire Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Lyon 1, 43 Bld du 11 Novembre 1918, Villeurbanne cedex F-69622, France.,
Laurent Duret
USDA-ARS, Crop Production and Pest Control Research Unit, Purdue University, 915 West State Street, West Lafayette, 47907-2054, Indiana, USA
Stephen B. Goodwin
Department of Biointeractions and Plant Health, Wageningen UR, Plant Research International, P.O. Box 69, Wageningen 6700 AB, The Netherlands.,
Gert H. J. Kema
Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg, 24061-0477, Virginia, USA
Christopher B. Lawrence & Brett M. Tyler
NIH/NLM/NCBI, 45 Center Drive, MSC 6510, Bethesda, 20892-6510, Maryland, USA
Conrad L. Schoch
Institute of Plant Genetics, Polish Academy of Sciences, Strzeszynska 34, Poznan PL-60479, Poland.,
Anna Stachowiak
Deparment of Plant Pathology & Plant-Microbe Biology, Cornell University, Ithaca, 14853, New York, USA
B. Gillian Turgeon
INRA, UMR1202 BIOGECO, 69 Route d'Arcachon, Cestas, F-33612, France
Delphine Vincent
Australian Centre for Necrotrophic Fungal Pathogens, Curtin University, Perth, Western Australia 6845, Australia.,
Richard P. Oliver

Authors

Thierry Rouxel
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Grandaubert
View author publications
You can also search for this author in PubMed Google Scholar
James K. Hane
View author publications
You can also search for this author in PubMed Google Scholar
Claire Hoede
View author publications
You can also search for this author in PubMed Google Scholar
Angela P. van de Wouw
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Couloux
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Dominguez
View author publications
You can also search for this author in PubMed Google Scholar
Véronique Anthouard
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Bally
View author publications
You can also search for this author in PubMed Google Scholar
Salim Bourras
View author publications
You can also search for this author in PubMed Google Scholar
Anton J. Cozijnsen
View author publications
You can also search for this author in PubMed Google Scholar
Lynda M. Ciuffetti
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Degrave
View author publications
You can also search for this author in PubMed Google Scholar
Azita Dilmaghani
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Duret
View author publications
You can also search for this author in PubMed Google Scholar
Isabelle Fudal
View author publications
You can also search for this author in PubMed Google Scholar
Stephen B. Goodwin
View author publications
You can also search for this author in PubMed Google Scholar
Lilian Gout
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Glaser
View author publications
You can also search for this author in PubMed Google Scholar
Juliette Linglin
View author publications
You can also search for this author in PubMed Google Scholar
Gert H. J. Kema
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Lapalu
View author publications
You can also search for this author in PubMed Google Scholar
Christopher B. Lawrence
View author publications
You can also search for this author in PubMed Google Scholar
Kim May
View author publications
You can also search for this author in PubMed Google Scholar
Michel Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Bénédicte Ollivier
View author publications
You can also search for this author in PubMed Google Scholar
Julie Poulain
View author publications
You can also search for this author in PubMed Google Scholar
Conrad L. Schoch
View author publications
You can also search for this author in PubMed Google Scholar
Adeline Simon
View author publications
You can also search for this author in PubMed Google Scholar
Joseph W. Spatafora
View author publications
You can also search for this author in PubMed Google Scholar
Anna Stachowiak
View author publications
You can also search for this author in PubMed Google Scholar
B. Gillian Turgeon
View author publications
You can also search for this author in PubMed Google Scholar
Brett M. Tyler
View author publications
You can also search for this author in PubMed Google Scholar
Delphine Vincent
View author publications
You can also search for this author in PubMed Google Scholar
Jean Weissenbach
View author publications
You can also search for this author in PubMed Google Scholar
Joëlle Amselem
View author publications
You can also search for this author in PubMed Google Scholar
Hadi Quesneville
View author publications
You can also search for this author in PubMed Google Scholar
Richard P. Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Wincker
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Hélène Balesdent
View author publications
You can also search for this author in PubMed Google Scholar
Barbara J. Howlett
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.K.H., C.H., A.P.vdW, A.C. and V.D. contributed equally to this work as second authors. J.A., H.Q., R.P.O., P.W., M.H.B. and B.J.H. coordinated genome sequencing, annotation and data analyses, and made equivalent contributions as senior authors. Individual contributions were as follows: T.R., M.H.B. and B.J.H. initiated the sequencing project; M.H.B was responsible for DNA production; P.W. and J.W. coordinated the sequencing; J.P., A.C. and V.A. performed the sequencing and assembled the genome; J.A. was responsible for annotation pipelines, databases and interfaces; V.D., C.H., and J.A. did the ab initio annotation of gene models; M.M., A.J.C. and B.J.H. provided EST/cDNA information; and V.D., C.H. and J.A. did the cDNA clustering, defined the training set for ab initio gene finder steps and inserted annotation data in the database. Genome statistics were performed by J.A., H.Q., and J.G. J.K.H., R.P.O. and J.G. carried out the genome synteny analyses; M.H.B., P.B., L.G., A.J.C., A.P.vdW., A.St., and J.G. identified and designed mini- and microsatellite markers; M.H.B and A.P.vdW. built the genetic maps; J.K.H. and R.P.O. performed mesosynteny and RIPCAL analyses; A.J.C. and K.M. hybridized electrokaryotypes and annotated NRPSs and PKSs; H.Q., V.D., L.G. and T.R. analysed TEs; V.D. and J.G. estimated time for TE transposition events; J.G., T.R. and M.H.B. analysed TE nesting; N.L. performed automated functional analysis; N.L. and S.B. performed GO analyses; B.M.T. and J.G. carried out RXLR analysis of effector candidates; I.F., A.Si., J.G., B.O. and J.L. designed microarrays and analysed microarray data; A.De., B.O., N.G. and I.F. performed expression analysis of effectors by reverse transcription-PCR and quantitative reverse transcription-PCR; A.Di. analysed polymorphism of effectors in field populations. D.V. carried out proteomic and secretomic analyses. L.M.C., S.B.G., C.B.L., G.H.J.K. and B.G.T. contributed to comparative genomics approaches. L.D. analysed isochores-like blocks and contributed to comparative analysis with those of mammals. C.L.S. and J.W.S. performed phylogenetic analyses and estimated divergence time. T.R. organized co-ordination between groups. T.R. wrote and edited the paper with major input from B.J.H. and R.P.O. Final editing of the text, Tables and Figures was done by S.B., J.G., M.H.B., B.J.H. and T.R.

Corresponding author

Correspondence to Thierry Rouxel.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Figures, Supplementary Tables, Supplementary Methods and Supplementary References.

Supplementary Figures S1-S15, Supplementary Tables S1-S15, Supplementary Methods and Supplementary References. (PDF 1458 kb)

Supplementary Data 1

De novo identification of repeated elements within the genome of Leptosphaeria maculans using the REPET pipeline. (XLS 62 kb)

Supplementary Data 2

Repeat Induced Point mutation indices of transposable elements in the genome of Leptosphaeria maculans, and improvement of annotation following deRIP (XLS 35 kb)

Supplementary Data 3

Synteny between Leptosphaeria maculans and Phaeosphaeria nodorum in the surroundings of AT-rich regions (XLS 85 kb)

Supplementary Data 4

Characteristics of Small Secreted Proteins-encoding genes identified in the genome of Leptosphaeria maculans (XLS 187 kb)

Supplementary Data 5

List and homologues of predicted genes occurring in AT-rich regions. (XLS 76 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Rouxel, T., Grandaubert, J., Hane, J. et al. Effector diversification within compartments of the Leptosphaeria maculans genome affected by Repeat-Induced Point mutations. Nat Commun 2, 202 (2011). https://doi.org/10.1038/ncomms1189

Download citation

Received: 20 April 2010
Accepted: 11 January 2011
Published: 15 February 2011
DOI: https://doi.org/10.1038/ncomms1189

This article is cited by

Hybrid de novo genome assembly and comparative genomics of three different isolates of Gnomoniopsis castaneae
- Silvia Turco
- Angelo Mazzaglia
- Carmen Morales-Rodríguez
Scientific Reports (2023)
Three new pathogenicity genes in Leptosphaeria maculans identified by Agrobacterium-mediated insertional mutagenesis
- Andrew S. Urquhart
- Alexander Idnurm
Australasian Plant Pathology (2023)
Comparative genomics reveals low levels of inter- and intraspecies diversity in the causal agents of dwarf and common bunt of wheat and hint at conspecificity of Tilletia caries and T. laevis
- Somayyeh Sedaghatjoo
- Bagdevi Mishra
- Wolfgang Maier
IMA Fungus (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

General features of the L. maculans genome

The TEs are RIP affected

The compartmentalized genome of L. maculans

The ribosomal DNA repeat is extensively affected by RIP

AT-blocks as niches for effectors

History of genome invasion by TEs

Discussion

Methods

Phylogenetic analysis

Sequencing and assembly

L. maculans genome annotation

Annotation and analysis of repeated elements

RIP and DeRIP analyses

Analysis of AT-blocks

Identification and features of SSPs

Additional information

Accession codes

Accessions

Gene Expression Omnibus

NCBI Reference Sequence

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links