Abstract
Lettuce downy mildew caused by Bremia lactucae is the most important disease of lettuce globally. This oomycete is highly variable and rapidly overcomes resistance genes and fungicides. The use of multiple read types results in a high-quality, near-chromosome-scale, consensus assembly. Flow cytometry plus resequencing of 30 field isolates, 37 sexual offspring, and 19 asexual derivatives from single multinucleate sporangia demonstrates a high incidence of heterokaryosis in B. lactucae. Heterokaryosis has phenotypic consequences on fitness that may include an increased sporulation rate and qualitative differences in virulence. Therefore, selection should be considered as acting on a population of nuclei within coenocytic mycelia. This provides evolutionary flexibility to the pathogen enabling rapid adaptation to different repertoires of host resistance genes and other challenges. The advantages of asexual persistence of heterokaryons may have been one of the drivers of selection that resulted in the loss of uninucleate zoospores in multiple downy mildews.
Similar content being viewed by others
Introduction
Oomycetes are genetically and biochemically distinct from fungi1 but have similar infection strategies and architectures. Oomycetes include successful and diverse plant and animal pathogens with global economic impacts2,3. These include the downy mildews caused by biotrophic members of the Peronosporaceae that are challenging to study due to their obligate reliance on their host. These pathogens are highly variable; plant resistance genes and fungicide treatments are often rapidly overcome4,5,6,7. Various mechanisms have been proposed for rapid generation of genetic diversity including hyper-mutability of genomic regions that encode effectors8, changes in ploidy9,10, and parasexuality11.
Heterokaryosis, the state of having multiple genetically distinct nuclei in a single cell, is an important life history trait in some true fungi12. While transient heterokaryosis has been suggested and detected in oomycetes5,10,13,14,15, the impacts of heterokaryosis remain poorly understood and rarely considered. The life cycles of many oomycetes are not conducive to the propagation of stable heterokaryons because most produce multiple flagellated, mono-nucleic, motile spores from sporangia15,16; heterokaryons are consequently broken every asexual generation15. However, some downy mildew species including Bremia lactucae do not produce zoospores and germinate directly from multinucleate sporangia17,18, transmitting multiple, possibly genetically distinct nuclei in each asexual generation.
Bremia lactucae is an obligate biotroph that causes lettuce downy mildew, the most important disease of lettuce worldwide. Numerous races and population shifts have been documented in Europe, Australia, Brazil, and California19,20,21. Resistance genes are rarely durable in the field and curative fungicides have become ineffective4,5,6,7. Several mechanisms for variation have been documented. B. lactucae is predominantly heterothallic with two mating types and sexual reproduction can generate new virulence phenotypes22. Asexual variation also occurs but is less well understood. Somatic fusion resulting in either polyploids or heterokaryons has been observed5,14, but it remained unclear whether heterokaryosis or polyploidy are significant sources of stable phenotypic variation of B. lactucae. Previously, sexual progeny of B. lactucae have been generated to build genetic maps23,24, study the genetics of (a)virulence and metalaxyl insensitivity4,6,25, and infer the presence of accessory chromosomes26. Only limited genomic studies had been conducted due to the difficulties of studying this biotrophic species27.
This study presents a near-chromosome-scale genome assembly of B. lactucae using multiple sequencing technologies and assembly approaches. This resource, combined with genome size estimates generated by flow cytometry, demonstrates the prevalence of heterokaryosis in multiple B. lactucae isolates and the absence of polyploidy. Heterokaryons are shown to be somatically stable and fitter on non-selective hosts compared with homokaryotic derivatives. Homokaryotic components differ in (a)virulence phenotypes and confer viability on selective hosts. Selection should be considered as acting on a population of nuclei within a coenocytic mycelium to maximize somatic hybrid vigor.
Results
Genome assembly
Bremia lactucae isolate SF5 was initially assembled into 885 scaffolds over 1 Kb with a contig N50 of 30.6 Kb and a scaffold N50 of 283.7 Kb. The haploid genome size of this isolate and 38 others were estimated to be ~152 Mb (+/−3 Mb) by flow cytometry (Fig. 1a and Supplementary Table 1). This 115 Mb assembly contained 91 Mb of sequence plus 24 Mb of gaps. Subsequently, 87.9 Mb (96.5%) of the assembled sequence was placed into 22 scaffolds over 1 Mb using Hi-C; these totaled 112 Mb including gaps. The resultant assembly was highly collinear and comparable with the highly contiguous v3.0 assembly of Phytophthora sojae28, which cross-validates the high quality of both assemblies (Fig. 1b). The heterozygosity of isolate SF5 was 1.17% and ranged from 0.77% to 1.29% for other isolates. These levels ranked high compared with other oomycetes, the majority of which had less than 1% heterozygosity (Fig. 1c). This level of heterozygosity resulted in some alleles not being collapsed into a consensus sequence, necessitating multiple rounds of condensation to achieve a close to haploid consensus assembly.
The discrepancy between the final assembly size (91 Mb without gaps) and genome size measured by flow cytometry (152 Mb) is due to collapsed repeats in the assembly. Single-copy k-mers were present in the predicted proportions in the assembly; most of the homozygous k-mers and approximately half of the heterozygous k-mers were distributed across the two peaks as expected (Fig. 1d). BUSCO29 analysis with the protist database (v9) also revealed 98.3% completeness similar to other well-assembled oomycetes (Supplementary Table 2). Therefore, the assembly contains most of the single-copy portion of the genome. Repeat annotation followed by masking determined that 63 Mb of the 91 Mb assembled sequence was repetitive (Table 1). The majority of the annotated repeats were recently diverged long terminal repeat retrotransposons (LTR-RTs). Annotation identified 6.3 Mb as Copia (RLC) and 53.3 Mb as Gypsy (RLG) elements (Table 1). The average coverage of sequences annotated as repeats in the assembly was 2.1-fold higher than that of the annotated genes. Therefore, the 63 Mb repeat portion of the assembly is present at least twice in the haploid genome accounting for the 61 Mb difference between the assembly and the genome size determined by flow cytometry.
Divergence of long terminal repeat (LTR) pairs showed that the majority of these repeat elements were recently expanded (Supplementary Fig. 1), when compared with previously published downy mildews (Fig. 2b). B. lactucae had a high density of recently diverged LTRs similar to Phytophthora spp. (Fig. 2a). This suggests that B. lactucae and Phytophthora species surveyed have undergone recent expansions of Copia and Gypsy elements. The larger genome assemblies of both S. graminicola isolates contained the largest number of annotated LTR-RTs of any downy mildew surveyed (Fig. 2b), although LTR pairs were more diverged than in B. lactucae (Fig. 2a). Significantly, when LTR-RTs from each species were used to mask the assemblies, B. lactucae contained the highest proportion of LTR-RTs with 74.2% of its contig sequence masked (Fig. 2c). In most of the other species studied, 26 to 46% of contig sequences were masked, except for the larger assemblies of S. graminicola (two isolates) and P. infestans that had 72.1%, 70.7%, and 62.6% masked, respectively (Fig. 2 c). The high frequency of low-divergence repeat sequences in B. lactucae combined with high heterozygosity (Fig. 1c) may have confounded assembly algorithms and slowed the generation of an accurate assembly, as well as prevented the construction of whole chromosomal molecules. Interestingly, LTR divergence of Plasmopara viticola, which is a closer relative to B. lactucae than any Phytophthora spp. (Fig. 3; see below), does not have the same recent expansion of LTRs; this implies that this expansion of LTR-RTs was not ancestral to these species.
Phylogenomics
Phylogenetic analysis of 18 proteins identified with BUSCO29 that were ubiquitously single-copy in the species surveyed supported polyphyly of downy mildew species within the nine Phytophthora clades analyzed (Fig. 3). B. lactucae clustered with the two Plasmopara spp. This clade was closely associated with Phytophthora clade 1, which includes P. infestans and P. cactorum. B. lactucae did not cluster with Peronospora effusa, Pseudoperonospora cubensis, Sclerospora graminicola, and Hyaloperonospora arabidopsidis. This second downy mildew clade clustered closer to P. agathidicida in Phytophthora clade 5. This is consistent with the biotrophic downy mildews having evolved at least twice from hemi-biotrophic Phytophthora-like ancestors. These results are similar to previous, less extensive studies17,30. Additional genome sequencing of both biotrophic and hemi-biotrophic Peronosporales species will enable further clarification with regards to the origin(s) of the downy mildews.
Gene annotation
Ab initio annotation identified 9781 protein-encoding genes. More than half (5009) lacked introns, 1949 had one intron, 1063 had two introns, and 1760 had three or more introns. A maximum of 20 introns was observed in two genes. The average gene length was 1679 bp and ranged from 180 bp to 20.7 Kb. The 142 largest genes were conserved with other oomycete gene models. In total, orthologs were identified for 84% of the predicted models. The mean exon length was 664 bp, ranging from 4 bp to 18.3 Kb, while the mean intron length was 104 bp, ranging from 10 bp to 9.3 Kb. Strand specific transcriptional support was detected for 74% of the genes. The total gene space was 16.5 Mb, of which 15.2 Mb was exonic and 1.5 Mb was intronic. This is similar to other obligate biotrophic oomycetes, where the gene space ranges from 13.5 to 25.8 Mb. The hemi-biotrophic P. sojae has the largest reported gene space within the Peronosporaceae at 37.7 Mb (Supplementary Table 3).
Motif searches revealed the repertoire of candidate effector proteins that could be important in pathogenesis. Among a total of 159 candidate secreted RxLR effectors, 66 had a canonical RxLR motif and 93 had degenerate [GHQ]xLR or RxL[GKQ] motifs17,27,31; 63 candidates also encoded a [DE][DE][KR] motif32 and/or a WY domain33,34 (Table 2). Expression inferred by presence in the transcriptome assembly was detected for 109 of these candidate RxLR effectors, 35 of which also had an EER motif or WY domain (Table 2). In addition to the 161 RxLR candidates, there were 24 secreted proteins and 23 proteins lacking a secretion signal predicted with one or more WY domains but no detectable RxLR motif. Of these, 19 of the 24 WY proteins lacking an identifiable RxLR motif with a signal peptide and 13 of the 23 without a signal peptide were detected in the transcriptome (Table 2). Interestingly, an EER or EER-like motif was detected in the first 100 residues from 29 of the 45 WY proteins that lacked an RxLR motif, 20 of which were predicted to be secreted. This is consistent with not all effectors requiring an RxLR motif for translocation in to the host cell, similar to previously reported effectors in animal pathogenic oomycetes35. Two putative secreted Crinklers (CRNs)36,37 were annotated, one of which also contained an RxLQ and DDR motif. An additional 74 CRNs lacking a secretion signal were identified, although only six of these were present in the transcriptome assembly (Table 2). Four of these six had the canonical LFLAK motif and the other two had a LYLA motif36,37. Together, these candidate effectors comprise 1.9% of all genes annotated in B. lactucae. Orthologs of all proteins that have previously been described as inducing a host response38,39,40 were detected in the draft assembly (Supplementary Table 4). An additional 173 proteins (1.8% of all annotated genes) had domains ascribed to putative pathogenic functions in studies of other species (Supplementary Table 5). This is lower than the proportion reported for Phytophthora spp. (2.6 to 3.6%) and consistent with observations for other downy mildews where 1.3 to 1.7% of total annotated proteins had putative pathogenicity domains17.
The majority of genes encoding flagella-associated proteins and calcium-associated domains were missing from the B. lactucae assembly. B. lactucae has lost 55 of 78 orthogroups that contain flagellar proteins (Supplementary Fig. 2). One hundred and twelve proteins from P. infestans were present in these orthogroups; 78 of these proteins were absent in B. lactucae. This is similar to assemblies of other non-flagellate downy mildews that had 34 to 48 proteins in these orthogroups (Supplementary Fig. 2). This is consistent with the loss of zoospore production by B. lactucae. There was also a significant loss of calcium-associated domains, which is also observed in the assemblies of other non-flagellate downy mildews17. B. lactucae had no proteins present in 125 of the 177 calcium-associated orthogroups similar to other non-flagellates, which ranged from 118 to 125. These orthogroups contained 53 proteins from B. lactucae compared with 193 proteins in P. infestans. Other non-flagellate species had 52 to 59 proteins assigned to these orthogroups (Supplementary Fig. 2). The parallel loss of zoospore production and proteins with calcium-associated domains in both clades of downy mildews (Fig. 3) is consistent with the involvement of these proteins in zoosporogenesis41. Genes encoding carbohydrate binding, transporter, and pathogenicity associated domains were also under-represented in B. lactucae, as previously reported for other downy mildews in both clades17. This provided further evidence for the convergent loss of genes encoding these domains during adaptation to biotrophy.
The majority of annotated genes had levels of coverage close to the average sequencing depth (Supplementary Fig. 3), indicating that most genes were each assembled into a single consensus sequence. A minority of genes had a normalized read depth equal to half the sequencing coverage, consistent with divergent haplotypes that had assembled as independent sequences. The BUSCO29 genes had the same distribution (Supplementary Fig. 3). However, genes encoding candidate effectors had variable coverage (Supplementary Fig. 3); this could have been due to a disproportionate number of effector haplotypes being assembled independently and/or a high rate of divergence between haplotypes resulting in poor mapping rates.
Genomic signatures of heterokaryosis
Distinct alternative allele frequency profiles were detected in multiple isolates of B. lactucae (Fig. 4 and Supplementary Fig. 4). Such analysis had previously been used to support polyploidy in P. infestans9,42. The profiles of thirteen isolates, including the reference isolate SF5, were clearly unimodal, seven isolates were trimodal, and nine isolates were bimodal (Supplementary Fig. 4). Two other isolates had profiles that were not clearly bimodal or trimodal (Supplementary Fig. 4). The symmetrical unimodal distribution of SF5 was consistent with a diploid genome; the other distributions were not. However, the genome size for all isolates as measured by flow cytometry varied by less than 3%. In the case of polyploidy, the genome size of triploids and tetraploids would be 150% and 200% that of the diploid, respectively; therefore, there was no evidence for polyploidy in B. lactucae (Fig. 1a and Supplementary Table 1).
Further evidence against polyploidy was provided by analysis of sexual progeny from the segregating F1 population of SF5 (unimodal; Fig. 4a) × California isolate C82P24 (trimodal; Fig. 4b)14,23. Four progeny isolates sequenced to over 50× all had unimodal allele frequency plots (Supplementary Fig. 5). Flow cytometry of 16 progeny isolates estimated the genome sizes to be the same as all other isolates (Fig. 1a and Supplementary Table 1). The outcross origin of these progeny was confirmed by the presence of unique combinations of SNPs inherited from each parent. Therefore, these progeny isolates could not have arisen by apomixis or selfing and all sexual progeny from this unimodal × trimodal cross were diploid. The origins of the gametes in this cross were determined for 38 progeny isolates that had been sequenced to sufficient depth. Pairwise SNP-based kinship coefficients revealed two distinct half-sib families of 29 and 9 individuals (Fig. 5). Therefore, three rather than two nuclei contributed gametes in this cross. The trimodal alternative allele frequency plot and flow cytometry of C82P24 are consistent with this isolate being heterokaryotic with two diploid nuclei.
To confirm that C82P24 was heterokaryotic rather than a mixture of two isolates, 20 asexual derivatives were generated from single sporangia. Kinship of these 20 isolates was as high between one another as with the original isolate, indicating they were identical. Furthermore, all asexual derivatives of C82P24 displayed similar relatedness to all sexual progeny (Fig. 5). Sequencing of 11 asexual derivatives to >50x coverage demonstrated that they retained the trimodal profile, indicating that two distinct nuclei were present in each derivative (Supplementary Fig. 6) with a diploid size of 303 +/− 3 Mb as measured by flow cytometry (Supplementary Table 6). Therefore, C82P24 was heterokaryotic rather than a mixture of isolates.
To demonstrate heterokaryosis in another isolate, 10 asexual derivatives were also generated from isolate C98O622b that also displayed a trimodal alternative allele frequency (Supplementary Fig. 4). In this case, kinship analysis revealed three distinct groups of derivatives (Fig. 6a). Derivatives A to F had a high kinship and an identical virulence phenotype to C98O622b (Fig. 6a, b). Derivatives G to I, formed a second group and derivative J a third; these four derivatives had lower kinship to C98O622b than derivatives A to F. The lowest kinship was between derivatives G to I and J (Fig. 6a). Virulence phenotypes varied between but not within groups (Fig. 6b); C98O622b and derivatives A to F were virulent on both Dm4 and Dm15; derivatives G to I were avirulent on Dm4 and virulent on Dm15, while derivative J was conversely virulent on Dm4 and avirulent on Dm15 (Fig. 6b). The single-spore derivatives of C98O622b were sequenced to >50× coverage to determine their nuclear composition. Derivatives A to F were trimodal (Fig. 6c, Supplementary Fig. 7); all other derivatives were unimodal, which is consistent with the separation of the heterokaryon into its diploid components (Fig. 6c). This conclusion was supported by combining read sets in silico. Combining reads of derivatives G-I did not increase their relatedness to C98O622b, while combining reads of derivatives G, H, or I with those of J resulted in a high kinship to C98O622b (Fig. 6a) and trimodal profile similar to C98O622b (Supplementary Fig. 8). Therefore, C98O622b was also heterokaryotic; however, unlike C82P24, C98O622b was unstable and could be separated into constituent homokaryotic derivatives by sub-culturing from single sporangia.
The trimodal distributions of the derivatives A to F were not identical and could clearly be split into two configurations, suggesting different nuclear compositions of these derivatives. Derivatives B and F were similar to C98O622b, displaying peaks at approximately 0.25, 0.5, and 0.75 (Fig. 6c). The other four heterokaryotic derivatives A, C, D, and E had peaks at approximately 0.33, 0.5, and 0.67 (Fig. 6c). The nuclear composition of these heterokaryotic derivatives was investigated by subsampling SNPs identified as unique to each homokaryotic derivative (G to J). This revealed that in the trimodal distribution of derivatives B and F (peaks at 0.25, 0.5, 0.75) SNPs unique to either constituent nucleus were in peaks at 0.25 and 0.75 (Fig. 6d, Supplementary Fig. 9), consistent with a balanced 1:1 ratio of constituent nuclei (1:3 read ratio of SNPs). For derivatives A, C, D, and E (peaks at 0.33, 0.5, 0.67), SNPs unique to constituent nuclei resembling derivatives G to I were consistently in peaks at approximately 0.17 and 0.83, while SNPs identified as unique to derivative J were consistently in peaks at 0.33 and 0.67 (Fig. 6d, Supplementary Fig. 9), consistent with a 2:1 unbalanced nuclear ratio in favor of nuclei similar to derivative J. This was further supported by combining reads in silico. Combining reads from derivatives G, H, or I with J in equal proportions resulted in trimodal plots similar to those of derivatives B and F (peaks at 0.25, 0.5, and 0.75; Supplementary Fig. 8). Combining reads from derivatives G, H, or I with J in a ratio of 1:2 resulted in frequency profiles like those of derivatives A, C, D, and E (peaks at 0.33, 0.5, and 0.67; Supplementary Fig. 8). This supports an unequal nuclear composition in four of the asexual derivatives of C98O622b.
Heterokaryosis impacts the rate of sporulation
To investigate the potential benefits of heterokaryosis, the fitness of asexual derivatives of C98O622b was assessed on a universally susceptible cultivar and two differential host lines. Derivatives A and B were selected to represent unbalanced and balanced heterokaryons, respectively, while derivatives I and J represented the two homokaryons. When grown in four replicates on the universally susceptible lettuce cultivar (cv.) Green Towers, the heterokaryotic derivatives grew faster than either homokaryotic derivative (Fig. 7a). The balanced heterokaryotic derivative B was significantly fitter than the homokaryotic derivative I. There was no significant difference within heterokaryotic derivatives or within homokaryotic derivatives when grown on cv. Green Towers. This is consistent with heterokaryotic isolates being fitter when unchallenged by host resistance genes. However, when a product of either nucleus of the heterokaryon was detected by a resistance gene (i.e., Dm4 in R4T57D or Dm15 in NumDM15) that differentiates the homokaryotic derivatives (Fig. 6b), the heterokaryotic derivatives were less vigorous than the virulent homokaryotic derivative (Fig. 7b). This suggested that it may be possible to break a heterokaryon by repeated subculture on a selective cultivar, as reported previously5. When the heterokaryotic derivatives were inoculated onto an F1 hybrid of the selective lines expressing both Dm4 and Dm15, neither the heterokaryotic nor homokaryotic derivatives were able to grow. Therefore, combining multiple resistance genes against the entire B. lactucae population into a single cultivar remains a potentially effective strategy to provide more durable resistance to the pathogen.
Discussion
Heterokaryosis in B. lactucae has phenotypic consequences, as well as implications for interpretation of tests for virulence phenotype. Homokaryotic derivatives G, H, and I of heterokaryotic isolate C98O622b are race Bl:5-US and derivative J has a novel virulence phenotype. The original field isolate C98O622b and heterokaryotic derivatives are race Bl:6-US, indicating that two phenotypically distinct isolates may combine to create a new phenotype when characterized on individual resistance genes, although such somatic hybrids may not be able to overcome combinations of these resistance genes in a single cultivar. Therefore, (a)virulence phenotypes on monogenic differentials are not necessarily a good predictor of virulence when heterokaryons are tested. The instability of heterokaryosis may enable a successful infection and proliferation of individual nuclear components. Furthermore, there is no a priori reason why coenocytic mycelia are limited to having only two nuclear types. Multiple rounds of somatic fusion are possible if favored by selection. Allele frequency plots are consistent with some isolates having more than two distinct nuclei (e.g., isolate C04O1017; Fig. 4c). Heterokaryotic isolates of B. lactucae were detected globally, over a 33-year period, from both Europe and the Western United States of America (Supplementary Table 7). Of the 31 isolates sequenced to over 50x coverage, 18 were determined to have allele frequencies deviating from that of a homokaryon. Neither the geographic location of sampling nor year of sampling correlated with heterokaryosis. Therefore, heterokaryotic isolates should be considered as exhibiting somatic hybrid vigor and selection for heterosis in B. lactucae as acting on populations of nuclei within a coenocytic mycelium (Fig. 8) rather than on individual isolates.
Heterokaryosis may be a common phenomenon in other oomycetes that has yet to be investigated extensively. Flow cytometry revealed heterogeneous nuclear sizes in mycelia of P. infestans, although stability over multiple asexual generations was not reported13. Somatic fusion may be a route to allopolyploidy; inter-species somatic fusion could result in transient heterokaryosis before nuclear fusion to form a somatic allopolyploid circumventing gametic incompatibility. Somatic sporangial fusions have also been reported in a basal holocarpic oomycete43, demonstrating the possibility of widespread heterokaryosis within the Oomycota. Heterokaryosis may be more prevalent in non-zoospore producing oomycetes. Production of zoospores with single nuclei during the asexual cycle, exhibited by most oomycetes, breaks the heterokaryotic state each asexual generation15,16. However, some downy mildews and Phytophthora spp. germinate directly from multinucleate sporangia, which potentially maintains the heterokaryotic state, as shown in our data. Increased fitness and phenotypic plasticity of heterokaryosis could be one of the selective forces favoring the loss of zoosporogenesis in multiple lineages of oomycete pathogens17,18.
Heterokaryosis should be considered when implementing strategies for deployment of resistance genes. Cycles of somatic fusion to increase fitness and selection on populations of nuclei provide potentially great phenotypic plasticity without mutation. This could result in rapid changes in pathogen populations in response to changes in host genotypes or fungicide use. Comprehensive knowledge of the prevalence and virulence phenotypes of homokaryotic and heterokaryotic isolates, as well as the population dynamics are necessary to predict the evolutionary potential of a pathogen population.
Methods
Isolation culturing and DNA extraction
Bremia lactucae isolate SF5 has been reported previously14,23,24. Additional field isolates surveyed in this study were either collected from California/Arizona between 1982 and 2015 or were supplied by Diederik Smilde (Naktuinbouw, The Netherlands; Supplementary Table 7). Sexual progeny of SF5 × C82P24 were generated by co-inoculating isolates onto cv. Cobham Green to generate oospores. Oospores were matured for several weeks in decaying plant tissue before maceration to produce a suspension. Sexual progeny were isolated by growing cv. Cobham Green in a dilute suspension of oospores. Oospores were titrated by assessing the number of infected seedlings resulting from a serial dilution of the oospore suspension. The suspension was then diluted so that on average a single seedling would be infected per culture box. Some sexual progeny sequenced have been reported previously23,24. Single-spore isolates were derived from cotyledons that had been sporulating asexually for 1 to 2 days typically 6 to 7 days post-infection (dpi). A single cotyledon was run over a 0.5% water agar plate until clean of spores. Single sporangia were located under a dissection microscope, pulled off the agar using pipette tips, and ejected onto fresh, 7-day-old cotyledons of cv. Green Towers that had been dampened with a drop of deionized water. Plates were incubated at 15 ºC and checked for sporulation after 5 dpi. Successful single-spore infections were transferred to cv. Green Towers seedlings and maintained thereon. The virulence phenotype was determined by inoculation onto the IBEB EU-B standardized differential set (http://www.worldseed.org/wp-content/uploads/2016/05/Table-1_IBEB.pdf) and observed for sporulation at 7, 11, 15, and 21 dpi (Source Data). Fitness was determined by measuring the growth rate of B. lactucae of four replicates of four isolates on 20 cotyledons at 3, 5, 6, 7, and 9 days post-inoculation (dpi) on cv. Green Towers by qPCR to quantify B. lactucae Β-tubulin (BlBtub-F: 5’-ACATACTCCGTGTGCCCTTC-3’ and BlBtub-R: 5’-TCATCAGCGTTTTCAACGAG-3’) relative to L. sativa Actin (LsActin-F: 5’-ATTACCGCTTTAGCCCCGAG-3’ and LsActin-R: 5’-GCTGGAAAGTGCTGAGGGAT-3’; Source Data). The area under the curve was calculated for each replicate and significance tested using a two-tailed t-test with Holm adjustment. Additional virulence tests of heterokaryons were performed on an F1 hybrid of NumDm15 and R4T57D, which confer resistance phenotypes Dm15 and Dm4, respectively. Microscopy was performed on ~2-week-old seedlings of lettuce cv. Green Towers, 5 dpi with B. lactucae isolate C16C1909 (Figs. 8a) or ~2-week-old seedlings of lettuce cv. Cobham Green homozygous for the AtUBI::dsRED transgene, 7 dpi with B. lactucae isolate C98O622b (Fig. 8b). Figure 8a was captured with a Leica TCS SP8 STED 3× inverted confocal microscope using a 40× water immersion objective. Image deconvolution was performed using Huygens Professional (https://svi.nl/Huygens-Professional) and a z-projection of stacked images was made in Fiji44. Figure 8b was captured using a Zeiss LSM 710 laser scanning confocal microscope using a 40× water immersion objective. Z stacks were processed and combined into a single image using the ZEN Black software. Spore pellets of all isolates sequenced were obtained by washing sporangia from infected lettuce cotyledons in sterile water. Spore suspensions were concentrated by centrifugation in 15 ml tubes, resuspended, transferred to microfuge tubes, pelleted, and stored at −80 °C until DNA extraction. For DNA extraction sporangia were vortexed for two minutes in a microcentrifuge tube with approximately 200 μl of Rainex-treated beads and 0.5 ml of 2× extraction buffer (100 mM Tris-HCl pH 8.0, 1.4 M NaCl, 20 mM EDTA, 2% [wt/vol] cetyltrimethylammonium bromide, and Β-mercaptoethanol at 20 μl/ml) and transferred to a fresh 2 ml tube. Material was treated with RNase (20 μl/ml) at 65 °C for 30 min. An equal volume of 1:1 phenol/chloroform was added, mixed, and centrifuged at maximum speed (5200×g) for 15 min. The aqueous phase was retained, mixed with 24:1 chloroform/isoamyl alcohol and again centrifuged as maximum speed for 15 min. The aqueous phase was mixed with 0.7 volumes of isopropanol and DNA precipitated at −20 °C for one hour. This was pelleted by centrifuging at maximum speed for 30 min, washed with 70% ethanol, dried, and suspended in 10 mM Tris-HCl. Quantity and quality of DNA was determined by spectrometry, as well as estimated by TAE gel electrophoresis.
Library preparation and sequencing
Paired-end (300 bp fragments) and mate-pair (2, 5, 7, and 9-Kb) libraries were prepared using Illumina (San Diego, CA), NEB (Ipswich, MA), and Enzymatics (Beverly, MA) reagents following the manufacturers’ protocols. RNAseq libraries were constructed from cotyledons of cv. Cobham Green infected with isolate SF5 through PolyA mRNA enrichment, reverse transcription with dNTP and 2nd-strand cDNA synthesis with dUTP. cDNA was sonicated with a Covaris S220 following the manufacturer’s recommendations to achieve 150-bp fragments prior to end repair and adapter ligation. Subsequent size selection and purification were performed using 1× Agencourt AMPure beads XP (Beckman Coulter, Brea, CA). The dUTP strand was then digested and libraries were amplified enriching for the first strand45. Synthetic long reads were generated by Moleculo (now Illumina) from barcoded libraries. Libraries were sequenced by the DNA Technologies Core at the UC Davis Genome Center (http://genomecenter.ucdavis.edu) on either a HiSeq 2500 or 4000.
The random-shear BAC library was constructed by Lucigen Corporation (Middleton, WI); this provided 10,000 BAC clones with a mean insert size of 100 kb. Sanger sequencing of BAC ends was performed by the Genome Institute at Washington University (St. Louis, MO) and generated sequences averaging 700 bp in length. A fosmid library consisting of over eight million clones with a mean insert size of 40 Kb was generated by Lucigen Corporation and end-sequenced on an Illumina MiSeq. Two SMRTbellTM libraries with mean insert sizes of 3 Kb and 10 Kb were constructed and sequenced by Pacific Biosciences. Hi-C libraries were produced by Dovetail Genomics. Coverage and percent of each read set mapping to the SF5 assembly are available as Supplementary Table 8.
Flow cytometry
Flow cytometry of select isolates was performed on sporulating cotyledons 7 dpi. For each measurement, two sporulating cotyledons were mixed with 1 cm2 of young leaf tissue from Oryza sativa cv. Kitaake (2 C = 867 Mb), which was sufficiently different from the genome size of B. lactucae (2 C and 4 C) for use as the internal reference. The O. sativa 2 C DNA content was determined by calibrating against nuclei from flower buds of Arabidopsis thaliana Col-0, which has a known absolute DNA content of 2 C = 314 Mb46. Nuclei extraction and staining with propidium iodide was done using the CyStain PI Absolute P kit (Sysmex, Lincolnshire, IL). Flow cytometry was done on a BD FACScan (Becton Dickinson, East Rutherford, NJ). For each measurement, 10,000 nuclei were assessed, and each isolate was measured three times. Lettuce nuclei are ~3× larger than rice nuclei and did not interfere with the measurements. Data was analyzed using FlowJo (Ashland, OR). Total nuclear DNA content was averaged over all replicates. Means and standard deviations were calculated from the average nuclear content of each isolate. Haploid genome size was calculated by halving the mean across all isolates.
De novo assembly assessment and annotation
Multiple assembly approaches were tried using a variety of templates. Ultimately, the genome of isolate SF5 was assembled using a hybrid approach using several types of sequences (Supplementary Fig. 10). Moleculo reads were assembled using Celera47 and further scaffolded using mate-pair, fosmid-end, BAC-end, and PacBio data, utilizing first SSPACE v3.048 followed by AHA49. A consensus assembly was obtained by removing the second haplotype using HaploMerger250 and further scaffolded and gap-filled48,51. Misjoins were iteratively detected and broken using REAPR v1.0.18 in ‘aggressive mode’52 in combination with SSPACE48 and GapFiller51. Mitochondrial sequences were detected by BLASTn v2.2.28 and removed before final scaffolding and gap-filling48,51. Hi–C scaffolding was performed by Dovetail Genomics using their Hi-Rise pipeline to infer breaks and joins. One putative effector gene was masked by Ns in the assembly because it was determined by read coverage to be erroneously duplicated multiple times.
The quality of the assembly was assessed in multiple ways. Assembly completeness and duplication was measured by BUSCO v2, protist ensemble library db929, and KAT v2.4.153. Nucleotide collinearity between B. lactucae and Phytophthora sojae was inferred using Promer v3.06 (-l 30) and visualized using Symap v4.254 set with a required minimum of 5 dots. Phylogenetic analysis was performed on nucleotide and amino acid sequences of single-copy proteins predicted by BUSCO; 18 sequences from B. lactucae that were also present as single copies in assemblies of all 20 Phytophthora spp., Plasmopara spp., H. arabidopsidis, S. graminicola, Pseudoperonospora cubensis, and P. effusa surveyed were aligned independently with MAFFT v7.24555, concatenated into single sequences for each species/isolate, and phylogenetically tested with RAxML v8.0.26, run with 1000 bootstraps56. Alignments and trees are provided as Source Data.
A de novo transcriptome assembly was generated by mapping reads with BWA-MEM v0.7.1257 to a combined reference of the B. lactucae de novo assembly and L. sativa assembly58. Reads that mapped to the B. lactucae assembly were assembled with Trinity v2.2.059 and filtered with BLAST to ensure that the transcripts belonged to B. lactucae. Transcripts were translated with TransDecoder v3.0.060.
Primary annotation was performed using MAKER v2.31.861. The RNAseq assembly was first used to predict proteins from the SF5 assembly without hidden Markov models (HMMs). These protein models were then filtered and used to produce HMMs with SNAP v2006-07-2862, which in turn were used with MAKER for ab initio gene model predictions. This process was repeated (bootstrapped) six times. The optimal run was selected based on hit scores from BLAST63 to the Oomycete training protein sequences, % orthology detected with the Oomycete training database as detected by OrthoFinder v2.2.164 and average e-value of Pfam domains65 detected with InterProScan66 with a value under 1e−5. Additional candidate effectors were predicted through regular expression string searches (RxLR and EER) and HMMs (WY and CRN) of all open reading frames (ORFs) over 80 residues long17. A previously published RxLR-EER HMM67 was also applied, though it failed to define additional candidate effectors compared with regular expression string searches. These gene models were filtered and prepared for submission to NCBI using GAG v2.0-rc.168. Genome wide transcriptional support was detected by mapping RNAseq reads back to the annotated genome with STAR v2.6.069 outputting the read counts per gene table. Assembly of putative effectors in the transcriptome was inferred by >=95% tBLASTn identity and <1e−75 e-value scored between the protein and the transcript. Absence of genes encoding domains linked to zoosporogenesis and biotrophy was performed by identifying Pfam65 domains, identified by InterProScan v5.3066, linked to these process that were under-represented in non-flagellate/all downy mildews. Expected and observed distributions were tested with Chi-square, and p-values interpreted with Bonferroni adjustment. The analysis used the protein sequences of Peronospora effusa, Pe. tabacina, Hyaloperonospora arabidopsidis, Plasmopara halstedii, Phytophthora infestans, Ph. sojae, and Ph. ramorum. All proteins containing these underrepresented domains were identified and orthogroups, defined on the entire protein set by OrthoFinder v2.2.164, containing these proteins were visualized17. This dataset was also used to identify the total number of B. lactucae proteins with orthology to other oomycetes. Genomic coverage of gene models was calculated with BEDtools2 v2.25.0 multicov70, multiplying the result by the read length (101 bp), and dividing by the length of the gene. Comparative annotation analysis was undertaken by downloading GFF files of annotated oomycetes from FungiDB71 and using GAG68 to obtain summary statistics of annotations.
Repeat analysis
Repeat libraries were produced independently from RepeatModeler v1.0.8 (http://www.repeatmasker.org/) and a LTRharvest v1.5.772/LTRdigest v1.5.773. Provisional LTRs were identified as being separated by 1–40 Kb with LTRharvest. LTRdigest was used to identify complete LTR-RTs that were then annotated by similarity to elements in TREP. Elements containing sequences annotated as genes were removed. These libraries were combined and run through RepeatMasker v4.0.6 (http://www.repeatmasker.org/). Coverage of each non-overlapping masked repeat region was calculated with BEDTools2 v2.25.0 multicov70 using the coordinates of the repeat elements and the BAM file generated by mapping SF5 reads back to the assembly with BWA-MEM v0.7.1257.
Divergence of LTRs for B. lactucae and additional oomycete assemblies was calculated by running LTRharvest72 and LTRdigest73 as above. Internal domains of annotated LTR-RTs were clustered with VMatch (http://www.vmatch.de/), followed by alignment of 3’ and 5’ LTRs with Clustal-O74. Too few internal domains were detected for P. halstedii to allow clustering, so it was excluded from this analysis. Divergence between aligned 3’ and 5’ LTRs was calculated with BaseML and PAML75 and plotted using R base packages. Divergence between LTR pairs calculated for each species are provided (Source Data). LTR frequency and percentage of the assembly of each species/isolate masked was plotted with ggplot276 (Source Data). These predictions were not filtered for overlaps with gene annotations as multiple assemblies analyzed do not have publicly available annotations (Supplementary Table 3).
Analyses of additional read sets
Whole-genome sequencing data of additional oomycetes were downloaded from NCBI SRA (Supplementary Table 9) and converted to fastq files using the SRA-toolkit. Heterozygosity was calculated with GenomeScope77. Isolates that did not fit a diploid model were excluded from the analysis (Source Data).
Paired-end reads of all sequenced B. lactucae isolates were trimmed and adapter-filtered using BBMap (https://sourceforge.net/projects/bbmap/), filtered for reads of a bacterial origin by mapping to a database of all bacteria assemblies on NCBI, and mapped to the final reference assembly of SF5 using BWA-MEM v0.7.1257. Alternative allele frequency plots were generated by filtering BAM files for quality (>25) then generating pileups using SAMtools mpileup v0.1.1878, followed by BCFtools v0.1.1979 to convert to human-readable format. Bash was used to parse the files and generate the frequency of the alternative allele for every SNP that was covered by >50 reads and had an allele frequency between 0.2 to 0.8. In some instances, this frequency filter was removed to investigate the full spectrum of peaks. Bar charts were plotted with the R base package. Intersections of SNPs common to heterokaryotic and homokaryotic derivatives were obtained with BEDTools2 v2.25.0 intersect70.
Kinship analysis was performed on progeny and derivatives sequenced to a depth greater than 10x. Reads were trimmed, filtered, and mapped as above. Multi-sample pileups were obtained with SAMtools mpileup v0.1.1878, made human readable using BCFtools v0.1.1979, and pairwise kinship was calculated using VCFtools v0.1.14 with the relatedness2 flag80. The two-column table output was transformed into a matrix using bash, and conditional formatting was used to visualize relationships. Raw matrices of these analyses are available (Source Data).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
All sequence data are available at NCBI under the following BioProjects: PRJNA387017 Bremia lactucae asexual single spore progeny WGS. PRJNA387192 Bremia lactucae diversity panel WGS. PRJNA387454 Bremia lactucae SF5 × C82P24 progeny WGS. PRJNA387613 Bremia lactucae whole genome sequencing and de novo assembly. PRJNA523226 RNAseq of lettuce infected with Bremia lactucae. Isolates contained within each BioProject are detailed in Supplementary Table 8. Additional data used in this study are available in Supplementary Table 9. The genome assembly and annotation are available under accession GCA_004359215.1. All other relevant data is available upon request. The source data underlying Figs. 1c, 2, 3, 5, 6a, and 7 are provided as a Source Data file.
Code availability
All software is described and cited in the article. A workflow summary of the assembly is provided (Supplementary Fig. 10a).
References
Sogin, M. L. & Silberman, J. D. Evolution of the protists and protistan parasites from the perspective of molecular systematics. Int. J. Parasitol. 28, 11–20 (1998).
Kamoun, S. et al. The Top 10 oomycete pathogens in molecular plant pathology. Mol. Plant Pathol. 16, 413–434 (2015).
Derevnina, L. et al. Emerging oomycete threats to plants and animals. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 371, 20150459 (2016).
Crute, I. R. & Harrison, J. M. Studies on the inheritance of resistance to metalaxyl in Bremia lactucae and on the stability and fitness of field isolates. Plant Pathol. 37, 231–250 (1988).
Schettini, T., Legg, E. & Michelmore, R. Insensitivity to metalaxyl in California populations of Bremia lactucae and resistance of California lettuce cultivars to downy mildew. Phytopathology 81, 64–70 (1991).
Crute, I. The occurrence, characteristics, distribution, genetics, and control of a metalaxyl-resistant pathotype of Bremia lactucae in the United Kingdom. Plant Dis. 71, 763–767 (1987).
Ilott, T., Durgan, M. & Michelmore, R. Genetics of virulence in Californian populations of Bremia lactucae (lettuce downy mildew). Phytopathology 77, 1381–1386 (1987).
Dong, S., Raffaele, S. & Kamoun, S. The two-speed genomes of filamentous pathogens: waltz with plants. Curr. Opin. Genet Dev. 35, 57–65 (2015).
Li, Y. et al. Changing ploidy as a strategy: the irish potato famine pathogen shifts ploidy in relation to its sexuality. Mol. Plant Microbe Interact. 30, 45–52 (2016).
Bertier, L., Leus, L., D’hondt, L., de Cock, A. W. A. M. & Höfte, M. Host adaptation and speciation through hybridization and polyploidy in Phytophthora. PLoS ONE 8, e85385 (2013).
Spring, O. & Zipper, R. Asexual recombinants of Plasmopara halstedii pathotypes from dual infection of sunflower. PLoS ONE 11, e0167015 (2016).
Strom, N. B. & Bushley, K. E. Two genomes are better than one: history, genetics, and biotechnological applications of fungal heterokaryons. Fungal Biol. Biotechnol. 3, 4 (2016).
Catal, M. et al. Heterokaryotic nuclear conditions and a heterogeneous nuclear population are observed by flow cytometry in Phytophthora infestans. Cytom. A 77, 769–775 (2010).
Hulbert, S. & Michelmore, R. DNA restriction fragment length polymorphism and somatic variation in the lettuce downy mildew fungus, Bremia lactucae. Mol. Plant Microbe Interact. 1, 17–24 (1988).
Long, M. & Keen, N. Evidence for heterokaryosis in Phytophthora megasperma var. sojae. Phytopathology 67, 4 (1977).
Walker, C. A. & van West, P. Zoospore development in the oomycetes. Fungal Biol. Rev. 21, 10–18 (2007).
Fletcher, K. et al. Comparative genomics of downy mildews reveals potential adaptations to biotrophy. BMC Genomics 19, 851 (2018).
Judelson, H. S., Shrivastava, J. & Manson, J. Decay of genes encoding the oomycete flagellar proteome in the downy mildew Hyaloperonospora arabidopsidis. PLoS ONE 7, e47624 (2012).
Castoldi, R. et al. Identification of new Bremia lactucae races in lettuce in São Paulo state. Hortic. Bras. 30, 209–213 (2012).
Trimboli, D. S. & Nieuwenhuis, J. New races of Bremia lactucae on lettuce in Australia. Australas. Plant Dis. Notes 6, 62–63 (2011).
Van Ettekoven, K. & van der Arend, A. in Eucarpia Leafy Vegetables’ 99 (eds Lebeda, A. & Křístková, E.) 171–175 (Palacky University Olomouc, Czech Republic, 1999).
Michelmore, R. W. & Ingram, D. S. Heterothallism in Bremia lactucae. Trans. Br. Mycol. Soc. 75, 47–56 (1980).
Sicard, D. et al. A genetic map of the lettuce downy mildew pathogen, Bremia lactucae, constructed from molecular markers and avirulence genes. Fungal Genet Biol. 39, 16–30 (2003).
Hulbert, S. H. et al. Genetic analysis of the fungus, Bremia lactucae, using restriction fragment length polymorphisms. Genetics 120, 947–958 (1988).
Norwood, J. M., Michelmore, R. W., Crute, I. R. & Ingram, D. S. The inheritance of specific virulence in Bremia lactucae (downy mildew) to match resistance factors 1, 2, 4, 6 and 11 in Lactuca sativa (lettuce). Plant Pathol. 32, 177–186 (1983).
Francis, D. M. & Michelmore, R. W. Two classes of chromosome-sized molecules are present in Bremia lactucae. Exp. Mycol. 17, 284–300 (1993).
Stassen, J. H. et al. Effector identification in the lettuce downy mildew Bremia lactucae by massively parallel transcriptome sequencing. Mol. Plant Pathol. 13, 719–731 (2012).
Tyler, B. M. et al. Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis. Science 313, 1261–1266 (2006).
Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Bourret, T. B. et al. Multiple origins of downy mildews and mito-nuclear discordance within the paraphyletic genus Phytophthora. PLoS ONE 13, e0192502 (2018).
Anderson, R. G., Deb, D., Fedkenheuer, K. & McDowell, J. M. Recent progress in RXLR effector research. Mol. Plant Microbe Interact. 28, 1063–1072 (2015).
Haas, B. J. et al. Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestans. Nature 461, 393–398 (2009).
Win, J. et al. Sequence divergent RXLR effectors share a structural fold conserved across plant pathogenic oomycete species. PLoS pathogens 8, e1002400 (2012).
Boutemy, L. S. et al. Structures of Phytophthora RXLR effector proteins: a conserved but adaptable fold underpins functional diversity. J. Biol. Chem. 286, 35834–35842 (2011).
Wawra, S. et al. Host-targeting protein 1 (SpHtp1) from the oomycete Saprolegnia parasitica translocates specifically into fish cells in a tyrosine-O-sulphate-dependent manner. Proc. Natl Acad. Sci. USA 109, 2096–2101 (2012).
Stam, R. et al. Identification and characterisation CRN effectors in phytophthora capsici shows modularity and functional diversity. PLoS ONE 8, e59517 (2013).
Links, M. G. et al. De novo sequence assembly of Albugo candida reveals a small genome relative to other biotrophic oomycetes. BMC Genomics. 12, 503 (2011).
Pelgrom, A. J. E. et al. Recognition of lettuce downy mildew effector BLR38 in Lactuca serriola LS102 requires two unlinked loci. Mol. Plant Pathol. 20, 240–253 (2018).
Giesbers, A. K. J. et al. Effector-mediated discovery of a novel resistance gene against Bremia lactucae in a nonhost lettuce species. New Phytol. 216, 915–826 (2017).
Stassen, J. H. et al. Specific in planta recognition of two GKLR proteins of the downy mildew Bremia lactucae revealed in a large effector screen in lettuce. Mol. Plant Microbe Interact. 26, 1259–1270 (2013).
Ah-Fong, A. M. V., Kim, K. S. & Judelson, H. S. RNA-seq of life stages of the oomycete Phytophthora infestans reveals dynamic changes in metabolic, signal transduction, and pathogenesis genes and a major role for calcium signaling in development. BMC Genomics 18, 198 (2017).
Yoshida, K. et al. The rise and fall of the Phytophthora infestans lineage that triggered the Irish potato famine. eLife 2, e00731 (2013).
Klochkova, T. A., Shin, Y. J., Moon, K.-H., Motomura, T. & Kim, G. H. New species of unicellular obligate parasite, Olpidiopsis pyropiae sp. nov., that plagues Pyropia sea farms in Korea. J. Appl. Phycol. 28, 73–83 (2016).
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676 (2012).
Zhong, S. et al. High-throughput illumina strand-specific RNA sequencing library preparation. Cold Spring Harb. Protoc. 2011, 940–949 (2011).
Bennett, M. D., Leitch, I. J., Price, H. J. & Johnston, J. S. Comparisons with Caenorhabditis (~100 Mb) and Drosophila (~175 Mb) Using Flow Cytometry Show Genome Size in Arabidopsis to be ~157 Mb and thus ~25 % Larger than the Arabidopsis Genome Initiative Estimate of ~125 Mb. Ann. Bot. 91, 547–557 (2003).
Myers, E. W. et al. A whole-genome assembly of Drosophila. Science 287, 2196–2204 (2000).
Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D. & Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–579 (2011).
Bashir, A. et al. A hybrid approach for the automated finishing of bacterial genomes. Nat. Biotech. 30, 701–707 (2012).
Huang, S., Kang, M. & Xu, A. HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly. Bioinformatics 33, 2577–2579 (2017).
Boetzer, M. & Pirovano, W. Toward almost closed genomes with GapFiller. Genome Biol. 13, R56 (2012).
Hunt, M. et al. REAPR: a universal tool for genome assembly evaluation. Genome Biol. 14, R47 (2013).
Mapleson, D., Garcia Accinelli, G., Kettleborough, G., Wright, J. & Clavijo, B. J. KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies. Bioinformatics 33, 574–576 (2017).
Soderlund, C., Nelson, W., Shoemaker, A. & Paterson, A. SyMAP: A system for discovering and viewing syntenic regions of FPC maps. Genome Res. 16, 1159–1168 (2006).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at arXiv 1303.3997 (2013).
Reyes-Chin-Wo, S. et al. Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce. Nat. Commun. 8, 14953 (2017).
Grabherr, M. G. et al. Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nat. Biotechnol. 29, 644–652 (2011).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-Seq: reference generation and analysis with Trinity. Nat. Protocols 8, https://doi.org/10.1038/nprot.2013.084 (2013).
Cantarel, B. L. et al. MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18, 188–196 (2008).
Korf, I. Gene finding in novel genomes. BMC Bioinforma. 5, 59 (2004).
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25 3389–3402 (1997).
Emms, D. M. & Kelly, S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 16, 157 (2015).
Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230 (2014).
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Whisson, S. C. et al. A translocation signal for delivery of oomycete effector proteins into host plant cells. Nature 450, 115 (2007).
Geib, S. M. et al. Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission. GigaScience 7, giy018–giy018 (2018).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformarmatics 29, 15–21 (2013).
Quinlan, A. R. BEDTools: the Swiss-army tool for genome feature analysis. Curr. Protoc. Bioinforma. 47, 11.12.1–11.12.34 (2014).
Stajich, J. E. et al. FungiDB: an integrated functional genomics database for fungi. Nucleic Acids Res. 40, D675–D681 (2012).
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinforma. 9, 18 (2008).
Steinbiss, S., Willhoeft, U., Gremme, G. & Kurtz, S. Fine-grained annotation and classification of de novo predicted LTR retrotransposons. Nucleic Acids Res. 37, 7002–7013 (2009).
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539–539 (2011).
Yang, Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer, New York, 2016).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Acknowledgements
We thank Melodie Najarro (UC Davis) for preparing B. lactucae cultures and Dave Tricoli for the transgenic DsRED lettuce line used for microscopy. We thank D. Smilde for supplying European (BL) isolates for sequencing. We thank Dovetail Genomics for Hi-C analysis, Dr. C.-C. Wu at Lucigen for construction of the BAC library, and the staff of the DNA Technologies Core of the UC Davis Genome Center for their sequencing efforts. This project was supported by the University of California Davis Flow Cytometry Shared Resource Laboratory with funding from the NCI P30 CA093373 (Cancer Center), and NIH NCRR C06-RR12088, S10 OD018223, S10 RR12964, and S10 RR 026825 grants and with technical assistance from Ms. Bridget McLaughlin and Mr. Jonathan Van Dyke. The work was supported by the NSF/USDA Microbial Sequencing Program award # 2009-65109-05925 and the Novozymes Inc. Endowed Chair in Genomics to R.M.
Author information
Authors and Affiliations
Contributions
K.F. performed the assembly, annotation, and inter-comparative and intra-comparative genomics, as well as drafted the manuscript. R.J.G. performed phenotyping of isolates, culturing, generation of the genetic cross, and DNA and RNA extractions. L.B. performed flow cytometry and confocal microscopy. A.K. performed culturing, DNA extractions, phenotyping of isolates, and obtained asexual single spore derivatives. K.W. prepared the qPCR investigation. L.Z., K.C., and J.W. performed culturing and DNA extractions. S.R. generated the first assembly. C.T. performed culturing, phenotyping of isolates, generation of the genetic cross, and DNA extractions. R.M. supervised and conceptualized the project and made significant contributions to all drafts. All authors contributed to the final manuscript and approved the submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Journal peer review information: Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fletcher, K., Gil, J., Bertier, L.D. et al. Genomic signatures of heterokaryosis in the oomycete pathogen Bremia lactucae. Nat Commun 10, 2645 (2019). https://doi.org/10.1038/s41467-019-10550-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-019-10550-0
This article is cited by
-
Root and crown rot caused by oomycetes: an emerging threat to olive trees
Tropical Plant Pathology (2024)
-
Phylogeography and population structure of the global, wide host-range hybrid pathogen Phytophthora × cambivora
IMA Fungus (2023)
-
Multigene analyses with a broad sampling in Phytophthora and related genera provide evidence for the monophyly of downy mildews
Mycological Progress (2023)
-
Phytophthora: an ancient, historic, biologically and structurally cohesive and evolutionarily successful generic concept in need of preservation
IMA Fungus (2022)
-
Variance of allele balance calculated from low coverage sequencing data infers departure from a diploid state
BMC Bioinformatics (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.