Lack of ITS sequence homogenization in Erysimum species (Brassicaceae) with different ploidy levels

Osuna-Mascaró, Carolina; de Casas, Rafael Rubio; Berbel, Modesto; Gómez, José M.; Perfectti, Francisco

doi:10.1038/s41598-022-20194-8

Download PDF

Article
Open access
Published: 07 October 2022

Lack of ITS sequence homogenization in Erysimum species (Brassicaceae) with different ploidy levels

Carolina Osuna-Mascaró^1,2^nAff5,
Rafael Rubio de Casas^2,3,
Modesto Berbel¹,
José M. Gómez^2,4 &
…
Francisco Perfectti^1,2

Scientific Reports volume 12, Article number: 16907 (2022) Cite this article

1053 Accesses
3 Citations
6 Altmetric
Metrics details

Subjects

Abstract

The internal transcribed spacers (ITS) exhibit concerted evolution by the fast homogenization of these sequences at the intragenomic level. However, the rate and extension of this process are unclear and might be conditioned by the number and divergence of the different ITS copies. In some cases, such as hybrid species and polyploids, ITS sequence homogenization appears incomplete, resulting in multiple haplotypes within the same organism. Here, we studied the dynamics of concerted evolution in 85 individuals of seven plant species of the genus Erysimum (Brassicaceae) with multiple ploidy levels. We estimated the rate of concerted evolution and the degree of sequence homogenization separately for ITS1 and ITS2 and whether these varied with ploidy. Our results showed incomplete sequence homogenization, especially for polyploid samples, indicating a lack of concerted evolution in these taxa. Homogenization was usually higher in ITS2 than in ITS1, suggesting that concerted evolution operates more efficiently on the former. Furthermore, the hybrid origin of several species appears to contribute to the maintenance of high haplotype diversity, regardless of the level of ploidy. These findings indicate that sequence homogenization of ITS is a dynamic and complex process that might result in varying intra- and inter-genomic diversity levels.

Gradual polyploid genome evolution revealed by pan-genomic analysis of Brachypodium hybridum and its diploid progenitors

Article Open access 29 July 2020

The complete plastome sequences of invasive weed Parthenium hysterophorus: genome organization, evolutionary significance, structural features, and comparative analysis

Article Open access 18 February 2024

Phylogenetic informativeness analyses to clarify past diversification processes in Cucurbitaceae

Article Open access 16 January 2020

Introduction

Concerted evolution is an evolutionary process by which sequences from the same gene family show higher sequence similarity to each other than to orthologous genes in related species^1,2. Hence, genes evolved in a concerted manner present low polymorphism in their sequences, i.e., the sequences are homogenized. Concerted evolution is particularly notable in multicopy nuclear genes, where homogenization is mainly achieved by unequal crossing over and gene conversion^3,4. One of the best characterized multicopy gene families is the 45S nuclear ribosomal DNA (nrDNA)⁵. It appears arranged as tandem repeated units with hundreds to thousands of copies in one or several loci per genome. These units are composed of the 18S rDNA, internal transcribed spacer 1 (ITS1), 5.8 S rDNA, internal transcribed spacer 2 (ITS2), and 26S rDNA, separated by longer non-transcribed intergenic spacers⁶. Among all these units, the internal transcribed spacers (ITS1 and ITS2) are the best-characterized nrDNA sequences⁷ partly because ITS sequences show characteristics advantageous for phylogenetic studies, such as biparental inheritance, short length, and high evolution rate^4,8,9.

ITS sequences usually present fast concerted evolution with low levels of intra-genomic sequence variation and very few polymorphic positions^10,11. However, in some animals (e.g.,¹²) and especially in plants^{13,14,15,16,17,18}, sequence homogenization remains incomplete across ITS sequences, resulting in relatively high intra-genomic polymorphism. This ITS diversity is often linked to hybridization events^{9,19,20,21,22}. Different ITS sequences may meet after hybridization and become homogenized after a time, but this homogenization may not be consistent among descendant lineages²³. As concerted evolution tends to homogenize sequences rapidly⁸, evidence of non-concerted evolution is mostly expected in recently-formed hybrid species, where both parental ITS sequences may still be present. This phenomenon should be particularly conspicuous in recent allopolyploid species, where the occurrence of different ITS sequences located in distinct chromosomes tends to delay this homogenization²⁴.

Erysimum l. (Brassicaceae) comprises more than 200 species²⁵, mainly from Eurasia, with some species inhabiting North America and North Africa^26,27. The Baetic Mountains (SE Iberia) constitute one of the most important glacial refugia in Europe and a hotspot for this group, with ~ 10 Erysimum species occurring in this small area^28,29. In particular, these Erysimum species show characteristics that may facilitate hybridization and inter-specific gene-flow, such as occasional sympatry and a generalist pollination system³². Thus, previous studies have suggested that several of these taxa could have a hybrid origin^30,31,32. Ploidy levels vary among and, in some cases, within species^28,33, suggesting that a detailed understating of hybridization and allopolyploidization is necessary to shed light on the evolution of this group. However, the effects of hybridization and polyploidization on the genomes of these species are far from being fully understood.

In this study, we explore the homogenization dynamics of ITS, taking into account the interacting effects of concerted evolution, hybridization, and polyploidization. For this purpose, we analyzed polymorphisms at the species, population, and individual levels in ITS1 and ITS2 for seven Erysimum species. These species are closely related and belong to an Iberian clade within this genus³⁴. We sequenced ITS1 and ITS2 by NGS to recover all the ITS copies present in the different genomes^11,35. With these sequences, we then proceeded to quantify the degree of sequence homogenization in both ITS1 and ITS2 within individuals, populations, and species; and the concerted evolution levels in Erysimum spp. These species have been previously studied, showing a mainly outcrossing mating system with weak prezygotic barriers among them^32,33. A cpDNA phylogeny has shown a recent origin (< 2 Mya) for these species³⁶. Moreover, other phylogenies for these same species have shown reticulated patterns with a lack of species clustering in some cases and evidence of cytonuclear discordance, suggesting a recent hybridization scenario with allopolyploidization³³. Due to their recent evolution, we hypothesize that polyploid species of this genus will have less ITS homogenization than diploid species. Any insight into ITS evolution in plants needs to consider the concomitant effects of hybridization and polyploidization on the rates of concerted evolution.

Materials and methods

Taxon sampling

We collected fresh leaves from polyploid and diploid Erysimum species. To determine DNA ploidy levels and assess genome size for each population, we used flow cytometry (see Table 1 for details on species ploidy levels). Specific details on the flow cytometry analyses could be found in Osuna-Mascaró et al.^32,33. In particular, we sampled leaves from five individuals belonging to three different populations of Erysimum baeticum, E. bastetanum, E. mediohispanicum, E. nevadense, and E. popovii, and five individuals from one population of E. lagascae and five from the microendemic E. fitzii. A total of 85 samples (= leaves of each individual) were dried and preserved in silica gel until DNA extraction. Table 1 shows the code, location, and ploidy levels of all samples.

Table 1 Taxonomic assignment, population code, location, elevation, and ploidy level for the Erysimum spp. populations sampled.

Full size table

DNA extraction

We used at least 60 mg of dry plant material from each sample. We disrupted the tissues in liquid N2 using a mortar and pestle. Then, total genomic DNA was isolated using the GenElute Plant Genomic DNA Miniprep kit (Sigma-Aldrich, St. Louis, MO) following the manufacturer's protocol [https://www.sigmaaldrich.com/ES/es/technical-documents/protocol/genomics/dna-and-rna-purification/genelute-plant-genomic-dna-purification-kit]. The quantity and quality of the obtained DNA were checked using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, United States), and the integrity of the extracted DNA was checked on agarose gel electrophoresis.

ITS1 and ITS2 amplification

We independently amplified ITS1 and ITS2 in each sample. The ITS PCR reactions were performed in 25 μl with the following composition: 5 μL 5 × buffer containing MgCl2 at 1.5 mM (New England Biolabs), 0.1 mM each dNTP, 0.2 µM each primer, and 0.02 U Taq high fidelity DNA-polymerase (Q5 High-Fidelity DNA Polymerase, New England Biolabs). We used a set of long primers developed to have a 5' flanking sequence complementary to the Nextera XT DNA index to facilitate adapter ligation during library construction:

> ITS1-Flabel (for ITS1 amplification)

TCG TCG GCA GCG TCA GAT GT GTA TAA GAG ACA GTC CGT AGG TGA ACC TGC GG

> ITS1-Rlabel (for ITS1 amplification)

GTC TCG TGG GCT CGG AGA TGT GTA TAA GAG ACA GGC TGC GTT CTT CAT CGA TGC

> ITS3-Flabel (for ITS2 amplification)

TCG TCG GCA GCG TCA GAT GTG TAT AAG AGA CAG GCA TCG ATG AAG AAC GCA GC

> ITS4-Rlabel (for ITS2 amplification)

GTC TCG TGG GCT CGG AGA TGT GTA TAA GAG ACA GTC CTC CGC TTA TTG ATA TGC

Reactions included 30 cycles with the following conditions: 94 °C 15 s, 60 °C 30 s, and 72 °C 30 s. Amplified fragments were purified using spin columns (GenElute TM PCR Clean-Up Kit, Sigma-Aldrich) and were checked on agarose gel electrophoresis. Finally, we quantified the starting DNA concentration using the Infinite M200 PRO NanoQuant spectrophotometer (TECAN, Männedorf, Switzerland).

Library construction

We constructed two libraries, one for ITS1 amplicons and one for ITS2 amplicons. The libraries were prepared using the Nextera XT DNA Sample Preparation Kit. In brief, the DNA was tagged by adding a unique adapter label combination to the 3' and 5' ends of the DNA sequence. Then, the DNA was amplified via a nine-cycle PCR. The total volume reaction was 25 μl with the following composition: 5 μL 10 × buffer at 1.0 mM (New England BioLabs), 0.1 mM each dNTP, 0.2 µM each Nextera primer, 0.02 U Taq high fidelity DNA-polymerase (Q5, NEB), and 5 × Q5 High GC Enhancer (NEB). PCR thermocycling conditions were 98 °C during 5 s, 55 °C for 10 s, and 72 °C for 10 s. After that, we purified both libraries using the GenElute PCR Clean-Up Kit (Sigma) to remove short library fragments. Finally, we generated equal volumes of the libraries to prepare equimolar libraries for sequencing, and the final concentration of each library was quantified using the Infinite M200 PRO NanoQuant spectrophotometer (TECAN, Männedorf, Switzerland).

Library sequencing

ITS1 and ITS2 library sequencing were carried out by Novogene Bioinformatics Technology Co., Ltd, with an Illumina MiSeq platform (Illumina, USA) using a paired-end 150 bp sequence read run. The ITS libraries of E. mediohispanicum were sequenced twice due to an unexpected low sequencing output (we constructed new libraries as explained above). This sequencing was done using the Illumina Miseq platform and paired-end chemistry in the Center for Scientific Instrumentation (CIC) of the University of Granada, Spain.

Data analysis

FASTQ files were demultiplexed, and read quality was checked in FastQC v0.11.5³⁷. Then, we did a trimming procedure using first cutadapt v1.15³⁸ to trim the adapters, followed by a quality trimming using Sickle v1.33³⁹. Forward and reverse reads were paired in Geneious R.11⁴⁰. Using the function "Set pair read" with default parameters for Illumina paired-end read technology. Then the paired reads were merged using BBMerge v37.64⁴¹ with a "Low Merge rate" to decrease false positives. Then, to reduce redundancy and noise caused by sequencing errors and tag switching events, we did a cluster analysis using CD-HIT v4.6.8⁴². We clustered the sequences from each sample using an identity threshold of 0.99 (i.e., we merged the sequences with similarity ≥ 99%) and discarded the clusters that included < 5% of the total reads⁴³. This step reduced the contribution of sequencing errors to the reported sequence diversity.

We aligned the sequences from each sample using MAFFT v7.450⁴⁴ with default parameters, generating one alignment per species and marker. We trimmed the alignments using trimAl v1.2⁴⁵, removing poorly aligned regions with the "gappyout" method. We estimated population genetic parameters at intra-species, intra-population, and intra-individual levels using the R package PEGAS v0.1⁴⁶. We used the "nuc.div" function to calculate nucleotide diversity (π), estimated as the average number of nucleotide differences per site between two sequences^47,48. We constructed boxplots in R to depict the nucleotide diversity (π) of each sample for ITS1 and ITS2 using the package ggplot2⁴⁹. Moreover, we estimated the haplotype diversity (Hd), with the "hap.div" function, as the probability of differentiation between two randomly chosen haplotypes. We then used the "haplotype" function to calculate the total number of haplotypes and the haplotype frequency distribution for each species, population, and individual. We represented the number of ITS1 and ITS2 haplotypes per sample for diploid and polyploid species with a boxplot generated in R using ggplot2⁴⁹. We checked for normality using Shapiro–Wilk’s method and then compared the nucleotide and haplotype diversity and the number of haplotypes among polyploid and diploid species and among ITS1 and ITS2 using the Mann–Whitney–Wilcoxon test. All statistical analyses were done in R v 4.1.0 using the package stats v3.6.1⁵⁰.

We investigated potential correlations among ploidy levels and haplotype and nucleotide diversity for ITS1 and ITS2 samples. Also, as these species were described as frequently hybridizing, we studied if there were shared haplotypes among different populations of the same species and among different species. To explore that, we estimated the total number of ITS1 and ITS2 haplotypes and their frequencies.

We analyzed the genetic structure of ITS1 and ITS2 by performing a hierarchical analysis of molecular variance (AMOVA;⁵¹). We used the "amova" function from the R package PEGAS v0.1⁴⁶ to explore the genetic variation explained by populations (i.e., at the population level), among individuals within populations (i.e., at the individual level), and within individuals (i.e., at the intra-genome level). We run an AMOVA for each species, including all the population samples, regardless of population ploidy levels. Moreover, we analyzed the amount of genetic variation in ITS1 and ITS2 explained by interspecific differences by partitioning the variance into three levels: among species, among populations within species, and within populations (i.e., among individuals). For that, we run two different AMOVA analyses first, all the sequences of ITS1 and then all the sequences of ITS2, regardless of the species and ploidy level.

Research involving plants

We obtained permission for collecting plant material from: Junta de Andalucía, Consejería de Medioambiente y Ordenación del Territorio. The sampling complied with all institutional, national, and international guidelines and legislations.

Results

From the initial 85 individuals, we obtained good-quality sequences for a total of 84 ITS1 and 81 ITS2 samples, with 10,156 ± 1233 sequences per individual for ITS1 and 49,428 ± 7678 sequences for ITS2 (Table S1).

Polyploid species (E. baeticum, E. bastetanum, E. popovii) tended to have higher nucleotide diversity than diploid species (Fig. 1) for both ITS1 (Wilcoxon test = 655, p value: 0.04; mean π ± SE; polyploid: 0.012 ± 0.007, diploid: 0.004 ± 0.006) and ITS2 (Wilcoxon test = 663, p value: 0.03; polyploids: 0.003 ± 0.004, diploids: 0.002 ± 0.003). In addition, the polyploid population of E. mediohispanicum (Em71, 4x) showed higher nucleotide diversity than the two diploid populations of this species, marginally significant for ITS1 (Wilcoxon test = 10, p value: 0.05; Em71: mean π = 0.011 ± 0.006; Em39: mean π = 0.006 ± 0.007 for ITS1; Em21: mean π = 0.0004 ± 0.001; Fig. S1) and more pronounced for ITS2 (Wilcoxon test = 8.5, p value: 0.04; Em71: mean π = 0.003 ± 0.002; Em39: mean π = 0.0003 ± 0.001 for ITS2; Em21: mean π = 0.001 ± 0.002 for ITS2; Fig. S1). Furthermore, the correlation between ploidy level and nucleotide diversity was highly significant for ITS1 (Spearman’s rho: 0.48, p value: 2.10 × 10^–6; Fig. S2) and marginally significant for ITS2 (Spearman’s rho: 0.20, p value: 0.06). The difference in the degree of association of ITS1 and ITS2 polymorphisms with ploidy levels might be a consequence of overall diversity. ITS2 samples presented significantly lower nucleotide diversities than ITS1 ones (Wilcoxon test = 5165.5, p value: 3.33 × 10^–7). Nucleotide diversity values for ITS1 and ITS2 at the three levels of analysis (species, population, individual) are shown in Tables S2–S8.

Haplotype diversity showed a similar pattern to that of the nucleotide diversity, with higher haplotype diversity for polyploid species than diploid species, for ITS1 (Wilcoxon test = 343, p value: 2.16 × 10^–6; mean Hd = 0.89 ± 0.38 for polyploid; mean Hd = 0.50 ± 0.49 for diploid) and marginally significant for ITS2 (Wilcoxon test = 632.5, p value = 0.059; mean Hd = 0.39 ± 0.49 for polyploid; mean Hd = 0.28 ± 0.45 for diploid). Moreover, the degree of association between haplotype diversity and ploidy level seemed to differ between ITSs, being highly significant for ITS1 (Spearman’s rho: 0.43, p value: 2.96 × 10^–5) but only marginally significant for ITS2 (Spearman’s rho: 0.18, p value: 0.09). The values of haplotype diversity for both ITS and three levels are shown in Tables S2–S8. ITS2 presented lower haplotype diversity than ITS1 in terms of haplotype numbers (Wilcoxon test = 4458, p value 0.002; Table 2). ITS2 diversity was reduced to a single haplotype (i.e., no polymorphism was detected) in 49 individuals (Tables S2–S8). Conversely, only 30 individuals showed no nucleotide diversity in ITS1 (Tables S2–S8).

Table 2 Average haplotype diversity (Hp) per species, estimated for ITS1 and ITS2 samples. Maximum and minimum values (in parentheses) refer to individual samples.

Full size table

Polyploid species showed higher number of haplotypes than diploid species (Fig. 2). Moreover, several ITS1 haplotypes were shared across species, particularly among some populations of E. bastetanum, E. fitzii, E. mediohispanicum, and E. nevadense. Specifically, we found that the three populations of E. bastetanum studied in this article shared haplotypes with two E. mediohispanicum populations (Em39, Em71) and with the three populations of E. nevadense. In addition, E. bastetanum populations and one population of E. nevadense (En05) shared haplotypes with the E. fitzii population included in the analyses. Conversely, no ITS2 haplotypes were found to be shared across different species (Tables S10, S11, S13, and S14).

The hierarchical AMOVA showed that interspecific differences were a significant source of variation for both ITS (Table 3). The species-level explained 52.63 and 73.50% of the variance for ITS1 (p value < 0.001, Φ = 0.48) and ITS2 (p value < 0.001, Φ = 0.70) respectively, implying ample genetic divergence among species. Conversely, differences among populations were not significant and absorbed a relatively low amount of molecular variance (< 9% for both ITS1 & ITS2; Table 3). When the genetic structure was separately analyzed for each species, we found more complex results. Most of the variance (44.96–100% for ITS1; 29.12–100% for ITS2) resided within-individuals (see Table 4). Differences among populations varied from 0 to 48.07% for ITS1 and from 0 to 70.87% for ITS2. Moreover, the differences were only significant in E. mediohispanicum, E. nevadense, and E. popovii for ITS1 and E. bastetanum for ITS2 (Table 4).

Table 3 Hierarchical AMOVA results for ITS1 and ITS2 regions.

Full size table

Table 4 ITS1 and ITS2 hierarchical AMOVA results for E. baeticum, E. bastetanum, E. fitzii, E.lagascae, E. mediohispanicum, E. nevadense, and E. popovii.

Full size table

Discussion

We observed incomplete sequence homogenization for the 45S rDNA regions in the Erysimum species studied here. Our analyses were based on stringent trimming to avoid false polymorphisms due to sequencing errors. However, despite being so restrictive, we found high nucleotide and haplotype diversities overall, especially for ITS1, and a significant genetic structure that may inform the evolutionary history of these species.

Polyploid Erysimum species presented lower ITS homogenization levels than diploid species. Specifically, polyploid species presented higher nucleotide and haplotype diversity and a higher number of haplotypes, congruent with the hypothesis that polyploids harbor greater genetic diversity even within gene families⁵². The lack of concerted evolution in polyploid species has been previously described in several plant species in which an absence of sequence homogenization could be related to a recent allopolyploid origin^{34,53,54,55,56}. Moreover, some studies have suggested that the number of rDNA loci, usually located in different chromosomes, is expected to be higher in polyploids, hindering sequence homogenization^57,58. The number of rDNA loci and their chromosomic locations in these Erysimum species is unknown. In the genome of the diploid E. cheiranthoides³⁴, the rDNA appears in eight locations in chromosomes 3, 6, 7, and 8, which may be related to the number of rDNA loci for the diploid Erysimum species studied here. In any case, a relatively higher number of rDNA loci is expected for polyploid Erysimum species. Although the number of rDNA loci may coincide with the sum of those of its parents in young allopolyploids, it could be more variable in older polyploids, where some loci are usually lost^59,60,61,62.

We also detected limited sequence homogenization in diploid species, particularly in ITS1. The high molecular variance within diploid genomes (Table 4) could be the result of past hybridization events, which might result in the coexistence of multiple ITS families within individual genomes, particularly if hybrids are young^34,63. This result is congruent with previous studies, in which the genomes of the diploid Erysimum species studied here were found to exhibit signatures of recent hybridization and introgression³³. Moreover, Erysimum phylogenies based on ITS sequences^64,65,66 showed a variable degree of phylogenetic incongruence compatible with hybridization. Here, the influence of hybridization on ITS diversity is further supported by the significant molecular variance among populations detected in some species (i.e., E. mediohispanicum, E. nevadense, for ITS1), showing a non-consistent homogenization pattern in the population level. Thus, these results suggest a different history of hybridization for each population, in concordance with previous studies^32,33.

Our results indicated that sequence homogenization was heterogeneous across the 45S rDNA regions within a general scenario of high diversity (Tables S2–S8). The degree of polymorphism exhibited by ITS1 was much higher than that of ITS2, suggesting that concerted evolution is operating more efficiently on the latter. This result agrees with previous studies that have shown that ITS1 is, on average, more variable than ITS2, which has been described as a very conserved marker^{67,68,69,70,71,72}. This variation between the two spacers might help to analyze evolutionary patterns at different scales. While ITS1 variation might throw light on divergence at the population- or individual-level, our AMOVA results (Table 3) suggest that ITS2 could be useful for species-level characterization, at least in Erysimum spp.

Because of their sensitivity to hybridization, ITS markers have been previously used to identify the parental contributors of hybrid taxa^{17,53,64,73,74}. Our study found shared haplotypes among diploid and polyploid species (specifically among E. bastetanum—a polyploidy—and the diploid species E. fitzii, E. mediohispanicum, and E. nevadense), which could be the result of incomplete lineage sorting or the effect of recent hybridization events. However, we have not found decisive evidence of whether these diploid species could be considered parental species of the polyploid taxon. Moreover, our results indicate hybridization across taxonomic levels (i.e., from individuals to species) since they are more congruent with multiple backcrossings across populations and taxa than with a single, “original” allopolyploidization event. Reticulated evolution seems to be the norm in this genus^{33,36,59,75,76,77}. Thus, for the species analyzed in this study, Osuna-Mascaro et al.³³ have found genomic evidence of rampant introgression between species, including both lilac- and yellow-flowered species. Future studies identifying the alleles co-located on the same chromosome through phased haplotypes⁷⁸ or using PacBio single-molecule sequencing and the PURC method (Pipeline for Untangling Reticulate Complexes;⁷⁹) could be used to identify parental species of the different hybrid taxa and trace back the evolutionary patterns of these Erysimum species.

Despite their evident versatility as molecular evolution markers, the analysis of ITS sequences needs to be undertaken to realize that concerted evolution might often be insufficient to ensure sequence homogenization⁸⁰. Both ITS and, especially, ITS2 have for a long time been used as phylogenetic and barcoding markers in plants^{8,69,81,82,83}. However, many studies have pointed out that evolutionary inferences based on these markers might lead to misleading or erroneous conclusions in species where sequence homogenization is lacking due to hybridization or other genome rearrangement events^9,11. In this study, our results indicate that allopolyploidization and hybridization have severely impaired ITS sequence homogenization in Erysimum, implying that ITS-based phylogenies of this genus should be considered with prudence. Given that these causes of genomic rearrangement are widespread and prevalent among flowering plants¹¹, caution is advised when using ITS for phylogenetic studies without prior knowledge of haplotype distribution, even for diploid species. Hence, intragenomic variation for ITS sequences could be used as an indication of possible recent hybridization.

Data availability

The raw data from this project were submitted to NCBI Sequence Read Archive (SRA) and can be found by the BioSample ID: SUB11440702, BioProject PRJNA835881, and the following accession numbers: E. baeticum (Ebb07: SAMN28120146, Ebb10: SAMN28120147, Ebb12: SAMN28120148); E. bastetanum (Ebt01: SAMN28120149, Ebt12: SAMN28120150, Ebt13: SAMN28120151); E. fitzii: Ef01 (SAMN28120152); E. lagascae (Ela07: SAMN28120153); E. mediohispanicum (Em21: SAMN28120154, Em39: SAMN28120155, Em71: SAMN28120156); E. nevadense (En05: SAMN28120157, En10: SAMN28120158, En12: SAMN28120159); E. popovii (Ep16: SAMN28120160, Ep20: SAMN28120161, Ep27: SAMN28120162).

References

Elder, J. F. Jr. & Turner, B. J. Concerted evolution of repetitive DNA sequences in eukaryotes. Q. Rev. Biol. 70(3), 297–320 (1995).
Article CAS PubMed Google Scholar
Eickbush, T. H. & Eickbush, D. G. Finely orchestrated movements: Evolution of the ribosomal RNA genes. Genetics 175, 477–485 (2007).
Article CAS PubMed PubMed Central Google Scholar
Dover, G. Concerted evolution, molecular drive, and natural selection. Curr. Biol. 4(12), 1165–1166 (1994).
Article CAS PubMed Google Scholar
Ganley, A. R. & Kobayashi, T. Highly efficient concerted evolution in the ribosomal DNA repeats: Total rDNA repeat variation revealed by whole-genome shotgun sequence data. Genome Res. 17(2), 184–191 (2007).
Article CAS PubMed PubMed Central Google Scholar
Feliner, G. N. & Rosselló, J. A. Concerted evolution of multigene families and homoeologous recombination. In Plant Genome Diversity (eds Wendel, J. F. et al.) 171–194 (Springer-Verlag, 2012).
Chapter Google Scholar
Sone, T. et al. Bryophyte 5S rDNA was inserted into 45S rDNA repeat units after the divergence from higher land plants. Plant Mol. Biol. 41(5), 679–685 (1999).
Article CAS PubMed Google Scholar
Long, E. O. & Dawid, I. B. Repeated genes in eukaryotes. Annu. Rev. Biochem. 49(1), 727–764 (1980).
Article CAS PubMed Google Scholar
Baldwin, B. G. et al. The ITS region of nuclear ribosomal DNA: A valuable source of evidence on angiosperm phylogeny. Ann. Mo. Bot. Gard. https://doi.org/10.2307/2399880 (1995).
Article Google Scholar
Álvarez, I. & Wendel, J. F. Ribosomal ITS sequences and plant phylogenetic inference. Mol. Phylogenet. Evol. 29(3), 417–434 (2003).
Article PubMed Google Scholar
Xu, B. et al. ITS non-concerted evolution and rampant hybridization in the legume genus Lespedeza (Fabaceae). Sci. Rep. 7, 40057 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Nieto-Feliner, G. & Rosselló, J. A. Better the devil you know? Guidelines for insightful utilization of nrDNA ITS in species-level evolutionary studies in plants. Mol. Phylogenet. Evol. 44(2), 911–919 (2007).
Article CAS PubMed Google Scholar
Teruel, M. et al. Disparate molecular evolution of two types of repetitive DNAs in the genome of the grasshopper Eyprepocnemis plorans. Heredity 112(5), 531–542 (2014).
Article CAS PubMed Google Scholar
Buckler, E. S. & Holtsford, T. P. Zea systematics: Ribosomal ITS evidence. Mol. Biol. Evol. 13(4), 612–622 (1996).
Article CAS PubMed Google Scholar
Mayol, M. & Rosselló, J. A. Why nuclear ribosomal DNA spacers (ITS) tell different stories in Quercus. Mol. Phylogenet. Evol. 19(2), 167–176 (2001).
Article CAS PubMed Google Scholar
Popp, M. & Oxelman, B. Evolution of an RNA polymerase gene family in Silene (Caryophyllaceae) incomplete concerted evolution and topological congruence among paralogues. Syst. Biol. 53(6), 914–932 (2004).
Article PubMed Google Scholar
Harpke, D. & Peterson, A. Non-concerted ITS evolution in Mammillaria (Cactaceae). Mol. Phylogenet. Evol. 41(3), 579–593 (2006).
Article CAS PubMed Google Scholar
Denk, T. & Grimm, G. W. The oaks of western Eurasia: Traditional classifications and evidence from two nuclear markers. Taxon 59(2), 351–366 (2010).
Article Google Scholar
Xiao, L. Q., Möller, M. & Zhu, H. High nrDNA ITS polymorphism in the ancient extant seed plant Cycas: Incomplete concerted evolution and the origin of pseudogenes. Mol. Phylogenet. Evol. 55(1), 168–177 (2010).
Article CAS PubMed Google Scholar
Bailey, J. A., Liu, G. & Eichler, E. E. An Alu transposition model for the origin and expansion of human segmental duplications. Am. J. Hum. Genet. 73(4), 823–834 (2003).
Article CAS PubMed PubMed Central Google Scholar
Won, H. & Renner, S. S. The internal transcribed spacer of nuclear ribosomal DNA in the gymnosperm Gnetum. Mol. Phylogenet. Evol. 36(3), 581–597 (2005).
Article CAS PubMed Google Scholar
Zheng, X., Cai, D., Yao, L. & Teng, Y. Non-concerted ITS evolution, early origin and phylogenetic utility of ITS pseudogenes in Pyrus. Mol. Phylogenet. Evol. 48(3), 892–903 (2008).
Article CAS PubMed Google Scholar
Drábková, L. Z., Kirschner, J., Štěpánek, J., Záveský, L. & Vlček, Č. Analysis of nrDNA polymorphism in closely related diploid sexual, tetraploid sexual and polyploid species. Plant Syst. Evol. 278(1–2), 67–85 (2009).
Article Google Scholar
Okuyama, Y. et al. Nonuniform concerted evolution and chloroplast capture: heterogeneity of observed introgression patterns in three molecular data partition phylogenies of Asian Mitella (Saxifragaceae). Mol. Biol. Evol. 22(2), 285–296 (2004).
Article PubMed Google Scholar
Soltis, P. S. & Soltis, D. E. The role of hybridization in plant speciation. Annu. Rev. Plant. Biol. 60, 561–588 (2009).
Article CAS PubMed Google Scholar
Polatschek A. Erysimum. Mountain flora of Greece (ed. Strid, A.) 239–247 (Cambridge University Press, 1986).
Warwick, S. I., Francis, A., Al-Shehba, I. & A,. Brassicaceae: species checklist and database on CD-Rom. Plant Syst. Evol. 259, 249–258 (2006).
Article Google Scholar
Al-Shehbaz, I. A. A generic and tribal synopsis of the Brassicaceae (Cruciferae). Taxon 61, 931–954 (2012).
Article Google Scholar
Nieto-Feliner, G. in Erysimum L. Flora Ibérica (Vol. IV. Cruciferae-Monotropaceae) 48–76 (Real Jardín Botánico, CSIC, Madrid, 1993).
Médail, F. & Diadema, K. Glacial refugia influence plant diversity patterns in the Mediterranean Basin. J. Biogeogr. 36(7), 1333–1345 (2009).
Article Google Scholar
Abdelaziz, M. How Species are Evolutionarily Maintained? Pollinator-Mediated Divergence and Hybridization in Erysimum mediohispanicum and Erysimum nevadense. Doctoral Dissertation, Universidad de Granada (2013).
Muñoz-Pajares, A. J. Erysimum mediohispanicum at the Evolutionary Crossroad: Phylogrography, Phenotype, and Pollinators. Doctoral Dissertation, Universidad de Granada (2013).
Osuna Mascaró, C. Hybridization as an Evolutionary Driver for Speciation: A Case in the Southern European Erysimum Species. Doctoral Dissertation, Universidad de Granada (2020).
Osuna-Mascaró, C. et al. Hybridization and introgression are prevalent in Southern European Erysimum (Brassicaceae) species. Ann. Bot. https://doi.org/10.1093/aob/mcac048 (2022).
Article PubMed Google Scholar
Züst, T. et al. Independent evolution of ancestral and novel defenses in a genus of toxic plants (Erysimum, Brassicaceae). Elife 9, e51712 (2020).
Article PubMed PubMed Central Google Scholar
Rauscher, J. T., Doyle, J. J. & Brown, A. H. D. Internal transcribed spacer repeat-specific primers and the analysis of hybridization in the Glycine (Leguminosae) polyploid complex. Mol. Ecol. 11(12), 2691–2702 (2002).
Article CAS PubMed Google Scholar
Osuna-Mascaró, C., Rubio de Casas, R., Landis, J. B. & Perfectti, F. Genomic resources for Erysimum spp (Brassicaceae): Transcriptome and chloroplast genomes. Front. Ecol. Evol. 9, 620601 (2021).
Article Google Scholar
Andrews, S. FastQC: A quality control tool for high throughput sequence data. Available at: www.bioinformatics.babraham.ac.uk/projects/fastqc (2010).
Martin, M. Cutadapt removes adapter sequences from highthroughput sequencing reads. EMBnet J. 17, 10–12 (2011).
Article Google Scholar
Joshi, N. A. & Fass, J. N. Sickle: A sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33) (2011).
Kearse, M. et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28(12), 1647–1649 (2012).
Article PubMed PubMed Central Google Scholar
Bushnell, B., Rood, J. & Singer, E. BBMerge–accurate paired shotgun read merging via overlap. PLoS ONE https://doi.org/10.1371/journal.pone.0185056 (2017).
Article PubMed PubMed Central Google Scholar
Li, W. & Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22(13), 1658–1659 (2006).
Article CAS PubMed Google Scholar
Esling, P., Lejzerowicz, F. & Pawlowski, J. Accurate multiplexing and filtering for high-throughput amplicon- sequencing. Nucleic Acids Res. 43(5), 2513–2524 (2015).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K., Misawa, K., Kuma, K. I. & Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14), 3059–3066 (2002).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25(15), 1972–1973 (2009).
Article PubMed PubMed Central Google Scholar
Paradis, E. Pegas: A R package for population genetics with an integrated–modular approach. Bioinformatics 26(3), 419–420 (2010).
Article CAS PubMed Google Scholar
Nei, M. & Li, W. H. Mathematical model for studying genetic variation in terms of restriction endonucleases. PNAS 76(10), 5269–5273 (1979).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Nei, M. & Jin, L. Variances of the average numbers of nucleotide substitutions within and between populations. Mol. Biol. Evol. 6(3), 290–300 (1989).
CAS PubMed Google Scholar
Villanueva, R. A. M. & Chen, Z. J. ggplot2: Elegant graphics for data analysis 160–167 (2019).
Team, R. C. R Core Team R: A language and environment for statistical computing. Foundation for Statistical Computing (2020).
Excoffier, L., Smouse, P. E. & Quattro, J. M. Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data. Genetics 131(2), 479–491 (1992).
Article CAS PubMed PubMed Central Google Scholar
Otto, S. P. & Whitton, J. Polyploid incidence and evolution. Annu. Rev. Genet. 34, 401–437 (2000).
Article CAS PubMed Google Scholar
Koch, M. A., Dobeš, C. & Mitchell-Olds, T. Multiple hybrid formation in natural populations: Concerted evolution of the internal transcribed spacer of nuclear ribosomal DNA (ITS) in North American Arabis divaricarpa (Brassicaceae). Mol. Biol. Evol. 20(3), 338–350 (2003).
Article CAS PubMed Google Scholar
Kovarik, A. et al. Concerted evolution of 18–5.8–26S rDNA repeats in Nicotiana allotetraploids. Biol. J. Linn. Soc. 82(4), 615–625 (2004).
Article Google Scholar
Lunerová, J., Renny-Byfield, S., Matyášek, R., Leitch, A. & Kovařík, A. Concerted evolution rapidly eliminates sequence variation in rDNA coding regions but not in intergenic spacers in Nicotiana tabacum allotetraploid. Biol. J. Linn. Soc. 303(8), 1043–1060 (2017).
Google Scholar
Morales-Briones, D. F. & Tank, D. C. Extensive allopolyploidy in the neotropical genus Lachemilla (Rosaceae) revealed by PCR-based target enrichment of the nuclear ribosomal DNA cistron and plastid phylogenomics. Am. J. Bot. 106(3), 415–437 (2019).
Article CAS PubMed Google Scholar
Wendel, J. F. Genome evolution in polyploids. Plant Mol. Evol. 225–249 (2000).
Kovarik, A. et al. Rapid concerted evolution of nuclear ribosomal DNA in two Tragopogon allopolyploids of recent and recurrent origin. Genetics 169(2), 931–944 (2005).
Article CAS PubMed PubMed Central Google Scholar
Clarkson, J. J. et al. Long-term genome diploidization in allopolyploid Nicotiana section Repandae (Solanaceae). New Phytol. 168, 241–252 (2005).
Article CAS PubMed Google Scholar
Chester, M. et al. Extensive chromosomal variation in a recently formed natural allopolyploid species, Tragopogon miscellus (Asteraceae). Proc. Natl. Acad. Sci. 109, 1176–1181 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Rebernig, C. A. et al. The evolutionary history of the white-rayed species of Melampodium (Asteraceae) involved multiple cycles of hybridization and polyploidization. Am. J. Bot. 99, 1043–1057 (2012).
Article CAS PubMed PubMed Central Google Scholar
Weiss-Schneeweiss, H., Emadzade, K., Jang, T. & Schneeweiss, G. Evolutionary consequences, constraints and potential of polyploidy in plants. Cytogenet. Genome Res. 40, 137–150 (2013).
Article Google Scholar
Nieto-Feliner, G., Gutiérrez Larena, B. & Fuertes Aguilar, J. Fine-scale geographical structure, intra-individual polymorphism and recombination in nuclear ribosomal internal transcribed spacers in Armeria (Plumbaginaceae). Ann. Bot. 93(2), 189–200 (2004).
Article CAS PubMed PubMed Central Google Scholar
Abdelaziz, M. et al. Phylogenetic relationships of Erysimum (Brassicaceae) from the Baetic Mountains (se Iberian peninsula). An. Jard. Bot. Madr. 71, 005 (2014).
Article Google Scholar
Gómez, J. M., Perfectti, F. & Klingenberg, C. P. The role of pollinator diversity in the evolution of corolla-shape integration in a pollination-generalist plant clade. Philos. Trans. R. Soc. B. 369, 20130257 (2014).
Article Google Scholar
Moazzeni, H. et al. Phylogenetic perspectives on diversification and character evolution in the species-rich genus Erysimum (Erysimeae; Brassicaceae) based on a densely sampled ITS approach. Bot. J. Linn. Soc. 175(4), 497–522 (2014).
Article Google Scholar
Hershkovitz, M. A. & Zimmer, E. A. Conservation patterns in angiosperm rDNA ITS2 sequences. Nucleic Acids Res. 24(15), 2857–2867 (1996).
Article CAS PubMed PubMed Central Google Scholar
Coleman, A. W. ITS2 is a double-edged tool for eukaryote evolutionary comparisons. Trends Genet. 19(7), 370–375 (2003).
Article CAS PubMed Google Scholar
Chen, S. et al. Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS ONE https://doi.org/10.1371/journal.pone.0008613 (2010).
Article PubMed PubMed Central Google Scholar
Buchheim, M. A. et al. Internal transcribed spacer 2 (nu ITS2 rRNA) sequence-structure phylogenetics: Towards an automated reconstruction of the green algal tree of life. PLoS ONE https://doi.org/10.1371/journal.pone.0016931 (2011).
Article PubMed PubMed Central Google Scholar
Wang, X. C. et al. ITS1: A DNA barcode better than ITS 2 in eukaryotes?. Mol. Ecol. Resour. 15(3), 573–586 (2015).
Article CAS PubMed Google Scholar
Yang, R. H. et al. Evaluation of the ribosomal DNA internal transcribed spacer (ITS), specifically ITS1 and ITS2, for the analysis of fungal diversity by deep sequencing. PLoS ONE https://doi.org/10.1371/journal.pone.0206428 (2018).
Article PubMed PubMed Central Google Scholar
Sun, K., Ma, R., Chen, X., Li, C. & Ge, S. Hybrid origin of the diploid species Hippophae goniocarpa evidenced by the internal transcribed spacers (ITS) of nuclear rDNA. Belg. J. Bot. 91–96 (2013).
Hodač, L., Scheben, A. P., Hojsgaard, D., Paun, O. & Hörandl, E. ITS polymorphisms shed light onhybrid evolution in apomictic plants: a case study on the Ranunculus auricomus complex. PLoS ONE https://doi.org/10.1371/journal.pone.0103003 (2014).
Article PubMed PubMed Central Google Scholar
Clot, B. Caryosystématique de quelques Erysimum L. dans le nord de la Péninsule Ibérique. Anal. Jardín Botán. Madrid 49, 215–229 (1992).
Google Scholar
Marhold, K. & Lihová, J. Polyploidy, hybridization and reticulate evolution: Lessons from the Brassicaceae. Plant Syst. Evol. 259(2), 143–174 (2006).
Article Google Scholar
Turner, B. L. Taxonomy and nomenclature of the Erysimum asperum-E.capitatum complex (Brassicaceae). Phytologia 88, 279–287 (2006).
Article Google Scholar
Browning, S. R. & Browning, B. L. Haplotype phasing: Existing methods and new developments. Nat. Rev. Genet. 12(10), 703–714 (2011).
Article CAS PubMed PubMed Central Google Scholar
Rothfels, C. J., Pryer, K. M. & Li, F. W. Next-generation polyploid phylogenetics: Rapid resolution of hybrid polyploid complexes using PacBio single-molecule sequencing. New Phytol. 213(1), 413–429 (2017).
Article CAS PubMed Google Scholar
Song, H. X. et al. The evolution and utility of ribosomal ITS sequences in Bambusinae and related species: Divergence, pseudogenes, and implications for phylogeny. J. Genet. 91(2), 129–139 (2012).
Article CAS PubMed Google Scholar
Hughes, C. E., Eastwood, R. J. & Donovan Bailey, C. From famine to feast? Selecting nuclear DNA sequence loci for plant species-level phylogeny reconstruction. Philos. Trans. R. Soc. Lond. B Biol. Sci. 361(1465), 211–225 (2006).
Article Google Scholar
Mishra, P., Kumar, A., Rodrigues, V., Shukla, A. K. & Sundaresan, V. Feasibility of nuclear ribosomal region ITS1 over ITS2 in barcoding taxonomically challenging genera of subtribe Cassiinae (Fabaceae). PeerJ 4, e2638 (2016).
Article PubMed PubMed Central Google Scholar
Cheng, T. et al. Barcoding the kingdom Plantae: New PCR primers for ITS regions of plants with improved universality and specificity. Mol. Ecol. Resour. 16(1), 138–149 (2016).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank the Tatiana López Pérez and the Evoflor group for helping us during several phases of the study. We also thank the Sierra Nevada National Park headquarters for providing access to sampling in the National Park.

Funding

This research is supported by grants from FEDER/Junta de Andalucía-Consejería de Economía y Conocimiento A-RNM-505-UGR18 and P18-FR-3641. This research was also funded by the Spanish Ministry of Science and Innovation (CGL2013-47558-P and PID2021-126456NB-C22), including EU FEDER funds. COM was supported by the Ministry of Economy and Competitiveness (BES-2014-069022). This is a contribution to the Research Unit Modeling Nature, funded by the Consejería de Economía, Conocimiento, Empresas y Universidad, and European Regional Development Fund (ERDF), reference QUALIFICA 00011.

Author information

Carolina Osuna-Mascaró
Present address: Department of Biology, University of Nevada-Reno, Reno, NV, USA

Authors and Affiliations

Departamento de Genética, Universidad de Granada, Granada, Spain
Carolina Osuna-Mascaró, Modesto Berbel & Francisco Perfectti
Research Unit Modeling Nature, Universidad de Granada, Granada, Spain
Carolina Osuna-Mascaró, Rafael Rubio de Casas, José M. Gómez & Francisco Perfectti
Departamento de Ecología, Universidad de Granada, Granada, Spain
Rafael Rubio de Casas
Departamento de Ecología Funcional y Evolutiva, Estación Experimental de Zonas Áridas (EEZA‐CSIC), Almería, Spain
José M. Gómez

Authors

Carolina Osuna-Mascaró
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Rubio de Casas
View author publications
You can also search for this author in PubMed Google Scholar
Modesto Berbel
View author publications
You can also search for this author in PubMed Google Scholar
José M. Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Perfectti
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.O.M., R.R., and F.P. conceived and designed the study. C.O.M. and M.B. carried out the laboratory procedures and field sampling, with the help of F.P. and J.M.G. C.O.M. analyzed the data with the help of F.P. C.O.M. wrote the first draft. The final version of the MS was redacted with the contribution of all the authors.

Corresponding authors

Correspondence to Carolina Osuna-Mascaró or Francisco Perfectti.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Osuna-Mascaró, C., de Casas, R.R., Berbel, M. et al. Lack of ITS sequence homogenization in Erysimum species (Brassicaceae) with different ploidy levels. Sci Rep 12, 16907 (2022). https://doi.org/10.1038/s41598-022-20194-8

Download citation

Received: 29 May 2022
Accepted: 09 September 2022
Published: 07 October 2022
DOI: https://doi.org/10.1038/s41598-022-20194-8

This article is cited by

Intragenomic rDNA variation - the product of concerted evolution, mutation, or something in between?
- Wencai Wang
- Xianzhi Zhang
- Aleš Kovařík
Heredity (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.