Abstract
The W chromosome of Lepidoptera is typically gene-poor, repeat-rich and composed of heterochromatin. Pioneering studies investigating this chromosome reported an abundance of mobile elements. However, the actual composition of the W chromosome varies greatly between species, as repeatedly demonstrated by comparative genomic hybridization (CGH) or fluorescence in situ hybridization (FISH). Here we present an analysis of repeats on the W chromosome in the willow beauty, Peribatodes rhomboidaria (Geometridae), a species in which CGH predicted an abundance of W-enriched or W-specific sequences. Indeed, comparative analysis of male and female genomes using RepeatExplorer identified ten putative W chromosome-enriched repeats, most of which are LTR or LINE mobile elements. We analysed the two most abundant: PRW LINE-like and PRW Bel-Pao. The results of FISH mapping and bioinformatic analysis confirmed their enrichment on the W chromosome, supporting the hypothesis that mobile elements are the driving force of W chromosome differentiation in Lepidoptera. As the W chromosome is highly underrepresented in chromosome-level genome assemblies of Lepidoptera, this recently introduced approach, combining bioinformatic comparative genome analysis with molecular cytogenetics, provides an elegant tool for studying this elusive and rapidly evolving part of the genome.
Similar content being viewed by others
Introduction
Sex chromosomes have evolved independently in multiple taxa1. In most cases, their evolution seems to follow similar trajectories with predictable outcomes, although various exceptions and derivations are relatively common2. Nevertheless, the classical model of sex chromosome evolution assumes that the sex-determining factor is acquired by one of the homologous chromosomes, resulting in a new pair of sex chromosomes3. The new heteromorphic chromosome (usually Y or W, depending on the type of heterogamety) eventually begins to attract sexually antagonistic genes, and these evolutionarily advantageous mutations can be further fixed, e.g. by inversions. However, such progressive differentiation simultaneously restricts recombination in this region. This is usually followed by gradual degenerative changes such as the accumulation of repetitive sequences and pseudogenization, often leading to heterochromatinization and subsequently even to the dispensability of this particular chromosome. If the sex-determining factor is translocated or substituted, such a sex chromosome may eventually be lost forever and the entire sex chromosome life cycle may be restarted2,4.
The processes of sex chromosome evolution are best studied in diverse groups of organisms that likely share an ancestral karyotype while exhibiting various abnormalities at different evolutionary stages. One of these is the insect order Lepidoptera (moths and butterflies), the largest group with female heterogamety5. Until recently, however, very little was known about the actual molecular composition of their W chromosomes, which were regularly excluded from NGS projects due to their repetitive nature causing assembly problems. So far, only a few genes have been found on the W chromosome, such as the sex-determining female factor generating Fem piRNA6. In contrast, the lepidopteran W chromosomes are typically filled with various types of ubiquitous transposable elements (TE)7,8,9 and may also contain satellite DNA10. Occasionally, some of these sequences are enriched on the W chromosome, such as piSAT1 in Plodia interpunctella11, or even W-specific (i.e. present only on the W), such as CpW2 and CpW5 in Cydia pomonella12.
Apart from Lepidoptera, the accumulation of transposable elements and other repetitive DNA by the heterogametic sex chromosomes has been observed in plants13, fungi14, humans15, other mammals16, birds17, Drosophila flies18, beetles19 and many others20. Such events usually lead to the formation of heterochromatin, which may attract even more TEs, as some of them preferentially insert into silenced regions21. This could be the result of opposing selection forces balancing the deleterious effects of integration on the host cell on the one hand, and the propagation of the specific transposable element in the genome on the other. Some degree of preference for integration sites can be observed in almost all TE types22. Their population dynamics within the genome are described by the deleterious insertion model, which assumes that transposons are eliminated from gene-rich regions by selection pressure, or by the ectopic recombination model, which assumes a higher abundance of transposons in regions with low recombination rates where they cannot be removed by ectopic recombination23. These models offer an explanation for the accumulation of transposable elements on heterogametic sex chromosomes (i.e. Y and W).
Hence, transposable elements may play an important role in the evolution and differentiation of sex chromosomes. In fact, it has already been shown that TEs can influence whole genomes, both in destructive and constructive ways. Destructive forces primarily include insertions into genes or their promoters that impede their function, which may eventually lead to lower fitness or the development of various diseases24. On the other hand, some of our vital proteins originated via domestication of ancient transposable elements, such as RAG proteins that carry out recombination in immunoglobulin genes25, or telomerase which maintains chromosome ends26. In addition, they may contribute to genome size expansion or speciation, as e.g. LINE-1 retrotransposons may reduce gene expression, provide new exons for protein-coding genes27, or serve as “booster stations” for the spread of silencing Xist RNA during X chromosome inactivation in mammals28. To sum up, transposable elements can shape genomes in various ways and influence sex chromosome differentiation, potentially leading to speciation or sexual dimorphism29.
Previous cytogenetic studies in Lepidoptera have shown striking differences between W chromosome composition and size even between closely related species, suggesting rapid evolution30,31,32. One of the most informative cytogenetic tools is comparative genomic hybridization (CGH), which enables to estimate the differentiation level and the ratio of common to female-enriched sequences on the W chromosome. In our recent study30, CGH revealed a high degree of sex chromosome differentiation in many species of Geometridae (Lepidoptera), including the willow beauty (Peribatodes rhomboidaria). This species has a standard lepidopteran karyotype (n = 31, WZ/ZZ), and its normal-sized W chromosome is strongly DAPI positive and contains a large amount of heterochromatin. Furthermore, it shows a remarkable hybridization pattern after CGH, as it is strongly marked by the female genomic probe (Fig. S1; Fig. 8e in our recent study30), suggesting the presence of female-enriched and/or female-specific sequences. The question remained open as to which type of repetitive sequences mainly contribute to the molecular differentiation of the W chromosome in P. rhomboidaria. To address this issue, we performed low coverage NGS sequencing of three individuals of each sex and compared their repeat content using the RepeatExplorer pipeline33. The results describe the repeatome of P. rhomboidaria in general and reveal two W-enriched sequences, contributing to our knowledge of W chromosome evolution in Lepidoptera. Finally, this study validates a novel approach for analysing differences in repeat content that can be applied to other species.
Materials and methods
Insects
Adult specimens of P. rhomboidaria were collected in the field from June to September 2019–2020 in the vicinity of České Budějovice, Czech Republic, using light traps and entomological nets. Females were kept in plastic containers with host plants (mostly leaves of Achillea millefolium) until they laid eggs. After hatching, the larvae were reared to the penultimate instar stage, dissected for chromosome preparations and the remaining tissues were frozen in liquid nitrogen and stored at − 20 °C.
Chromosome preparations
Meiotic chromosomes were obtained from gonads of penultimate instar larvae and spread chromosome preparations were made as previously described34, dehydrated in an ethanol series (70%, 80% and 100%, 30 s each), air-dried and stored at − 20 °C until further use.
DNA isolation and sequencing
Genomic DNA (gDNA) was extracted from one half of the larva using CTAB (hexadecyltrimethylammonium bromide; Sigma-Aldrich, St. Louis, MO, USA) according to the published protocol35 with modifications previously described30. The concentration of isolated DNA was measured using Qubit 3.0 fluorometer (Invitrogen, Carlsbad, CA, USA), and its quality was assessed by the absorbance ratio at 260/280 nm using a Nanodrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). For each of the three broods studied (i.e. offspring of the same mother), we selected a sample of optimal quality from one male and one female (6 samples in total). Paired-end sequencing of 150 bp long reads from a library with 450 bp inserts was performed by Novogene (HK) Co., Ltd. (Hong Kong, China) using the Illumina HiSeq 4000 platform.
NGS data processing and repeatome analysis
The obtained Illumina raw reads were first processed by Trimmomatic version 0.3236, the Illumina sequencing adapters were trimmed and finally all reads were cropped to a final length of 140 bp. The quality of the reads before and after processing was checked using FastQC version 0.10.137. The remaining paired sequences were converted to FASTA format using FastQtoFasta software from the FASTX-Toolkit version 0.0.14 (http://hannonlab.cshl.edu/fastx_toolkit). The following procedures were performed using the RepeatExplorer2 Utilities Kit version 0.3.8-451 (https://repeatexplorer-elixir.cerit-sc.cz). Sequences were interlaced and a random sample of 100,000 reads was generated for each individual (corresponding to approximately 0.02 × genome coverage) and tagged with a specific read name prefix. Alternatively, read sampling was done using the seqtk tool (https://github.com/lh3/seqtk). Samples were concatenated into a single fasta file, which was processed by RepeatExplorer33 using the default parameters for comparative analysis. To identify W-enriched repeats, the number of reads in individual clusters was compared between males and females using R version 4.0.3 in RStudio version 1.4.1103 (https://www.rstudio.com). Only clusters that consisted of at least 0.01% of the analysed reads and showed statistically significant differences (P < 0.05,two sample t-test with unequal variance) were considered. The annotation of the repeats was based on the automatic annotation of RepeatExplorer using Metazoa v3 database. Annotation of selected repeats was further confirmed by protein-based repeat masking in RepeatMasker 4.1.238 (https://www.repeatmasker.org).
PCR and cloning
Based on the RepeatExplorer results, we further analysed the two most abundant W-enriched repeats: cluster 32 (containing the PRW LINE-like retrotransposon) and cluster 79 (containing the PRW Bel-Pao retrotransposon). We designed primers (Table S1) for a selected contigs from cluster 32 and the assembly of three contigs covering most of the cluster 79 using Geneious Prime version 2021.1.1. (https://www.geneious.com). These primers were used in a standard 25 μl PCR mix containing 1× ExTaq buffer (TaKaRa, Otsu, Japan), 2U DNA ExTaq Polymerase (TaKaRa), 200 μM of each nucleotide, 1 μM of each primer and 50 ng of gDNA with the following profile: initial denaturation at 94 °C for 3 min, followed by 30 cycles of 94 °C for 30 s, 56 °C for 30 s and 72 °C for 1 min 30 s; final extension was at 72 °C for 3 min. The results were checked on a standard 1.5% agarose gel in 1× TAE buffer. Gels were stained with ethidium bromide for 20 min and photographed under UV light. To avoid cloning problems due to the size of the PRW Bel-Pao sequence, we generated two shorter overlapping PCR products (using primer pairs PRW_F1 + PRW_R1 and PRW_F2 + PRW_R2; Table S1). All products were purified using ExoSAP-IT (ThermoFisher Scientific) and cloned into a vector using pGEM-T Easy Vector System (Promega, Madison, WI, USA) according to the manufacturer's instructions. Plasmid DNA was isolated using the NucleoSpin Plasmid kit (Macherey–Nagel, Düren, Germany) according to the manufacturer's protocol. The identity of the fragment was verified by Sanger sequencing in SEQme (Dobříš, Czech Republic) with universal M13 primers. All examined sequences (both contigs and cloned fragments) were deposited in GenBank (Table 1).
Coverage analysis
To obtain more information about the representation of selected fragments in the genomes of both sexes, we mapped the trimmed, filtered and paired reads (see above) to the cloned consensus sequences in Geneious Prime version 2021.1.1. using the built-in algorithm. Mapping was done separately for male and female reads (three combined samples from each sex) using the 95% identity threshold, and the mapped reads were manually sorted to ensure accurate results. We also estimated copy number based on total coverage.
Probe labelling and fluorescence in situ hybridization (FISH)
PRW LINE-like mapping was done in Jaén, Spain. 1 µg of PRW LINE-like DNA was labelled by nick translation using the biotin-NT-mix (Roche, Basel, Switzerland). The probe was then precipitated along with 50 μg yeast RNA and 50 μg salmon sperm DNA and redissolved in 50% formamide in 2 × SSC. Fluorescence in situ hybridization (FISH) was performed according to the published protocol39 with listed modifications40.
PRW Bel-Pao mapping was done in České Budějovice, Czech Republic. Both fragments (I and II, which do not contain the deleted region—see below) were labelled by PCR containing 0.04 mM each of dATP, dCTP and dGTP; 0.014 mM dTTP, 0.025 mM biotin-16-dUTP (Jena Bioscience, Jena, Germany), 1× ExTaq buffer (TaKaRa), 1–10 ng plasmid DNA, 1 μM of each primer and 2U DNA ExTaq Polymerase (TaKaRa) under the same conditions as described above. FISH was performed according to the published protocol41, with signal amplification using Cy3-conjugated streptavidin (Jackson ImmunoRes. Labs. Inc, West Grove, PA, USA) at a dilution of 1:1000 with washing blocking buffer.
Results
Characterization of repeats in the genome of P. rhomboidaria
Total genome coverage after read processing was 2.38× for females and 2.24× for males, corresponding to approximately 0.77 × coverage depth/per sample, based on a rough estimate of genome size 1C ~ 0.646 pg performed by flow cytometry in the laboratory of Petr Koutecký according to the published protocol34.
The RepeatExplorer analysis clustered about 44% of input reads, indicating the relative representation of repeats in the genome.
Of all the identified clusters, 220 (representing ~ 24% of the input reads) were subjected to automatic annotation and further analysis, as each of these clusters represented at least 0.01% of the examined reads. However, the automatic annotation was not able to classify most of the top 220 clusters and clusters corresponding to 87% of the input reads remained unknown. Among the annotated clusters, about 0.91% of the input reads were satellites. As for the classified mobile elements, the prevailing groups were long interspersed nuclear elements (LINE) with 1.33% of the reads and long terminal repeats (LTR) with 0.65% of the reads.
Characterization of selected sequences
Out of the top 220 clusters, 12 clusters were statistically significantly female enriched and three clusters were male enriched (Fig. S2). Since the maximum male enrichment was 1.37× , any female-enriched cluster where the female-to-male ratio was below this value was considered a false positive. In the end, we obtained ten clusters representing putative W repeats (Fig. S2). The average female enrichment in these clusters was 6.48× and the maximum enrichment was 197.21×. Most of the putative W repeats were annotated as retrotransposons (Table 2). The two most abundant W repeats represented by cluster 32 (PRW LINE-like) and cluster 79 (PRW Bel-Pao) were further analysed (Fig. 1).
Based on the RepeatExplorer results, cluster 32 (corresponding to PRW LINE-like) was enriched 3.57× in the female data, where it represented about 0.31% of the genome, while in males it was only about 0.09% (Table 2, Fig. 1). Since the RepeatExplorer results are based on data with very low coverage, we mapped all processed reads to the cloned consensus sequence to obtain more accurate information on the abundance of this repeat. The results showed significantly higher coverage in females (774.6×) than in males (39.2×). We also estimated copy number using total coverage information, which was 325 copies in females and 17 copies in males, making up to a 19-fold difference.
Automatic annotation by RepeatExplorer classified the cluster 32 repeat as a retrotransposon from the LINE element group, based on the homology of the two identified reverse transcriptase (RT) domains. However, the structure of the RE graph was relatively complicated (Fig. 2), similarly to the RepeatMasker-based annotation. Apart from the expected two LINE RTs from the LINE/CR1 (which corresponds to our cloned and FISH-mapped sequence) and LINE/L2 subgroups, RepeatMasker also found a DNA transposon similar to Transib in this cluster.
Database search in NCBI BLASTN42 revealed 72% identity with a non-LTR retrotransposon T1Q-like reverse transcriptase gene of Abraxas sylvata (GenBank Acc. No. HQ284333.1), another moth from the family Geometridae. All other relevant hits without closer annotation came from the Tree of Life project43 and ranged from 99% sequence identity in P. rhomboidaria to 70% identity in many related species.
Cluster 79 (corresponding to PRW Bel-Pao) was the most female-enriched repeat cluster identified by RepeatExplorer (197.21×). The results show that this repeat is almost absent in the male genome (0.001%), while it covers more than 0.1% of the female genome suggesting that it is mainly localised on the W chromosome (Fig. 1). This assumption was confirmed by read mapping, which revealed even greater imbalance between the sexes, as coverage was 840× in females and only 2.5× in males. We also found a less frequent version of this element with a 213-bp long deletion. After mapping the reads to this “deleted” region, the coverage dropped to 698.5× in females, while it remained the same in males. The estimated copy number is 353 copies in females, out of which 60 have the deleted region, and 1–2 copies in males.
Both RepeatMasker and RepeatExplorer classified this cluster 79 repeat as an LTR transposon from the Bel-Pao group, based on the group-specific antigen (GAG), integrase (INT) and protease (PROT) domains. Although the graph of this repeat was linear (Fig. 3), indicating a less variable sequence of this repeat between copies, we could not recover all crucial protein-coding domains for this type of element, and the long terminal repeats were also missing.
NCBI BLASTN search yielded hits for P. rhomboidaria (98% identity) and for other Geometridae species, including the W chromosome of Crocallis elinguaria (73%) and the W chromosomes of two other Lepidoptera species, Clostera curtula (Notodontidae, 70%) and Hydraecia micacea (Noctuidae, 68%), with all sequences coming from the Tree of Life project43.
PCR amplification of selected repeats from gDNA
Both examined fragments were amplified by PCR (in the case of PRW Bel-Pao, we used the entire non-divided product from primer F1 to primer R2), using equal amounts of male and female gDNA. While the PCR products were present in both sexes, the female band was much stronger (Fig. 4). These results suggest that both elements are mainly present on the W chromosome.
FISH mapping
To verify the predicted location of selected fragments, we carried out physical mapping using FISH. Indeed, both probes labelled the W chromosome (Figs. 5A, 6A), which was also identified by its heterochromatinization and corresponding DAPI positivity (Figs. 5B, 6B). In the case of PRW LINE-like (Fig. 5), the probe showed significant signal accumulation on the W chromosome, and multiple small scattered signals were also detected on other chromosomes, which were equally abundant in both sexes. In PRW Bel-Pao (Fig. 6), the probe hybridized exclusively on the W chromosome, which was labelled almost along its entire length, except for a small gap at one of the chromosomal ends. Accordingly, no signal was detected in males (Fig. 6D).
Discussion
The genome size of P. rhomboidaria was estimated to 0.646 pg (corresponding to 631 Mb), which is similar to another geometrid species Operophtera brumata (645 Mb44) and falls within the range of known genome sizes in Lepidoptera from 273 Mb in Danaus plexippus (Nymphalidae) to 1.9 Gb in Euchlaena irraria (also Geometridae)45. However, the vast majority of species tend to have smaller genomes, similar to B. mori with 0.53 pg (corresponding to 518 Mb)46. The proportion of repetitive DNA in the genome of P. rhomboidaria was estimated to 44%; however, its actual volume is expected to be higher based on the genome size and repeat proportion in genomes of other Lepidoptera. For comparison, repeats in D. plexippus make up only 10.2% of the genome47, while in B. mori they comprise up to 46.8%48 and in O. brumata even more than half of the genome (53.5%)44. This discrepancy in P. rhomboidaria is likely caused by the RepeatExplorer analysis, which omits microsatellites and ancient diversified copies of mobile elements49. Although most of the repetitive sequences in P. rhomboidaria remain unclassified, similarly to genomes of other Lepidoptera the most abundant class of repeats are transposable elements. For instance, LINEs are particularly abundant in the two species of Geometridae, P. rhomboidaria (this study) and O. brumata44.
High occurrence of retrotransposons is a common feature of the W chromosome in Lepidoptera, although the number of species with available data is still limited7,9,12. In this study, we identified 10 putative W-enriched repeats of P. rhomboidaria, most of them being classified as LINE or LTR retrotransposons. Their identification was done by comparative analysis with the RepeatExplorer software, a novel approach for Lepidoptera as it only has been used for mapping satellite sequences in Crambidae10, and for the identification and characterization of W-enriched repetitive sequences in Diatraea saccharalis (Crambidae)50.
All these findings suggest that transposable elements are the main repeats with a tendency to colonize the W chromosomes of Lepidoptera and play the major role in their diversification. This phenomenon has already been observed in heterogametic sex chromosomes of many other organisms51,52,53. It seems that immediately after recombination around the sex-determining region is stopped, an accumulation of TEs follows54,55. This is one of the first events which may lead to the diversification of sex chromosomes (as seen e.g. in Ty3/Gypsy elements in papaya56), and even temporarily increase the size of the heterogametic chromosome at the early stage of differentiation57. The presence of TEs in the sex-determining region may also facilitate translocation of the sex-determining factor either via transposition or via ectopic recombination, potentially leading to the rise of a new sex chromosome pair58,59, or influence the expression of neighbouring genes. For instance, TE-induced methylation of a transcription factor realizes sex determination in melon60. Further, it is known that abundance of TEs is linked to the formation of heterochromatin, which may lead to the silencing of the whole chromosome (such as LINE in mammalian X chromosome inactivation28).
The expansion of TEs on the non-recombining sex chromosomes such as Y and W probably reflects the strongly reduced efficiency of natural selection on these chromosomes61,62. This is especially true for the W chromosomes of Lepidoptera, since meiotic recombination is completely absent in females63,64,65. Therefore, the abundant occurrence of TEs on the W chromosome in Lepidoptera is not surprising.
Moreover, the colonization of the W chromosomes of Lepidoptera by a particular type of retrotransposon is probably a rapid process with a random outcome, occurring independently even in closely related taxa10. However, it seems that at least for some retrotransposons, the W chromosome becomes the final destination, as they often genetically erode. This happens mainly through their recurrent uncoordinated insertions into already existing elements, which may destroy their functional domains. This, of course, also complicates their annotation.
PRW LINE-like element is a possible example of such an event. The complexity of the graph of the cluster 32 analysis with RepeatExplorer suggests a derived type of repeat with ambiguous annotation, as it contains two RT domains from different LINE elements and one DNA transposon Transib domain. Indeed, it is possible that there were originally several types of transposable elements that were repeatedly inserted into each other during their accumulation in the W chromosome (as in B. mori66), and this new ‘patchwork’ sequence could potentially amplify independently as a novel repeat (as suggested in legumes67).
Regarding the PRW LINE-like relative copy number differences between males and females, the RepeatExplorer analysis showed lower female enrichment than the coverage analysis in Geneious. These discrepancies are most likely due to the fact that RepeatExplorer analysed the entire cluster generated in silico, while only the cloned fragment (annotated as LINE CR1 by RepeatMasker) was used for Geneious read mapping. Since the RepeatExplorer graph may actually compile more types of monomers, it is possible that some of them are not accumulated on the W chromosome.
FISH mapping revealed a significant accumulation of the cloned PRW LINE-like sequence on the W chromosome, and its presence was also confirmed on other chromosomes, as many scattered signals were detected in both males and females. Accordingly, PCR results showed products in both sexes, although the female gDNA products were considerably stronger. These results are consistent with the fact that the PRW LINE-like female enrichment is less prominent in comparison to PRW Bel-Pao (Table 2), since it occurs profusely also on other chromosomes.
In the case of PRW Bel-Pao, it was classified as an LTR Bel-Pao element both by RepeatExplorer automatic annotation and RepeatMasker protein search. Both versions of this element are probably also non-functional and genetically eroded, as they lack the RT domain, long terminal repeats and in case of the less frequent deleted version also the PROT domain. Strong female enrichment of this transposon is very clear in both RepeatExplorer and Geneious coverage analyses. The difference between them is negligible and most likely technical, as the RepeatExplorer analysis is affected by a smaller sample size and a lower required percentage of identity between reads (90% vs. 95% in Geneious). Based on the coverage differences between the full-size fragment and the deleted region, we estimate that about 17% of the copies carry the deletion.
The PCR results strongly support the bioinformatic data. Two PCR products were found in both sexes, differing in volume and length. They correspond to a shorter, less abundant version carrying the deletion and a longer, more abundant full-length sequence. These results, combined with the coverage analysis data, therefore confirm that both deleted and non-deleted versions of this sequence are present on and off the W chromosome. Since it is unlikely that the original non-deleted version would mutate independently on different chromosomes, we hypothesize that the derived deleted version have migrated either to or from the W chromosome via an alternative transposition mechanism, such as ectopic recombination or with another mobile element.
As expected, the PCR products from the female gDNA were considerably stronger than those from the male gDNA. Accordingly, the FISH mapping revealed striking differences between the sexes and provided the final physical evidence for the accumulation of this repeat on the W chromosome. We did not observe hybridization signals on other chromosomes, but the probe highlighted almost the entire length of the W chromosome except for its terminal part, which did not contain heterochromatin. In our previous work, we showed results of comparative genomic hybridization (CGH), which revealed clear differences between the signals of the female gDNA probe in this terminal part and the heterochromatic part, proving that the latter part consists mainly of female-enriched sequences30. Thus, the PRW Bel-Pao repeat is probably one of the key components of the CGH results. Moreover, by colonizing the W chromosome, this repeat significantly contributes to its progressive differentiation.
To conclude, our data support previous findings that retrotransposons are the main cause of differentiation of the W chromosome in Lepidoptera. Moreover, we characterized two types of retrotransposons that accumulated on the W chromosome of P. rhomboidaria and we validated a novel approach to finding differences in repeat content using RepeatExplorer. Bioinformatic analysis corresponded very well with PCR and cytogenetic mapping, demonstrating the connectivity and mutual indispensability of these methods in providing a comprehensive insight. Despite the fact that Lepidoptera is a species-rich taxon, the W chromosome content is at least partly known in few species only. Therefore, the approach used in this work represents an alternative, elegant, and cost-effective tool for unveiling sequences which drive the W chromosome diversification.
Data availability
All examined sequences have been deposited into the GenBank with accession numbers provided in the Table 1 of “Materials and methods” section.
References
Charlesworth, B. The evolution of chromosomal sex determination and dosage compensation. Curr. Biol. 6, 149–162 (1996).
Abbott, J. K., Nordén, A. K. & Hansson, B. Sex chromosome evolution: historical insights and future perspectives. Proc. R. Soc. B 284, 20162806 (2017).
Charlesworth, B. The evolution of sex chromosomes. Science 251, 1030–1033 (1991).
Furman, B. L. S. et al. Sex chromosome evolution: So many exceptions to the rules. Genome Biol. Evol. 12, 750–763 (2020).
Traut, W., Sahara, K. & Marec, F. Sex chromosomes and sex determination in Lepidoptera. Sex. Dev. 1, 332–346. https://doi.org/10.1159/000111765 (2007).
Kiuchi, T. et al. A single female-specific piRNA is the primary determiner of sex in the silkworm. Nature 509, 633–636 (2014).
Abe, H., Mita, K., Yasukochi, Y., Oshiki, T. & Shimada, T. Retrotransposable elements on the W chromosome of the silkworm, Bombyx mori. Cytogenet. Genome Res. 110, 144–151. https://doi.org/10.1159/000084946 (2005).
Sahara, K., Yoshido, A. & Traut, W. Sex chromosome evolution in moths and butterflies. Chromosome Res. 20, 83–94 (2012).
Traut, W., Vogel, H., Glöckner, G., Hartmann, E. & Heckel, D. G. High-throughput sequencing of a single chromosome: A moth W chromosome. Chromosome Res. 21, 491–505 (2013).
Cabral-de-Mello, D. C., Zrzavá, M., Kubíčková, S., Rendón, P. & Marec, F. The role of satellite DNAs in genome architecture and sex chromosome evolution in Crambidae moths. Front. Genet. 12, 661417. https://doi.org/10.3389/fgene.2021.661417 (2021).
Dalíková, M., Zrzavá, M., Kubíčková, S. & Marec, F. W-enriched satellite sequence in the Indian meal moth, Plodia interpunctella (Lepidoptera, Pyralidae). Chromosome Res. 25, 241–252 (2017).
Fuková, I. et al. Probing the W chromosome of the codling moth, Cydia pomonella, with sequences from microdissected sex chromatin. Chromosoma 116, 135–145. https://doi.org/10.1007/s00412-006-0086-0 (2007).
Kejnovský, E., Hobza, R., Čermák, T., Kubat, Z. & Vyskot, B. The role of repetitive DNA in structure and evolution of sex chromosomes in plants. Heredity 102, 533–541 (2009).
Hood, M. E. Repetitive DNA in the automictic fungus Microbotryum violaceum. Genetica 124, 1–10 (2005).
Erlandsson, R., Wilson, J. F. & Pääbo, S. Sex chromosomal transposable element accumulation and male-driven substitutional evolution in humans. Mol. Biol. Evol. 17, 804–812 (2000).
Toder, R., Wakefield, M. J. & Graves, J. A. M. The minimal mammalian Y chromosome—The marsupial Y as a model system. Cytogenet. Cell Genet. 91, 285–292 (2000).
Peona, V. et al. The avian W chromosome is a refugium for endogenous retroviruses with likely effects on female-biased mutational load and genetic incompatibilities. Philos. Trans. R. Soc. B 376, 20200186 (2021).
Junakovic, N., Terrinoni, A., Di Franco, C., Vieira, C. & Loevenbruck, C. Accumulation of transposable elements in the heterochromatin and on the Y chromosome of Drosophila simulans and Drosophila melanogaster. J. Mol. Evol. 46, 661–668 (1998).
Rosolen, L. A. M., Vicari, M. R. & Almeida, M. C. Accumulation of transposable elements in autosomes and giant sex chromosomes of Omophoita (Chrysomelidae: Alticinae). Cytogenet. Genome Res. 156, 215–222 (2019).
Śliwińska, E. B., Martyka, R. & Tryjanowski, P. Evolutionary interaction between W/Y chromosome and transposable elements. Genetica 144, 267–278 (2016).
Zhu, Y., Dai, J., Fuerst, P. G. & Voytas, D. F. Controlling integration specificity of a yeast retrotransposon. Proc. Natl. Acad. Sci. USA 100, 5891–5895 (2003).
Bourque, G. et al. Ten things you should know about transposable elements. Genome Biol. 19, 199. https://doi.org/10.1186/s13059-018-1577-z (2018).
Charlesworth, B. & Langley, C. H. The evolution of self-regulated transposition of transposable elements. Genetics 112, 359–383 (1986).
Ostertag, E. M. & Kazazian, J. Biology of mammalian L1 retrotransposons. Annu. Rev. Genet. 35, 501–538 (2001).
Fugmann, S. D. The origins of the Rag genes—From transposition to V(D)J recombination. Semin. Immunol. 22, 10–16 (2010).
Belfort, M., Curcio, M. J. & Lued, N. F. Telomerase and retrotransposons: Reverse transcriptases that shaped genomes. Proc. Natl. Acad. Sci. USA 108, 20304–20310 (2011).
Kazazian, H. H. Mobile elements: Drivers of genome evolution. Science 303, 1626–1632 (2004).
Lyon, M. F. LINE-1 elements and X chromosome inactivation: A function for ‘junk’ DNA?. Proc. Natl. Acad. Sci. USA 97, 6248–6249 (2000).
Dechaud, C., Volff, J. N., Schartl, M. & Naville, M. Sex and the TEs: Transposable elements in sexual development and function in animals. Mob. DNA 10, 42. https://doi.org/10.1186/s13100-019-0185-0 (2019).
Hejníčková, M. et al. Degenerated, undifferentiated, rearranged, lost: High variability of sex chromosomes in Geometridae (Lepidoptera) identified by sex chromatin. Cells 10, 2230. https://doi.org/10.3390/cells10092230 (2021).
Vítková, M., Fuková, I., Kubíčková, S. & Marec, F. Molecular divergence of the W chromosomes in pyralid moths (Lepidoptera). Chromosome Res. 15, 917–930 (2007).
Zrzavá, M. et al. Sex chromosomes of the iconic moth Abraxas grossulariata (Lepidoptera, Geometridae) and its congener A. sylvata. Genes 9, 279. https://doi.org/10.3390/genes9060279 (2018).
Novák, P., Neumann, P., Pech, J., Steinhaisl, J. & Macas, J. RepeatExplorer: A galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. Bioinformatics 29, 792–793. https://doi.org/10.1093/bioinformatics/btt054 (2013).
Hejníčková, M. et al. Absence of W chromosome in Psychidae moths and implications for the theory of sex chromosome evolution in Lepidoptera. Genes 10, 1016. https://doi.org/10.3390/genes10121016 (2019).
Winnepenninckx, B., Backeljau, T. & De Wachter, R. Extraction of high molecular weight DNA from molluscs. Trends Genet. 9, 407 (1993).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Andrews, S. FastQC. A Quality Control Tool for High Throughput Sequence Data. (Babraham Bioinformatics, 2010).
Smit, A., Hubley, R. & Grenn, P. RepeatMasker Open-4.0. RepeatMasker Open-4.0.7. (2015).
Palomeque, T., Muñoz-López, M., Carrillo, J. A. & Lorite, P. Characterization and evolutionary dynamics of a complex family of satellite DNA in the leaf beetle Chrysolina carnifex (Coleoptera, Chrysomelidae). Chromosome Res. 13, 795–807 (2005).
Montiel, E. E., Panzera, F., Palomeque, T., Lorite, P. & Pita, S. Satellitome analysis of Rhodnius prolixus, one of the main chagas disease vector species. Int. J. Mol. Sci. 22, 6052. https://doi.org/10.3390/ijms22116052 (2021).
Cabral-de-Mello, D. C. & Marec, F. Universal fluorescence in situ hybridization (FISH) protocol for mapping repetitive DNAs in insects and other arthropods. Mol. Genet. Genomics 296, 513–526 (2021).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Maddison, D. R., Schulz, K. S. & Maddison, W. P. The tree of life web project. Zootaxa 1668, 19–40. https://doi.org/10.11646/zootaxa.1668.1.4 (2007).
Derks, M. F. L. et al. The genome of winter moth (Operophtera brumata) provides a genomic perspective on sexual dimorphism and phenology. Genome Biol. Evol. 7, 2321–2332 (2015).
Gregory, T. R. et al. Eukaryotic genome size databases. Nucleic Acids Res. 35, D332–D338 (2007).
Gage, L. P. The Bombyx mori genome: Analysis by DNA reassociation kinetics. Chromosoma 45, 27–42 (1974).
Zhan, S. & Reppert, S. M. MonarchBase: The monarch butterfly genome database. Nucleic Acids Res. 41, D758-763 (2013).
Kawamoto, M. et al. High-quality genome assembly of the silkworm, Bombyx mori. Insect Biochem. Mol. Biol. 107, 53–62 (2019).
Novák, P. et al. Repeat-sequence turnover shifts fundamentally in species with large genomes. Nat. Plants 6, 1325–1329 (2020).
Gasparotto, A. E. et al. A step forward in the genome characterization of the sugarcane borer, Diatraea saccharalis: Karyotype analysis, sex chromosome system and repetitive DNAs through a cytogenomic approach. Chromosoma https://doi.org/10.1007/s00412-022-00781-4 (2022).
Dechaud, C. et al. Sex and the TEs: Transposable elements in sexual development and function in animals. Mob. DNA 10, 42. https://doi.org/10.1186/s13100-019-0185-0 (2019).
Schemberger, M. O. et al. DNA transposon invasion and microsatellite accumulation guide W chromosome differentiation in a Neotropical fish genome. Chromosoma 128, 547–560. https://doi.org/10.1007/s00412-019-00721-9 (2019).
Srikulnath, K., Ahmad, S. F., Singchat, W. & Panthum, T. Do Ty3/Gypsy transposable elements play preferential roles in sex chromosome differentiation?. Life 12, 522. https://doi.org/10.3390/life12040522 (2022).
Mawaribuchi, S. et al. Sex chromosome differentiation and the W- and Z-specific loci in Xenopus laevis. Dev Biol. 426, 393–400. https://doi.org/10.1016/j.ydbio.2016.06.015 (2017).
Kudoh, T. et al. Molecular insights into the non-recombining nature of the spinach male-determining region. Mol. Genet. Genomics 293, 557–568 (2018).
Na, J. K., Wang, J. & Ming, R. Accumulation of interspersed and sex-specific repeats in the non-recombining region of papaya sex chromosomes. BMC Genomics 15, 335 (2014).
Conte, M. A. et al. Origin of a giant sex chromosome. Mol. Biol. Evol. 38, 1554–1569. https://doi.org/10.1093/molbev/msaa319 (2021).
Tennessen, J. A. et al. Repeated translocation of a gene cassette drives sex-chromosome turnover in strawberries. PLoS Biol 16, e2006062. https://doi.org/10.1371/journal.pbio.2006062 (2018).
Kabir, A. et al. Repeated translocation of a supergene underlying rapid sex chromosome turnover in Takifugu pufferfish. Proc. Natl. Acad. Sci. 119, e2121469119. https://doi.org/10.1073/pnas.2121469119 (2022).
Martin, A. et al. A transposon-induced epigenetic change leads to sex determination in melon. Nature 461, 1135–1138. https://doi.org/10.1038/nature08498 (2009).
Bachtrog, D. Sex chromosome evolution: Molecular aspects of Y-chromosome degeneration in Drosophila. Genome Res. 15, 1393–1401 (2005).
Bachtrog, D. Y-chromosome evolution: Emerging insights into processes of Y-chromosome degeneration. Nat. Rev. Genet. 14, 113–124 (2013).
Nokkala, S. Cytological characteristics of chromosome behaviour during female meiosis in Sphinx ligustri L. (Sphingidae, Lepidoptera). Hereditas 106, 69–179 (1987).
Traut, W. A study of recombination, formation of chiasmata and synaptonemal complexes in female and male meiosis of Ephestia kuehniella (Lepidoptera). Genetica 47, 135–142 (1977).
Marec, F. Synaptonemal complexes in insects. Int. J. Insect Morphol. Embryol. 25, 205–233 (1996).
Abe, H., Fujii, T., Shimada, T. & Mita, K. Novel non-autonomous transposable elements on W chromosome of the silkworm, Bombyx mori. J. Genet. 89, 375–387 (2010).
Macas, J., Neumann, P. & Požárková, D. Zaba: A novel miniature transposable element present in genomes of legume plants. Mol. Genet. Genomics 269, 624–631 (2003).
Acknowledgements
We wish to thank Petr Koutecký (University of South Bohemia, České Budějovice, Czech Republic) for his expertise in measuring the genome size of Peribatodes rhomboidaria by flow cytometry. We also thank Marie Korchová for technical assistance and Pavel Potocký for insect collection and determination.
Funding
This research was funded by GAJU grant No. 013/2020/P given to M.H., by grant 20-13784S of the Czech Science Foundation (CSF) given to F.M. and by ”Plan Operativo de Apoyo a la Investigación 2021–2022″ from the University of Jaén. M.D. was supported by CSF grant 20-20650Y. Computational resources were supplied by the project "e-Infrastruktura CZ" (e-INFRA CZ LM2018140) supported by the Ministry of Education, Youth and Sports of the Czech Republic, and by the ELIXIR-CZ project (LM2018131), part of the international ELIXIR infrastructure.
Author information
Authors and Affiliations
Contributions
M.H. conceived the study, designed experiments, performed research and wrote the first draft. M.D. analyzed the bioinformatic data. E.M. and M.Z. provided supervision and consulting. F.M. and P.L. contributed expertise in cytogenetics and provided partial funding. All authors contributed to the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hejníčková, M., Dalíková, M., Zrzavá, M. et al. Accumulation of retrotransposons contributes to W chromosome differentiation in the willow beauty Peribatodes rhomboidaria (Lepidoptera: Geometridae). Sci Rep 13, 534 (2023). https://doi.org/10.1038/s41598-023-27757-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-27757-3
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.