Journal home
Advance online publication
Current issue
Archive
Press releases
Free Association (blog)
Supplements
Focuses
Guide to authors
Online submissionOnline submission
For referees
Free online issue
Contact the journal
Subscribe
Advertising
work@npg
Reprints and permissions
About this site
For librarians
 
NPG Resources
Nature
Nature Biotechnology
Nature Cell Biology
Nature Medicine
Nature Methods
Nature Reviews Cancer
Nature Reviews Genetics
Nature Reviews Molecular Cell Biology
news@nature.com
Nature Conferences
NPG Subject areas
Biotechnology
Cancer
Chemistry
Clinical Medicine
Dentistry
Development
Drug Discovery
Earth Sciences
Evolution & Ecology
Genetics
Immunology
Materials Science
Medical Research
Microbiology
Molecular Cell Biology
Neuroscience
Pharmacology
Physics
Browse all publications
Brief Communication
Nature Genetics 33, 123 - 124 (2003)
Published online: 13 January 2003; | doi:10.1038/ng1074

Retroelements containing introns in diverse invertebrate taxa

Irina R. Arkhipova1, 4, Konstantin I. Pyatkov2, 4, Matthew Meselson1 & Michael B. Evgen'ev2, 3

1 Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138, USA.

2 Institute of Cell Biophysics, Pushino, Russia.

3 Institute of Molecular Biology, Moscow, Russia.

4 These two authors contributed equally to this work.

Correspondence should be addressed to Matthew Meselson msm@wjh.harvard.edu

We report that two structurally similar transposable elements containing reverse transcriptase (RT), Penelope in Drosophila virilis and Athena in bdelloid rotifers, have proliferated as copies containing introns. The ability of Penelope-like elements (PLEs) to retain introns, their separate phylogenetic placement and their peculiar structural features make them a novel class of eukaryotic retroelements.

Both known classes of eukaryotic retroelements (LTR and non-LTR retrotransposons) and retropseudogenes are generally devoid of introns because they pass through an RNA intermediate that would be subject to splicing (spliced subgenomic RNAs coding for gag- or env-like proteins do not serve as transposition intermediates). The involvement of an RNA intermediate in transposition is usually indicated by the absence of introns in transposed copies of elements into which they are artificially introduced1, 2, 3.

Unexpectedly, the unusual transposon Penelope contains RT and is a causative agent of hybrid dysgenesis in Drosophila virilis4. Its structure and the amino-acid sequence of its conserved RT domains differ greatly from those of the two known classes of eukaryotic retrotransposons, yet it shares structural features with both classes. As in non-LTR retrotransposons without LTRs, Penelope has frequent 5' truncations and its target-site duplications, when present, are of variable length. Also, as in LTR retrotransposons, Penelope may have LTR-like structures; when present, however, these may be in either direct or inverted orientation. The Penelope RT bears a remote resemblance to the RT of retrons and bacterial or organellar group II introns. Its single open reading frame contains a C-terminal Uri (or GIY-YIG) domain with homology to homing endonucleases of group I introns and to UvrC bacterial DNA-repair endonucleases, which has not previously been encountered in retroelements5.

Comparison of Penelope genomic and cDNA sequences showed that the 5' untranslated region contains a functional intron of 75 bp bounded by GT/AG donor/acceptor splice sites (Fig. 1a and Supplementary Fig. A online). To distinguish between splicing and the existence of a transcribed intronless copy, we examined D. virilis genomic DNA by PCR (see Supplementary Note A online), but detected no signals corresponding to intronless copies (see Supplementary Fig. A online).

Figure 1. Introns in PLEs from diverse species.
Figure 1 thumbnail

a, DNA sequence of the 5' untranslated region from the active p6 copy of D. virilis Penelope4. The initiator sequence immediately following the RNA start site (first nucleotide) and the downstream +30 promoter element of D. virilis14 are overlined; poly(A) signals corresponding to those in the 3' LTR-like structure4 are boxed; the ATG codon is italicized; intron sequences are in lowercase; splice donor (SD) and acceptor (SA) sites are indicated; matches to the branch-point consensus YNYYRAY15 are indicated by a single underline; potential pyrimidine tracts by a double underline. b, Amino-acid sequence alignment of the RT domains B–E (or 4–7; square brackets) in PLEs. Shown in red is Penelope from D. virilis (Dv; ref. 4), the only PLE known to be functional; in blue, defective PLE copies identified in sequence databases5, 6, 7 or assembled from Schistosoma mansoni gss (CercyonSm); in black, sequences deduced from Athena genomic (arbitrarily numbered 1–4) and cDNA-derived (C1–C2) clones from the bdelloids A. vaga (Av), H. constricta (Hc) and P. roseola (Pr). The position of the phase 2 intron within the arginine codon in all bdelloid PLEs is indicated by a triangle. Asterisks at the top denote amino-acid residues conserved across all species located within domains B through E (80% overall consensus); colons denote additional residues conserved in bdelloids (80% bdelloid consensus). Asterisks in the sequences indicate in-frame stop codons and pound signs indicate frame shifts. c, Nucleotide sequences showing heterogeneity of introns in Athena elements with functional motifs underlined as in a.



Full FigureFull Figure and legend (120K)
Recently, it has become apparent that Penelope is a member of a widely distributed group of similar elements with hosts as diverse as flatworms, roundworms, crustaceans, echinoderms, fish and amphibians5, 6, 7. We found PLEs in rotifers of the class Bdelloidea, an anciently asexual taxon in which PCR screens had not detected representatives of either of the two known classes of retroelements, although these were readily detected in all other animal phyla tested8, 9. We identified amplification products corresponding to PLEs in three distantly related bdelloid species, Adineta vaga, Habrotrocha constricta and Philodina roseola (Fig. 1b and Supplementary Fig. A online). Their RT coding sequences could not be aligned to those of other PLEs without removing a variable-length (53–70 bp) region containing in-frame stop codons and frameshifts and surrounded by GT/AG donor/acceptor splice sites. This indicated the presence of a functional intron, which was confirmed by the finding of spliced RNA molecules by RT–PCR (Fig. 1c and Supplementary Fig. A online). As in D. virilis, PCR with primers flanking the intron region showed no intronless copies in the total genomic DNA of A. vaga and H. constricta.

The bdelloid RT sequences, especially the introns, showed a high degree of diversity (Fig. 1b,c). But the exon sequences immediately adjacent to the donor and acceptor splice sites were conserved and, in most introns, a pyrimidine tract could be distinguished between the putative branch point and the 3' splice site (Fig. 1c). Southern-blot analysis and genomic library screens with diverged RT fragments as probes (data not shown) indicated that each RT sequence was present at very low copy number (1–2 copies). Thus, bdelloid PLEs either have been inactive since their expansion in copy-number tens of millions of years ago or have been active but kept at low copy-number. Both possibilities are consistent with the expectation that active deleterious vertically transmitted transposons will not persist in the genomes of ancient asexual organisms9.

Phylogenetic analysis of the seven conserved domains of numerous RT coding sequences (Fig. 2 and Supplementary Fig. B online) indicated that the bdelloid PLEs, which we named Athena, form a clade distinct from the PLEs of other organisms. Moreover, the entire PLE clade is clearly different from LTR and non-LTR retrotransposons and constitutes a third, phylogenetically distinct, group of eukaryotic retroelements. Notably, the high clade credibility value for the branch containing PLEs and telomerase reverse transcriptases indicates that the RT of telomerases (the only known non-selfish RT with introns) may be a sister clade to the PLEs, adding a new twist to discussions regarding the origin of telomerases10, 11.

Figure 2. Phylogenetic relationships of RTs based on the seven most conserved domains.
Figure 2 thumbnail

Shown in the phylogram are 50 RTs representing all major prokaryotic and eukaryotic groups: bacterial retrons, group II introns, non-LTR retrotransposons, LTR retrotransposons (including retroviruses and pararetroviruses), hepadnaviruses, telomerase reverse transcriptases (TERTs) and PLEs. Alignments are shown in Supplementary Fig. B online. The tree was rooted with bacterial retrons, which have the simplest RT-domain structure; including the group II introns in the outgroup did not change the tree topology. Numbers on the branches correspond to the clade credibility values, that is, the percentage of trees containing each bipartition. Bar, 0.1 amino-acid substitutions per site.



Full FigureFull Figure and legend (54K)
The differing placement of introns in D. virilis and bdelloid PLEs and the apparent absence of intron-like interruptions in the open reading frame in PLEs from other organisms indicate that introns are not a necessary feature of PLEs. The presence of an intron at the same site in all genomic copies of Athena suggests that it was already present before the element proliferated, possibly before the bdelloid radiation. As functional RT cannot be translated from the unspliced Athena transcript, proliferation seemingly occurred either by an RNA-independent pathway or by reverse transcription of an unspliced transcript. In the latter case, the existence of an intron in the RT coding region of Athena would rule out the cis-preference in transposition that is characteristic of non-LTR retrotransposons, whereby the RT preferentially acts on the mRNA template that produced it12. If RNA is not an intermediate, one may speculate that the Uri endonuclease domain found in all PLEs enables them to use a DNA-dependent transposition pathway similar to that used by group I and occasionally group II introns13. Regardless of whether transposition occurs by an RNA-dependent or RNA-independent pathway, it is clear that the ability to retain introns after transposition and their distinct placement in the phylogeny of elements containing RT, as well as the variable structure of their termini4, 5, 6, 7 and notable parallels with both group I and group II introns, support assignment of PLEs to a third class of eukaryotic retroelements, distinct from LTR and non-LTR retrotransposons.

GenBank accession numbers.
Penelope cDNA, AF418571; Athena elements, AY179351 through AY179364; Cercyon, TPA BK000685.

Note: Supplementary information is available on the Nature Genetics website.

Received 16 September 2002; Accepted 19 November 2002; Published online: 13 January 2003.

 Top
REFERENCES
  1. Boeke, J. , Garfinkel, D. , Styles, C. & Fink, G. Cell 40, 491–500 (1985). | Article | PubMed | ISI | ChemPort |
  2. Heidmann, O. & Heidmann, T. Cell 64, 159–170 (1991). | Article | PubMed | ISI | ChemPort |
  3. Moran, J. et al. Cell 87, 917–927 (1996). | Article | PubMed | ISI | ChemPort |
  4. Evgen'ev, M. et al. Proc. Natl. Acad. Sci. USA 94, 196–201 (1997). | Article | PubMed | ChemPort |
  5. Lyozin, G. et al. J. Mol. Evol. 52, 445–456 (2001). | PubMed | ISI | ChemPort |
  6. Volff, J. , Hornung, U. & Schartl, M. Mol. Genet. Genomics 265, 711–720 (2001). | Article | PubMed | ISI | ChemPort |
  7. Dalle Nogare, D. , Clark, M. , Elgar, G. , Frame, I. & Poulter, R. Mol. Biol. Evol. 19, 247–255 (2002). | PubMed | ChemPort |
  8. Mark Welch, D. & Meselson, M. Science 288, 1211–1215 (2000). | Article | PubMed |
  9. Arkhipova, I. & Meselson, M. Proc. Natl. Acad. Sci. USA 97, 14473–14477 (2000). | Article | PubMed | ChemPort |
  10. Nakamura, T. & Cech, T. Cell 92, 587–590 (1998). | Article | PubMed | ISI | ChemPort |
  11. Eickbush, T. Science 277, 911–912 (1997). | Article | PubMed | ISI | ChemPort |
  12. Wei, W. et al. Mol. Cell. Biol. 21, 1429–1439 (2001). | Article | PubMed | ISI | ChemPort |
  13. Belfort, M. , Derbyshire, V. , Parker, M. , Cousineau, B. & Lambowitz, A. in Mobile DNA II (eds. N. Craig, R. Craigie, M. Gellert & A. Lambowitz) (ASM Press, Washington, DC, 2002).
  14. Arkhipova, I. Genetics 139, 1359–1369 (1995). | PubMed | ISI | ChemPort |
  15. Moore, M. , Query, C. , & Sharp, P. in The RNA World (eds. R. Gesteland & J. Atkins) (Cold Spring Harbor Laboratory Press, New York, 1993).
 Top
Acknowledgments
We thank M. Cummings and D. Mark Welch for comments. This work was supported by the National Science Foundation (M.M.), the Wellcome Trust and the Russian Foundation for Basic Research (M.B.E.).

Competing interests statement:  The authors declare that they have no competing financial interests.

FULL TEXT
Previous | Next
Table of contents
Download PDFDownload PDF
Send to a friendSend to a friend
rights and permissionsRights and permissions
Order commercial reprintsOrder commercial reprints
CrossRef lists 20 articles citing this articleCrossRef lists 20 articles citing this article
Save this linkSave this link
More articles like this
Figures & Tables
Acknowledgments
References
Supplementary info
Export citation
Export references

naturejobs

natureproducts

Search buyers guide:

 
Nature Genetics
ISSN: 1061-4036
EISSN: 1546-1718
Journal home | Advance online publication | Current issue | Archive | Press releases | Supplements | Focuses | For authors | Online submission | Permissions | For referees | Free online issue | About the journal | Contact the journal | Subscribe | Advertising | work@npg | naturereprints | About this site | For librarians
Nature Publishing Group, publisher of Nature, and other science journals and reference works©2003 Nature Publishing Group | Privacy policy