Transposons: The Jumping Genes

By: Leslie A. Pray, Ph.D. © 2008 Nature Education
Citation: Pray, L. (2008) Transposons: The jumping genes. Nature Education 1(1)

Transposable elements, or "jumping genes", were first identified by Barbara McClintock more than 50 years ago. Why are transposons so common in eukaryotes, and exactly what do they do?

 

Transposable elements (TEs), also known as "jumping genes," are DNA sequences that move from one location on the genome to another. These elements were first identified more than 50 years ago by geneticist Barbara McClintock of Cold Spring Harbor Laboratory in New York. Biologists were initially skeptical of McClintock's discovery. Over the next several decades, however, it became apparent that not only do TEs "jump," but they are also found in almost all organisms (both prokaryotes and eukaryotes) and typically in large numbers. For example, TEs make up approximately 50% of the human genome and up to 90% of the maize genome (SanMiguel, 1996).

Types of Transposons

Today, scientists know that there are many different types of TEs, as well as a number of ways to categorize them. One of the more common divisions is between those TEs that require reverse transcription (i.e., the transcription of RNA into DNA) in order to transpose and those that do not. The former elements are known as retrotransposons or class 1 TEs, whereas the latter are known as DNA transposons or class 2 TEs. The Ac/Ds system that McClintock discovered falls in the latter category. Different classes of transposable elements are found in the genomes of different eukaryotic organisms (Figure 1).

DNA Transposons

All complete or "autonomous" class 2 TEs encode the protein transposase, which they require for insertion and excision (Figure 2). Some of these TEs also encode other proteins. Note that DNA transposons never use RNA intermediaries—they always move on their own, inserting or excising themselves from the genome by means of a so-called "cut and paste" mechanism.

Class 2 TEs are characterized by the presence of terminal inverted repeats, about 9 to 40 base pairs long, on both of their ends (Figure 3). As the name suggests and as Figure 3 shows, terminal inverted repeats are inverted complements of each other; for instance, the complement of ACGCTA (the inverted repeat on the right side of the TE in the figure) is TGCGAT (which is the reverse order of the terminal inverted repeat on the left side of the TE in the figure). One of the roles of terminal inverted repeats is to be recognized by transposase.

In addition, all TEs in both class 1 and class 2 contain flanking direct repeats (Figure 3). Flanking direct repeats are not actually part of the transposable element; rather, they play a role in insertion of the TE. Moreover, after a TE is excised, these repeats are left behind as "footprints." Sometimes, these footprints alter gene expression (i.e., expression of the gene in which they have been left behind) even after their related TE has moved to another location on the genome.

Less than 2% of the human genome is composed of class 2 TEs. This means that the majority of the substantial portion of the human genome that is mobile consists of the other major class of TEs—the retrotransposons (Kazazian & Moran, 1998).

Retrotransposons

Unlike class 2 elements, class 1 elements—also known as retrotransposons—move through the action of RNA intermediaries. In other words, class 1 TEs do not encode transposase; rather, they produce RNA transcripts and then rely upon reverse transcriptase enzymes to reverse transcribe the RNA sequences back into DNA, which is then inserted into the target site.

There are two major types of class 1 TEs: LTR retrotransposons, which are characterized by the presence of long terminal repeats (LTRs) on both ends; and non-LTR TEs, which lack the repeats. Both the LINE1, or L1, and Alu genes represent families of non-LTR TEs. L1 elements average about 6 kilobases in length. In contrast, Alu elements average only a few hundred nucleotides, thus making them a short interspersed transposable element, or SINE. Alu is particularly prolific, having originated in primates and expanding in a relatively short time to about 1 million copies per cell in humans. L1 is also common in humans; although not present in as many copies as Alu, its larger size means that this element makes up an estimated 15%–17% of the human genome (Kazazian & Moran, 1998; Slotkin & Martienssen, 2007). In humans, these non-LTR TEs are the only active class of transposons; LTR retrotransposons and DNA transposons are only ancient genomic relics and are not capable of jumping.

Autonomous and Nonautonomous Transposons

Both class 1 and class 2 TEs can be either autonomous or nonautonomous. Autonomous TEs can move on their own, while nonautonomous elements require the presence of other TEs in order to move. This is because nonautonomous elements lack the gene for the transposase or reverse transcriptase that is needed for their transposition, so they must "borrow" these proteins from another element in order to move. Ac elements, for example, are autonomous because they can move on their own, whereas Ds elements are nonautonomous because they require the presence of Ac in order to transpose.

What Jumping Genes Do (Besides Jump)

The fact that roughly half of the human genome is made up of TEs, with a significant portion of them being L1 and Alu retrotransposons, raises an important question: What do all these jumping genes do, besides jump? Much of what a transposon does depends on where it lands. Landing inside a gene can result in a mutation, as was discovered when insertions of L1 into the factor VIII gene caused hemophilia (Kazazian et al., 1988). Similarly, a few years later, researchers found L1 in the APC genes in colon cancer cells but not in the APC genes in healthy cells in the same individuals. This confirms that L1 transposes in somatic cells in mammals, and that this element might play a causal role in disease development (Miki et al., 1992).

Silencing and Transposons

As opposed to L1, most TEs appear to be silent—in other words, these elements do not produce a phenotypic effect, nor do they actively move around the genome. At least that has been the general scientific consensus. Some silenced TEs are inactive because they have mutations that affect their ability to move from one chromosomal location to another; others are perfectly intact and capable of moving but are kept inactive by epigenetic defense mechanisms such as DNA methylation, chromatin remodeling, and miRNAs. In chromatin remodeling, for example, chemical modifications to the chromatin proteins cause chromatin to become so constricted in certain areas of the genome that the genes and TEs in those areas are silenced because transcription enzymes simply cannot access them.

Another example of transposon silencing involves plants in the genus Arabidopsis. Researchers who study these plants have found they contain more than 20 different mutator transposon sequences (a type of transposon identified in maize). In wild-type plants, these sequences are methylated, or silenced. However, in plants that are defective for one of the enzymes responsible for methylation, these transposons are transcribed. Moreover, several different mutant phenotypes have been explored in the methylation-deficient plants, and these phenotypes have been linked to transposon insertions (Miura et al., 2001).

Based on studies such as these, scientists know that some TEs are epigenetically silenced; in recent years, however, researchers have begun to wonder whether certain TEs might themselves have a role in epigenetic silencing. Interestingly, it was Barbara McClintock who first speculated that TEs might play this kind of regulatory role (McClintock, 1951). It has taken decades for scientists to collect enough evidence to consider that maybe McClintock's speculation had a kernel of truth to it.

Transposons Can Encode siRNAs That Mediate Their Own Silencing

Because transposon movement can be destructive, it is not surprising that most of the transposon sequences in the human genome are silent, thus allowing this genome to remain relatively stable, despite the prevalence of TEs. In fact, investigators think that of the 17% of the human genome that is encoded by L1-related sequences, only about 100 active L1 elements remain. Moreover, research suggests that even these few remaining active transposons are inhibited from jumping in a variety of ways that go beyond epigenetic silencing.

For instance, in human cells, small interfering RNAs (siRNAs), also known as RNAi, can prevent transposition. RNAi is a naturally occurring mechanism that eukaryotes often use to regulate gene expression. What is especially interesting about this situation is that the siRNAs that interfere with L1 activity are derived from the 5′ untranslated region (5′ UTR) of L1 LTRs. Specifically, the 5′ UTR of the L1 promoter encodes a sense promoter that transcribes the L1 genes, as well as an antisense promoter that transcribes an antisense RNA. Yang and Kazazian (2006) demonstrated that this results in homologous sequences that can hybridize, thereby forming a double-stranded RNA molecule that can serve as a substrate for RNAi. Furthermore, when the investigators inhibited endogenous siRNA silencing mechanisms, they saw an increase in L1 transcripts, suggesting that transcription from L1 is indeed inhibited by siRNA.

Transposons Are Not Always Destructive

Not all transposon jumping results in deleterious effects. In fact, transposons can drive the evolution of genomes by facilitating the translocation of genomic sequences, the shuffling of exons, and the repair of double-stranded breaks. Insertions and transposition can also alter gene regulatory regions and phenotypes. In the case of medaka fish, for instance, the Tol2 DNA transposon is directly linked to pigmentation. One highly inbred line of these fish was shown to have a variety of pigmentation patterns. In the members of this line in which the Tol2 transposon hopped out "cleanly" (i.e., without removing other parts of the genomic sequence), the fish were albino. But when Tol2 did not cleanly hop from the regulatory region, the result was a wide range of heritable pigmentation patterns (Figure 4; Koga et al., 2006).

The fact that transposable elements do not always excise perfectly and can take genomic sequences along for the ride has also resulted in a phenomenon scientists call exon shuffling. Exon shuffling results in the juxtaposition of two previously unrelated exons, usually by transposition, thereby potentially creating novel gene products (Moran et al., 1999).

The ability of transposons to increase genetic diversity, together with the ability of the genome to inhibit most TE activity, results in a balance that makes transposable elements an important part of evolution and gene regulation in all organisms that carry these sequences.

References and Recommended Reading


Bailey, J. A., et al. Molecular evidence for a relationship between LINE-1 elements and X chromosome inactivation: The Lyon repeat hypothesis. Proceedings of the National Academy of Sciences 97, 6634–6639 (2000)

Feschotte, C., et al. Plant transposable elements: Where genetics meets genomics. Nature Reviews Genetics 3, 329–341 (2002) (link to article)

Kazazian, H. H. Mobile elements: Drivers of genome evolution. Science 303, 1626–1632 (2004) doi:10.1126/science.1089670

Kazazian, H. H., & Moran, J. V. The impact of L1 retrotransposons on the human genome. Nature Genetics 19, 19–24 (1998) (link to article)

Kazazian, H. H., et al. Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man. Nature 332, 164–166 (1988) (link to article)

Koga, A., et al. Vertebrate DNA transposon as a natural mutator: The medaka fish Tol2 element contributes to genetic variation without recognizable traces. Molecular Biology and Evolution 23, 1414–1419 (2006) doi:10.1093/molbev/msl003

McLean, P. McClintock and the Ac/Ds transposable elements of corn, www.ndsu.nodak.edu/instruct/mcclean/plsc431/transelem/trans1.htm (1997)

McClintock, B. Mutable loci in maize. Carnegie Institution of Washington Yearbook 50, 174–181 (1951) (link to article)

Miki, Y., et al. Disruption of the APC gene by a retrotransposal insertion of L1 sequence in colon cancer. Cancer Research 52, 643–645 (1992)

Miura, A., et al. Mobilization of transposons by a mutation abolishing full DNA methylation in Arabidopsis. Nature 411, 212–214 (2001) (link to article)

Moran, J. V., et al. Exon shuffling by L1 retrotransposition. Science 283, 1530–1534 (1999)

SanMiguel, P., et al. Nested retrotransposons in the intergenic regions of the maize genome. Science 274, 765–768 (1996)

Slotkin, R. K., & Martienssen, R. Transposable elements and the epigenetic regulation of the genome. Nature Reviews Genetics 8, 272–285 (2007) (link to article)

Yang, N., & Kazazian, H. H. L1 retrotransposition is suppressed by endogenously encoded small interfering RNAs in human cultured cells. Nature Structural and Molecular Biology 13, 763–771 (2006) (link to article)


Flag Inappropriate

This content is currently under construction.

This reading is linked to the following Scitable pages:

The sequencing of the platypus genome has received a high amount of misleading press attention. What does this information really tell us about this strangely unique animal and its genetic past?
McClintock’s maize breeding experiments provided the first detailed descriptions of transposable elements. What exactly are these “jumping genes”, and why are they so important?
The formation of new genes is a primary driving force of evolution in all organisms. How exactly do these new genes crop up in an organism’s genome and what must occur in order for them to be passed on?
All Articles Within Nucleic Acid Structure and Function (36)

DNA Replication (6)

  • DNA Replication and Causes of Mutation
    Cells employ an arsenal of editing mechanisms to correct mistakes made during DNA replication. How do they work, and what happens when these systems fail?
  • Major Molecular Events of DNA Replication
    Arthur Kornberg compared DNA to a tape recording of instructions that can be copied over and over. How do cells make these near-perfect copies, and does the process ever vary?
  • Semi-Conservative DNA Replication: Meselson and Stahl
    Watson and Crick's discovery of DNA structure in 1953 revealed a possible mechanism for DNA replication. So why didn't Meselson and Stahl finally explain this mechanism until 1958?
  • Genetic Mutation
    A single base change can create a devastating genetic disorder or a beneficial adaptation, or it might have no effect. How do mutations happen, and how do they influence the future of a species?
  • DNA Damage & Repair: Mechanisms for Maintaining DNA Integrity
    DNA integrity is always under attack from environmental agents like skin cancer-causing UV rays. How do DNA repair mechanisms detect and repair damaged DNA, and what happens when they fail?
  • Genetic Mutation
    Is it possible to have “too many” mutations? What about “too few”? While mutations are necessary for evolution, they can damage existing adaptations as well.

Transcription & Translation (4)

  • Translation: DNA to mRNA to Protein
    How does the cell convert DNA into working proteins? The process of translation can be seen as the decoding of instructions for making proteins, involving mRNA in transcription as well as tRNA.
  • DNA Transcription
    If DNA is a book, then how is it read? Learn more about the DNA transcription process, where DNA is converted to RNA, a more portable set of instructions for the cell.
  • RNA Transcription by RNA Polymerase: Prokaryotes vs Eukaryotes
    Gene expression is linked to RNA transcription, which cannot happen without RNA polymerase. However, this is where the similarities between prokaryote and eukaryote expression end.
  • What is a Gene? Colinearity and Transcription Units
    In 1958, Francis Crick’s sequence hypothesis finally provided an answer to the question: what is a gene? Why is this definition now considered overly simplistic?

Discovery of Genetic Material (4)

RNA (8)

  • RNA Functions
    The central dogma of molecular biology suggests that the primary role of RNA is to convert the information stored in DNA into proteins. In reality, there is much more to the RNA story.
  • RNA Transcription by RNA Polymerase: Prokaryotes vs Eukaryotes
    Gene expression is linked to RNA transcription, which cannot happen without RNA polymerase. However, this is where the similarities between prokaryote and eukaryote expression end.
  • Chemical Structure of RNA
    The more researchers examine RNA, the more surprises they continue to uncover. What have we learned about RNA structure and function so far?
  • RNA Splicing: Introns, Exons and Spliceosome
    What's the difference between mRNA and pre-mRNA? It's all about splicing of introns. See how one RNA sequence can exist in nearly 40,000 different forms.
  • What is a Gene? Colinearity and Transcription Units
    In 1958, Francis Crick’s sequence hypothesis finally provided an answer to the question: what is a gene? Why is this definition now considered overly simplistic?
  • Restriction Enzymes
    Restriction enzymes are one of the most important tools in the recombinant DNA technology toolbox. But how were these enzymes discovered? And what makes them so useful?
  • Genome Packaging in Prokaryotes: the Circular Chromosome of E. coli
    How do bacteria, lacking a nucleus, organize and pack their genome into the cell? Supercoiling enables this but forces a different kind of transcription and translation in prokaryotes.
  • Eukaryotic Genome Complexity
    How many genes are there? This question is surprisingly not very important, and has nothing to do with the organism’s complexity. There is more to genomes than protein-coding genes alone.

Gene Copies (5)

  • Copy Number Variation and Genetic Disease
    Did you know that a large number of your genes exist in variable numbers of copies? While they can overlap with disease-related genes, these variants exist in healthy individuals too.
  • DNA Deletion and Duplication and the Associated Genetic Disorders
    Deletions and duplications of single-base pairs typically arise during homologous recombination and cause diseases. But what happens when a mutation occurs over multiple genes?
  • Tandem Repeats and Morphological Variation
    All mammals have basically the same set of genes, yet there are obviously some significant differences that distinguish the various species. Recent research suggests that one such difference involves tandem repeats, or short lengths of DNA that are repeated multiple times within a gene. But what, if anything, does having a different number of tandem repeats do to an organism?
  • Copy Number Variation
    Copy number variations (CNVs) have been linked to dozens of human diseases, but can they also represent the genetic variation that was so essential to our evolution?
  • Copy Number Variation and Human Disease
    Analysis of individual human genomes has revealed an unexpected amount of variability in human populations. Copy number variation (CNV) has recently been identified as a major cause of structural variation in the genome, involving both duplications and deletions of sequences that typically range in length from 1,000 base pairs to 5 megabases, the cytogenetic level of resolution. Evidence is accumulating that CNVs play important roles in human disease.

Jumping Genes (4)

Applications in Biotechnology (4)

 
Ask an Expert
Post Question



Nature Education Home Learn More About Faculty Page Students Page Feedback



Genetics

Event Reminder