Main

We mapped 86 human pathogenic mutations in the 22 tRNAs encoded by mitochondrial genes, many of which are known to cause serious disease early in life3,4, on the corresponding alignments of orthologous tRNAs from 106 mammalian species. Often a site in the nonhuman tRNA was occupied by a nucleotide that represents a pathogenic mutation at the homologous human site, a pattern called a compensated pathogenic deviation2 (CPD; Fig. 1). An organism carrying a CPD must also have some other, compensatory deviation from humans that suppresses the deleterious effect of the CPD2,5. CPDs are common in proteins encoded by nuclear genes, but the molecular mechanism of compensation has been determined for only a small fraction of them2,5. Here, we systematically analyzed the mechanisms of intramolecular compensation in the evolution of mammalian mitochondrial tRNAs.

Figure 1: Multiple alignments of tRNAs with secondary structure annotation.
figure 1

(a) Genes encoding primate tRNAGly. (b) Selected nonprimate mammalian gene encoding tRNALeu (UUR). Sites identical to the human state are represented by dots; differences are represented by letters. Substitutions that correspond to a human pathogenic mutation (CPD) are capitalized and shown in red; the compensatory substitutions are shown in blue. CPDs with identified compensatory substitutions are in bold. Sequence corresponding to stem structures is in brackets.

We identified 52 CPDs that occurred 1,207 times in 105 mammalian species, including 90 occurrences of 27 CPDs in 12 primate species (Table 1; alignments in Supplementary Sequence Archive online). These CPDs probably do not represent sequencing errors, because different species often carry the same CPD (Fig. 1 and Supplementary Sequence Archive online) and no CPDs were found at polymorphic sites in species with multiple completely sequenced mitochondria (data not shown). The CPDs are also probably not a result of errors in the Mitomap data set. First, pathogenic mutations are found in tRNA sites that are much more conservative than sites that are known to be polymorphic in humans6 (Supplementary Table 1 online). Second, many pathogenic mutations are well characterized, so that errors in Mitomap must be rare3,4, whereas 60% of the mutations in the database were found as a CPD. Last, a subset of the best characterized pathogenic mutations, which are labeled as being confirmed in Mitomap, showed a similar proportion of CPDs (data not shown).

Table 1 Occurrences of CPDs across different mammalian species for different mitochondrial tRNAs

The secondary structure of most mitochondrial tRNAs is that of a cloverleaf with four stems and three loops7. Most CPDs (34) were found in stem structures, and 32 of them disrupted a Watson-Crick pair. Therefore, we assumed that substitutions compensating for the deleterious effect of pathogenic mutations usually reverse the loss of Gibbs free energy in stem structures8. Studies of compensation of substitutions that disrupt Watson-Crick pairs emphasized second-site substitutions that restore the disrupted Watson-Crick interaction9,10,11,12,13,14. This mechanism of compensation is common (Table 2). For example, in the anticodon stem of human tRNAAsn, the mutation C27U (G5703A in mitochondrial genome notation) destroys a GC pair. In chimpanzees (Pan troglodytes), this Watson-Crick interaction is restored by a G43A substitution that creates an AU pair (Fig. 2a). Species that are more distant from humans may have multiple CPDs in the same tRNA (Fig. 1); for example, the Malayan flying lemur (Cynocephalus variegatus) has evolved compensations of three pathogenic mutations in tRNALys, each through the direct restoration of the disrupted Watson-Crick interaction (Fig. 2b).

Table 2 The molecular basis of compensatory evolution in mitochondrial tRNAs
Figure 2: Secondary structures of selected nonhuman tRNAs.
figure 2

(a) P. troglodytes (chimpanzee) tRNAAsn. (b) C. variegatus (Malayan flying lemur) tRNALys. (c) C. simu (white rhinocerus) tRNATrp. (d) U. maritimus (polar bear) tRNASer (UCN). (e) H. ampullatus (northern bottlenose whale) tRNALeu (UUR). (f) T. aculeatus (Australian echidna) tRNALeu (UUR). (g) S. ehrenbergi (Ehrenberg's mole rat) tRNAIle. (h) T. tetradactyla (southern tamandua) tRNAIle. CPDs (nucleotides corresponding to human pathogenic mutations) are shown in red; predicted compensatory substitutions are shown in blue; and other deviations from the human ortholog, those unrelated to the pathogenic mutations or their compensations, are shown in green. Nucleotides found in healthy humans are shown in orange alongside the nonhuman sequence.

We also noted three indirect mechanisms of compensation of CPDs that disrupt Watson-Crick pairs, which involve substitutions at sites that do not pair directly with that of the CPD. In the white rhinoceros (Ceratotherium simum), the pathogenic human mutation G30A (G5540A) that disrupts a GC pair in the anticodon stem of tRNATrp is compensated by a substitution that creates an extra Watson-Crick pair in the same stem (Fig. 2c), thus reversing the loss of free energy8. Compensation can also occur through strengthening of an existing Watson-Crick pair, which requires two substitutions. In the polar bear (Ursus maritimus), replacement of an AU pair with a stronger GC pair abolishes the deleterious effect of the mutation A5G that converts a Watson-Crick pair into a weak GU pair in the acceptor stem of tRNASer (Fig. 2d). The same GC pair is also involved in compensation of the A3G pathogenic mutation, indicating that compensatory evolution may involve complex, epistatic interactions between multiple sites. Finally, multiple changes can compensate for one pathogenic mutation. The loss of a GC pair in the D stem of tRNALeu (UUR) is compensated by the creation of two new, weaker pairs on the other end of the stem in the northern bottlenose whale (Hyperoodon ampullatus; Fig. 2e). The Australian echidna (Tachyglossus aculeatus) compensates for the same mutation directly, by replacing the disrupted GC pair with a weaker AU pair, and the evolution of another AU pair in a different part of the D stem may also aid the compensation (Fig. 2f). Different species may compensate for the same pathogenic mutation by different mechanisms. For example, Ehrenberg's mole rat (Spalax ehrenbergi) has evolved a new GC pair in the D stem of tRNAIle that compensates for the loss of a neighboring AU pair (Fig. 2g). The southern tamandua (Tamandua tetradactyla), in contrast, compensates for the same CPD by direct restoration of Watson-Crick interaction (Fig. 2h).

Another mechanism of compensation involves the interaction of stem and loop structures. For example, the loss of an AU pair in the D stem of rabbit (Oryctolagus cuniculus) tRNACys is compensated for by elongation of the D loop and elimination of the hairpin loop found in human tRNACys (Fig. 3a). Similarly, compensation for a CPD in a loop may involve compensations in the corresponding stem structure. We observed 18 CPDs in loops (Fig. 2e–h), but a comprehensive analysis of the tertiary interactions involved must be deferred until crystal structures of the tRNAs become available15. In one instance we identified the substitutions involved in compensating for the pathogenic mutation A3243G in the D loop of tRNALeu (UUR). tRNAs with an A3243G mutation are prone to forming a six-nucleotide dimer between two tRNAs, which requires a partial denaturation of the D stem15,16, suggesting that strengthening the D stem may prevent the pathogenic effect of the A3243G mutation15. The A3243G mutation is fixed in dogs (Canis familiaris), and they probably retain a normal phenotype because of strengthening of the D stem through the creation of two new Watson-Crick bonds (Fig. 3b).

Figure 3: Secondary structures with interactions between loops and stems.
figure 3

(a) O. cuniculus (rabbit) tRNACys. (b) C. familiaris (dog) tRNALeu (UUR). Known CPDs are shown in red; compensatory substitutions are shown in blue; and nucleotide states found in human are shown in orange. The pathogenic mutation in the tRNACys D stem is compensated in rabbit by the elimination of a hairpin loop found in humans. The human pathogenic mutation in tRNALeu (UUR) completes a palindrome (yellow shading) that is prone to the formation of a dimer between two tRNALeu (UUR) molecules by partial unbinding of the D stem15,16. The compensatory substitutions increase the free energy of the stem, preventing the formation of the dimer16.

For the 32 CPDs that destroy a Watson-Crick pair, we noted 25 instances of direct restoration of the Watson-Crick pairing of the affected site (Table 2), indicating that different Watson-Crick pairs are often functionally interchangeable. We also found 26 cases of indirect compensation of these CPDs by one of the three mechanisms described above (Table 2), which indicates that different organisms often compensate for the same CPD in different ways. In the remaining seven cases, we could not determine what compensates for the CPD (this was also the case for two within-stem CPDs that do not destroy a Watson-Crick pair). Thus, for each pathogenic mutation in a stem, there seem to be more than two potential methods of compensation, one involving restoration of the Watson-Crick interaction (available for almost all mutations) and, on average, more than one indirect change. Thus, most compensatory evolution in tRNAs occurs through complex, indirect mechanisms, indicating that models that consider only compensation that restores Watson-Crick pairs are oversimplified9,10,11,12,13,14,17.

We obtained predictions for the compensatory substitutions by visually selecting a candidate substitution and then predicting its effect on the free energy of the tRNA molecule using mfold8 (Supplementary Methods online). In a few instances the CPD-containing stem structures had both a substitution that restored the disrupted Watson-Crick pair and one or more other substitution(s) that restored the free energy of the stem. In such cases we classified the compensation only as a Watson-Crick restoration event. Thus, our estimate of the involvement of indirect compensation is conservative because these other substitutions may also contribute to compensation, especially when direct compensation involves replacement of an AU pair with a GC pair (that is, a weaker with a stronger interaction).

We observed high rates of convergent evolution in substitution pairs the compensate for CPDs among the mammalian taxa examined (Fig. 4 and Table 1). For the 69 species in a recent mammalian phylogeny18, we found that on average, a CPD evolved independently three times (Table 1). This high incidence is probably not a consequence of fast evolution of mitochondrial DNA, because mammalian tRNAs are generally conservative and even monotreme tRNAs are very similar to their placental orthologs (Fig. 1b and Supplementary Sequence Archive online). CPDs and compensatory substitutions evolve in quick succession in an apparently dependent way such that a CPD is rarely found without the compensatory substitution (Fig. 4). Almost every independent evolutionary event leading to the fixation of a CPD is directly associated with a unique origin of a compensatory change (Table 1).

Figure 4: Convergent evolution of compensation of three pathogenic mutations, A26G in the anticodon stem of tRNAAsp (left), G46A in the TΨC stem of tRNAHis (middle) and G67A in the acceptor stem of tRNALys (right), in 69 mammalian species.
figure 4

CPDs are shown in red; compensatory substitutions are shown as blue lines; unresolved branches are shown in gray.

A pathogenic mutation in a mitochondrial tRNA is unlikely to be fixed without a compensatory change, because the fitness of an organism with such a mutation is nearly zero2. Thus, there are only two scenarios for the evolution of a CPD and the corresponding compensatory substitution. In the simpler scenario, the compensatory substitution is the first to achieve fixation, with the CPD becoming fixed afterwards2. Alternatively (and perhaps less probably), the CPD and the compensatory substitution achieve fixation simultaneously. This may be possible if the pathogenic mutation is maintained in the population at a very low frequency and the compensatory substitution occurs in an individual with the CPD with subsequent fixation of the resulting haplotype19. The latter scenario is perhaps more likely to occur in mitochondrial than in nuclear genes, because mitochondrial pathogenic mutations may persist in a heteroplasmic state.

The high rate of compensatory evolution and the coupling of the evolution of CPDs and their compensations may indicate that positive selection is involved in the evolution of compensatory interactions2,20. Positive selection would reduce the time between the fixations of a compensatory change and the corresponding CPD, explaining the simultaneous convergent evolution of CPDs and the compensatory substitutions. In the absence of doublet mutations, the fraction of CPDs in diverging species is expected to increase if the accumulating differences evolve neutrally by genetic drift21,22. In contrast, positive selection should lead to a more constant proportion of compensatory substitutions on molecular distance2. This prediction holds true for primate species, such that the proportion of CPDs among all deviations is constant across molecular and phylogenetic distances (Supplementary Fig. 1 online). Thus, the fixation of the compensatory substitutions or of the CPD-compensatory haplotype in mitochondrial tRNAs occurs as a result of positive selection.

The list of human mitochondrial pathogenic mutations is far from complete3. Therefore, our observation that 5% of all substitutions are CPDs or, assuming that each CPD interacts with one compensatory substation, 10% of all substitutions participate in a compensatory interaction (Supplementary Fig. 1 online) is a minimum estimate. Assuming that tRNALeu (UUR) is representative of an average tRNA, we estimate that 25% of all substitutions are CPDs, with another 25% corresponding to compensatory substitutions (Supplementary Note online). Thus, substitutions participating in compensatory interactions comprise 50% of all substitutions occurring in the evolution of mammalian tRNAs.

These findings show that the fitness surfaces along which mitochondrial tRNAs evolve consist of multiple highly epistatic ridges2,23. This implies that their fast rate of evolution is not an indication of mitochondrial mutational 'meltdown', or functional degradation, in an asexual genome24. The high frequency of convergent evolution and the observation that individual sites in tRNA genes often evolve nonindependently from each other suggest that RNA genes may be misleading when used for deep phylogenetic reconstruction22. Relating phenotypic, structural and evolutionary data provides invaluable information that can be used to understand the function and structural interactions of complex molecules. We suggest that similar approaches may facilitate the study of structural characteristics of tRNAs and provide further insight into the molecular basis of human disease.

Methods

We obtained data on human disease-causing mutations from Mitomap and curated them manually to eliminate annotation errors in the database. We obtained 106 complete mitochondrial genomes from GenBank with the Entrez retrieval25 system using “mitochondrion AND complete AND genome AND mammal” as the search entry; only one genome for each species was used in our final data set, and the annotation for the genes was corrected when necessary using sequence similarity information with closely related species. We used a multiple alignment of 436 human complete mitochondria genomes to obtain polymorphism data in the human population. We constructed multiple alignments using CLUSTALW26 and manually corrected them using secondary structure information7. We used maximum parsimony to estimate the number of times CPDs and their predicted compensatory substitutions evolved on a mammalian phylogeny18. The use of different variants of the mammalian tree27 did not change our results. We used MacClade28 to draw the phylogenetic trees with CPDs and the RnaViz2 program29 to draw the tRNA secondary structures.

URLs.

Mitomap is available at http://www.mitomap.org/.

Note: Supplementary information is available on the Nature Genetics website.