Main

Oxidative phosphorylation (OXPHOS) disorders affect at least 1 in 8,000 of the general population and belong to the group of most common inherited metabolic diseases.1 Disease manifestations due to OXPHOS defects can be highly variable, but usually involve tissues with a high energy demand like heart, muscle, renal and the endocrine systems.2 Several well-described syndromes are known, like Kearns-Sayre (KSS) and Pearson syndrome, Neuropathy Ataxia Retinitis Pigmentosa (NARP), Mitochondrial Encephalomyopathy, Lactic Acidosis and Stroke-like episodes (MELAS) and Myoclonus Epilepsy and Ragged Red Fibers (MERRF). However, OXPHOS defects can also manifest with more common and less specific symptoms, like type 2 diabetes, deafness, encephalopathy, myopathy, and cardiomyopathy. OXPHOS disorders therefore cause significant morbidity and mortality and have a broad impact on public health.

Approximately 25% of the pediatric patients with OXPHOS disorders have mutations in the mtDNA, but these are difficult to find due to the genetic and clinical heterogeneity.3 As the number of mtDNA mutations has increased to over 250 and the clinical specificity lags behind, symptom-based protocols fall short and screening of the entire mtDNA is preferable.4 The recently introduced MitoChip (Affymetrix) is a new method for mtDNA resequencing.5,6 In this paper we describe our experience of the complete mtDNA screening of 28 patients with OXPHOS disease by MitoChip resequencing.

MATERIALS AND METHODS

Patient and test samples

Genomic DNA was isolated from muscle according to the protocol of Mullenbach et al.7 DNA from 3 conventionally sequenced patients, a patient with a heteroplasmic 5 bp deletion,8 and 28 patients with a clinical and/or biochemical phenotype of OXPHOS disease were analyzed on the MitoChip. These patients were all negative for the MELAS m.3243A>G, MERRF m.8344A>G, and NARP m.8993T>C/G mutations, and for large deletions of the mtDNA.

MitoChip and experimental procedure

The MitoChip contains both sense and antisense probes specific for the forward and reverse sequence of two fragments of the mtDNA. One fragment interrogates 15,446 bases of the mtDNA minus the D-loop sequence. The other fragment is a duplicate of the first fragment, comprising 12,938 bases of the mtDNA coding sequence minus the sequence of the 12S and 16S mitochondrial ribosomal RNA genes. The two fragments are separated on the chip by a control fragment (Affymetrix). Oligonucleotide probes are synthesized in situ on the chip by Affymetrix by standard photolithography and solid-phase DNA synthesis.9,10 For each position of the mtDNA fragments, four 25-mer probes are represented on the chip, each with a different nucleotide in the middle (A, G, C, or T) allowing for the detection of all possible nucleotide substitutions. The probe with the correct corresponding nucleotide in the middle for each mtDNA position will give the highest signal intensity after hybridization and scanning. The entire mtDNA was amplified using the Expand Long Template system (Roche) in 2 fragments (A and B) of 8,466 bp and 7,866 bp in length, with an overlap of 235 bp. The primers for fragment A were forward primer [5′-ccgcttctggccacagcacttaaacacatc-3′] and reverse primer [5′-aggaggttagttgtggcaat-3′], and for fragment B forward primer [5′-gcttcattcattgcccccac-3′] and reverse primer [5′-ggaggatggtggtcaagggacccctatctg-3′]. The 7.5-kb control sequence was amplified using the primers and template from the Customseq™ control kit (Affymetrix). PCR products were purified using the QIAQuick PCR cleanup kit (QIAGEN). Equimolar amounts of the amplified fragments A and B were pooled and fragmented together with the amplified 7.5-kb control sequence, labeled and hybridized on a pre-hybridized MitoChip as described in the Affymetrix CustomSeq Resequencing protocol. Chips were washed and stained on the GeneChip fluidics station 400 (Affymetrix) using the preprogrammed CustumSeq Resequencing wash and stain protocol (DNA ARRAY-WS2). The MitoChips were scanned using the Affymetrix GeneChip scanner 3000 creating .CEL files for subsequent analysis.

Data analysis

Affymetrix GeneChip DNA Analysis Software (GDAS) version 3.0.1.3 beta was used to analyze the .CEL files, using the default program settings. A report file was created by GDAS listing the nucleotide variations for both chip fragments compared to the Revised Cambridge Reference Sequence (RCRS).4,11 Discrepancies between nucleotide calls for both chip fragments were evaluated manually. The final base changes were written to a text file (“affy output file”). A second analysis was performed using R, a free software environment for statistical computing and graphics.12 In the R analysis, each chip (.CEL file) was analyzed separately. For each position the nucleotide with the highest signal intensity was determined, not taking into account the background signal. Nucleotide changes were printed to a text file (“R output file”). Finally the “affy output files” and the “R output files” were compared and combined, resulting in a list of variations for each sample. Discrepancies between the GDAS and R output were evaluated manually by directly looking at the chip image or by conventional sequencing. Unless otherwise mentioned, throughout this paper the phrase “MitoChip analysis” refers to the combined GDAS and R analysis.

Validation of variations and heteroplasmy levels

Both strands of a fragment carrying a variant were cycle sequenced using the BigDye Terminator v3.1 Cycle sequencing kit, an ABI-PRISM 3100 genetic analyser and the Sequence Analysis 3.7 software package. The level of heteroplasmy was determined by mutation specific restriction digestion. In case a restriction site was gained or lost, fragments were amplified using primers surrounding the underlying variation, otherwise a specific mismatch primer was designed in order to create a restriction site. On PCR amplification, a labeled primer was added to the reaction in the last PCR cycle to label the PCR products. Labeled PCR products were digested and the fragments were analyzed on an ABI-PRISM 3100 genetic analyser using the GeneScan Analysis 3.7 software package (Applied Biosystems). The level of heteroplasmy was determined by calculating the ratio of the mutant or wild type peak area (depending on the gain or loss of a restriction site by the nucleotide variation) and the sum of the mutant and wild type peak areas. Primer sequences and reaction conditions are available on request.

RESULTS

MitoChip performance and validation

Based on a total of 32 DNA samples, the average GDAS call rate was 94.0%, ranging from 88.9–96.8%. Of the GDAS “no call” signals, 0.72% (212/29,366) of the chip positions gave a “no call” for each of the 32 chips analyzed, corresponding to 134 mtDNA nucleotide positions. A GDAS call was always given for each of the 32 chips for 12200 mtDNA nucleotide position (20,486 out of 29,366 (70%) chip positions). For 7 CHIPs GDAS and R analysis were compared for the entire sequence and R analysis accounted for an average of 24.2% (ranging from 13–37%) of the total amount of base changes in the output files. Five unclassified variants (UVs) were identified by the R analysis, which were not detected by the standard GDAS analysis. For the three test samples, which were also sequenced conventionally, 34/35 (97%) variations were called correctly on the MitoChip. In two test samples, one and six variations were additionally detected by the MitoChip. By reinterpretation of the conventional sequencing results, one of the variations which were detected additionally by the MitoChip proved to be a low percentage heteroplasmic substitution (false negative), and the other additional variants appeared to be falsely detected as a nucleotide variant by the MitoChip (false positives). A known 5-bp deletion was also tested, but not detected. Two DNA samples differing at 16 positions in their mtDNA sequence were mixed at ratios of 1:1, 1:9, and 1:19. In the mix sample analyses (only GDAS), the Total Quality Threshold Score setting was set at 30 and 75. Fifteen of 16 positions were called correctly at a heteroplasmy level of 50% for both TQT settings. Only one position was called correctly as heteroplasmic in all three mix samples at both TQT settings. At a TQT value of 30, “no call” signals were 3 times more often observed at the 16 investigated heteroplasmy positions than at a TQT value of 75.

mtDNA variations in 28 patients

A total of 520 variations were detected in 28 patients, comprising 197 unique nucleotide substitutions. Fifteen heteroplasmic variations were detected in a total of 11 patients. Four of these variations were confirmed by sequencing and mutation specific restriction digestion (m.15939C>T at 7%, m.13513G>A at 14%, m.3243A>T at 34%, and m.13042G>A at 84%); two proved to be homoplasmic single base pair insertions (m.3229_3230insA and m.3158_3159insT); two appeared to be false positive; one was shown to be a homoplasmic polymorphism (m.15452C>A); the remaining six were not tested because they were considered not to be pathogenic. Of all detected variations, three were known pathogenic mutations (Table 1), 114 have been reported before as polymorphisms (Supplementary Table 1), 41 did not result in an amino acid change and were most likely polymorphisms (Supplementary Table 1), and 39 were unclassified variants (UVs), some of which were likely pathogenic (Table 2). Eight variants were located in tRNA molecules (Fig. 1). In three patients known pathogenic mutations were detected and in 23 patients UVs were detected. In five patients only polymorphisms were detected.

Table 1 Pathogenic mutations
Table 2 Unclassified variants
Fig 1
figure 1

Eight tRNA variations (unclassified variants and mutations) detected in 28 patients with OXPHOS disease. Detected were an m.3243A>T variation in the tRNA-Leu1, an m.4336T>C variation in the tRNA-Gln gene, an m.5558A>G variation in the tRNA-Trp gene, an m.5592A>G variation in tRNA-Ala gene, an m.5850T>C variation in the tRNA-Tyr gene, an m.12308A>G variation in the tRNA-Leu2 gene, and an m.15890C>T variation and m.15939C>T variation in the tRNA-Thr gene. The images are adapted from the website dealing with the compilation of mammalian tRNA genes (http://mamit-trna.u-strasbg.fr/index.html).32

UVs were evaluated for evolutionary conservation, for functional significance by determining the effect on the tRNA or on the protein, and, when available, for segregation in the family. Seven UVs were located in six tRNA genes (Fig. 1), 11 UVs in the 12S and 16S rRNA genes, one UV in the mtDNA transcription terminator site, and 20 UVs in the protein encoding genes. Of the eight tRNA variations (Fig. 1), the m.15939C>T variation in the tRNA-Thr gene was heteroplasmic with a mutation load of 7%. The other tRNA variations were all homoplasmic or close to homoplasmy (>98% mutation load). Two of the variations in the protein encoding genes were heteroplasmic 84% and 70%, respectively, for the m.13042G>A in the ND5 gene and m.14258G>A in the ND6 gene.

Single nucleotide insertions were detected as well. A heteroplasmic m.3158A>T transition on the CHIP proved to be a single nucleotide insertion (m.3158_3159insT) by standard sequencing. Additionally, the homoplasmic m.3229_3230insA insertion was initially detected by the MitoChip analysis as a heteroplasmic m.3229T>A transition.

DISCUSSION

MitoChip performance and validation

MitoChip resequencing is a very rapid method to screen the mtDNA for mutations. Eight samples can be analyzed by one technician over a period of 4 days. The average call rate of 94.0% is within the range described by Affymetrix and others.5 The additional R analysis led to an increase in total base calls and to the identification of 5 UVs (4/7 samples), which would have been missed by GDAS analysis only. Of the 212 chip positions (0.72%) (134 mtDNA nucleotide positions) which consistently gave a “no call,” 197 (124 mtDNA nucleotide positions) were cytosine nucleotides. Previously, it was reported that 1.7% of the bases always gave a “no call” in a dataset of 26 chips.5 About 51% of these were from regions containing two or more successive C bases. R-analysis showed that in these regions especially the forward strand differed from the reference sequence, whereas the reverse sequence gave a call equal to the reference sequence. The reason for this is yet unknown. The mtDNA nucleotide positions of twelve common mtDNA mutations (m.1555A>G, m.3243A>G, m.3460G>A, m.8344A>G, m.8993T>G/C. m.9176T>C, m.10159T>C, m.11778G>A, m.13513G>A, m.14459G>A, m.14484T>C, and m.14487T>C), were not among the nucleotide positions always giving a “no-call.” Within-chip and between-chip reproducibility have already been determined showing a within-chip error rate of 0.0025% and a between-chip error rate of 0.0027%, illustrating that the base calls were reproducible for >99.99%.5 In our experiments 97% of the variations were called correctly in three previously sequenced samples, which is comparable with the 95% reported before in 18 samples.5 In the three test samples, only six variations appeared be false positive after follow-up investigation. It is unlikely that PCR errors are causative for false positive MitoChip calls since there is a large number of templates for the PCR, and the heteroplasmy levels due to sporadic PCR errors will be below the detection limit. As GeneChip Resequencing Arrays only support detection of homozygous and heterozygous single nucleotide substitutions, small deletions (5 bp) could not be detected, although 2 single nucleotide insertions were detected as heteroplasmic “substitutions.” One might expect that the heteroplasmic 5-bp deletion would be detected by the MitoChip as a loss of signal, but available software is insufficient for these quantitative analyses, especially if over 50% of the wild type mtDNA is still present.8

Although the GDAS software was designed to detect homozygous (homoplasmic) and heterozygous (50% heteroplasmy) single base pair substitutions only, it was previously shown that heteroplasmic variations at levels as low as 2% were detectable.5 However, in our dilution experiments only one out of 16 positions could be identified correctly as a heteroplasmic variation in the 5%, 10%, and 50% mix samples. Notably, the number of “no calls” at these 16 heteroplasmic positions was three times higher at the TQT setting of 30 compared to the GDAS default TQT setting of 75, illustrating a difficulty for the algorithm to assign a call for the particular position, probably because of the signal of the heteroplasmic variant. This suggests that further development of the data analysis algorithm and tuning of the data analysis settings can improve the detection level of heteroplasmic mutations. An option would be to customize our R-analysis to make it sensitive for heteroplasmic variations. Nevertheless, we were able to detect a heteroplasmic variation in the tRNA-Thr gene (m.15939C>T) in a patient with a mutation load as low as 7%. Additionally, a pathogenic mutation in the ND5 gene (m.13513G>A) was identified and validated at a heteroplasmic level of 13–15%. Apparently the sensitivity for heteroplasmy detection is different per position and probably depends on the sequence surrounding the variation.

MitoChip compared to other methods

Resequencing Chips are powerful if long-range PCR protocols are available and substitutions are the predominant pathogenic mutations, which is the case for the mtDNA. Other methods for mtDNA mutation analysis are conventional sequence analysis, single strand conformation polymorphism (SSCP) analysis, denaturing gradient gel electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE), denaturant capillary electrophoresis (DCE) and denaturing high-performance liquid chromatography (dHPLC).1317 These techniques all have their advantages and disadvantages with respect to costs time consumption, high-output possibilities, heteroplasmic detection limit, and type of mutations that can be detected (Table 3). At this moment, heteroduplex-based methods (dHPLC, DGGE, TGGE, and DCE) appear to be more sensitive than the MitoChip for heteroplasmy detection, with detection sensitivities as low as 0.5%.1416,18 SSCP and conventional sequence analysis are not able to detect low percentages of heteroplasmy. MitoChip resequencing is able to detect heteroplasmic variants at low levels, although the data analysis needs to be optimized to reach similar detection levels. Although the costs per nucleotide are cheaper for conventional sequencing,5 considering the overall costs per sample, MitoChip resequencing is preferable over conventional sequencing, mostly due to a tremendous gain in labor costs (8 samples can be analyzed by one technician over a period of 4 days) and automated analysis at comparable bench costs.

Table 3 Comparision of mtDNA mutation screening methods

MtDNA variations in 28 clinical samples

The pathogenic m.3243A>T mutation was found in a patient with proximal myopathy, ptosis and ophthalmoplegia at a mutation level of 34%. Magnetic resonance imaging (MRI) of the brain showed white matter abnormalities and muscle creatine phosphokinase levels were elevated. Muscle biopsy histomorphology revealed ragged red fibers (RRF) and cytochrome c oxidase (COX) negative fibers. This mutation has been previously described in muscle (81.4%), skin (69.3%) and blood (13.8%) of a 9-year-old girl with muscle weakness, encephalopathy and a reduction of the activity of OXPHOS complexes I, III and IV.19 The mutation is located at the same position as the classical MELAS m.A3243A>G mutation at the first position of the D-loop in the tRNA-Leu1 molecule (Fig. 1), leading to a dramatic loss of aminocylation efficiency. Moreover, residue A14 of this tRNA is strongly conserved within all classical tRNA molecules and it is involved in establishing tertiary interactions with other residues of the tRNA.20 The pathogenic m.3697G>A mutation in the ND1 gene was detected at a mutation load higher than 97% in a patient with spastic dystonia, elevated levels of lactate and pyruvate in blood serum and cerebrospinal fluid. The affected amino acid is evolutionary highly conserved (Table 1) and the mutation has been described before in a MELAS patient with 80% of the m.3697G>A mutation in muscle and 79% in skin fibroblasts.21 Pathogenicity of the mutation was confirmed by fusion experiments with ρ0 cell lines (no restoration of complex I activity) combined with blue native polyacrylamide gel electrophoresis (BN-PAGE) (low levels of fully assembled complex I).21 The m.3697G>A mutations was also evaluated according to the scoring system to assess pathogenicity of complex I mutations.22 This score can range from 0 (not pathogenic) to 40 (pathogenic) and is based on several criteria: biochemical defect, functional studies, reports by two or more independent laboratories, heteroplasmy, segregation in the family, and evolutionary conservation. The m.3697G>A mutation reached a score of 32. Our report of this mutation as a second independent laboratory will increase the score to 37 points, which is above the “pathogenic” cutoff of 30. The m.13513A>G mutation was detected in a patient with encephalopathy, strokes, dystonia, signs of Parkinsonism, mental retardation and signs of early fatigued. Biochemical analysis indicated a deficiency of the pyruvate dehydrogenase complex (PDHC) in muscle. The m.13513A>G transition in the ND5 gene changes an amino acid strongly conserved over 11 species (Table 1) and has been related to MELAS and Leigh syndrome in several reports.2328 The pathogenicity score for this mutation is 39.22 The functional importance of this amino acid is further supported by the finding of a pathogenic mutation m.13514A>G, affecting the same codon, that results in a different amino acid replacement (D393G vs. D393N) in two MELAS patients.29

UVs were evaluated with respect to heteroplasmy, evolutionary conservation, functional influence of the effect on tRNA or protein, and segregation in the family.22,30,31 Several UVs were likely to be pathogenic, but the effect of others is still unclear. The m.5850T>C variation had a mutation load of >98% in muscle and 13% in urine, but it could not be detected in blood or hair of the patient. It was not present in blood, hair, or urine from the patient's clinically unaffected mother, brother, and sister. The position of this variation in the tRNA-Tyr gene is evolutionary highly conserved. These data strongly suggest a pathogenic role. A second variation with a high probability of pathogenicity was the homoplasmic m.3229-3230insA. This variation is located in the transcription terminator site and may disturb the function of this site. Moreover, the location of this variation being exactly between the 16S rRNA gene and the tRNA-Leu1 gene might result in a faulty cleavage of the RNA after transcription. A faulty cleavage could have consequences for the function of the tRNA-Leu1 and 16S rRNA genes. The m.13042G>A variation in the ND5 gene was heteroplasmic (84%) and the involved amino acid was also strongly conserved (Table 2), which favors a pathogenic role. The m.14258G>A variation in the ND6 gene was also a heteroplasmic (70%) variation in a protein coding gene. Although this variation is heteroplasmic, the affected amino acid is not well conserved. The m.15939C>T variation in the tRNA-Thr gene was also heteroplasmic (7%) in the patient's muscle. The variation is located in the T-loop of the tRNA molecule and not involved in establishing tertiary interactions in the tRNA molecule, making a pathogenic role less likely. The other homoplasmic variations (Table 2) have to be further evaluated, as it is difficult to determine pathogenicity solely based on criteria like heteroplasmy and conservation alone. Functional and family studies will be performed to provide the definite evidence.

Resequencing does not only detect known pathogenic mutations or polymorphisms, but also UVs and risk factors for unrelated pathology, like cancer, Alzheimer's disease or Parkinson's disease. As the significance of this latter group of variations for individual cases and their families is yet unclear and the risk factors can not explain the primary pathology, it is evident that these data have to be carefully dealt with by the clinicians. Patients should be counseled about the uncertainties of some of the observations and the difficulties to interpret part of the results. However, this is inevitable and does not counterweight the genetic diagnoses made. It will also be a temporary problem, as due to the joint sequencing efforts more and more information will become available about neutral polymorphisms, genuine risk factors of disease, and actual pathogenic mutations.

CONCLUSION

MitoChip resequencing is a fast, cost-effective and reliable method with high-output capabilities for complete mtDNA screening. A quarter of the patients with OXPHOS disease can be genetically diagnosed by this technique, based on the detection of three pathogenic and three or four likely pathogenic mutations in 28 patients, which confirms the percentage previously described.3 Because of the growing number of mtDNA mutations combined with the increasing clinical heterogeneity, MitoChip resequencing is the preferred method to screen the mtDNA for mutations, especially when symptom specific screening and screening of only the most common mtDNA mutations falls short.