Mito-SiPE is a sequence-independent and PCR-free mtDNA enrichment method for accurate ultra-deep mitochondrial sequencing

Walsh, Darren J.; Bernard, David J.; Pangilinan, Faith; Esposito, Madison; Harold, Denise; Parle-McDermott, Anne; Brody, Lawrence C.

doi:10.1038/s42003-022-04182-2

Download PDF

Article
Open access
Published: 19 November 2022

Mito-SiPE is a sequence-independent and PCR-free mtDNA enrichment method for accurate ultra-deep mitochondrial sequencing

Communications Biology volume 5, Article number: 1269 (2022) Cite this article

4737 Accesses
6 Citations
26 Altmetric
Metrics details

Subjects

Abstract

The analysis of somatic variation in the mitochondrial genome requires deep sequencing of mitochondrial DNA. This is ordinarily achieved by selective enrichment methods, such as PCR amplification or probe hybridization. These methods can introduce bias and are prone to contamination by nuclear-mitochondrial sequences (NUMTs), elements that can introduce artefacts into heteroplasmy analysis. We isolated intact mitochondria using differential centrifugation and alkaline lysis and subjected purified mitochondrial DNA to a sequence-independent and PCR-free method to obtain ultra-deep (>80,000X) sequencing coverage of the mitochondrial genome. This methodology avoids false-heteroplasmy calls that occur when long-range PCR amplification is used for mitochondrial DNA enrichment. Previously published methods employing mitochondrial DNA purification did not measure mitochondrial DNA enrichment or utilise high coverage short-read sequencing. Here, we describe a protocol that yields mitochondrial DNA and have quantified the increased level of mitochondrial DNA post-enrichment in 7 different mouse tissues. This method will enable researchers to identify changes in low frequency heteroplasmy without introducing PCR biases or NUMT contamination that are incorrectly identified as heteroplasmy when long-range PCR is used.

A method for multiplexed full-length single-molecule sequencing of the human mitochondrial genome

Article Open access 06 October 2022

Mitochondrial single-cell ATAC-seq for high-throughput multi-omic detection of mitochondrial genotypes and chromatin accessibility

Article 15 February 2023

Prediction of mitochondrial genome-wide variation through sequencing of mitochondrion-enriched extracts

Article Open access 05 November 2020

Introduction

Decades of research have established a link between mitochondrial DNA variation and human health. Recent advances in DNA sequencing technologies have led to an increased ability to interrogate the mitochondrial genome for low-frequency mutations associated with various disease states. Mitochondrial DNA mutations have been associated with ageing¹ and a myriad of disease phenotypes². Disorders caused by inherited and acquired mitochondrial DNA variants affect ~1 in 4300 of the population³. These variants were initially thought to solely originate from the matrilineal inheritance of mitochondrial DNA molecules, however more recent studies have shown that somatic mutations also occur in mtDNA over time in a tissue-specific manner^4,5,6. There are hundreds to thousands of mitochondrial DNA molecules in every human cell. This number is dependent on the tissue, cell type and energy state of the mitochondria⁷. The fluctuating, multi-copy nature of mitochondrial DNA means that mutations can be present at any frequency within a cell, unlike the diploid nuclear genome. The presence and frequency of mitochondrial DNA mutations is referred to as mitochondrial heteroplasmy.

Mitochondrial heteroplasmy has been increasingly investigated as a contributor to human disease. To date, it has been linked to various diseases, including cardiomyopathy, hypertension, epilepsy, Parkinson’s disease and optic neuropathy^8,9,10,11,12. Elevated heteroplasmy levels have also been linked to tumour aggressiveness and poor cancer prognoses^13,14. These studies demonstrate that mitochondrial heteroplasmy analysis could help to identify unknown molecular mechanisms that drive some disease states. Additionally, evidence suggests that mitochondrial DNA heteroplasmy could be used as a potential target for diagnosis/prognosis of particular conditions and even perhaps as a therapeutic target¹⁵. Given these findings, it is important that mitochondrial heteroplasmy, particularly low-frequency heteroplasmy, can be identified and quantified as a part of disease-related studies. Such investigations require high to ultra-high sequencing coverage (>1000X–>10,000X) of the mitochondrial genome to reliably quantitate low-frequency heteroplasmy with a high degree of sensitivity and specificity. Currently, this is typically achieved by using probe hybridisation or long-range polymerase chain reaction (PCR) to enrich mitochondrial DNA.

Probe Hybridisation¹⁶ uses complementary probes that bind mitochondrial sequences to separate mtDNA from nuclear DNA. Another approach is long-range PCR¹⁷ which amplifies the mitochondrial genome, typically in two, overlapping fragments. Both probe hybridisation and long-range PCR amplification require complementary binding of probes/primers to enrich mtDNA from whole DNA extracts. The widespread use of these methodologies in many heteroplasmy studies is due to their ease, amenability to high-throughput processes and efficacy in producing mtDNA appropriate for ultra-deep sequencing^4,5,18,19. These sequence-dependent methods are imperfect. Probes and primers designed to match reference alleles may select against rare heteroplasmic variants that are of interest. Additionally, PCR amplification is known to introduce errors that may appear as false positive heteroplasmic variants. Arguably, the most problematic issue for these techniques is the contamination of nuclear-mitochondrial elements (NUMTs)²⁰.

NUMTs are nuclear sequences that share high levels of sequence identity with mtDNA. They had arisen from the somatic translocation of mitochondrial DNA into the nuclear genome. The number, size and sequence of NUMTs varies within species²¹, including between human populations and indivduals²². The entire mitochondrial genome is represented in the human nuclear genome²². As a result of this, it is extremely difficult to design primers or hybridisation probes that will selectively enrich mitochondrial DNA without also enriching NUMT sequences. Multiple studies have found that NUMT contamination was present in mtDNA sequencing data that used either probe hybridisation or PCR amplification to enrich mtDNA^23,24,25,26. Most notably, NUMT contamination is thought to explain the apparent paternal inheritance of mitochondrial DNA in humans that was reported in PNAS in 2018^27,28,29,30. The difficulty posed by deciphering NUMT contamination from true mitochondrial DNA may require the use a sequence-independent methods for mitochondrial enrichment.

Here, we have adapted a previously described³¹ sequence-independent and PCR-free technique which relies on differential centrifugation and alkaline lysis to separate mitochondria from other tissue/cellular debris. We have called this method Mito-SiPE (a sequence-independent, PCR-free mitochondrial DNA enrichment). We provide evidence that this method can be effectively used to isolate mitochondrial DNA from different tissues for subsequent mtDNA sequencing, achieving ultra-deep coverage of the mitochondrial genome when combined with an appropriate NGS data pipeline.

A sequence-independent method for mitochondrial enrichment carried out on blood and cell culture samples was previously described using an exonuclease digest³². Its use on samples with a modest amount of starting material offers a unique approach for measuring heteroplasmy in scarce samples such as cultured cells. However, compared to Mito-SiPE, the method yielded a reduced mapping efficiency and sequencing depth on average, limiting its use in low-frequency heteroplasmy analysis.

Polg mutator mice are used as positive controls to compare Mito-SiPE with long-range PCR amplification enrichment of mtDNA. These mice lack the ability to ‘proof-read’ their mitochondrial DNA and, as a result, gather single nucleotide mutations at a higher frequency than their wild-type counterparts³³. We propose that this method can be applied to a range of species to allow researchers to reliably investigate mitochondrial heteroplasmy and expand our current knowledge of the contribution of somatic mitochondrial mutations to human ageing and disease. This methodology negates the impact of NUMT contamination and PCR error introduction when assessing heteroplasmy and is, therefore, more sensitive and accurate than long-range PCR amplification.

Results

Mito-SiPE produces highly enriched mitochondrial DNA

This technique for enriching mtDNA for sequencing is rooted in classic cell biology and biochemistry methods for subcellular fractionation. It utilises a combination of differential centrifugation and alkaline lysis to separate the mitochondria from nuclear and cytoplasmic cellular components. The preparation is then used to purify mitochondrial DNA with minimal contamination from nuclear DNA (Fig. 1a). To test this method, we used it on seven different mouse tissues; brain, heart, lung, kidney, liver, spleen and muscle. Mitochondrial copy number was assessed via quantitative polymerase chain reaction (qPCR) in enriched samples from the seven tissues and compared to unenriched, whole DNA extracts (Fig. 1b). qPCR was used to calculate the ratio of mitochondrial DNA to nuclear DNA, which provides an estimate of the average mtDNA copy number per cell. A significant increase (100–1000-fold, P < 0.0005) in mitochondrial DNA copy number was observed in samples that underwent enrichment. This effect was present across the seven tissues of interest (Fig. 1c).

**Fig. 1: PCR-free enrichment of mitochondrial DNA using differential centrifugation and alkaline lysis.**

The performance of this technique to produce pure, high-quality mtDNA for next-generation DNA sequencing was assessed by applying the method to 163 samples across seven different mouse tissues. These samples were then sequenced across four lanes of the Illumina NovaSeq platform. After quality control, alignment and removal of duplicates, the number of mitochondrial, nuclear and unmapped reads were assessed for each sample (Fig. 1d and Table 1). Of the mapped reads, an average of 75 ± 20% were mapped to the mitochondrial genome and 25 ± 20% to the nuclear genome. Unmappable reads made up 0.26 ± 0.63% (Table 2). The level of nuclear contamination in samples originating from the brain, heart, kidney and liver was extremely low, with higher levels found in the lung, muscle and spleen (Fig. 1e).

Table 1 The number of mitochondrial, nuclear and unmapped reads per tissue.

Full size table

Table 2 The percentage of reads aligned to the mitochondrial genome, nuclear genome and unmappable reads.

Full size table

Mito-SiPE requires an alternative alignment strategy

Two alignment strategies were used to assess the coverage of the mitochondrial genome and the distribution of nuclear contamination (Fig. 2a). The first method aligned filtered reads to the whole genome and then isolated mitochondrial-aligned reads. The second method aligned all the reads to the mitochondrial genome first, then mapped any remaining unaligned reads to the nuclear genome. Coverage across the mitochondrial genome and distribution of nuclear contamination were assessed after duplicate removal. Average coverage across the mitochondrial genome exceeded 50,000X at each base pair and was dependent on the tissue of origin (Fig. 2b). Lung, muscle and spleen had lower levels of coverage due to the higher levels of reads mapped to nuclear DNA in these samples. Mapping to the whole genome caused a loss in coverage between nucleotide positions 7500–11,000 (Fig. 2b, top). This dip in coverage was not observed when reads were first aligned to the mitochondrial genome using the second alignment strategy (Fig. 2b, bottom). The latter method did not lead to an overall increase in sequencing coverage across the rest of the mitochondrial genome, indicating that nuclear reads were not misaligned to the mitochondrial genome when this method was used.

**Fig. 2: Alignment of sequencing data to whole-genome reference leads to misalignment of mitochondrial reads.**

As this mtDNA enrichment method is sequence-independent, any reads that originate from nuclear DNA should have been randomly distributed throughout the whole genome. To test this hypothesis, the distribution of nuclear contamination was assessed using both alignment strategies (Fig. 2c). When reads were aligned to the whole genome, nuclear contamination appeared to be evenly distributed across the nuclear genome as expected, except for chromosome 1 (Fig. 2c, top). Aligning the reads to the mitochondrial genome first did not produce this same anomaly (Fig. 2c, bottom and Supplementary Fig. 1). This is due to a NUMT found on chromosome 1 of the reference genome that shares 99.94% sequence identity of its homologous sequence in the mitochondrial genome (Supplementary Fig. 2). As a result, the alignment tool (bwa) mapped ~50% of reads originating from this region to the mitochondrial genome and ~50% incorrectly, to the homologous NUMT on chromosome 1 when reads were mapped to the whole genome. This artefact was eliminated when reads were mapped to the mitochondrial genome first. Chromosomes 2 and 9 also had slightly elevated levels of mapped reads when compared to the rest of the genome. This effect was produced by both alignment strategies. However, further investigation showed that this was caused by the alignment of highly repetitive reads from across the genome to the same loci on chromosomes 2 and 9 (Supplementary Figs. 3–6). Mapping to the mitochondrial genome first appeared to be more effective at mapping true mitochondrial reads correctly, without increasing levels of spurious alignments. This alignment methodology was used to assess heteroplasmy levels in the tissues of Polg^D257A/D257A and Polg^wt/wt mice.

Heteroplasmy analysis comparison between Mito-SiPE and lrPCR in Polg mutator mice

There was an average of 7.15 × 10⁷ reads produced per sample across all studied groups (Table 3; n = 48). A higher average number of reads was observed in the lrPCR amplification samples (7.22 and 7.19 × 10⁷, Polg^D257A/D257A (n = 12) and Polg^wt/wt (n = 12), respectively) compared to the mtDNA prep samples (7.15 and 7.02 × 10⁷ Polg^D257A/D257A and Polg^wt/wt, respectively). The proportion of reads mapped to the mitochondrial genome was also higher in the lrPCR amplification samples than in the mtDNA preparation (Mito-SiPE) samples (Table 3). The average coverage across the mitochondrial genome was higher in the long-range PCR amplification samples (Average depth 137,000X compared to mtDNA prep average of 123,000X). However, these samples had a loss in coverage towards the end of the two overlapping fragments (Fig. 3a, left). The mitochondrial DNA prep samples, broadly, had uniform coverage across the whole mitochondrial genome compared to the lrPCR samples. Polg^D257A/D257A samples displayed minor region-specific fluctuations in coverage (Fig. 3a, right). This effect was not observed in the long-range PCR amplification assay (Fig. 3a, middle).

Table 3 The average total number of reads, mapped reads and sequencing depth for each methodology and genotype.

Full size table

**Fig. 3: Mitochondrial DNA preparations outperform long-range PCR amplification and reduce the impact of PCR errors and NUMT contamination on mitochondrial heteroplasmy.**

Three metrics were measured to assess heteroplasmy levels in Polg^D257A/D257A and Polg^wt/wt tissues: the number of heteroplasmic sites, the average alternative allele frequency (average heteroplasmy) and cumulative heteroplasmic burden. These three metrics have previously been reported in the literature^{4,14,34,35,36,37}. The number of heteroplasmic sites is the number of nucleotide positions at which an alternative allele was identified above the threshold frequency (0.2%) in a sample. Alternative allelic calls caused by sequencing error are present below this frequency and are indistinguishable from low-frequency heteroplasmy. Average heteroplasmy is the mean frequency of all variants observed in a sample above the threshold frequency. Finally, the cumulative heteroplasmic burden is the sum of all variant frequencies that were identified above the threshold frequency in a sample.

More heteroplasmic sites were observed in the Polg^D257A/D257A mice than wild-type, in the brain, liver and kidney (Fig. 3b, top). No difference was observed between Polg^D257A/D257A males and females, however, there was a significant difference between sexes of Polg^wt/wt (Fig. 3b, top, left panels vs right panels). There were significantly fewer heteroplasmic sites found in mtDNA preps of both male and female Polg^wt/wt as well as Polg^D257A/D257A males compared to lrPCR enrichment. This effect was not statistically significant in female Polg^D257A/D257A tissues. The difference between enrichment assays was larger in Polg^D257A/D257A samples than in Polg^wt/wt (Supplementary Fig. 7). When examining cumulative heteroplasmic burden, similar overall results were observed (Fig. 3b, middle). A lower burden was detected in mtDNA prep enriched samples compared to lrPCR in the same three of four comparisons. Average heteroplasmy, however, displayed contrasting results (Fig. 3b, bottom). Polg^D257A/D257A mice had lower average heteroplasmy levels than Polg^wt/wt mice. In Polg^D257A/D257A mice, the mean alternative allele frequency was significantly lower in mtDNA preps than in lrPCR enrichment. Polg^wt/wt mice appeared to have higher average heteroplasmy levels, which were elevated in mtDNA preps compared to lrPCR amplification enrichment.

Polg^wt/wt mice have a baseline of mitochondrial DNA mutation much higher than that of true wild-type C57BL6 mice (169 ± 29.4 and 0.06 ± 0.24 heteroplasmic sites, respectively). This is because heterozygous female breeders have an intermediate phenotype. Long-range PCR amplification enrichment was performed on total DNA extracted from the brain, kidney and liver of two wild-type C57BL6/J males. These enrichments were compared to the mtDNA preparation of six control C57BL6 males. These mice were age-matched. There was a significant and substantial increase in the number of heteroplasmic sites, cumulative heteroplasmic burden and average heteroplasmy in the lrPCR samples compared to samples that underwent mtDNA preparation (Fig. 3c).

Mitochondrial DNA prep enrichment was performed on seven tissues of Polg^wt/wt and Polg^D257A/D257A mice. Mutant mice displayed significantly higher levels of mitochondrial DNA mutation than wild-type across all tissues (Supplementary Fig. 8). Kidney, liver, colon, heart and lung had a similar number of heteroplasmic sites in Polg^D257A/D257A mice with higher levels observed in spleen and lower levels found in the brain (Supplementary Fig. 9a). Colon and spleen had higher levels of average heteroplasmy than the other five tissues in Polg^D257A/D257A mice (Supplementary Fig. 9b). Cumulative heteroplasmic burden was higher in Polg^D257A/D257A colon and spleen tissues and lower in the brain than in heart, kidney, liver and lung (Supplementary Fig. 9c). There was no significant difference between the tissues of Polg^wt/wt mice across any of the three heteroplasmy metrics that were assessed.

Analysis of the mutation profile observed using both lrPCR and Mito-SiPE showed similar results in Polg^wt/wt and Polg^D257A/D257A (Supplementary Fig 10a). Polg^D257A/D257A mice had higher levels of mutations occurring at ‘C’ nucleotide positions in the reference genome (light-strand). Interestingly, Polg^wt/wt had more mutations at the ‘A’ nucleotide position, indicating that perhaps there is some selection that occurs when mutations are passed from Polg^D257A/wt to their progeny.

Where lrPCR and Mito-SiPE widely diverged in results was in the mutation spectrum of wild-type C57BL6 mice. Long-range PCR amplification causes an increase in mutations occurring at the ‘T’ nucleotide position, whereas no mutations are identified using Mito-SiPE. This is also in contrast to both groups with a Polg ^D257A/wt background. Due to the large number of mutations that were identified in Polg^D257A/D257A mouse tissues (1000–7500), high-frequency variants (≥10% MAF) were selected for further sequence analysis (Supplementary Fig. 10b). There was no difference between lrPCR and Mito-SiPE in the mutational spectrum in these high-frequency mutations in terms of profile or proportion of transitions to transversions. Additionally, we observed no difference in the loci/genes in which mutations were identified. The number of mutations found in each gene is available in Supplementary Data 1. The difference in heteroplasmic variants (MAF ≥0.2%) across the mitochondrial genome between both methodologies was calculated (Supplementary Fig. 11a). False positive heteroplasmic variants occur across the mitochondrial genome with a marked increase at the locations where lrPCR has reduced coverage and decrease in the D-loop region. Annotation of these differences did not show a notable contrast between low, medium and high-impact variants. However, more variants were identified in mt-Nd4 and mt-Nd1, which coincides with the lrPCR region of reduced coverage.

Finally, we attempted to use alternative methods of sequence-independent enrichment for human cell culture samples (Supplementary Fig. 12). Exonuclease digest, Qiagen’s QProteome kit and QIAprep Miniprep kit were all used to isolate mitochondrial DNA from HepG2 cells. When the mtDNA copy number was assessed from these samples, a modest increase was observed. This increase is orders of magnitude lower than the levels achieved using Mito-SiPE on fresh mouse tissue and would not be adequate as a substrate for ultra-deep sequencing.

Discussion

In this study, we have demonstrated that a sequence-independent technique for mitochondrial DNA enrichment is highly effective and can produce ultra-deep sequencing coverage required for heteroplasmy analysis. We conclude that this methodology works most effectively with brain, heart, liver and kidney samples, however, sufficient results can also be obtained in samples originating from the lung, muscle and spleen. The cause of the disparity between tissues is unknown, however, we hypothesise that it may be related to the amount of starting material, the mechanical properties of the tissues and the mitochondrial copy number before enrichment. There are three main advantages of this method over the current standard: (1) It does not require complementary binding of reference probes/primers to mitochondrial DNA, (2) PCR amplification is not required and therefore generates no polymerase errors during enrichment and (3) Any nuclear contamination present is randomly distributed across the nuclear genome and therefore does not result in NUMT enrichment.

The sequencing depth achieved using this methodology exceeds that of many heteroplasmy studies to-date, even in the tissues where enrichment was less effective^5,19,38,39. High coverage of the mitochondrial genome allows for a more sensitive assessment of low-frequency mutations and changes in heteroplasmy frequency. The number of reads (average 22.4 million per sample, Table 1) produced through sequencing in this study could be reduced by up to fivefold and would still be sufficient to achieve coverage levels in-line with previous studies (typically less than 10,000X). This would reduce the sequencing costs, increase throughput and enable the study of mitochondrial heteroplasmy across many samples by taking advantage of high-throughput sequencing. Our results also suggest that sufficient sequencing coverage could be achieved even with lower-capacity sequencing platforms.

An alternative bioinformatics pipeline is required when this technique is utilised. Typically, it is advised to align all sequencing data to the whole reference genome before isolating mitochondrial reads to avoid spurious alignment of nuclear reads to the mitochondrial genome⁴⁰. However, due to the lack of sequence-specific enrichment when using this technique, a different approach is optimal. When reads are aligned to the whole reference genome first, mitochondrial reads are incorrectly aligned to homologous regions in the nuclear genome, most prominently, mouse chromosome 1. This effect in this study, it should be noted, is specific to the mouse reference genome; however, future studies may assess whether the same effect is observed in human alignments or, indeed, in other species. As we demonstrate here, mapping reads to the mitochondrial genome first did not appear to lead to an increase in the spurious alignment of nuclear reads to the mitochondrial genome and produced uniform sequencing coverage.

Nuclear-mitochondrial sequences have been identified as an important source of artefacts in heteroplasmy analysis. By employing sequence-independent enrichment of mitochondrial DNA, we find that any nuclear contamination present in the resultant sequence data is randomly distributed across the nuclear genome. It is pertinent to note, however, that although NUMT contamination is not enriched in this study, it does not mean that NUMT contamination is entirely absent. This is an important point as the number and size of all NUMTs have not been fully elucidated for most species and varies within species, even between individuals. The effect of NUMT contamination when using this technique will be minimised in the tissues that contain less nuclear contamination e.g., the brain, heart, kidney and liver.

Mitochondrial DNA heteroplasmy has been assessed in different ways across a number of studies. The number of heteroplasmic sites, average alternative allele frequency and cumulative heteroplasmic burden are all metrics that have been considered^{4,14,34,35,36,37}. The number of heteroplasmic sites and cumulative heteroplasmic burden was higher in Polg^D257A/D257A mouse tissues than in that of Polg^wt/wt, using both lrPCR amplification and mtDNA preparation methods. These findings are in-line with previous studies, however, the number of mutations detected appear to be higher in the data presented here. This may be due to the high levels of coverage achieved and a lower minimum threshold of alternative allele frequency (0.2%) utilised in this study. Interestingly, Polg^wt/wt mice displayed lower average alternative allele frequencies than Polg^D257A/D257A. This effect, at first observation, is in stark contrast to what has been recorded previously and is not what one would expect based on the literature. However, the high number of low-frequency heteroplasmies identified in Polg^D257A/D257A tissues have caused a decrease in the overall average heteroplasmy levels (average alternative allele frequency) compared to Polg^wt/wt as a result of the high sequencing depth achieved across all samples. This result highlights the importance of looking at multiple measures of heteroplasmy to get an accurate assessment of the levels of mitochondrial DNA mutation that are present in a sample.

Mitochondrial DNA enriched via lrPCR amplification had a higher number of heteroplasmic sites and cumulative heteroplasmic burden than DNA enriched using the mtDNA prep method. This difference was larger in Polg^D257A/D257A mice than in Polg^wt/wt. There are two likely explanations for this observation. First, that rounds of PCR amplification cause PCR errors that are subsequently identified as mitochondrial heteroplasmy, although a high-fidelity polymerase is used to negate this impact as much as possible. The second mechanism is that NUMT regions in the nuclear genome are being co-amplified and thus are mistaken for heteroplasmic, mitochondrial reads. If the first mechanism was the leading cause, it is unexpected that the difference would be higher in Polg^D257A/D257A tissues than in Polg^wt/wt, as the error rate of the polymerase used for amplification should remain constant. Co-amplification of NUMT regions, however, could be affected by changes in mtDNA copy number – a feature that has previously been identified in Polg mutator mice^41,42,43. Changes in mtDNA copy number may directly affect the amount of mispriming of NUMTs that occurs during lrPCR. Whilst the results described here cannot rule out either mechanism, this evidence suggests that the co-amplification of NUMT regions is a more likely/influential candidate mechanism to explain the false positive heteroplasmic variants identified in PCR amplified samples. The metric of average alternative allele frequency did not display this same pattern, but as explained previously, it is influenced by the presence of many apparent, low-frequency heteroplasmies that are identified when using lrPCR amplification at these sequencing depths and heteroplasmy thresholds. This effect was even more dramatic when lrPCR was compared to mtDNA enrichment prep of C57BL6 wild-type tissues. This is largely due to the high baseline of mutation that is present in Polg^wt/wt due to the intermediate phenotype of female breeder mice.

Analysis of the variants identified using lrPCR and Mito-SiPE showed reproducible results across Polg^wt/wt and Polg^D257A/D257A mouse tissues. Mito-SiPE showed much improvement in samples that have low levels of mitochondrial heteroplasmy. Long-range PCR amplification causes artificially elevated levels of mutation, either through PCR errors or amplification of NUMTs. Mito-SiPE offers researchers a more sensitive approach to detect smaller changes in low-frequency mutations than methods reliant on PCR.

Finally, a tissue-specific effect on the levels of heteroplasmy was observed using the mtDNA prep method in Polg^D257A/D257A mice. Spleen and colon samples had higher levels of heteroplasmy compared to heart, kidney, liver and lung, whereas, brain samples had lower levels. Heart, kidney, liver and lung were indistinguishable from one another. This is an interesting finding, as previous studies have identified strong, tissue-specific effects of mitochondrial heteroplasmy in more tissues than reported here^4,5,44. It is possible that enrichment methods used in previous studies have identified false heteroplasmic variants due to NUMT co-enrichment, and thus the differences observed are due to mtDNA copy number changes, rather than true mtDNA mutations. It is also possible, however, that the levels of coverage and low thresholds used in this study have led to this difference.

One limitation of this methodology is that it requires the availability of tissue for mitochondrial enrichment, and as such, it may not be feasible for archived DNA samples. DNA purified from intact cells and tissues using standard methods predominantly consists of nuclear DNA. These preparations and already existing NGS data are not compatible with this method. We attempted to use alternative methods of mtDNA enrichment, such as exonuclease digest³², however, the level of enrichment and total yield of DNA would not be compatible with ultra-deep sequencing of mitochondrial DNA.

With the explosion of interest in mtDNA and increasing levels of heteroplasmy-focused research, Mito-SiPE will provide an important tool for future explorations of mtDNA variation and its role in aging and disease. Our methodology is limited by the sequencing error rate that produces false positive heteroplasmy calls. The use of unique molecular identifiers (UMIs) have been shown to reduce the impact of errors and thus increase one’s ability to detect rare variants^45,46. UMI’s are nucleotide ‘barcodes’ which are ligated to DNA before library preparation. After sequencing, consensus reads are generated from reads that possess the same UMI, and thus PCR errors and sequencing errors can be negated. Mito-SiPE, in combination with the use of UMIs, may enable researchers to detect variants at even lower levels than what is documented here.

In conclusion, differential centrifugation and alkaline lysis may be used to enrich mitochondrial DNA free from PCR amplification or probe hybridisation. Avoiding sequence-dependent techniques greatly reduces the effect of NUMT contamination, a problem which has been identified in the previous studies^23,24,28,30. This technique, in addition to a modified bioinformatics pipeline, can be applied to different tissues and achieves ultra-deep sequencing coverage. It provides a straightforward and robust workflow to assess heteroplasmy in mitochondrial DNA. This technique outperforms long-range PCR amplification and negates the potential impact of PCR errors and NUMT contamination on heteroplasmy analysis.

Methods

Breeding and tissue harvesting

Mice were housed in shoe box cages and fed ProLab RMH 1800 diet (PMI Nutrition International) containing 50 μg vitamin B12/kg of diet and 3.3 mg folic acid/kg of diet. Breeding mice were fed Picolab Mouse Diet 20, containing 51 μg vitamin B12/kg diet and 2.9 mg folic acid/kg of diet. Heterozygous Polg^wt/D257A males and Polg^wt/D257A females were mated. Homozygous mutant and wild-type progeny were aged to 6 months, at which point they were sacrificed. There were 4 Polg^wt/wt mice (two male and two female) and 4 Polg^D257A/D257A mice (two male and two female) used in the Polg experiments. Brain, heart, lung, liver, spleen, kidney and muscle tissue was isolated and mitochondrial DNA enrichment was performed on all tissues.

Tissue homogenisation

Harvested tissue was placed in a homogenisation tube with 10X volume per gram of fresh homogenisation buffer i.e. 5 ml buffer for 500 mg tissue. Tissues were homogenised until no discernible whole tissue was present. The homogenate was then transferred to 1.5 ml microcentrifuge tubes and spun at 1000 × g for 1 min at 4 °C. The supernatant was transferred to a new microcentrifuge tube and spun at 12,000 × g for 10 min at 4 °C to pellet mitochondria. The mitochondrial pellet was resuspended with 100 µl of resuspension buffer for storage or for immediate DNA extraction.

Mitochondrial DNA isolation from fresh mouse tissue

The mitochondria resuspension was added to 200 µl alkaline lysis buffer, vortexed, and placed on ice for 5 min. Potassium Acetate Buffer (150 µl) was then added and the mixture was vortexed slowly and placed on ice for 5 min. The mixture was centrifuged at 12,000 × g for 5 min at 4 °C to pellet proteins and the supernatant was decanted to a new tube. RNase (1 µg) was added to the mixture and left at room temperature for 15 min. Phenol-chloroform (500 µl) was added to each tube, inverted and placed on a shaker/rotator for 20 min. Afterwards, centrifugation at 12,000 × g for 2 min at room temperature was carried out. The aqueous (top) layer was decanted to a new tube (~450 µl from this phase was retrieved) and 40 ul sodium acetate, 1 µl glycogen (20 mg/ml) and 1200 µl 100% EtOH were added. The mixture was inverted and mixed well, then left on dry ice for 60 min. The mixture underwent centrifugation at 12,000 × g and the supernatant was removed. The pellet was finally washed twice using 70% ethanol, air-dried, and resuspended in a low-TE buffer for sequencing or regular TE buffer for (q)PCR.

Library preparation and next-generation DNA sequencing

Libraries were generated from approximately 50 ng genomic DNA using the Accel-NGS 2 S Plus DNA Library Kit (Swift Biosciences) using five cycles of PCR to minimise PCR bias. The DNA samples were sheared by sonication (Covaris Inc., Woburn, MA) to a mean of 300 bp. Libraries were tagged with unique dual index DNA barcodes to allow the pooling of libraries and minimise the impact of barcode hopping. Libraries were pooled for sequencing on the NovaSeq 6000 (Illumina) to obtain at least 7.6 million 151-base read pairs per individual library. Sequencing data was processed using RTA version 3.4.4. DNA sequencing was carried out at the NIH Intramural Sequencing Center.

Data processing and alignment

Fastq files were aligned to the mouse reference genome, GRCm38, using bwa mem using the default parameters⁴⁷. Picard tools were used to add read groups, and to mark and remove duplicates⁴⁸. Samtools was used to calculate the coverage across the nuclear and mitochondrial genome for each sample. Finally, R (v3.5.0) and ggplot2 (v3.3.0) were used for statistical analysis and subsequent visualisation of graphs^49,50. Library complexity and fragment sizes were calculated using Picard tools v1.4.2 on 15 randomly-selected samples (Supplementary Data 2).

Variant calling and mutation analysis

Variant calling was performed using bcftools v1.9 with ‘bcftools mpileup -f -Q 30 –skip-indels reference_fasta bam_file | bcftools call -mv’ to identify single nucleotide variants only. Filtering was performed by removing any SNVs that had a QUAL score lower than 20. The code used for alignment and variant calling is available on github (https://github.com/walshd59/mtDNAhetScripts.git). Of 66,738 variants identified across all samples in our study at an alternative allele frequency ≥0.2%, only 137 of these had an ln(Strand Odds Ratio) value ≥3. Analysis of the mutation spectrum and further characterisation/annotation of heteroplasmic variants was performed using SnpEff (v 5.1)⁵¹.

Quantification of mtDNA copy number

Mitochondrial DNA copy number was assessed via qPCR targeting both mitochondrial and nuclear loci, as previously described^52,53. Briefly, 2.5 µl LightCycler® 480 SYBR Green I Master (Roche, Molecular Systems, Inc, Germany), 2 µl of DNA (20 ng/µl) and 0.5 µl primer mix were added in triplicate to a 384-well plate and the reactions were carried out by the QuanStudio 6 Flex (Applied Biosystems, Foster City, CA, USA). The conditions were as follows: 95 °C for 5 min, 45 cycles of 95 °C for 10 s, 60 °C for 10 s and 72 °C for 20 s. A melting curve was performed using 95 °C for 5 s, 66 °C for 1 min and gradual increase to 97 °C. The mitochondrial DNA copy number was assessed using the following equation:

$$2\times {2}^{\varDelta {{{{{\rm{Ct}}}}}}}({{{{{\rm{where}}}}}}\,\varDelta {{{{{\rm{Ct}}}}}}={{{{{\rm{Ct}}}}}}({{{{{\rm{mtDNA}}}}}}\,{{{{{\rm{gene}}}}}})-{{{{{\rm{Ct}}}}}}({{{{{\rm{nDNA}}}}}}\,{{{{{\rm{gene}}}}}}))$$

(1)

The following primers were used for human mtDNA copy number: mtDNA tRNA (forward: CACCCAAGAACAGGGTTTGT, reverse: TGGCCATGGGTATGTTGTTA) nuclear DNA β2-microglobulin (forward: TGCTGTCTCCATGTTTGATGTATCT), reverse: TCTCTGCTCCCCACCTCTAAGT). Mouse mtDNA copy number was assessed using the following primers: mtDNA ND1 (forward: CTAGCAGAAACAAACCGGGC, reverse: CCGGCTGCGTATTCTACGTT) nuclear DNA HK2 (forward: GCCAGCCTCTCCTGATTTTAGTGT, reverse: GGGAACACAAAAGACCTCTTCTGG). These primers are available in the supplementary data file (Supplementary data 3).

Long-range PCR enrichment of mtDNA

This technique was used to amplify human and mouse mitochondrial DNA in two fragments from a whole DNA extract. DNA was quantified via Nanodrop (Methods 2.2.7) unless otherwise stated. Each PCR reaction consisted of Q5 High-fidelity Polymerase (0.02 U/μl), 5X Q5 reaction buffer (1X), 10 mM dNTPs (300 μM), 5 μM Forward and Reverse primers (0.25 μM). Template DNA (100 ng) was added to each reaction except for the no-template control (NTC) but an equivalent volume of molecular biology-grade water was added instead. The temperature cycles were as follows: 1 × 30 s denature 98 °C, 25 × 10 s denature 98 °C, 30 s annealing 66 °C, 4 min 30 s elongation 72 °C, 1 × 10 min elongation 72 °C on a thermocycler. Both fragments were quantified using Qubit and mixed in equimolar ratios. The following primers were used for each fragment: lrPCR fragment 1 (forward: GGATCCTACTCTCTACAAAC, reverse: TAGTTTGCCGCGTTGGGTGG) and lrPCR fragment 2 (forward: CTACCCCCTTCAATCAATCT, reverse: CCGGTTTGTTTCTGCTAGGG). These primers are also available in the supplementary data file (Supplementary Data 3).

Mitochondrial isolation via Qiagen QProteome™ kit

HepG2 cells (2 × 10⁶) were collected, counted and pelleted when 80% confluency was reached. The supernatant was removed and the pellet was resuspended in the lysis buffer provided in the kit. Homogenisation and mitochondrial isolation were then carried out as per the manufacturers’ protocol. Briefly, the cell pellet was resuspended in 1.5 ml of ice-cold Disruption Buffer by pipetting up and down using a 1 ml pipet tip. The cell disruption was completed using a blunt-ended needle and a syringe. The mixture was centrifuged at 1000 × g for 10 min at 4 °C to pellet proteins and the supernatant was decanted to a new tube. Afterwards, centrifugation at 6000 × g for 2 min at room temperature was carried out. This pellet was then resuspended in 200 μl PBS and 20 μl proteinase K. DNA extraction was then performed on the mitochondrial isolate via Qiagen DNeasy™ Blood and Tissue kit using the manufacturer’s protocol.

Plasmid-Safe™ digest for mtDNA enrichment

Whole DNA extractions were treated with Plasmid-Safe ATP-dependent exonuclease (Lucigen) as per the manufacturer’s protocol. Briefly, a plasmid-safe solution was created using 42 μl sterile water, 2 μl 25 mM ATP, 5 μl 10X reaction buffer and 1 μl Plasmid-Safe DNase. This DNase targets linear molecules and, as such, does not degrade intact mitochondrial DNA as it is circular. The solution was added to the DNA extracted using the QIAprep miniprep and incubated at 37 °C for 1 h. The DNase was then deactivated with a 70 °C incubation for 30 min.

HepG2 culture conditions

HepG2 cells (Merck; 85011430-1VL) were cultured in Dulbecco’s modified Eagle medium (DMEM) with 10% supplementation of foetal bovine serum in a 5% CO₂ incubator. Cells were cultured in 10 ml of media in T75 flasks. The cells were passaged by washing with 5 ml PBS (1X) followed by incubation in 2 ml of 0.25% Trypsin-EDTA at 37 °C for 5 min. Trypsinisation was inhibited by adding 4 ml DMEM. Cells were then collected via centrifugation at 500×g for 5 min before being counted.

Qiagen QIAprep miniprep

HepG2 cells (2 × 10⁶) were collected, counted and pelleted when 80% confluency was reached. The supernatant was removed and the pellet was resuspended in lysis buffer provided in the kit. DNA isolation was performed as per the manufacturer’s protocol, using a silica membrane to capture the mtDNA, which was subsequently collected in 100 μl the provided elution buffer.

Statistics and reproducibility

All statistical analyses included in this paper were carried out in R (version 4.1.1) and the software package rstatix (version 0.7.0). Sample sizes are described within each experimental figure. Wilcoxon signed-rank tests were performed to compare mtDNA copy number from unenriched and enriched samples from different tissues (n = 14 for each group). Wilcoxon signed-rank tests were also performed to compare the effect of lrPCR amplification and Mito-SiPE on mitochondrial heteroplasmy in Polg^wt/wt and Polg^D257A/D257A (n = 24 for each genotype). A Student’s t-test was used for comparing the effect of lrPCR and Mito-SiPE on heteroplasmy in C57BL6 wild-type mice (n = 6 for the lrPCR group, n = 12 for the Mito-SiPE group).

Solutions

The following solutions were used: Homogenisation Buffer (0.25 M Sucrose, 10 mM EDTA, 30 mM Tris-HCl, pH = 7.5), Resuspension Buffer (10 mM Tris, 0.15 M NaCl, 10 mM EDTA, pH = 8.0), Alkaline Lysis Buffer (0.18 N NaOH, 1% SDS, prepared fresh), Potassium Acetate Buffer (3 M potassium, 5 M acetate), and Low-TE buffer (10 mM Tris-HCl, 0.1 mM EDTA, pH = 8.0).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All sequencing data and associated metadata is available on Sequence Read Archive (PRJNA881035). The raw fastq files may be downloaded from this repository using prefetch or similar https://www.ncbi.nlm.nih.gov/bioproject/PRJNA881035. All source data used to make graphs are included in the supplementary data file (Supplementary Data 4).

Code availability

The code used for alignment and variant calling is available on GitHub (DOI: 10.5281/zenodo.7238550).

References

Kauppila, T. E. S., Kauppila, J. H. K. & Larsson, N.-G. Mammalian mitochondria and aging: an update. Cell Metab. 25, 57–71 (2017).
Article CAS PubMed Google Scholar
Stewart, J. B. & Chinnery, P. F. The dynamics of mitochondrial DNA heteroplasmy: implications for human health and disease. Nat. Rev. Genet. 16, 530–542 (2015).
Article CAS PubMed Google Scholar
Gorman, G. S. et al. Prevalence of nuclear and mitochondrial DNA mutations related to adult mitochondrial disease. Ann. Neurol. 77, 753–759 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, M., Schröder, R., Ni, S., Madea, B. & Stoneking, M. Extensive tissue-related and allele-related mtDNA heteroplasmy suggests positive selection for somatic mutations. Proc. Natl Acad. Sci. USA 112, 2491–2496 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ma, H. et al. Germline and somatic mtDNA mutations in mouse aging. PLoS ONE 13, e0201304 (2018).
Zhang, R., Wang, Y., Ye, K., Picard, M. & Gu, Z. Independent impacts of aging on mitochondrial DNA quantity and quality in humans. BMC Genomics 18, 890 (2017).
Kelly, R. D. W., Mahmud, A., McKenzie, M., Trounce, I. A. & St John, J. C. Mitochondrial DNA copy number is regulated in a tissue specific manner by DNA methylation of the nuclear-encoded DNA polymerase gamma A. Nucleic Acids Res. 40, 10124–10138 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zhang, D. et al. Mitochondrial DNA mutations activate the mitochondrial apoptotic pathway and cause dilated cardiomyopathy. Cardiovasc. Res. 57, 147–157 (2003).
Article CAS PubMed Google Scholar
Golob, M. J. et al. Mitochondria DNA mutations cause sex-dependent development of hypertension and alterations in cardiovascular function. J. Biomech. 48, 405–412 (2015).
Article PubMed Google Scholar
Lee, S., Na, J.-H. & Lee, Y.-M. Epilepsy in Leigh syndrome with mitochondrial DNA mutations. Front. Neurol. 10, 496 (2019).
Parker, W. D. & Parks, J. K. Mitochondrial ND5 mutations in idiopathic Parkinson’s disease. Biochem. Biophys. Res. Commun. 326, 667–669 (2005).
Article CAS PubMed Google Scholar
Starikovskaya, E. et al. Mitochondrial DNA variation of Leber’s hereditary optic neuropathy in Western Siberia. Cells 8, 1574 (2019).
Hopkins, J. F. et al. Mitochondrial mutations drive prostate cancer aggression. Nat. Commun. 8, 656 (2017).
Kalsbeek, A. M. F. et al. Mutational load of the mitochondrial genome predicts pathological features and biochemical recurrence in prostate cancer. Aging 8, 2702–2711 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bacman, S. R. et al. MitoTALEN reduces mutant mtDNA load and restores tRNA Ala levels in a mouse model of heteroplasmic mtDNA mutation. Nat. Med. 24, 1696–1700 (2018).
Article CAS PubMed PubMed Central Google Scholar
He, Y. et al. Heteroplasmic mitochondrial DNA mutations in normal and tumour cells. Nature 464, 610–614 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cui, H. et al. Comprehensive next-generation sequence analyses of the entire mitochondrial genome reveal new insights into the molecular diagnosis of mitochondrial DNA disorders. Genet. Med. J. Am. Coll. Med. Genet. 15, 388–394 (2013).
CAS Google Scholar
Ye, K., Lu, J., Ma, F., Keinan, A. & Gu, Z. Extensive pathogenicity of mitochondrial heteroplasmy in healthy human individuals. Proc. Natl Acad. Sci. USA 111, 10654–10659 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kelly, P. S. et al. Ultra-deep next generation mitochondrial genome sequencing reveals widespread heteroplasmy in Chinese hamster ovary cells. Metab. Eng. 41, 11–22 (2017).
Article CAS PubMed Google Scholar
Hazkani-Covo, E., Zeller, R. M. & Martin, W. Molecular poltergeists: mitochondrial DNA copies (numts) in sequenced nuclear genomes. PLoS Genet. 6, e1000834 (2010).
Article PubMed PubMed Central Google Scholar
Richly, E. & Leister, D. NUMTs in sequenced eukaryotic genomes. Mol. Biol. Evol. 21, 1081–1084 (2004).
Article CAS PubMed Google Scholar
Dayama, G., Emery, S. B., Kidd, J. M. & Mills, R. E. The genomic landscape of polymorphic human nuclear mitochondrial insertions. Nucleic Acids Res. 42, 12640–12649 (2014).
Article CAS PubMed PubMed Central Google Scholar
Goios, A., Carvalho, A. & Amorim, A. Identifying NUMT contamination in mtDNA analyses. Forensic Sci. Int. Genet. Suppl. Ser. 2, 278–280 (2009).
Article Google Scholar
Wang, D. et al. Mitochondrial DNA enrichment reduced NUMT contamination in porcine NGS analyses. Brief. Bioinform. https://doi.org/10.1093/bib/bbz060 (2019).
Article PubMed PubMed Central Google Scholar
Calvignac, S., Konecny, L., Malard, F. & Douady, C. J. Preventing the pollution of mitochondrial datasets with nuclear mitochondrial paralogs (numts). Mitochondrion 11, 246–254 (2011).
Article CAS PubMed Google Scholar
Goios, A., Prieto, L., Amorim, A. & Pereira, L. Specificity of mtDNA-directed PCR—influence of NUclear MTDNA insertion (NUMT) contamination in routine samples and techniques. Int. J. Leg. Med. 122, 341–345 (2008).
Article Google Scholar
Luo, S. et al. Biparental Inheritance of Mitochondrial DNA in Humans. Proc. Natl Acad. Sci. USA. 115, 13039–13044 (2018).
Article CAS PubMed PubMed Central Google Scholar
Balciuniene, J. & Balciunas, D. A nuclear mtDNA concatemer (Mega-NUMT) could mimic paternal inheritance of mitochondrial genome. Front. Genet. 10, 518 (2019).
Wei, W. et al. Nuclear-mitochondrial DNA segments resemble paternally inherited mitochondrial DNA in humans. Nat. Commun. 11, 1740 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bai, R. et al. Interference of nuclear mitochondrial DNA segments in mitochondrial DNA testing resembles biparental transmission of mitochondrial DNA in humans. Genet. Med. 23, 1514–1521 (2021).
Tamura, K. & Aotsuka, T. Rapid isolation method of animal mitochondrial DNA by the alkaline lysis procedure. Biochem. Genet. 26, 815–819 (1988).
Article CAS PubMed Google Scholar
Gould, M. P. et al. PCR-free enrichment of mitochondrial DNA from human blood and cell lines for high quality next-generation DNA sequencing. PLoS ONE 10, e0139253 (2015).
Article PubMed PubMed Central Google Scholar
Williams, S. L. et al. The mtDNA mutation spectrum of the progeroid Polg mutator mouse includes abundant control region multimers. Cell Metab. 12, 675–682 (2010).
Article CAS PubMed PubMed Central Google Scholar
Avital, G. et al. Mitochondrial DNA heteroplasmy in diabetes and normal adults: role of acquired and inherited mutational patterns in twins. Hum. Mol. Genet. 21, 4214–4224 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hefti, E. & Blanco, J. G. Mitochondrial DNA heteroplasmy in cardiac tissue from individuals with and without coronary artery disease. Mitochondrial DNA Part DNA Mapp. Seq. Anal. 29, 587–593 (2018).
Article CAS Google Scholar
Kloss-Brandstätter, A. et al. Somatic mutations throughout the entire mitochondrial genome are associated with elevated PSA levels in prostate cancer patients. Am. J. Hum. Genet. 87, 802–812 (2010).
Article PubMed PubMed Central Google Scholar
Tranah, G. J., Katzman, S. & Cummings, S. R. Mitochondrial DNA heteroplasmy leads to age-related functional decline and increased mortality risk. Innov. Aging 1, 858–859 (2017).
Article PubMed Central Google Scholar
Amer, W. et al. Evolution analysis of heterogeneous non-small cell lung carcinoma by ultra-deep sequencing of the mitochondrial genome. Sci. Rep. 7, 11069 (2017).
Article PubMed PubMed Central Google Scholar
Schubert, A. D. et al. Somatic mitochondrial mutation discovery using ultra-deep sequencing of the mitochondrial genome reveals spatial tumor heterogeneity in head and neck squamous cell carcinoma. Cancer Lett. 471, 49–60 (2020).
Article CAS PubMed Google Scholar
Santibanez-Koref, M. et al. Assessing mitochondrial heteroplasmy using next generation sequencing: A note of caution. Mitochondrion https://doi.org/10.1016/j.mito.2018.08.003 (2018).
Article PubMed Google Scholar
Dai, Y. et al. Behavioral and metabolic characterization of heterozygous and homozygous POLG mutator mice. Mitochondrion 13, 282–291 (2013).
Article CAS PubMed PubMed Central Google Scholar
Perier, C. et al. Accumulation of mitochondrial DNA deletions within dopaminergic neurons triggers neuroprotective mechanisms. Brain J. Neurol. 136, 2369–2378 (2013).
Article Google Scholar
Safdar, A. et al. Endurance exercise rescues progeroid aging and induces systemic mitochondrial rejuvenation in mtDNA mutator mice. Proc. Natl Acad. Sci. USA 108, 4135–4140 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jenuth, J. P., Peterson, A. C. & Shoubridge, E. A. Tissue-specific selection for different mtDNA genotypes in heteroplasmic mice. Nat. Genet. 16, 93–95 (1997).
Article CAS PubMed Google Scholar
Liggett, L. A., Sharma, A., De, S. & DeGregori, J. FERMI: a novel method for sensitive detection of rare mutations in somatic tissue. G3 GenesGenomesGenetics 9, 2977–2987 (2019).
Article CAS Google Scholar
Dai, P. et al. Calibration-free NGS quantitation of mutations below 0.01% VAF. Nat. Commun. 12, 6123 (2021).
Article CAS PubMed PubMed Central Google Scholar
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at arXiv:13033997 Q-Bio (2013).
Picard Tools - By Broad Institute. https://broadinstitute.github.io/picard/ (2021).
RStudio Team. RStudio: Integrated Development Environment for R. Studio, PBC Boston, MA http://www.rstudio.com/ (2021).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag, 2009).
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly. 6, 80–92 (2012).
Article CAS PubMed PubMed Central Google Scholar
Quiros, P. M., Goyal, A., Jha, P. & Auwerx, J. Analysis of mtDNA/nDNA ratio in mice. Curr. Protoc. Mouse Biol. 7, 47–54 (2017).
Article CAS PubMed PubMed Central Google Scholar
Venegas, V. & Halberg, M. C. Measurement of mitochondrial DNA copy number. Methods Mol. Biol. 837, 327–335 (2012).

Download references

Acknowledgements

This work was funded by the Wellcome Trust, the Intramural Program of the National Human Genome Research Institute and the NIH Intramural Sequencing Center.

Funding

Open Access funding provided by the National Institutes of Health (NIH).

Author information

Authors and Affiliations

Gene and Environment Interaction Section, National Human Genome Research Institute, NIH, Bethesda, MD, USA
Darren J. Walsh, David J. Bernard, Faith Pangilinan, Madison Esposito & Lawrence C. Brody
School of Biotechnology, Dublin City University, Dublin, Ireland
Darren J. Walsh, Denise Harold & Anne Parle-McDermott

Authors

Darren J. Walsh
View author publications
You can also search for this author in PubMed Google Scholar
David J. Bernard
View author publications
You can also search for this author in PubMed Google Scholar
Faith Pangilinan
View author publications
You can also search for this author in PubMed Google Scholar
Madison Esposito
View author publications
You can also search for this author in PubMed Google Scholar
Denise Harold
View author publications
You can also search for this author in PubMed Google Scholar
Anne Parle-McDermott
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence C. Brody
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.J.W., M.E. and D.J.B. performed the sample collections and enrichment. D.J.W. and D.H. performed bioinformatics and subsequent analysis. D.J.W., D.J.B., F.P., D.H., A.P.McD. and L.C.B. prepared the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lawrence C. Brody.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

All animal protocols were reviewed and approved by the National Human Genome Research Institute (NHGRI) Animal Care and Use Committee prior to animal experiments.

Peer review

Peer review information

Communications Biology thanks Caleb Lareau and Chadi A. El Farran for their contribution to the peer review of this work. Primary Handling Editor: George Inglis. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1–4

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Walsh, D.J., Bernard, D.J., Pangilinan, F. et al. Mito-SiPE is a sequence-independent and PCR-free mtDNA enrichment method for accurate ultra-deep mitochondrial sequencing. Commun Biol 5, 1269 (2022). https://doi.org/10.1038/s42003-022-04182-2

Download citation

Received: 15 June 2021
Accepted: 27 October 2022
Published: 19 November 2022
DOI: https://doi.org/10.1038/s42003-022-04182-2

This article is cited by

A PCR-independent approach for mtDNA enrichment and next-generation sequencing: comprehensive evaluation and clinical application
- Dong Liang
- Lin Zhu
- Zhengfeng Xu
Journal of Translational Medicine (2024)
A foundation for comparative genomics and evolutionary studies in Nucella lapillus based on complete mitogenome assembly
- Daniel García-Souto
- Jonathan Fernández-Rodríguez
- Juan Galindo
Marine Biology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Mito-SiPE produces highly enriched mitochondrial DNA

Mito-SiPE requires an alternative alignment strategy

Heteroplasmy analysis comparison between Mito-SiPE and lrPCR in Polg mutator mice

Discussion

Methods

Breeding and tissue harvesting

Tissue homogenisation

Mitochondrial DNA isolation from fresh mouse tissue

Library preparation and next-generation DNA sequencing

Data processing and alignment

Variant calling and mutation analysis

Quantification of mtDNA copy number

Long-range PCR enrichment of mtDNA

Mitochondrial isolation via Qiagen QProteome™ kit

Plasmid-Safe™ digest for mtDNA enrichment

HepG2 culture conditions

Qiagen QIAprep miniprep

Statistics and reproducibility

Solutions

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links