Introduction

Duchenne muscular dystrophy (DMD; OMIM no.: 310200) is one of the most common neuromuscular diseases of childhood. It is an X-linked recessive disease that affects 1 in 3600–6000 live male births.1, 2 Becker muscular dystrophy (BMD; OMIM no.: 300376) is milder than DMD. DMD and BMD are caused by mutations in DMD (MIM no. 300377). The dystrophin gene located at Xp21.2 is large and contains 79 exons forming a 14-kb mRNA script with lengthy introns that spans more than 2.5 Mb of the genomic DNA.3 The mutational spectrum was reported to be as follows: deletions in ~60% of patients, duplications in ~10% of patients and small mutations in ~30% of patients, including small insertions or deletions within an exon (~7%) and nonsense mutations (~15%).4, 5, 6 Multiplex ligation-dependent probe amplification (MLPA) is a widely used method, and it is the initial diagnostic test of choice in many hospitals. Although MLPA can only diagnose patients with deletions/duplications, another 30% of patients with point mutations need direct sequencing of all coding regions. Therefore, diagnostic methods have been conducted separately, which eventually increase medical costs and delay diagnosis.7, 8 At present, promising mutation-specific therapies have been developed. For example, exon skipping is applicable to patients that have a deletion in DMD. In contrast, induction of read-through of nonsense codons is expected to produce full-length dystrophin in DMD patients with nonsense mutations, which comprise ~15% of DMD cases.9, 10, 11, 12 Likewise, in Japan, genetic tests are covered by medical insurance only once in a lifetime. Therefore, a single genetic test that enables a genetic diagnosis in more patients is required.

Recently, next-generation sequencing (NGS) approaches have become major tools for finding the causative genes for diseases. Lim et al.13 reported a mutational search platform for the genetic diagnosis of DMD/BMD. Wei et al.14 also reported a mutational search for DMD/BMD and female carriers using targeted NGS on HiSeq2000 (Illumina, San Diego, CA, USA). To the best of our knowledge, this is the first study that investigated the clinical availability of Ion Torrent Personal Genome Machine (Ion PGM; Thermo Fisher Scientific, Waltham, MA, USA) for the detection of large deletions, duplications and small mutations in DMD. We selected 67 cases previously diagnosed by MLPA and direct sequencing, and developed a novel computational framework for detecting deletions/duplications.

Materials and methods

Patients

We chose 67 cases with different mutations to test the clinical utility of our method: 17 cases with common deletions, 20 cases with common duplications (2 of them with discontinuous double duplications) and 30 cases with point mutations, including nonsense mutations, deletions, insertions, splice regions and missense mutations (Table 1). All deletion/duplication patterns are frequently diagnosed by MLPA in our laboratory, as shown in Supplementary Tables S1 and S2 and Supplementary Figures S1 and S2. Among 30 small mutations, 10 mutations were located within five or more consecutive bases (P16, 18, 19, 22, 25, 26, 27, 28, 29 and 30) and 5 mutations were deletion/insertion of more than nine bases (P12, 17, 21, 23 and 24). These mutations were predicted to be difficult to detect by Ion Torrent sequencer. All clinical information and materials used in this study were obtained for diagnostic purposes with written informed consent. This study was approved by the ethics committee of the National Center of Neurology and Psychiatry.

Table 1 67 cases with different mutations diagnosed by MLPA or Sanger sequencing

Targeted NGS sequencing

Targeted NGS covering the dystrophin gene was performed on genomic DNA extracted from blood lymphocytes of patients. Multiplex primer pools were designed using Ion AmpliSeq Designer software (Thermo Fisher Scientific, MA, USA). This custom gene panel covers 99.6% of the exonic region, including the flanking region (30 base pairs from the exon–intron boundary), and it covers 100% of the coding region. Enrichment of exonic sequences was performed with an Ion AmpliSeq Library Kit 2.0 (Thermo Fisher Scientific, MA, USA) and sequenced on an Ion PGM (Thermo Fisher Scientific, MA, USA) using 318 Chip (Thermo Fisher Scientific, MA, USA) according to the manufacturer’s protocol.

Deletion/duplication detection

The number of reads mapped on each amplicon was counted by bedtools.15 The read that intersects with multiple amplicons was assigned to the amplicon of the largest overlap. The normalized read count was calculated as follows: the raw read count was divided by the total reads on the C-terminal region, which covers exons 70–79, and then multiplied by one million. We used exons 70–79 as we rarely observed mutations in the patients.4, 5, 6 We used the average value of the 19 control cases. For each amplicon, the mean value and sample s.d. were calculated. It was considered to be duplicated when both of the following conditions were met: if the normalized read count was >1.5-fold of the mean value in control samples and if the normalized read count deviated from the mean value by over 2-fold of sample s.d., the amplicon (Supplementary Figure S3).

Results

Most small mutations were detected by targeted NGS (P1–30)

We chose 15 cases with different small mutations, together with 15 mutations predicted to be difficult to detect by Ion Torrent sequencing, such as single-nucleotide variants (SNVs) in the same consecutive bases or relatively large deletions. We failed to detect five small insertions. Among these five, there were four small insertions (P17, 18, 26 and 27) that were located in the same consecutive bases.

Control cases for copy number analysis (P1–19)

To set a normalized read count, we performed amplicon coverage on 19 cases (P1–19) that were negative for deletion/duplication mutation in DMD by MLPA analysis (Figure 1). The coverage ratio for all amplicons was 1±0.5. The amplicon for exon 12 had low amplification on 19 cases, leading to a ratio of 1.5 that was not usable. The cutoff values for the 19 controls were 2 times the s.d. and 1.5 times for the duplication (Supplementary Figure S3).

Figure 1
figure 1

The representative amplicon coverage plot of a control case (P3). The horizontal axis shows exon number and the vertical axis shows the ratio, which is the patient ‘r.p.m.’ divided by control r.p.m. (r.p.m.=106 × (read number for each amplicon)/(read number for each pool exons 70–79)).

All large deletions were detected by targeted NGS (P31–47)

The coverage plot of the 17 large deletion cases is shown in Figure 2. For deletion cases (P31–47), the deleted exons detected by NGS were completely consistent with the results obtained using MLPA. There was a variation in the ratio of each amplicon for P45.

Figure 2
figure 2figure 2

The amplicon coverage plot of 17 patients with large deletion mutation. The horizontal axis shows exon number and the vertical axis shows the ratio, which is the patient r.p.m. divided by control r.p.m. (rpm=106 × (read number for each amplicon)/(read number for each pool exons 70–79)). Each exon is covered by 1–3 amplicons, only exon 79 is covered by 13 amplicons. If the ratio is zero, the patient has a deleted exon.

Most large duplications were detected by targeted NGS (P48–67)

The coverage plot of the 20 large duplication cases is shown in Figure 3. In the cases with large duplications (P48–67), 10 of the 20 duplicated exon cases were accurately detected (P48, 51, 54, 55, 56, 58, 63, 64, 66 and 67). Ten cases (P49, 50, 52, 53, 57, 59, 60, 61, 62 and 65) had false negative or positive amplicons. However, seven cases (P49, 50, 57, 59, 60, 62 and 65) had only one false negative or positive amplicon. Therefore, overall, we can detect the precise duplicated exons in these seven cases. P52 had several false positive regions due to marked variation in read counts; thus we could not identify the duplicated exon.

Figure 3
figure 3figure 3

The amplicon coverage plot of 20 patients. The horizontal axis shows the exon number and the vertical axis shows the ratio, which is the patient r.p.m. divided by control r.p.m. (rpm=106 × (read number for each amplicon)/(read number for each pool exons 70–79)). If the value of the ratio is 1.5 or more, the patient has a supplicated exon. We have drawn a line at the ratio of 1.5.

P53 was reported to have exons 8–11 duplication by MLPA, but the Ion PGM result showed that exons 8, 10 and 11 were duplicated and exon 9 was deleted. We suspected that Ion PGM primers failed to amplify the exon 9 amplicon due to a nucleotide variant in the primer-binding site. Thus, we sequenced exon 9 and identified point mutations at a primer site in the exon 9-flanking region in intron 8 (c.832-53C>T).

MLPA showed that P61 has an exons 18–48 duplication, but the Ion PGM result showed that exons 18–47 were duplicated. We also suspected that there are variants that may interfere with the MLPA probe or Ion PGM primers. We sequenced exon 48 by Sanger sequencing but found no small mutations in exon 48.

Discussion

In this study, we established the clinical utility of Ion Torrent sequencing technology for the detection of deletions/duplications and small mutations in DMD by comparison with the conventional method, MLPA and Sanger sequencing in the large number of the DMD/BMD patients in the Japanese cohort. As we expected, all large deletions and 50% of the large duplications are correctly detected. We also aimed to clarify the percentage of the detection yield of the small mutations predicted to be difficult to read by Ion PGM. Three out of 10 mutations located with >5 consecutive bases and one out of five small deletions/insertions of more than 10 bases were not detected. Out of 255 patients with point mutations in our in-house data (Supplementary Figure S4), we estimate that Ion PGM could not identify 1% of DMD/BMD cases with such small insertions (Figure 4).

Figure 4
figure 4

The mutational spectrum from our in-house data and reported data.

Previous reports have demonstrated the ability of NGS, Illumina sequencer to detect SNVs and deletions/duplications in DMD13 and copy number variations (CNVs) in female carriers.14 Gene panels that contained many genes (Lim et al.13 with 25 genes and Wei et al.14 with 222 genes), including DMD, were used in these studies. In this report, we targeted single DMD gene using Ion PGM for the first time that effectively enables us to perform the test at a lower cost compared that with other studies. Because of the small sample size in the previous study and different cohort of the mutation tested, we do not know which sequencer has more advantage on DMD gene testing. Further comparison study will be necessary.

We used multiplex primer pools covering 100% of the coding region, comprising 161 amplicons for 79 exons. One to four amplicons covered each exon except exon 79, which was covered by 13 amplicons. When an exon is covered only by one amplicon, it would be difficult to determine when there is a single-nucleotide polymorphism at the primer site. For example, P53 showed an exon 9 deletion by Ion PGM, but we could amplify exon 9 by PCR and Sanger sequencing identified with a point mutation at the primer site of Ion PGM. In P48 and P56, we detected duplication of exon 2 and 49–50, respectively, but these exons are covered by one amplicon. Therefore, it should be noted that if there were a point mutation at the primer site, we could have not correctly diagnosed these cases. This problem can also occur with MLPA when it detects a single exon deletion. If there is a sequence variation close to the probe ligation site deletion/duplication will not be identified correctly by MLPA.16 As for the Ion PGM, we recommend designing two or more amplicons for each exon to increase the diagnostic accuracy.

Detecting the large duplications has some difficulty in both MLPA and Ion PGM. For example, MLPA showed that P61 has an exons 18–48 duplication, but the Ion PGM result showed that exons 18–47 was duplicated. In this study, exon 48 was covered by two amplicons. If there is a breakpoint within the exon 48 that disrupts the binding of either PCR primer, but preserves the MLPA probe site, it is possible to have a confounding result between MLPA and coverage analysis (Supplementary Figure S5). This could occur in detecting duplications by both MLPA and Ion PGM. Thus, we require careful interpretations for detecting the end regions of exons for putative duplication events.

Given the difficulty in interpreting certain large duplications and insertions, MLPA or Sanger sequencing may be required to complement Ion PGM in some cases. Nonetheless, a single genetic test that could give a precise genetic diagnosis to more patients would be preferable, saving time and money for the patients. Accordingly, we estimate that 92% of DMD/BMD patients are accurately diagnosed using the Ion PGM (Figure 4). In addition, we assume that sequencing DMD gene even in the patients with copy-number variation mutation will be necessary to exclude the possibility of a rare single-nucleotide polymorphism that might impact exon skipping therapy or possible double mutation, which can blemish the effect of gene therapy. Therefore, we believe NGS will be a standard for the diagnosis of DMD/BMD in the future.

Here, we suggest a new diagnostic flowchart for DMD/BMD patients in the future. First, if the patient is suspected to have DMD/BMD, we recommend performing Ion PGM for DMD (92% are estimated to be diagnosed). In addition, MLPA can work as a complimentary test to confirm the duplication that is difficult to interpret from Ion PGM result (5%). If the mutation is not detected, the next step is to examine dystrophin expression on the sarcolemma and exclude a possibility of loss of other proteins such as sarcoglycans in the muscle biopsy specimen, and to perform direct Sanger sequencing for all coding regions to detect mutations that are difficult to read by Ion PGM. This proposed diagnostic flowchart will finally give a genetic diagnosis for 98% of patients. Our in-house data showed there are 2% of the patients in whom we could not detect any mutations by MLPA or Sanger sequencing of all the exons and flanking regions, but show abnormal dystrophin staining. These patients might have mutations in deep introns or dystrophin expression abnormality might be secondary. Therefore, we demonstrate that NGS can be used for an initial genetic diagnosis test of DMD/BMD.

This is the first study to show the utility of Ion PGM to detect both deletions/duplications and small mutations in DMD/BMD patients.