Introduction

Leber congenital amaurosis (LCA) (OMIM no. 204000) represents the earliest and most severe autosomal recessive retinal dystrophy, causing profound visual deficiency or blindness from birth. LCA has a worldwide incidence of ~1/30,000 and is the most frequent cause of childhood blindness. Patients are diagnosed in their first year of life based on the presence of severe visual loss, a nondetectable or strongly impaired electroretinogram, and nystagmus.1,2

As for all retinal dystrophies, the progressive retinal degeneration leading to blindness has been irreversible thus far. However, in 2008, three independent phase I clinical trials achieved a major breakthrough following gene-replacement therapy in three young adults with RPE65-related LCA, which was shown to be safe and resulted in visual improvement in a subset of patients.3,4,5 A follow-up study including children demonstrated even more beneficial effects at a younger age.6 The efficacy and safety persisted through up to 2 years.6,7,8

Despite this tremendous step forward, a major obstacle remains in identifying LCA patients eligible for gene-specific treatment due to a massive genetic heterogeneity. Sixteen disease genes are known to account for ~70% of LCA cases, leaving the remaining 30% unexplained (RetNet, http://www.sph.uth.tmc.edu/RetNet).9 Moreover, some of these genes are also involved in syndromic diseases.10,11 Consequently, an early molecular diagnosis offers the prospect of specific and adequate medical follow-up.

To date, the most commonly used genetic test for LCA is a microarray evaluating 641 known variants in 13 genes.12 Although this technique represents a good first-pass screening, it fails in detecting new mutations, is expensive in a routine context, and has a variable, population-dependent detection rate.13,14,15 Secondary genetic tests comprise denaturing high-performance liquid chromatography and Sanger sequencing, mostly of a subset of genes because of excessive costs and workload.1,16,17 Hence, an urgent need exists for an effective approach able to identify mutations in all currently known LCA genes.

Massively parallel sequencing (MPS) technologies are highly suitable for molecular diagnosis of genetically heterogeneous disorders, given their high throughput and decreasing sequencing cost.18 However, a major challenge remains to enrich regions of interest, such as exons, from the patient’s genome. Although hybridization-based DNA capture is often used, this approach has several limitations, including a high cost, suboptimal capturing efficiency at repetitive regions, interference from homologous sequences, a large variation in coverage, and lack of flexibility if new regions need to be captured.19,20,21 Only a few groups have designed PCR-based enrichment protocols, mostly using long-range PCR.22,23,24,25 Following amplification, all products are quantified and normalized to equimolar amounts, which is time-consuming when handling large numbers of samples.

The goal of this study was to design an accurate, fast, and cost-efficient tool for molecular testing of all known LCA genes using MPS. For this purpose, we applied a novel quantitative PCR (qPCR)-based enrichment strategy that overcomes the aforementioned issues. We tested this LCA panel in 22 LCA patients, performed a thorough variant validation as proof of concept, and identified causal mutations in a subset of prescreened patients.

Materials and Methods

Patients

This study was conducted in accordance with the Declaration of Helsinki. Twenty-two sporadic LCA patients were included. The first part of this study was mainly intended for validation and comprised 10 patients (patients 1–10); the second part of this study was an application of the workflow in 12 additional patients without previously identified mutations (patients 11–22).

For the first part, the causal genetic defect was known for five patients (so-called positive controls; patients 4, 5, 6, 9, and 10), whereas no mutations had yet been identified in the remaining five (patients 1, 2, 3, 7, and 8). All patients previously underwent microarray testing (LCA chip versions 2004–2010; Asper Ophthalmics). In the positive control patients, Sanger sequencing was performed of CEP290, CRB1, RPE65, GUCY2D, AIPL1, and CRX.16 In addition, patients 4 and 5, originating from a consanguineous marriage, were sequenced for IQCB1 and RDH12, respectively, following identification of these genes in homozygous regions (Affymetrix GeneChip Human Mapping 250K arrays; Affymetrix, Santa Clara, CA).

Prescreening of the 12 patients included in the second part of this study consisted of LCA chip analysis in all patients and sequencing of CEP290, CRB1, RPE65, GUCY2D, AIPL1, and CRX in patients 18–22.16

More than 160 unrelated healthy individuals were used as a control panel. Genomic DNA was extracted from leukocytes using the Puregene DNA isolation kit (Gentra, Minneapolis, MN).

Enrichment

Primers were designed using the in-house primerXL pipeline to cover all 252 exons of RD3, RPE65, CRB1, MERTK, IQCB1, LRAT, LCA5, TULP1, IMPDH1, CEP290, RPGRIP1, RDH12, SPATA7, AIPL1, GUCY2D, and CRX and the deep intronic CEP290 mutation c.2991+1655A>G (Ensembl, Release 55, GRCh37) (S. Lefever, F. Pattyn, B. De Wilde et al., unpublished data). Two primer designs were performed (maximal amplicon length of 400 and 600 bp). From these designs, a selection was made to obtain an overall coverage of all target bases as efficiently as possible.

qPCRs were prepared in 384-well plates with a 96-well head pipetting robot (Tecan Freedom Evo 100; Tecan, Männedorf, Switzerland) using the SsoFast EvaGreen Supermix (Bio-Rad, Nazareth Eke, Belgium). Subsequently, amplicons from one patient were pooled, ligated, and sheared. Parameters of the primer design and conditions of the qPCR, ligation, and shearing reactions are specified in Supplementary Methods and Procedures online.

Sequencing on the Illumina Genome Analyzer IIx

Library preparations and sequencing were performed by the Ghent University NXTGNT consortium following manufacturer’s protocols. All patients were uniquely tagged using the Multiplexing Sample Preparation Oligonucleotide kit (Illumina, Eindhoven, the Netherlands). Of the 12 LCA patients pooled in the first lane, 10 were used for validation of the protocol (lane 1, patients 1–10). Libraries were prepared from a 300-bp size-selected fraction, subjected to 18 cycles of PCR and evaluated for their size (2100 Bioanalyzer, DNA 1000 kit; Agilent, Diegem, Belgium) as well as concentration (Quant-iT PicoGreen dsDNA assay; Invitrogen, Ghent, Belgium). Finally, 120 µl of a pool of the 12 normalized libraries (concentration of 7 pmolar) was subjected to single-end sequencing of 100 cycles.

For the second lane, libraries were prepared from a 200–300-bp size-selected fraction, and library quantification was performed using qPCR following manufacturer’s protocols (lane 2, patients 11–22). In total, 120 µl of a pool of the 12 normalized libraries (concentration of 10 pmolar) was subjected to paired-end sequencing of 2 × 45 cycles.

Read mapping and variant analysis

Image analysis and base calling was performed using the Genome Analyzer Pipeline Software (Illumina). Sequence analysis was carried out with the NextGENe software v2.00 (SoftGenetics, LLC, State College, PA). Reads with a median quality score <20 or with ambiguous nucleotide calls (N ≥ 3) were removed. Subsequently, reads were aligned against GenBank reference sequences ( Supplementary Table S1 online).

Two distinct condensation methods were used: error correction and consolidation (differences specified in Supplementary Methods and Procedures online). The latter merges overlapping sequences and uses consensus sequences instead of the original reads. Default alignment settings were applied, with two exceptions: the minimal coverage for a mutation to be called was set to 1 and the “detect structural variations” option was selected. For lane 2, paired reads were taken into account during error correction mapping.

Coverage data were obtained for all exons with 20-bp up- and downstream intronic sequence. Variant nomenclature uses numbering with the A of the initiation codon ATG as +1 (http://www.hgvs.org/mutnomen), based on the RefSeqs listed in Supplementary Table S1 online.

The pathogenic potential of novel variants was assessed using the Alamut mutation interpretation software v1.54 (Interactive Biosoftware, Rouen, France), complemented by the NetGene2 Server26 and Berkeley Drosophila Genome Project splice-site prediction tools.27

Results

We present a high-throughput molecular test for all currently known LCA genes using a novel enrichment strategy based on qPCR, ligation, and fragmentation, followed by sequencing on the Illumina Genome Analyzer IIx. The first part of this study was mainly aimed at validation of the enrichment protocol (proof of concept, patients 1–10), whereas the second part of this study consisted of a blind screening of 12 prescreened mutation-negative patients with LCA (patients 11–22).

Enrichment of LCA disease genes

qPCR was used to target all exons from 16 LCA disease genes. In total, 375 qPCR amplicons were designed, which together amplify ~152 kb. As a validation, qPCR parameters were evaluated in patients 1–10 ( Figure 1 and Supplementary Figure S1 online). More than 80% of amplicons generated a quantification cycle (Cq) value between 23 and 27 ( Figure 1a ). In addition, end point fluorescence values ranged from 40 to 65 in more than 86% of amplicons, pointing at very similar end concentrations ( Figure 1b ).

Figure 1
figure 1

Workflow overview. For each patient, all exons and intron–exon boundaries of the 16 known LCA genes are amplified using qPCR, followed by random ligation and shearing. Subsequently, 12 amplicon pools are indexed and sequenced in one lane of the Illumina Genome Analyzer IIx. (a) Cumulative distribution plot of mean Cq values of both replicates for each patient of lane 1. (b) Cumulative distribution plot of mean end-point fluorescence values of both replicates for each patient of lane 1. (c) Merged Agilent Bioanalyzer electropherograms (DNA 7500) of purified pooled PCR-product (blue, 1/4 dilution), purified end-repaired DNA (green, 1/4 dilution), and ligated DNA (red, 1/3 dilution) from patient 7. (d) Agilent Bioanalyzer electropherogram (DNA 1000) of sheared product from patient 7. (e) Screenshot of the NextGENe alignment viewer showing the homozygous AIPL1 mutation c.885del, identified in patient 17. Cq, quantification cycle; LCA, Leber congenital amaurosis; qPCR, quantitative PCR.

Cycling conditions were adjusted to allow amplification of fragments with a length ranging from 118 to 783 bp. No effect of amplicon size was observed ( Supplementary Figure S1a,b ). However, qPCR amplification was influenced by the amplicon GC content and secondary structure Gibbs free energy (dG) ( Supplementary Figure S1c–f ).

Supplementary Figure S2 shows ligation and shearing results. As only a fraction of unligated PCR product was present in comparison with the amount of ligated DNA, no additional purification was performed ( Figure 1c ).

Sequencing output and coverage

Supplementary Tables S2 and S3 online summarize the results of quality filtering, condensation, and alignment steps performed by NextGENe for lanes 1 and 2. The wide variability in the number of reads between patients included in lane 1 can be attributed to the picogreen-based library quantification method, which was replaced by qPCR quantification for lane 2.

Further coverage and variant analysis was based on the National Center for Biotechnology Information RefSeq sequences and the 236 (224 coding) exons defined herein ( Supplementary Table S1 ). Figure 2 shows cumulative percentage plots of the average and minimal coverage for the original as well as condensed reads for all exons, including 20-bp up- and downstream intronic sequences.

Figure 2
figure 2

Distribution of average and minimal coverage for both original and condensed reads. Coverage data were obtained using .bed files containing all exons with 20-bp up- and downstream intronic sequence.

The expected average coverage of lane 1 was calculated to be 918-fold per patient, in case of 200,000 clusters/tile with a 70% pass filter rate. Due to a lower number of clusters and wide distribution of reads between patients ( Figure 2 ), however, this expected coverage was achieved only for patients 1, 2, 7, and 8. For these patients, more than 90% of exons were covered 40 times or more ( Supplementary Table S2 ).

Improved specifications for lane 2 resulted in an expected average coverage of 1,239 (300,000 clusters/tile with a 70% pass filter rate), which was exceeded in all patients. More than 90% of exons had a minimal coverage of at least 40 in patients 11 and 16–22 ( Supplementary Table S3 ).

Overall, 17 exons lacked sufficient coverage depth in all patients. These exons correspond with 23 distinct amplicons, of which 18 amplicons (5%) resulted in zero coverage for the whole or part of the exon, presumably due to high GC content and/or low dG ( Supplementary Table S4 ).

Variant analysis and interpretation

Validation of the protocol using previously identified variants. Validation of our protocol consisted of the assessment of all sequence variants, both mutations and polymorphisms, that were previously identified in patients 1–10.

First, the presence of seven mutations was evaluated (patients 4–6, 9, and 10). All of these known mutations were detected by NextGENe ( Table 1 ). The frequency of the homozygous mutations varied between 93% and 100%. The frequency of the heterozygous changes approximated 50% for patient 9 but was quite low for both mutations present in patient 6, despite sufficient coverage ( Table 1 ). Strikingly, the frequency of both these heterozygous variants was higher when considering mapping with error correction as the condensation method ( Table 1 ). For c.3713_3716dup, this could partially be explained by the presence of reads starting or ending at the duplicated GCCT site in the consolidation mapping project ( Supplementary Figure S3 online). Importantly, the frequency of the other mutations was not increased in the error-corrected mapping project in comparison with the consolidation mapping project ( Table 1 ).

Table 1 Evaluation of seven mutations previously identified by Sanger sequencing, using consolidation as well as error-correction methods

Second, the protocol was validated by the evaluation of 100 (39 distinct) polymorphisms previously identified, including 42 heterozygous and 58 homozygous variants located within the exons and 200-bp up- and downstream sequence ( Supplementary Table S5 ). All variants were identified in the consolidation project, except for three representing two distinct polymorphisms. The first one is c.907-16_907-14del (RPGRIP1), which was present in patients 2 (heterozygous), 5 (homozygous), 6 (heterozygous), and 10 (heterozygous) according to the LCA chip. However, NextGENe identified this variant only in patients 2 and 10 ( Supplementary Table S5 ). For patient 5, the variant was present in one read, but not included in the mutation report. Again, when considering the original data in the error-corrected mapping project, the c.907-16_907-14del variant was visible in both patients, albeit in only one read in patient 6 ( Supplementary Figure S4a ). The frequency of 26% of this variant in the error-correction project of patient 5 questioned the homozygous call of the LCA chip. Indeed, Sanger sequencing confirmed a heterozygous c.907-16_907-14del variant. Although the repetitive nature of the region might hamper correct detection, an important discrepancy for the variant frequency remains for patient 5 between the two condensation methods. A similar observation was made for c.2818-50G>C (CEP290), which was previously identified by Sanger sequencing in a heterozygous state in patient 5. Although this variant was present in 20% of reads in the error-corrected mapping project, it was not reported in the consolidation mapping project ( Supplementary Figure S4b ).

In addition to c.907-16_907-14del (RPGRIP1), two polymorphisms displayed a frequency <25% ( Figure 3 ). They were both detected in a heterozygous state in patient 5, having the second lowest number of reads, and were located in a homopolymeric tract of 7 (c.4704+46del) and 8 (c.3574-9del) thymines in CEP290 ( Supplementary Table S5 ).

Figure 3
figure 3

Distribution of variant allele frequencies of 7 mutations and 97 polymorphisms previously identified. Taking into account all 104 variants reported by NextGENe, the mean frequencies for heterozygous and homozygous changes were 46.4 (s.d. = 12.9) and 97.6 (s.d. = 5.2), respectively. Het, heterozygous; Hom, homozygous.

The mutation nomenclature handled by NextGENe approximated Human Genome Variation Society guidelines (http://www.hgvs.org), as shown in Tables 1 and 2 . One important inconsistency was noted. For the c.2441_2442del mutation, NextGENe reported two variants: one at position 2441 (c.2441_2442delTA) and one at position 2442 (no mutation call). At position 2441, a single-nucleotide polymorphism has been described, corresponding to a missense variant (rs62636268). NextGENe included this polymorphism in the information for the c.2441_2442delTA call. Strikingly, the c.2441_2442delTA was not listed anymore when reported variants were excluded from the mutation report. However, NextGENe still included a c.2442delA variant, occurring at the second position of the deletion.

Table 2 Overview of mutations and unclassified variants identified in the mutation-negative patients

Identification of mutations in mutation-negative patients. This study included 17 patients in which previous Sanger sequencing of six genes and/or LCA chip analysis did not identify causal mutations (lane 1: patients 1–3, 7, and 8; lane 2: patients 11–22). Given the exclusion of 3 of 107 known variants by the consolidation method of condensation, variant analysis was performed using error correction, in which the original reads are maintained. To search for potential mutations in these patients, variants were first selected based on their coverage and variant allele frequency. Subsequently, variants were evaluated for their potential pathogenic effect using Alamut, their frequency in dbSNP (version 132) and the other patients, and their presence in literature and/or locus-specific mutation databases (http://www.retina-international.org/sci-news/mutation.htm). All potential mutations were confirmed through Sanger sequencing of the involved amplicon ( Supplementary Table S6 ).

This approach allowed the identification of the causal genetic defect in three patients ( Table 2 ). First, patient 7 was found to be heterozygous for c.2302C>T (p.Arg768Trp) and c.2182G>A (p.Asp728Asn) in GUCY2D. Both variants affect a strongly conserved nucleotide and amino acid, are predicted to affect protein function, and are described as disease-causing variations (https://www.carverlab.org/database). Moreover, p.Arg768Trp is presumably a founder mutation in the northwest of Europe.15 This mutation was actually a missed call on the LCA chip. Second, a novel homozygous missense mutation in RDH12, c.176T>G (p.Leu59Arg), was identified in patient 13. This mutation affects a highly conserved nucleotide and amino acid, is predicted to affect protein function, and is located in the dehydrogenase/reductase domain. Third, a novel homozygous frameshift mutation in AIPL1, c.885del (p.Ser296ProfsX10), was found in patient 17. This patient is also a heterozygous carrier of c.472G>A (p.Ala158Thr) in CRX, which has previously been described in cone-rod dystrophy and LCA, as well as in 2/110 control individuals.28,29 The presence of the A allele in 12.7% of the Yoruba population (1000Genomes Project, pilot_1_YRI_low_coverage_panel) points to a polymorphism rather than a mutation (rs61748445). In addition, patient 2 carries two AIPL1 missense variants of which the pathogenic effect is currently uncertain: c.140C>G (p.Thr47Arg) and c.937G>T (p.Ala313Ser) ( Table 2 ). Both variants were absent in >160 control individuals.

Moreover, six patients were heterozygous carriers of a single potential pathogenic variant, without a second mutation identified following Sanger sequencing of all exons lacking sufficient coverage. First, a heterozygous c.2577G>T (P.=) variant in GUCY2D was identified in patient 1. This substitution affects the first nucleotide of exon 14 and is predicted to cause loss of the wild-type acceptor splice site by Alamut, NetGene2,26 and the Berkeley Drosophila Genome Project.27 The T allele was found in 1.7% of the Yoruba population (1000Genomes project, pilot_1_YRI_low_coverage_panel) and in 0.6% of 4,548 chromosomes of the National Heart, Lung, and Blood Institute’s Exome Sequencing Project (rs112372281). In addition, this variant was absent in >160 control individuals (own data). Interestingly, patient 13 is also heterozygous for this variant. Second, patient 14 is heterozygous for the novel c.1441G>A (p.Glu481Lys) mutation in IQCB1, which affects a highly and moderately conserved nucleotide and amino acid, respectively, and is predicted to affect protein function. Third, two novel heterozygous missense variants in MERTK were identified in patients 11 (c.1893C>G, p.Ile631Met) and 21 (c.2237A>G, p.Lys746Arg), respectively. Both variants affect a highly conserved nucleotide and amino acid, are predicted to affect protein function, and are located in the tyrosine kinase domain. Finally, a single novel heterozygous unclassified variant was found in patient 18 (c.3773C>T, p.Thr1258Ile; RPGRIP1) and patient 20 (c.461_463del, p.Glu154del; CEP290) ( Table 2 ).

Discussion

The goal of this study was to design and validate a comprehensive, accurate, and affordable molecular test for LCA. To this end, we developed an innovative workflow for fast and flexible enrichment of a large number of genes before MPS. Our straightforward approach is based on high-throughput qPCR amplification, ligation, and shearing, thus enabling sequencing of regions of interest with variable length, such as exons, on a short-read sequencer ( Figure 1 ).

Our workflow holds a number of technical advantages as compared with current molecular tests for retinal dystrophies based on large-scale resequencing. So far, five groups have developed a molecular test for retinal disease genes using custom-made resequencing chips30,31 or MPS.32,33,34 Three of these either employ PCR-based enrichment followed by product quantification30,34 or use a PCR efficiency MPS run32 to obtain equimolar products prior to sequencing. However, the combination of our in-house primerXL pipeline and highly efficient qPCR amplification eliminates the need for time-consuming amplicon normalization, as shown by the uniform coverage distribution within patients ( Figure 2 ). In addition, the use of a single qPCR protocol enabled high-throughput amplification in a small volume, thereby overcoming the need for complex PCR multiplexing ( Figure 1 ).31 Moreover, qPCR enrichment provides an additional internal quality control. The majority of the 18 failed amplicons (5%) were characterized by secondary structures and a high GC content ( Supplementary Table S4 online). The additional optimization required will, however, be more straightforward and flexible in comparison with extra design rounds necessary in case of hybridization-based capturing.20 In general, the amplification of GC-rich regions could be improved by the addition of qPCR enhancers.35 In addition, base-composition bias during the Illumina library preparation has recently been addressed.36 As for now, these 18 amplicons are included in the Sanger confirmation step, which is currently indispensable for the confirmation of variants identified by MPS.

Other differences lie in the sequencing technology. In contrast to resequencing chips,30,31 MPS is able to detect small deletions, as illustrated here ( Tables 1 and 2 and Supplementary Table S5 ). The applied workflow of reproducible ligation and shearing enabled sequencing of amplicons with a variable length of up to 650 bp by 1 × 100 or 2 × 45 cycles of the Illumina Genome Analyzer IIx. In addition to its large capacity, this platform was chosen because of a lower number of false-positives in homopolymeric regions, characteristic for 454 sequencing.22,32 The use of indexing allowed pooling of 12 patients in one Genome Analyzer IIx lane, reducing false-negatives common to untagged pooling strategies.34

Finally, one of the major advantages of our workflow is its flexibility, which is crucial for genetically heterogeneous disorders for which a number of disease genes remain to be identified (i.e., 30% in LCA). This feature is often lacking in both resequencing arrays and hybridization-based capturing. Our workflow enables not only easy addition of new genes but also expansion of regions of interest from coding regions to, for instance, all exons, as applied here. Sequencing of untranslated regions on a large scale has not yet been performed for retinal disease genes and might provide novel insights into the regulation of retinal gene expression.

Overall, the lack of time-consuming optimization of amplification assays, as well as amplicon normalization, the decreasing cost of qPCR with an increasing number of samples to be analyzed, and the use of sample pooling on a short-read sequencer with high capacity, make this approach very cost-effective and thus highly suitable to a clinical context. Of note, the cost and turn-around time of MPS of all LCA genes are lower and shorter, respectively, in comparison with our current routine workflow consisting of LCA chip analysis followed by tedious gene-by-gene Sanger sequencing of a limited gene set.16

The workflow was subjected to a thorough validation of 107 previously identified variants using NextGENe. This software provides a consolidation method of condensation that merges and elongates reads containing the same anchor sequence. Using this tool, we observed a more evenly distributed coverage ( Figure 2 ) and a lower number of sequencing errors. However, three heterozygous polymorphisms (2.8%), each present in the original reads, were eliminated ( Supplementary Figure S4 online). Importantly, the variants were present following mapping using error-correction condensation, a method that corrects low-frequency errors but does not merge reads. The differences between the two condensation methods stress the importance of dedicated evaluation of data analysis tools for MPS projects. In addition, one should be careful with the exclusion of reported variations from the mutation report, as this could lead to exclusion or incorrect nomenclature of mutations located at the same nucleotide position.

This study included 17 patients for whom the genetic defect was not yet identified following Sanger sequencing of six genes and/or LCA chip analysis. Using the current workflow, the causal genetic defect was found in 3 of 17 patients, in addition to the involvement of two AIPL1 unclassified variants in a fourth patient ( Table 2 ). Taking into account the contribution of all 16 genes to LCA (~70%)1 and the detection rate of the LCA chip in the Belgian population (~41%),16 mutations were expected in 29% or 5/17 individuals. The lower detection rate is presumably due to the additional prescreening of six genes by Sanger sequencing performed in 5 of 17 patients.

Patient 7 is heterozygous for two known GUCY2D mutations (p.Arg768Trp and p.Asp728Asn). Of these, p.Arg768Trp was a missed call on the LCA chip, for which ~3.5% of interrogations fail.13 Moreover, MPS revealed a second inconsistency of the LCA chip, namely a homozygous instead of heterozygous call for c.907-16_907-14del. Patients 13 and 17 are homozygous for the novel mutations p.Leu59Arg (RDH12) and p.Ser296ProfsX10 (AIPL1), respectively. Interestingly, patient 13 is also heterozygous for an unclassified variant with potential splicing effect ( Table 2 ). The occurrence of variants in multiple LCA genes has previously been reported.12,37 So far, it is unclear whether they influence disease penetrance or represent modifying alleles. Undoubtedly, application of MPS-based strategies on a large scale will reveal more insights into the understanding of intra- and interfamilial variability, individual phenotypes, and a patient’s visual prognosis.

In line with this, the MPS panel identified a single heterozygous mutation or unclassified variant in six additional patients ( Table 2 ). First, these might represent modifier alleles influencing mutated alleles of yet to be identified LCA genes. Second, the mutation on the second allele might have been missed, as our current approach is not able to detect deep-intronic mutations, regulatory variants (other than in the 5′ and 3′ untranslated regions), or large copy number variations. A recently reported 9-kb deletion in MERTK, in which we found two novel heterozygous mutations ( Table 2 ), serves as an example of this.38 Ideally, the described MPS approach should be complemented with accurate copy number variation detection, using, for instance, multiplex ligation-dependent probe amplification,39 high-resolution array comparative genomic hybridization,40 or qPCR.41 Of note, qPCR for copy number variation detection should meet specific assay requirements that are not included in the current workflow.41

The MPS panel presented here is the most comprehensive molecular test currently available for LCA. Identification of a molecular diagnosis is highly important for several reasons. First, it unequivocally confirms the clinical diagnosis, which sometimes offers not only a visual but also a systemic prognosis. Indeed, our panel includes genes associated with LCA as part of a syndrome (CEP290, IQCB1) in addition to the nonsyndromic LCA genes. The identification of mutations in such genes contributes importantly to general clinical management. Second, a molecular diagnosis opens options for family planning such as prenatal diagnosis or preimplantation genetic diagnosis and provides the basis for recurrence risk assessment. Last but not least, the knowledge of the molecular defect is a prerequisite for gene-specific therapy, as recently established for LCA.3,4,5,6,7,8 This study appointed mutations in AIPL1, GUCY2D, and RDH12 as the molecular cause of LCA in three patients. For both AIPL1 and GUCY2D, proof-of-concept studies in animal models have shown beneficial effects of subretinal delivery of adeno-associated vectors containing the wild-type coding sequence.42,43,44 The young age of these patients (10, 2, and 5 years) and the knowledge of their molecular defect identified them as potentially eligible for future gene therapy trials.

In conclusion, we present the first comprehensive molecular test for LCA, based on a novel workflow that allows accurate, fast, flexible, and straightforward enrichment of exons with a variable size range, followed by MPS on a short-read sequencer. The thorough validation and low costs characteristic of our approach argue for its implementation in a clinical context. The strategy used here could easily be applied to other genetically heterogeneous disorders such as other retinal dystrophies, deafness, ataxia, and arrhythmia.

Disclosure

The authors declare no conflict of interest.