Introduction

Inherited retinal dystrophy (IRD) is a class of monogenic, vision-impairing disorders that are caused by retinal degeneration. IRD includes retinitis pigmentosa (RP) and allied diseases. The common manifestation is a pathological change in rod and/or cone photoreceptor cells, which are the specialized and light-sensitive neurons in the retina.1 More than 200 disease-causing genes associated with IRD have been identified (RetNet, https://sph.uth.edu/retnet/), demonstrating the extreme genetic heterogeneity of the disease and therefore making molecular diagnosis technically challenging, because it entails the commitment of substantial time and resources.2

Next-generation sequencing (NGS) is a powerful new approach that enables the simultaneous screening of multiple genes; it has quickly come to be used extensively in molecular diagnosis by sequencing the exome.3,4,5 Notably, this approach has been used for the development of next-generation molecular diagnosis of hereditary eye disorders.6,7,8 Abu-Safieh and colleagues6 designed an autozygome-guided screening panel in which they were able to successfully identify IRD in patients with six novel candidate genes. In another study, 163 known IRD genes were screened for by exome sequencing in 179 patients with Leber congenital amaurosis and juvenile RP, identifying a genetic predisposition in 40.2% (72/179) of the cases.8 Further verifying this approach, Chen et al. used targeted sequencing of 189 genes in members of 41 Chinese families with IRD and identified mutations occurring in 14 families.9 In pilot studies, we successfully performed molecular diagnoses in patients with Usher syndrome and Bardet–Biedl syndrome by assessing a panel of 144 known genes.7,10 Despite success in previous studies, there is still a large proportion of patients with IRD who have unidentified genetic mutations.2 In addition, there are few thorough studies of the relationship between specific genotypes and phenotypes.

Because NGS allows for the screening of limited networked genes in addition to genes known to be associated with the onset of IRD, targeted exome sequencing of additional candidate genes is a rapid and cost-effective method of revealing novel disease-causing genes.9,10 Many novel or unrecognized genes interact with known targets of IRD.12,13 In addition, variations in microRNA (miRNA) expression could lead to a significant abnormality in the retina.14 Taken together, this information suggests that networked genes and miRNAs potentially interact with key IRD genes, possibly inducing a disease state.

In this study, we developed a panel of 164 known genes causing retinal disease, 88 possible candidate genes, and 32 retinal miRNAs for genomic DNA capture. This panel was used in NGS of the exomes of 179 unrelated patients with IRD. We discovered 124 pathological mutations in 99 patients, including 79 novel mutations in previously identified IRD genes. Moreover, several genotype–phenotype correlations were uncovered, including identification of AHI1 as a novel candidate gene for nonsyndromic RP. The identification of novel genotype–phenotype correlations and the mutation spectrum not only greatly enhance knowledge of IRD phenotypic and genotypic heterogeneity but also will assist in both the clinical diagnosis and the development of personalized treatments for IRD.

Materials and Methods

Study subjects

The study protocol was designed in accordance with the tenets of the Declaration of Helsinki and approved by the Ethics Committee of the Eye Hospital of Wenzhou Medical University. A group of 179 Chinese patients diagnosed with IRD by retina specialists were recruited for this study. Informed consent was obtained from all participants. Patients’ initial symptoms and complaints included poor vision or night blindness during their first or second decade of life. All patients received a clinical diagnosis based on clinical eye examinations. A detailed family history was obtained from the probands and/or their relatives. Peripheral blood samples were collected from the affected patients as well as from unaffected relatives.

Design of targeted capture panel

We developed a panel for targeted DNA capture and sequencing by selecting 164 known IRD-related genes from RetNet and OMIM (Supplementary Table S1 online). The panel also included a cluster of 88 possible candidate genes with expression patterns that were spatiotemporally related to retinal development15,16 (Supplementary Table S2 online) and 32 abundant retinal miRNAs17,18 (Supplementary Table S3 online). The targeted capture panel included a total of 252 genes, which comprised 3,376 exons and 615,029-nt coding regions. The panel was constructed using the GenCap custom enrichment kit (MyGenostics, Beijing, China) according to the manufacturer’s instructions.

Paneled exome sequencing

A minimum of 3 μg of genomic DNA was extracted from each blood sample and was broken into fragments to construct the DNA library. Biotinylated single-strand DNA capture probes were hybridized in solution with the target library. Streptavidin-coated magnetic beads were used to wash, elute, and amplify the DNA, which was then subjected to NGS using an Illumina Solexa HiSeq 2000 sequencer.

In-depth bioinformatics analysis

Sequence reads acquired from raw data were aligned to the reference human genome (hg19) using the SOAPaligner program (http://soap.genomics.org.cn/). Single-nucleotide polymorphisms were initially identified by the SOAPsnp program (http://soap.genomics.org.cn/) after filtering polymerase chain reaction duplicates using SAMtools (http://samtools.sourceforge.net/). Insertions or deletions were analyzed using BWA (http://bio-bwa.sourceforge.net/) and GATK programs (https://www.broadinstitute.org/gatk/) and then were extracted using the 1000 Genome, dbSNP135, and HapMap databases, as well as an in-house database of 500 exomes. Sequence variants were standardized according to Human Genome Variant Society variation nomenclature (http://www.hgvs.org/mutnomen/) and checked using the Mutalyzer program (http://www.lovd.nl/mutalyzer/). Missense variants were assessed with PolyPhen-2 and SIFT, as described previously.19 The splice-site effect was predicted using the BDGP (http://www.fruitfly.org/seq_tools/splice.html) and ASSP programs (http://wangcomputing.com/assp/index.html).

Sanger sequencing validation and cosegregation analysis

All of the candidate mutations were confirmed by direct Sanger sequencing using an ABI 3500 Genetic Analyzer (Applied Biosystems, Carlsbad, CA). Variants, including nonsense or frameshift mutations, were considered mutation candidates, expanding the validation that was performed within kinships to analyze cosegregation.

Results

Patients with IRD

The study group included 127 isolated cases (70.9%) and 34 autosomal recessive cases (19.0%); the remaining 18 cases (10.1%) were either autosomal dominant (n = 12) or X-linked (n = 6). Diagnoses of participating subjects included RP, cone–rod dystrophy, Leber congenital amaurosis, Stargardt disease, congenital stationary night blindness, Bietti crystalline dystrophy, and Usher syndrome (Supplementary Table S4 online).

Exome sequencing of paneled genes

The genomic DNA derived from the 179 patients with IRD was subjected to massive parallel NGS, which covered a panel of 252 genes and 32 miRNA regions. On average, the mean coverage depth was 191.5× and included 98.2% of the targeted regions ( Figure 1 ). These results suggested that exome sequencing combined with direct sequencing is sufficient to yield high-quality data for further analysis.

Figure 1
figure 1

Primary data statistics of deep exome sequencing. The black histogram represents the coverage of the target region in each sample. The red line indicates the average sequencing depth of the target region.

Identification of pathogenic mutations in the IRD cohort

Bioinformatics analysis, direct sequencing confirmation, and familiar validation from 179 patients enrolled in this study revealed pathological mutations in 99 cases (55.3%) ( Table 1 ). A total of 124 distinct mutations were identified; 79 novel occurrences had not previously been reported. The other 45 mutations have been reported in previous studies (Supplementary Table S5 online). Notably, among the list of 79 novel mutations, 42 (53.2%) were nonsense, frameshift, or splice-site mutations that severely affected protein function; the other 37 novel missense mutations (46.8%) were predicted to have deleterious protein functional effects in silico. All of these mutations were either absent from the control database or their minor allele frequencies were less than 0.01. Furthermore, cosegregation testing among family members indicated that the mutations cosegregated with disease phenotype. Compound heterozygous mutations or homozygous mutations were confirmed by cosegregation in parents or siblings of affected patients. Dominant mutations either caused multigeneration pedigrees or were confirmed to be de novo mutations. Taken together, these results demonstrated that NGS is a powerful approach for molecular diagnosis of IRD.

Table 1 Distribution and detection rates in patients with inherited retinal dystrophy

Mutation spectrum in Chinese patients with IRD

Among the 179 probands in this study, 161 of the cases (90%) were sporadic or recessive, and causative mutations were successfully identified as genetic defects in 99 unrelated families (55.3%) ( Table 1 ; Supplementary Tables S4 and S5 online). The genotyping data obtained from these patients also indicated a mutation spectrum ( Figure 2 ). Among the 127 patients with sporadic IRD, 63 cases (50%) harbored mutations, whereas, intriguingly, 50 others (79% of the total solved sporadic cases) carried mutations in their autosomal recessive genes, with 10 autosomal dominant (16%) and 3 X-chromosome linked (5%) mutations. These results could be helpful in genetic counseling for patients with sporadic IRD, who are frequently seen in clinics. In this study, 34 families were definite autosomal recessives, whereas 24 were successfully determined to have genetic predispositions, with a detection rate as high as 71% for autosomal recessive IRD ( Table 1 ; Supplementary Tables S4 and S5 online). Among the 26 consanguineous families, mutations were identified in 19 (73%). Interestingly, all mutations were in recessive genes ( Table 1 ; Supplementary Tables S4 and S5 online). In our mutation pool, the top three genes contributing to IRD were USH2A, EYS, and CRB1 ( Figure 2 ). These genes with a high frequency of mutations should be a priority during genetic screening of Chinese patients. These results bring to light, for the first time, a mutational spectrum in Chinese patients with IRD, and they provide valuable insights not only for genetic counseling but also for the diagnosis and therapy of patients with IRD.

Figure 2
figure 2

Mutation spectrum in Chinese patients with inherited retinal dystrophy (IRD). The mutation spectrum indicates the range of mutations observed in the Chinese IRD cohort. Each colored box represents a mutation type: red, missense; green, nonsense; blue, frameshift; gray, splice-site. The most frequently mutated genes were USH2A, EYS, and CRB1.

Screening of possible candidate genes and miRNAs

To reveal any underlying genetic defect in novel genes, we screened 88 possible candidate genes (Supplementary Table S2 online) and 32 miRNAs (Supplementary Table S3 online). However, no pathological mutations were found in any of these genes or miRNA regions. This result indicated that the selected genes and miRNAs either do not confer a genetic predisposition to IRD or that the incidence is very low in these patients.

Evaluation of genotype–phenotype correlations

To investigate the relationship between mutated genes and IRD symptoms, we performed genotype–phenotype analyses of all patients with IRD with disease-causing mutations. Clinical data for all 99 patients, including the most recent best-corrected visual acuity, disease duration, and age at onset, are shown in a scatterplot based on 35 different genes ( Figures 3 and 4 ). The results of assessment of visual acuity are represented by different colors. Disease duration was calculated as the difference between current age and age at onset. Herein, the total 35 causative genes that were responsible for IRD in 99 families were ranked according to the percentage of best-corrected visual acuity less than 0.3 ( Figure 3 ). The genes exhibited on the left of the scatterplot caused severely decreased vision. In this group, best-corrected visual acuity was reduced to less than 0.03 in a high proportion of patients ( Figure 3 ). By contrast, the genes in the right side of the plot correlated with a mild phenotype; almost all patients with these genes had a visual acuity higher than 0.3 ( Figure 3 ). The genes in the middle showed a moderate phenotype, including the two genes with highest frequency of mutations:USH2A and EYS. We were also able to clarify the average age at onset and distribution ( Figure 4 ). The red line represents the average age at onset, and the blue histogram exhibits the number distribution of five age groups. The genes to the left caused more early onset disease compared with those on the right ( Figure 4 ). Despite the high genetic and clinical heterogeneity of IRD, phenotypic trends such as visual acuities and age at onset were correlated with genotypes. Collectively, the results of this study of a large sample bring to light valuable and significant genotype–phenotype correlations.

Figure 3
figure 3

Relationship between visual acuity and specific causative genes. The phenotypic severity of visual acuity is classified into six groups, which are plotted in different colors, according to the causative genes and disease duration. The disease duration is calculated as the difference between the current age and age at onset. The blue line represents the percentage of patients with best-corrected visual acuity (BCVA) less than 0.3.

Figure 4
figure 4

Relationship between average age at onset and specific causative genes. The red line represents the average age at onset, and the blue histogram displays the number distribution of five age groups.

Revision of the initial clinical diagnosis

We observed an inconsistency between the initial clinical diagnosis and genetic defects. In a consanguineous family (Supplementary Figure S1 online), patient W29-1 was diagnosed as having sporadic RP. However, a pathogenic homozygous mutation in the NR2E3 gene (c.290G>A, p.R97H) was identified (Supplementary Figure S1 online) that was reported to cause enhanced S cone syndrome (ESCS) in previous studies.20,21 To explain this inconsistency, proband W29-1 was sent for a clinical reassessment. Full-field electroretinography showed extinguished rod responses (Supplementary Figure S1 online). The maximal dark-adapted responses had amplitudes significantly below the normal minimum and an unusually broad waveform, and oscillatory potentials were almost absent. After adaptation to light, the cone responses presented abnormal cone waveforms and were similar to the dark-adapted maximal responses. The 30-Hz flicker responses were strikingly weak and delayed (Supplementary Figure S1 online). Electroretinography demonstrated that the phenotype was consistent with typical ESCS.20,21 Consequently, we successfully revised the clinical diagnosis of patient W29-1. To the best of our knowledge, this is the first recorded case of ESCS in the Chinese population.

Observation of a novel genotype–phenotype correlation

In a 67-year-old patient (W83-1) with sporadic RP (Supplementary Figure S2 online), a de novo mutation (c.628G>T, p.V210F) in the RHO gene was identified, which was reported in a previous study22 (Supplementary Figure S2 online). The fundus showed typical bone spicule pigmentation. Interestingly, the macula of the right eye was severely abnormal, showing prominent and deep macular coloboma (Supplementary Figure S2 online). As far as we know, unilateral macular coloboma was first observed in patients with RP with a RHO gene mutation.

Identification of AHI1 as a novel candidate gene for nonsyndromic RP

AHI1 was previously reported to cause Joubert syndrome (JBS),23 a congenital systemic disease associated with the involvement of multiple organs, including the retina, kidneys, skeleton, and liver. In this study we identified compound heterozygous mutations (c.653A>G, p.Y218C; c.3257A>G, p.E1086G) in the AHI1 gene in proband W196-1, who had sporadic RP (Supplementary Figure S3 online). Cosegregation analysis confirmed that the Y218C mutation was inherited paternally and that the E1086G mutation was inherited maternally (Supplementary Figure S3 online). The missense mutation Y218C was novel, and another mutation was reported previously as one causing JBS.24 The novel mutation was predicted in silico to be strongly pathogenic, with an amino acid change in a highly conserved region (Supplementary Figure S3 online). Interestingly, the proband showed only a typical RP phenotype (Supplementary Figure S3 online) without systemic manifestation. These results suggest that the AHI1 is a novel candidate gene for nonsyndromic RP.

Discussion

In the past 2 decades, almost 200 genes have been linked to IRD. The significant genetic heterogeneity of this disease is a major obstacle for molecular diagnosis in clinical practice. Traditional methods of individual gene screening are difficult and unlikely to reveal the full mutation spectrum in the patient population. Tremendous efforts to develop comprehensive screening strategies have been made, including genotyping microarray,25 denaturing high-performance liquid chromatography,19 and high-resolution melting analysis.26 However, issues such as prolonged processing time and an inability to identify novel mutations persist. NGS is a high-throughput approach capable of efficiently sequencing large gene pools, and it provides large data sets. It has become a powerful tool for elucidating complete mutation profiles for heterogeneous diseases.27,28 In this study we developed a new capture panel to comprehensively screen 164 known IRD genes. Using a group of 179 unrelated patients with IRD, we successfully identified pathological mutations in 99 patients (55.3%). Of note, among the total 124 mutation described in this study, 79 mutations were previously reported worldwide. We previously screened five patients with Bardet–Biedl syndrome for BBS genes10 and five patients with Usher syndrome for USH genes7 using the same panel; these were considered pilot studies to test the methodology and have been excluded from this study. Summarizing all these results, the detection rate was 57.7% (109/189). To the best of our knowledge, this is the largest investigation of genetic screening in Chinese patients with IRD. The previous studies focused only on several specific genes or screened a small cohort of patients; therefore, the contributions of causative genes to IRD in patients remain unclear. The results of this study outline a significant mutation spectrum in Chinese patients with IRD. Among the 35 genes identified in the cohort of 179 patients, the genes with the highest frequency of mutations were USH2A, EYS, and CRB1, indicating that these genes should be a priority when screening Chinese patients. This improved knowledgebase provides valuable insights and aids in genetic counseling, diagnosis, and potential therapy for patients with IRD.

Epidemiological studies assessing different populations have indicated that half of all mutations in patients with IRD are sporadic or multiplex, and, unfortunately, those cases are the ones most prevalent in the clinic.19,29 In our cohort of patients with IRD, 90% of probands displayed sporadic or autosomal recessive mutations, which makes molecular diagnosis more challenging.30 Several groups, including ours, have attempted to challenge the molecular diagnosis of simplex or autosomal recessive IRD19,31,32; however, the detection rates remained lower than 20%. In this study, however, our results showed that 50% of patients with sporadic and 71% of patients with recessive IRD were successfully determined to have genetic predispositions, suggesting that targeted exome sequencing is an efficient and applicable approach for the molecular diagnosis of such patients. In addition, the mutation spectrum delineated in our cohort provides essential information for molecular diagnosis as well as for future gene therapy in Chinese patients with IRD. Notably, two unrelated patients in our cohort were identified as having RPE65 mutations, indicating a possibility of treatment with gene therapy, as several clinical trials have shown significant effects for RPE65.33 By contrast, 80 patients had no mutations identified among the screened genes, suggesting a high possibility of novel undiscovered causes, which warrants further investigation. Most IRD-related genes are critical for physiological or developmental function of the retina.1 For instance, the dysfunction of genes involved in either of the phototransduction cascades, such as the CRX and NRL genes, leads to retinal dystrophy.34,35 With the aim of detecting mutations in possible novel candidate genes in a cost-effective way, a capture panel including 88 possible candidate genes and 32 retina-abundant miRNAs was designed. Most recently, mutations in the SIX6 gene, included in our panel, were linked to an abnormal proliferation–differentiation balance and optic nerve degeneration.36 However, we observed no such pathogenic mutation in the predicted candidate genes or miRNAs regions. This result might be because of true nonpathogenicity or a simply small proportion of these genes in our panel.

In general, IRD presents with high genetic and clinical heterogeneity. So far, few studies have described a genotype–phenotype correlation, mainly because of the limited effectiveness of genetic screening approaches. In this study we successfully identified disease-causing mutations in 99 patients, providing the opportunity to evaluate genotype–phenotype correlations. We investigated the relationships between a lack of visual acuity and age at disease onset, which have different causative mutations. Notably, the known IRD genes could be loosely classified into three distinct groups, showing severe, moderate, or mild visual acuity phenotypes. Trends within genotypes for patients’ age at onset also were noted. However, examination of a larger cohort and identification of more mutations causing IRD are needed to help further orient our findings as genetic associations. These results demonstrated that disease severity might be gene specific, knowledge that will be of value in prognosis and future therapeutic interventions.

Interestingly, the first case of ESCS in the Chinese population and a rare case of unilateral macular coloboma with RHO gene mutation were identified in this study. Patient W29-1 was diagnosed as having RP. However, a previously reported NR2E3 gene mutation was discovered in this proband; this mutation also cosegregated with phenotype in the patient’s family. Therefore, we revisited the proband and observed clinical features that were in accordance with ESCS (Supplementary Figure S1 online), resulting in a revised clinical diagnosis of ESCS. Together, these results clearly illustrate the power and importance of the combination of genotype and phenotype in the molecular diagnosis of heterogeneous diseases.

For the first time, our study links mutations in the AHI1 gene to nonsyndromic RP. AHI1 mutations were first observed in patients with JBS, a developmental brain disorder with diverse symptoms in several organs, including the retina.37 The diagnostic hallmark of retinal JBS is a unique cerebellar and brainstem malformation. In our study patient W196-1, who had sporadic RP, was identified as having compound heterozygous mutations (c.653A>G (p.Y218C) and c.3257A>G (p.E1086G)) in AHI1 (Supplementary Figure S3 online). The mutations completely cosegregated with phenotype in all family members tested. The missense mutation Y218C was reported initially, whereas another mutation was identified as a causative mutation of JBS in previous studies.24 However, the proband showed typical RP phenotypes, without any additional manifestation of JBS. This result indicates that AHI is a novel candidate gene for nonsyndromic RP. Moreover, several previous studies demonstrated that an affected gene from syndromic RP could also cause isolated retinal disease with no additional symptoms. Examples include CEP290, mutations in which can cause either JBS or Leber congenital amaurosis,38 and CLN3, which is associated with Batten disease and nonsyndromic retinal degeneration.39 The novel genotype–phenotype correlations described in this study enhance our knowledge of the molecular mechanisms of retinal dystrophies.

In summary, we have developed a paneled exome sequencing methodology for the molecular diagnosis of IRD and identified 124 mutations, including 79 novel mutations, in a group of 179 Chinese patients. Furthermore, novel genotype–phenotype correlations of IRD were uncovered, and we established a novel candidate gene for nonsyndromic RP. To the best of our knowledge, this is the largest comprehensive genetic screening of Chinese IRD to date. This study expands the field of genotype–phenotype correlations and the mutational spectrum of the Chinese population with IRD, increasing our understanding of molecular mechanisms of the disease and aiding the clinical diagnosis and personalized treatment of patients with IRD.

Disclosure

The authors declare no conflict of interest.