Introduction

Massive parallel sequencing is rapidly being applied to identify disease-causing variants in disorders with locus and allelic heterogeneity. Both targeted gene panel sequencing1 and exome or genome sequencing approaches2, 3, 4 are being used in diagnostic neurogenetics. Both approaches allow for parallel analyses of multiple genes (panels) associated with a given disorder in a single test. However, where targeted gene panel approaches are limited to this, exome (or genome) sequencing enables variant detection beyond a predefined gene panel or in ‘broader’ gene panels. The unbiased nature of exome sequencing can thus result in the discovery of novel gene–disease associations, in the expansion of disease spectra, or in the recognition of very rare and unanticipated disorders. On the other hand, exome sequencing bears the risk of finding pathogenic variants in genes not related to the original request. To minimize the chance of having such ‘unsolicited’ findings, bioinformatic gene panels may be used for initial filtering.5

Here, we have used clinical exome sequencing with bioinformatic gene panel filtering for variant detection in cerebellar ataxia (CA) and hereditary spastic paraplegia (HSP). The prevalence rates of CA and HSP are 1–5 per 100 0006 and 1–10 per 100 000,7 respectively. CA and HSP are genetically heterogeneous, with over 70 loci/genes involved in each, often with little or no clues for genetic differentiation. In this study, the exome sequencing data were managed using a bioinformatic gene panel filter containing 200 genes for central motor disorders, including not only CA and HSP, but also Parkinson’s disease and various hyperkinetic movement disorders.

Subjects and methods

Patients

In total, 76 patients with CA (n=28) or HSP (n=48) were included for clinical exome sequencing. The cohort was largely nonconsecutive. Patients with a visiting history from our center were selected from corresponding databases, contacted, and then referred to and counseled by a clinical geneticist. New patients were recruited at their first visit. No restrictions on age at onset or inheritance pattern were made, but there had to be a clinical suspicion of a genetic etiology, based on the combination of a CA and/or HSP phenotype plus either an onset age below 45 years or a positive family history. The latter was liberally defined as any affected family member suggestive of dominant, recessive, or X-linked inheritance. Moreover, mutations in the most likely gene(s) had to be excluded. All patients were counseled by a clinical geneticist and informed consent was obtained. From each family, only one patient (proband) was included for exome sequencing. Candidate mutations were verified by Sanger sequencing in affected family members whenever possible. All but three patients were adults at the time of testing, but in many the disease had started in childhood (see Tables 1 and 2). Most patients had already undergone extensive genetic testing, with a mean of 4.5 genes for each group. Autosomal dominant cerebellar ataxias caused by repeat expansions were excluded in all CA patients with dominant or sporadic modes of inheritance. In addition, SPG types 4 and 7 were excluded in ~75 and 50% of the HSP patients before exome sequencing. The set of genes excluded per patient was quite variable, but this reflects the variation of clinical practice in case of different requesting and referring physicians. Clinical exome sequencing was approved by the Medical Review Ethics Committee, Region Arnhem-Nijmegen, Number 2011/188.

Exome sequencing and data analysis

Exome sequencing was essentially performed as previously described.5 Briefly, capture of exons was done using an Agilent SureSelect Human All Exon 50 Mb Kit (Santa Clara, CA, USA). Sequencing was performed using a Life Technologies 5500XL machine (Thermo Fisher, Waltham, MA, USA) or an Illumina Hiseq 2000 (San Diego, CA, USA). Read mapping and variant calling were done using LifeScope Life Technologies (Thermo Fisher) for the 5500XL data or BWA (mapping) and GATK (calling) for the Illumina data. A filter for a ‘movement disorders’ gene panel was applied. This panel consists of 200 genes implicated in various forms of CA, HSP, Parkinson’s disease, genetic choreas, and other hyperkinetic movement disorders. Only genes with substantial evidence (multiple families, functional evidence, and/or literature reports) were included in this panel. Genes with repeat expansions as molecular mechanism only were not included either. The genes in this panel and coverage statistics are provided as Supplementary File 1) and can also be found at www.genomediagnosticsnijmegen.nl/exome. Variants were prioritized based on the following criteria: frequency (<5% dbSNP, <1% in-house database of >5000 exomes), nucleotide and amino acid conservation (based on alignments), relation of the gene to disease (per family), and inheritance pattern.5 All reported variants were confirmed by Sanger sequencing.

All possible causative variants have been deposited into the appropriate gene-specific databases (https://grenada.lumc.nl/LOVD2/mendelian_genes (Patient ID 0064817), PMM2 (Patient ID 0064818) or http://databases.lovd.nl/shared/variants (SPG7), 00054866 (GOSR2), 00054867 (SCN8A), 00054869 (C10orf2), 00054870 (KCNC3), 00054871& 00054875 (OPA1), 00054872 (GBA2), 00054873 (FA2H), 00054874 (KIAA0196), 00054876 (TH), 00054877 and 00054878 (KIF1A), and 00054879 (LRRK2).

Results

Diagnostic yield of exome sequencing for CA and HSP

Initially, variants were identified by exome sequencing and an in-house designed ‘movement disorder’ filter in 33 of 76 patients. Some of those were directly considered causative, whereas others needed follow-up by segregation analysis, clinical re-evaluation, or biochemical tests. Pathogenicity was based on nature of the variants and previous reports (see Tables 1 and 2) and frequency in controls and prediction algorithms (Table 3). Of the 28 CA patients, eventually 9 had causative variants, 1 a possibly causative variant, and another 4 likely noncausative variants (Table 4). Of the 48 HSP patients, 8 had causative, 8 possibly causative, and 3 had likely noncausative variants. Thus, the diagnostic yield of clinical exome sequencing in this cohort is somewhere between 17 and 26 of 76 (1 in 4).

Table 1 Cerebellar ataxia patients with (a) causative variants and (b) possible causative variants identified by exome sequencing
Table 2 Hereditary spastic paraplegia patients with (a) causative variants and (b) possible causative variants identified by exome sequencing
Table 3 Frequency and pathogenicity scores of identified causative and possible causative variants
Table 4 Diagnostic yield of gene panel analysis after exome sequencing

Molecular diagnoses in cerebellar ataxia

Three patients with autosomal recessive or sporadic CA had biallelic SPG7 variants (Table 1a). Spastic paraplegia type 7 (SPG7) accounts for 5–12% of autosomal recessive forms of HSP, depending on the population tested.20 In approximately half of the SPG7 patients, CA is an additional clinical feature.21 However, two of the three patients described here had no pyramidal signs, but rather a pure CA; the third did have pyramidal features at clinical re-evaluation. Two of these patients are compound heterozygous for two common variants in spastic paraplegia type 7, p.(Arg485_Glu487del) and p.(Ala510Val), suggesting that pure ataxia caused by SPG7 variants is not caused by specific variants, but part of the SPG7 disease spectrum.

Three other patients (4–6; Table 1a) had ataxia complicated by seizures, and exome sequencing revealed variants in genes involved in rare disorders. Patient 4 carried a de novo variant in SCN8A. Variants in this gene are associated with early epileptic infantile encephalopathy type 13.22 Patient 5 was homozygous for the only GOSR2 variant known to date to cause progressive myoclonic epilepsy type 6,23 whereas patient 6 has an earlier described nonsense variant in CACNA1A, a gene involved in various disorders with ataxia and/or epilepsy.

The remainder of the variants (Table 1a) was considered to be pathogenic in CA for several reasons. Patient 7 has ataxia complicated by ovarian dysgenesis, deafness, neuropathy, and a mild intellectual disability that is compatible with mitochondrial DNA depletion syndrome caused by C10orf2 (Twinkle) variants. In patient 8, two well-characterized PMM2 variants with reduced enzymatic activity24 were identified. Despite the often severe CDG1a phenotype associated with PMM2 variants, a relative mild neurological phenotype with ataxia as the main feature has been described before.25, 26 The missense variant in KCNC3 (patient 9) was considered causative, as it has been described earlier in ataxia and, in the present case, was found in an affected sibling and in a mosaic state in the unaffected mother.

A heterozygous variant in OPA1, the gene for dominant optic atrophy type 1 and optic atrophy-plus syndrome, was considered to be possibly causative (patient 10, Table 1b). However, despite segregation of this variant with the disease in two siblings and despite the fact that this variant affects an evolutionary conserved residue, it is quite uncertain whether this variant is the cause of the patient’s ataxia. The optic atrophy plus syndrome is mostly a recessive disorder, and no second variant was identified using this test. The test has several limitations, including incomplete coverage of the gene, no detection of deep-intronic variants (as have been found in OPA127), or the poor detection of exon deletions/duplications. In addition, the OPA1 gene seems to be quite a polymorphic gene, with many nonpathogenic sequence variants. Analysis of the OPA1 gene from our anonymized cohort of healthy parents from trio-based studies (n=2224) revealed 7 infrequent (<1% carrier frequency) sequence variants that lead to substitutions of moderately or well conserved amino acids. Thus, further testing of the OPA1 gene and its involvement in this family’s phenotype is warranted. Unfortunately, further segregation analysis was not possible as the parents were not available for testing.

Other variants were thought to be possibly causative initially, but eventually considered likely noncausative, for the following reasons: (1) they did not segregate with the disease or (2) no second pathogenic variant in the same gene was detected by Sanger sequencing in recessive disorders (Supplementary File 2). Of note, AP5Z1 (KIAA0415) variants have been described in autosomal recessive spastic paraplegia type 48.28 However, only two families have been described in the original paper, and in a replication study no further pathogenic AP5Z1 variants were identified in a cohort of 127 ataxia patients.29 Two affected siblings from the original study were homozygous for a frameshift variant, whereas a third unrelated patient had only one heterozygous frameshift variant. These findings are inconclusive concerning the putative inheritance pattern proposed for AP5Z1 variants. The AP5Z1 nonsense variant in patient 28 was not present in his affected sibling, making it an unlikely cause of the CA in both siblings. Furthermore, using exome sequencing, we recently identified another AP5Z1 heterozygous null variant (c.1312-2A>G) that did not segregate with spastic ataxia in another family (not part of this cohort), whereas a homozygous AP5Z1 nonsense variant was revealed in a patient with retinitis pigmentosa but no motor symptoms. Therefore, it seems unlikely that AP5Z1 variants are related to disease and we consider SPG48 for now to be unconfirmed.

Molecular diagnoses in hereditary spastic paraplegia

In eight HSP patients, clinical exome sequencing revealed variants that were considered causative because they had been described in other HSP patients before, they were null variants, or they segregated with the disease (patients 11–18, Table 2a). Three of those were in KIF5A, indicating that SPG10 is relatively frequent in this cohort of HSP patients.

Surprisingly, one of the KIF5A variants (detected in patient 13 and her affected father) leads to intron retention in the mature mRNA in lymphocytes (Supplementary File 3). This is predicted to result in a premature termination codon. Despite the suggestion of gain-of-function or dominant-negative effects of variants in SPG10,30 the variant spectrum consists of missense, splice site, and frameshift variants, indicating that KIF5A haploinsufficiency may also be a mechanism underlying SPG10.

Patient 11 has a heterozygous frameshift variant in RTN2. As two of the three SPG12-causing variants in this gene described to date are null variants,31 haploinsufficiency of RTN2 seems to cause SPG12. Patient 15 has a homozygous nonsense variant in GBA2. This patient has HSP complicated with a mild intellectual disability, cataract, and neuropathy that are major characteristics of SPG46 caused by GBA2 variants.

Genetic and clinical follow-up of two other patients also confirmed the role of the identified variants in their spastic syndrome. In patient 16, one heterozygous FA2H variant was complemented with a second variant identified by Sanger sequencing. The latter was missed by exome sequencing because of coverage issues. The MRI in this patient did not show iron accumulation, a variable finding in SPG35 due to FA2H variants. ‘Reverse phenotyping’ was convincing for Kufor–Rakeb syndrome in patient 17 with ATP13A2 variants, because of the additional presence of mild Parkinsonism, upgaze limitation, and cognitive decline (see Supplementary File 4). A pure HSP in patient 18 and her affected father is likely to be due to a KIAA0196 variant.

A relatively large group of HSP patients had variants of uncertain pathogenicity (Table 2b). Variants in VCP can cause amyotrophic lateral sclerosis type 14 (ALS14) and inclusion body myopathy with early-onset Paget disease and frontotemporal dementia type 1 (IBMPFD1), a very heterogeneous disorder with reduced penetrance of each feature. Recently, we reported on a VCP variant as a new cause of HSP.32 The p.(Arg93His) variant in VCP in patient 19 is novel. However, another variant of the same codon, p.(Arg93Cys), has been described in IBMPFD1,33 suggesting that patient 19 is the second case of HSP caused by a VCP variant. Similarly, patient 20 and his sibling have HSP and a homozygous variant in TH that is normally involved in autosomal recessive dopamine-responsive dystonia. The p.(Glu258Lys) variant in TH in this family is likely to be causative, as the same change, but of the adjacent and equally conserved codon (p.(Glu259Lys)), has been described in patients with dopamine-responsive dystonia.34 Unfortunately, a lumbar punction to study dopamine metabolites and a trial with L-DOPA was refused by both siblings. Interestingly, a variant in the GCH1 gene, causing autosomal dominant dopamine-responsive dystonia, has recently been associated with dopamine-responsive, autosomal dominant HSP,35 suggesting a link between HSP and dystonia.

In the same group of possibly causative variants in HSP, several variants in genes implicated in HSP (KIAA0196 and KIF1A) or in other central motor disorders (LRRK2, AFG3L2, OPA1) were identified, and some of the variants have been published before (but not in HSP). Segregation analyses or functional studies were not possible to confirm or exclude the role of these variants in HSP because of various reasons. However, an interesting observation is the finding of two heterozygous KIF1A variants segregating in two families with autosomal dominant HSP. KIF1A variants have been associated with autosomal recessive HSP in two families,36 and also in de novo autosomal dominant intellectual disability with motor symptoms.37, 38 All variants described in KIF1A in these conditions affect the kinesin motor domain. The mechanism underlying these two different disorders has not been established, but it is tempting to speculate that the KIF1A variant that segregates with HSP in the family of patient 22, which is also in its kinesin motor domain, may cause autosomal dominant HSP. An apparent dominant inheritance of HSP with normal cognition was recently suggested in one family.39 Interestingly, similar variants in the kinesin motor domain of the KIF1A paralog KIF5A are involved in pure autosomal dominant SPG10. The nonsense variant in patient 23, which is not in the kinesin domain, is more difficult to understand.

Three other variants identified in the cohort of HSP patients (Supplementary File 5) were considered to be likely noncausative, as they did not segregate with the disease (variant in KCNC3), were not supported by biochemical analyses (GALC), or because no second pathogenic variant in the same gene was identified (PNPLA6).

Discussion

Clinical exome sequencing in CA and HSP has a high diagnostic yield

Sequencing of single genes in monogenic disorders with high locus and allelic heterogeneity, such as CA and HSP, has a low diagnostic yield per gene. The most common dominant forms of CA are caused by polyglutamine-coding repeat expansions. Spinocerebellar ataxia 3 is the most prevalent CA in this group, accounting for 21% of all SCA patients worldwide, though large differences between populations have been reported.6 The most common recessive type is Friedreich ataxia, with a prevalence of 2–4 in 100 000. As the total number of patients with recessive CA is unknown, it is as yet unclear which fraction is affected with Friedreich ataxia. The detection rate for other genotypes declines steeply, with many forms being extremely rare with prevalences below 1%. Once the ataxias with repeat expansion have been excluded, 70 other genes associated with various other forms of ataxia remain candidates. Similarly, the most common form of HSP is autosomal dominant (AD) SPG type 4,40 explaining 30 to 40% of AD HSP. Many other HSP types are very rare, and sometimes have been described in only one or a few families. The number of loci involved in HSP has now exceeded 70. Thus, a generic test for parallel analyses of all the nonrepeat CA and HSP genes is expected to have a better overall yield than single gene tests. Considering the phenotypic overlap between CA and HSP, a combined analysis of genes involved in both groups of disorders is justified.41 Indeed, clinical exome sequencing in this cohort showed a diagnostic yield (1 in 4), comparable to earlier exome sequencing studies for CA.4 The reasons for not detecting causative variants in approximately three-fourths of the patients in the cohort may be heterogeneous, including that the variants reside outside the tested regions (such as mitochondrial DNA, introns, and promoters), that the variants are in ‘novel’ genes yet to be identified as involved in a central motor disorder, that the variants are not being picked up by this test (exonic regions that are not targeted, poor coverage, or repeat expansions), or a nongenetic basis of the disorder. In addition, it is worthwhile noting that this was a selection of patients negative for conventional gene testing (4.5 genes on average), decreasing the a priori chance to find a variant by exome sequencing. Of note, a limitation of our study was the lack of systematicity in which genes were excluded before exome sequencing, preventing making reliable frequency estimates.

However, our selection of cases for this testing strategy, that is, a genetic etiology had to be suspected, might have led to a higher yield than when a less selected cohort of CA and HSP patients would have been tested. Future re-evaluation of the data (to detect variants in newly identified genes) or future retesting (ie, genome sequencing or mitochondrial DNA sequencing) may eventually provide a molecular cause for some of the patients in the ‘unsolved’ group.

Of note, before exome sequencing was undertaken, repeat expansion disorders were excluded in most CA patients (both here and the study from Nemeth et al1), and similarly SPAST and SPG7 variants were excluded in most of the HSP patients. As it is still not possible to reliably detect repeat expansions by exome sequencing in CA, it remains necessary to test for these first. For HSP, however, clinical exome sequencing could be the first tier testing strategy that would increase its diagnostic yield.

Uncertainty of the pathogenicity of a subset of variants

One of the challenges in clinical genetics is the determination of the pathogenicity of a given variant. In part, variants are clearly or highly likely to be pathogenic based on functional studies, segregation analyses, frequencies of variants in patient and control sets, or simply on the nature of the variant (such as nonsense variant in haploinsufficiency diseases). Others may seem pathogenic, but proof is lacking, and their role in disease should be considered with great caution, and this has been stressed for variants with uncertain pathogenicity in ataxia by a discussion in the recent literature.42, 43 Therefore, such variants with uncertain pathogenicity (as in Tables 1b and 2b) should be considered carefully and not be used in clinical decision making. However, sharing these variants either in databases or via scientific papers is key to build the publically available data allowing scrutinization of the putative role of these variants in disease.

Clinical exome sequencing reveals ‘old’ gene–‘new’ disease associations

Clinical exome sequencing and filtering for a broader panel of genes involved in motor disorders has revealed some novel gene-to-disease associations. SPG7 is the most common form of autosomal recessive HSP and is in half of the cases associated with ataxia. Here, two patients (1 and 2) with a pure ataxia were found to be compound heterozygous for two common SPG7 gene variants. Similarly, other exome sequencing studies had also revealed one patient with pure CA due to SPG7 variants.4, 44 Thus, the clinical spectrum of SPG7 variants is expanded to pure CA. Similarly, a homozygous variant in the TH gene in two siblings with HSP strongly suggests a role for this gene not only in dopamine-responsive dystonia, but also in HSP. A third observation is that of a dominant inheritance pattern of pure HSP in two independent families with KIF1A variants that were until now implicated in autosomal recessive HSP in two families or as de novo variants in severe intellectual disability and a motor phenotype.37, 38 For SPG7, this new association seems to be established with five families (this and other reports). For TH and KIF1A, the identification of other families and/or functional studies are warranted to establish their role in a wider disease spectrum.

Other observations in our cohort substantiate previously suggested ‘allelic’ disorders. These include the role of PMM2 not only in a severe congenital disorder of glycosylation, but also in late-onset ataxia (patient 8), a second VCP variant (patient 18) that likely explains HSP and that suggest this form of HSP is allelic to ALS14 and IBMPFD1, and a possible second case of autosomal recessive spastic-ataxia-neuropathy syndrome (SPAX5)45 caused by variants in AFG3L2 (patient 24).

Some of our other findings cautiously hint toward other gene–disease associations, but clearly require confirmation by other cases or functional tests. These are the possible role of LRRK2 (involved in Parkinson’s disease) variants in HSP (patient 20), or that of OPA1 (the gene for dominant optic atrophy plus syndrome) in neurological disorders without optic atrophy (patients 10 and 25).

Clinical exome sequencing uncovers unanticipated disorders

Other genetic disorders may escape from being recognized clinically, simply because they are extremely rare, or because the disease may not have progressed to the recognizable full-blown phenotype. For example, patient 5 with myoclonic epilepsy type 6 due to GOSR2 variants had indeed gradually developed the classical phenotype of ataxia with myoclonic epilepsy and other seizure types, although at age 13 there is still no scoliosis. Because of the locus heterogeneity in ataxia–epilepsy syndromes and the rare occurrence of this rather new type, myoclonic epilepsy type 6 was not considered clinically. Another example in our cohort was that of Kufor–Rakeb syndrome (patient 17), who was initially diagnosed with a complex form of HSP because spastic tetraplegia was predominant. Only after establishing a molecular diagnosis, clinical re-examination (‘reverse phenotyping’) confirmed Kufor–Rakeb syndrome, leading to a favorable levodopa trial. Moreover, this case illustrates that unexpected exome sequencing findings can affect therapeutic management.