Introduction

Hereditary spastic paraplegias (HSP) and hereditary ataxias (HA) represent the extremes on the spectrum of spinocerebellar neurodegenerative disorders. They are often correlated and impose diagnostic difficulty, especially with other overlapping neurodegenerative disorders.1 They can be transmitted by all modes of inheritance.1, 2, 3, 4, 5 The core defining clinical features of HSP are insidiously progressive weakness and spasticity of the lower extremities. HSP has a prevalence of 3–10 per 100 000 in most populations.6 Prevalence of autosomal dominant (AD) HSP ranges between 0.5 and 5.5 per 100 000 individuals and that of autosomal recessive (AR) HSP between 0.3 and 5.3 per 100 000 individuals.7 HSP can be pure (uncomplicated) or complex according to the absence or presence of additional neurological and non-neurological manifestations. There are >67 known HSP genes.1 AD HSP forms are the most frequent in western populations. SPG4, SPG3A, SPG10 and SPG31 account for up to 50% of AD HSP. SPG3A (ATL1) alone accounts for ~10% of AD HSP.8 On the contrary, AR HSP predominates in highly consanguineous communities.7, 9, 10 Thin corpus callosum-associated HSP (TCC-HSP) represents a distinct subgroup accounting for approximately one-third of all AR HSP.11 At least nine genes have been identified to be responsible for TCC-HSP. They include SPG1, SPG11, SPG15, SPG18, SPG21, SPG32, SPG46 and SPG49/56 and in rare cases, SPG7. Disease causing variants in SPG11 (KIAA1840) (OMIM phenotype #604360) constitute the most frequent cause of TCC-associated HSP (41–77%) and up to for 10–20% of all AR HSP, particularly in the Mediterranean basin.6, 10, 12, 13, 14, 15, 16 Next-generation sequencing aided in the identification of many rare HSP genes.1, 17 TFG/SPG57 stands as an example of this genetic surge. TFG is a highly conserved regulator of protein secretion and functions at the interface between the endoplasmic reticulum (ER) and ER–golgi intermediate compartments. Variants in TFG were implicated in SPOAN-like HSP (spastic paraplegia, optic atrophy, neuropathy)18 and hereditary sensory motor neuropathy with proximal predominant involvement (HSMN-P).19 Spastic ataxias now stand as a distinct disease category. Autosomal Recessive Spastic Ataxia of Charlevoix Saguenay (ARSACS) (OMIM phenotype #270550) shows a challenging phenotypic overlap with HSP and HA, with a remarkable clinical diversity.20, 21, 22 ALS2 (Alsin) (OMIM gene *606352) is another prominent example of a gene implicated in pyramidal syndromes, with overlapping roles in other neurodegenerative conditions including amyotrophic lateral sclerosis.23

Relatively little is known about the genetics of spinocerebellar degenerations in the Sudanese population.24 In this article, we studied 25 families with progressive spastic neurodegenerative disorders in an effort to determine the genes underlying the disease. We report the first seven Sudanese families carrying novel and rare variants in KIAA1840 (SPG11), TFG/SPG57, ATL1 (SPG3A), ALS2 and SACS. Using in vitro biochemical experiments, we demonstrate the pathological effect of a novel TFG variant. Further genetic heterogeneity of HSP is also highlighted.

Materials and methods

Ethical approval

This study was prospectively reviewed and approved by the Paris Necker Ethics Committee (France) and the Ethical Committee of the University of Khartoum Medical Campus (Sudan). It was conducted in accordance with the recommendation of the Declaration of Helsinki. A written informed consent was obtained from all participants.

Subjects recruitment

We investigated 25 consanguineous Sudanese families (of various ethnic backgrounds and with multiple consanguinity loops). The total number of patients sampled in these families was 65 patients. In each family, we recruited the proband along with all accessible diseased family members and available unaffected first-, second- and third-degree relatives when possible. Inclusion criteria for patients were clinical presentation suggestive of spastic neurodegeneration plus familial consanguinity and/or positive family history of neurodegenerative disorders. Alternative diagnoses were excluded.

Clinical phenotyping

Patients were examined and diagnosed by the referring consultant neurologists/neuro-pediatricians, followed by a second standardized phenotyping by the research team. Healthy related controls were examined to exclude subtle disorders. Patients were assessed for motor disability using a Seven-Stage Disability Score (Supplementary Tables 1 and 2). Magnetic resonance imaging (MRI) of the brain was performed for at least one patient per family in most cases.

Sampling and DNA purification

Two milliliters of saliva were collected using Oragene.Discover DNA collection kits (DNA Genotek Inc., Ottawa, ON, Canada). DNA purification was done according to the prepIT.L2P manual protocol provided by the manufacturer. DNA quality and quantity control were performed using standard Agarose gel electrophoresis, NanoDrop spectrophotometer (Thermo Scientific, Wilmington, DE, USA) and Qubit fluorometer (Promega, Madison, WI, USA).

Genetic linkage analysis

We performed linkage analysis to screen two families (10 patients and 31 healthy relatives) for candidate loci based on patients’ clinical presentation, brain imaging and previously known relative frequencies of HSP genetic mutations. We used microsatellite markers (list available upon request) flanking the CYP7B1 (SPG5), KIAA1840 (SPG11), ZFYVE26 (SPG15), GBA2 (SPG46), CYP2U1 (SPG49/SPG56) and SACS genes. Classical procedures were used for their PCR amplification and allelic resolution in an ABI3730 sequencer (Applied Biosystems, Foster City, CA, USA). Peak Scanner software (Applied Biosystems) was utilized for analysis of product length. Haplotypes were then manually reconstructed for each locus and pedigree in order to obtain the minimum number of recombination events.

Targeted next-generation sequencing screening panel

We used a multi-gene targeted next-generation exome-sequencing panel to screen the remaining 23 (out of 25) HSP families. The total number of patients in these families was 55 patients. However, an index patient(s) in each family underwent exome panel screening, whereas samples from the remaining patients were used to test the segregation of candidate variants. The panel (Supplementary 3) covered a total of 74 genes including 68 genes previously implicated in a variety of spastic disorders1 and six unpublished candidate genes. In total, 1042 exonic regions with total length of 220 095 bp were targeted. Twenty base pairs of flanking intronic sequences were included, whereas promotors and UTRs were excluded. NimbleGen/SeqCap Ez enrichment system was used (Roche, Madison, WI, USA). Sequencing was performed using MiSeq platform (Illumina, San Diego, CA, USA) on 250 bp paired-end reads. Variant calling and quality control were performed using Genomics Workbench (CLC Bio, Aarhus, Denmark). Short variations (SNVs and Indels) were detected using probability-based and quality-based algorithms. Large variations (duplications and large deletions) and zero-coverage exons were examined as well through the analysis of the coverage results. For gene/variant prioritization, nonsense, frameshift, splice site and missense variants with a minimum depth of 30 × were selected. Minor allele frequency cutoff of 1.5% was used first then decreased to 0.1% for more stringency. Five pathogenicity prediction algorithms were used to predict the effect of missense variants (SIFT, Polyphen-2, Mutation Taster, Align GVGD, KD4v). PhastCons and PhyloP conservation scores were used to indicate conservation. For splicing sites prioritization, five tools (Human Splicing Finder, SpliceSiteFinder–Like, Gene Splicer, NNSPLICE, MaxEntScan) were used. All tools were collectively provided by AlamutVisual software (Interactive Biosoftware, Rouen, France). Using these criteria, heterozygous variants in AD families and homozygous variants in families with AR inheritance were considered as candidate variants. Heterozygous variants in AR pedigrees were examined to exclude the presence of compound heterozygous inheritance. Exons with one (or more) base pair(s) with less 30 × coverage were sequenced (Sanger sequencing) in all genes in which a single heterozygous variant was found to exclude missing variants due to low coverage.

Sanger sequencing

We performed Sanger sequencing using the BIGDYE chemistry on an ABI3730 sequencer (Applied Biosystems) to validate candidate variants in index patients and to confirm their co-segregation with the disease distribution within each family. Bioinformatic sequence analysis was performed using Seqscape (Applied Biosystems) and Chromas lite software (Technelysium, South Brisbane, QLD, Australia).

Submission to locus specific databases

Variants were submitted to Leiden Open Variation Database available at: http://databases.lovd.nl/shared/variants/ with the following variant IDs: 0000090056 (SPG11, c.6709del), 0000090058 (SPG11, c.3568A>T), 0000090050 (SPG11, c.6349G>T), 0000090062 (TFG, c.64C>T), 0000090063 (SACS, c.7739G>A), 0000090069 (ALS2, c.368G>A), 0000090064 (ATL1, c.452T>C).

Biochemistry

We conducted a series of size-exclusion chromatography studies, coupled to multi-angle light scattering (to accurately determine absolute protein molecular mass) to determine whether the novel TFG c.64C>T variant impacts the homo-oligomeric properties of the TFG PB1 domain. Recombinant TFG proteins (amino acids 1–96) were expressed as fusions to glutathione-S-transferase (GST) using Escherichia coli BL21 (DE3). Purifications were conducted using glutathione agarose beads. Proteins were cleaved from resin with GST-HRV3C protease overnight at 4 °C and purified further using ion exchange chromatography. Light-scattering data were collected as described previously.18 In brief, purified proteins were applied onto a high-resolution size-exclusion chromatography column (WTC-030S5, Wyatt Technology, Santa Barbara, CA, USA), which was coupled to a three-angle light-scattering detector (miniDAWN TREOS, Wyatt Technology). Data were collected at a flow rate of 0.5 ml/min and analyzed (ASTRA software, Wyatt Technology) to determine the molecular mass of TFG proteins.

Results

Description of the cohort

All investigated 25 families were consanguineous. The majority of these showed a probable AR inheritance with multiple affected individuals (15 families; 60%), whereas some pedigrees featured only one patient despite the consanguinity (seven families; 28%). Few were consistent with a dominant inheritance pattern, although pseudo-dominant autosomal inheritance cannot be excluded (three families; 12%). In these 25 families, a complex phenotype was predominant (21/25 families; 84%).

In total, genetic diagnosis was established in seven families (28%). The genetics and phenotype–genotype correlations of these families with a genetic diagnosis are presented in the following sections. In the remaining 18 families yet to have a confirmed genetic diagnosis, two families presented with pure HSP, six families showed complex HSP, whereas 10 families presented with spastic ataxia. HSP cases were complicated by stereotypic movement (2/18), bulbar signs and sensory motor polyneuropathy (1/18), cognitive decline (1/18), dystonia (1/18) or severe cortical atrophy (1/18). Spastic ataxia was complicated with dysmorphic features (2/18), chorea (1/18) and visual impairment (3/18).

Genetic results

In two families, homozygous microsatellite haplotypes surrounding the SPG11 locus segregated with the disease (data not shown). Sanger sequencing confirmed the segregation of two homozygous SPG11 variants, one in each family: c.6349G>T and c.6709del (Figure 1). Using a multi-gene NGS panel, causative variants were identified in other five families: SPG11 c.3568A>T, TFG/SPG57 c.64C>T, ATL1 c.452T>C, ALS2 c.368G>A and SACS c.7739G>A (Figures 13). Table 1 shows the details of these variants. All variants were novel except the SPG11 c.6709del variant, which was reported previously in a Somalian patient.25 All variants co-segregated with the disease in the families and were absent from large sets of databases including 1000 Genomes, EVS, Broad institute ExAC and our local institutional databases. Missense variants were predicted to be deleterious by at least three pathogenicity prediction algorithms and affected highly conserved amino acids (Table 1). No pathological variant in an unexpected zygosity, zero-coverage exons or large rearrangements (for example, deletions/duplications) were identified. We found rare deleterious heterozygous variants in our cases that did not conform to the mode of inheritance in the families and were not considered as causative mutations (Supplementary Table 4).

Figure 1
figure 1

Families F1, F6 and F16 with autosomal recessive hereditary spastic paraplegia with thin corpus callosum (TCC) caused by SPG11 variants. (a (I)) Pedigree of family F1 in which a nonsense variant segregates with the disease in subfamily I. Electrophoregrams show the variant in a homozygous patient and a heterozygous carrier. Subfamily II showed dissimilarities in HSP phenotypes. (a (II)) Brain MRI (sagittal T1W section) of an SPG11 patient showing severe TCC. (b (I)) Pedigree of family F6 showing the segregation of a frameshift variant. Electropherograms show the variant in a patient (homozygous) and in a heterozygous carrier. (b (II)) Brain MRI (sagittal T1W section) of the index patient showing severe TCC and cortical atrophy. (c (I)) Pedigree of family F16 showing the segregation of a nonsense variant. Electropherograms show the variant in a patient (homozygous mutant) and in a heterozygous carrier. (c (II)) Brain MRI (axial T2 FLAIR) of the index patient showing TCC and periventricular hyperintensities. (d) SPG11 gene model showing the position of detected variants as red lollipops. Pedigree symbols: full shading indicates affected individuals, half horizontal shading indicates other diseases, numbers between brackets are multiple siblings, diamonds indicate multiple siblings of different sex. Individual ID numbers and genotypes are indicated. Genotypes: ++ homozygous reference; M+ heterozygous; MM homozygous mutant. Others are standard medical pedigree symbols.

Figure 3
figure 3

Families F14, F34 and FM5 with spastic ataxia and hereditary spastic paraplegia caused by SACS, SPG3A and ALS2 variants. (a (I)) Pedigree of family F14 with recessive spastic ataxia of Charlevoix Saguenay (ARSACS) caused by a nonsense variant in SACS. Other diseases are seen in the family. Electropherograms show the variant in a patient (homozygous mutant) and a heterozygous carrier. (a (II)) Brain MRI (Sagittal section T1W) of individual 14133 showing thinning of the posterior half of the body of corpus callosum and moderate thinning of cerebellum. (b) Pedigree of Family F34 with autosomal dominant HSP caused by a missense variant in ATL1. Electropherograms show the variant in a patient (heterozygous) and a control (reference alleles). (c) Pedigree of family FM5 with autosomal recessive HSP caused by missense variant in ALS2. Electropherograms are shown for a patient (homozygous mutant) and a heterozygous carrier. Pedigree symbols: shading indicates affected individuals, numbers between brackets are multiple siblings, diamonds indicate multiple siblings of different sex. Individual ID numbers and genotypes are indicated. Gene models show the position of detected variant in each gene. For missense variants (b and c) conservation of the (predicted) affected amino-acid is shown.

Table 1 Hereditary spastic paraplegia variants identified in seven studied families

Phenotype–genotype correlations

Clinical data from seven families (23 patients) with established genetic diagnosis is presented (Tables 2 and 3). A summary of the MRI and electrophysiological investigations is also provided (Table 4).

Table 2 Clinical data of 12 patients from three families with SPG11 genetic diagnosis
Table 3 Clinical data of 11 patients from four families with SPG57, ATL1, ALS2 and SACS genetic diagnosis
Table 4 Available radiological (MRI brain) and electrophysiological (NCS) data of 11 patients (from five of the seven families with genetic diagnoses)

Patients from three families with variants in SPG11 presented with a complex HSP associated frequently with cognitive deficit. MRI and electrophysiological findings matched previous reports.26, 27 MRI of the brain showed atrophy of the corpus callosum in six tested cases, often associated with periventricular white matter abnormalities (Figure 1). Signs of motor demyelination frequently mixed with axonal degeneration were found in electrophysiological examination.

In family F14, the phenotypic presentation varied between patients from different branches. The two affected individuals in this family (cousins) had the same variant (SACS c.7739G>A) but showed interesting clinical differences. Both had spastic ataxia but one showed predominant ataxia and peripheral neuropathy whereas the other had predominant spastic paraparesis. Age at onset was late childhood in agreement with Baets et al.28 and higher than what was reported in Tunisian patients.29 Both patients had absent fundal striations in accordance with previous reports of Italian patients30, 31 and Japanese sibship.32 The family history revealed three brothers who were clinically diagnosed with Laurence Moon Biedl syndrome and died from renal failure years ago (suggesting a second segregating disease gene within the same family, although this was not genetically analyzed).

In three families with variants in ATL1, ALS2 and TFG (F34, FM5 and F19, respectively), we observed homogeneous clinical patterns in all patients within each family. Family F34 had rather unusual characteristics in our cohort with pure HSP and dominant inheritance (heterozygous variant) despite the high consanguinity in this family. The presentation (Table 3) matched the classical clinical pattern and the early age of onset associated with SPG3A.33 In family FM5, four patients from two branches presented with infantile onset progressive pyramidal syndrome with pseudobulbar palsy. This is in conformity with what was reported in Infantile Onset Ascending Spastic Paralysis associated with Alsin variants (OMIM #607225) (Table 3).34, 35, 36

Interestingly, disease phenotypes in family F19 extended the clinical spectrum associated with TFG/SPG57 variants. As in first SPG57 family (c.316C>T), the mean age at onset in the three affected sibs was around 1 year. They presented with signs of progressive pyramidal tract involvement. In contrast to the SPOAN-like HSP reported by Beetz et al.,18 patients in this family with a second novel variant (c.64C>T) had intact optic nerve and normal fundi. The electrophysiological studies confirmed variable degrees of motor neuronal demyelination in addition to motor axonal degeneration matching the previous report. Although brain MRI revealed no significant abnormalities in the first family, we report thinning of the corpus callosum, variable degrees of cerebellar atrophy and mild white matter hyperintensities, as part of the SPG57 clinical spectrum.

Biochemical impact of the novel TFG c.64C>T variant

This variant is the second TFG/SPG57 to be reported in HSP families and is predicted to result in a missense change c.64C>T (p.(Arg22Trp)) that lies within the Phox and Bem1p (PB1) domain of the TFG protein. The PB1 domain has been shown previously to self-associate (forming homo-oligomers) in yeast two-hybrid studies.37 Consistent with a previous structural characterization of TFG,38 we found that the wild-type PB1 domain, at concentrations ranging from 18–45 μ M, formed octamers in solution (~92 kD). In contrast, the c.64C>T (p.(Arg22Trp)) mutant of the PB1 domain exhibited a significantly reduced molecular mass of ~24 kD (around one-fourth of the wild-type protein mass) throughout the same range of protein concentrations, indicating it was only capable of forming dimers rather than octamers. In addition, its elution time was extended relative to the wild-type protein, consistent with the idea that the mutant failed to self-assemble properly (Figure 2). Together, these studies demonstrate that the c.64C>T (p.(Arg22Trp)) missense variant disrupts the ability of the PB1 domain to oligomerize in octamers thus extending what was previously reported38 about the effect of c.316C>T (p.(Arg106Cys)) variant (located in the coiled-coil domain) on TFG self-assembly.

Figure 2
figure 2

Family F19 with autosomal recessive hereditary spastic paraplegia caused by TFG/SPG57. (a) Pedigree of family F19 in which a missense variant segregates with the disease. Electropherograms show homozygous and heterozygous genotypes. Conservation of the amino-acid sequence is shown as well as a gene model showing the position of detected variant. (b) Purified, untagged forms of TFG (amino acids 1–96; wild-type or mutated to include the c.64C>T (p.(Arg22Trp)) substitution), were separated over a gel filtration column (Wyatt TWC-030S5) that was coupled to a multi-angle light-scattering device. Representative light-scattering profiles (wild-type TFG, blue; TFG c.64C>T (p.(Arg22Trp)), red) are plotted (b (I)). Eluted fractions were separated by SDS-PAGE and stained using Coomassie to highlight the elution profiles of the both forms of TFG (b (II)). Based on three independent experiments for each protein, wild-type TFG (amino acids 1–96) exhibited a molecular mass of 92.4 kD ± 3.7 kD, and c.64C>T (p.(Arg22Trp)) TFG (amino acids 1–96) exhibited a molecular mass of 24.4 kD ± 1.0 kD. (c) Brain MRIs of individuals 19172 (sagittal T1W section) and 19171 (T2 FLAIR axial section) showing thinning of the body of the corpus callosum (c (I)) and TCC with mild periventricular signal hyperintensity lesions at the occipital pole (c (II)). Pedigree symbols: full shading indicates affected individuals, half horizontal shading indicates other diseases, numbers between brackets are multiple siblings, diamonds indicate multiple siblings of different sex. Individual ID numbers and genotypes are indicated. Genotypes: ++ homozygous reference; M+ heterozygous; MM homozygous mutant. Others are standard medical pedigree symbols.

Discussion

Similar to other communities with high consanguinity,9, 10 we found that AR HSP inheritance is predominant in Sudan. However, in our series, the complex phenotype is predominant (84%), which is in disagreement with Coutinho et al.9 who elicited a predominant pure phenotype, but in agreement with a study in South Tunisia10 that reported 69% of complex cases. Sudanese families with AR HSP showed a high frequency of SPG11 variants (~13%) in conformity with other populations.15 Other genes were identified in single families. Of note, SPG7 and SPG4 were not identified in this report. Commonly, phenotypes were relatively heterogeneous within families and between different families. These differences ranged between subtle and overt, including onset, progression, and motor and cognitive aspects. This represented a diagnostic challenge especially when symptoms overlapped with other diseases. The use of a targeted NGS exome panel provided an efficient and reliable screening method to test HSP and HSP-related genes. Here, it was possible to identify causative variants in 28% of screened families, a success rate similar to other reports using candidate gene or exome sequencing in heterogeneous diseases.27

In the remaining families with yet unidentified causative variants, no pathological short variants (homozygous/compound heterozygous) or deletions/duplications in known HSP genes were identified. The complex heterogeneous phenotypes seen in these undiagnosed families imply a high possibility to expand the molecular basis of HSP and carry a prospect of unraveling new candidate genes with monogenic or digenic/oligogenic inheritance, non-coding variations, undetected large deletions/rearrangements or epigenetic modifications.

Correlations between HSP phenotypes and variants' positions (in the same gene) are rarely noted, reported previously only in PNPLA6 by Schabhüttl et al.39 In TFG/SPG57, variants in the proline and glutamine-rich domain (carboxyl terminus) of TFG/SPG57 result in AD hereditary sensory motor neuropathy with proximal predominant involvement.19 On the contrary, variants in the amino terminus result in AR HSP.18 In this study, an amino terminus variant resulted in AR HSP as well. Significant clinical differences were observed between the first reported SPG57 c.316C>T variant (p.(Arg106Cys)) and the variant c.64C>T (p.(Arg22Trp)) reported here. SPG57 may thus represent another exception regarding phenotype vs variant position correlations, although other variants are required to study this phenomenon with accuracy. These observed differences might be attributed to the different domains affected in the two families (coiled-coil and PB1 domains) and the functional impact of each. Although both variants resulted in perturbed oligomerization of the protein, the mechanism and consequent functional effect are likely to be distinct based on our biochemical analysis. Structural studies indicate that full length TFG self-assembles into octameric ring-like complexes in solution.18 Our analysis of the PB1 domain indicates that it is likely to be responsible for the octameric self-assembly, as it exhibits a similar propensity to form octamers. Thus, the coiled-coil domain may play another distinct role in promoting TFG complex assembly, potentially facilitating its ring-like conformation, which is likely under considerable strain.

Conclusions

We identified the first Sudanese families carrying novel variants in SPG11, ATL1, ALS2, SACS and TFG/SPG57. A novel TFG variant c.64C>T (p.(Arg22Trp)) – the second SPG57 variant to be reported – was found to disrupt TFG protein oligomerization. The difficulty to reach a genetic diagnosis in the majority of studied families suggests the possibility of new genes, unusual models of inheritance or non-coding variations underlying spinocerebellar degeneration. Adoption of novel diagnostic approaches like whole-exome or -genome sequencing is therefore highly recommended to explore these families.