Introduction

The spinocerebellar ataxias (SCAs) are autosomal dominant progressive neurodegenerative disorders displaying clinical and genetic heterogeneity. The loci and mutations associated with at least eight subtypes of SCAs have been cloned: SCA11 (MIM 164400), SCA22,3,4 (MIM 183090), SCA35 (MIM 109150), SCA66 (MIM 183086), SCA77 (MIM 164500), SCA88 (MIM 603680), SCA109 (MIM 603516), SCA1210 (MIM 604326). Furthermore, mutations in the DRPLA gene11,12 (MIM 125370) and the TBP/SCA17 gene13,14,15 (MIM 600075) produce SCA-like phenotypes. In these disorders, expansions of unstable nucleotide repeats in the respective genes are implicated in the pathogenesis of the disease. Unstable expansions of trinucleotide repeats have been shown to cause at least 14 neurological diseases (for review).16

Spinocerebellar ataxia type 1 (SCA1) primarily affects the brainstem, spinocerebellar tracts and cerebellar Purkinje cells. Patients with SCA1 develop progressive ataxia of limb and gait, as well as bulbar and pyramidal symptoms. In most cases, there is evidence for fronto-executive dysfunction while general intellectual impairment is restricted to advanced stages of the disease.17,18

The neurodegenerative process in SCA1 is thought to be contingent on transcription and translation of expanded CAG trinucleotide repeats within the coding region of the gene. This results in the synthesis of a pathologically altered gene product harbouring an expanded polyglutamine stretch.1 Normal SCA1 alleles range from 6–39 while clinical symptoms have been reported in individuals carrying 39–81 CAG repeats.1,19,20,21 The age of onset is inversely correlated to the repeat length with large alleles being predominantly found in juvenile cases. Sequence analysis of normal alleles revealed an interrupted repeat configuration with one to three CAT in 98% of the normal chromosomes.22 In contrast, expanded SCA1 alleles are characterised by continuous CAG stretches. Only in rare cases, expanded alleles are interrupted as shown for a patient with 58 repeats and late onset of the disease.23 CAT interruptions are therefore thought to be of critical importance in maintaining the stability of normal SCA1 alleles during DNA replication.

Molecular genetic analysis is performed for clinical diagnosis, predictive or prenatal testing in families with SCA1. Among CAG trinucleotide diseases, Huntington's disease (HD) is characterised by the greatest number of individuals with intermediate alleles and reduced penetrance.24 In SCA1, where normal and disease allele ranges are continuous, reduced penetrance is poorly documented but would be expected in the lower range of expanded alleles. Intermediate alleles and an age-dependent gene penetrance in SCA1 may therefore cause difficulties in interpreting molecular genetic results.25 The aim of the present study was to perform genotype-phenotype correlations in intermediate alleles from 36–41 CAG repeats in the SCA1 gene with respect to the presence of interrupting CAT trinucleotides.

Subjects and methods

Molecular genetic testing

Searching for pathogenic causes in patients with symptoms of ataxia or severe cognitive impairment (differential diagnostic requests) molecular genetic analyses for repeat expansions in the SCA1, SCA2, SCA3, SCA6, SCA7 and SCA12 gene were performed. After having obtained informed consent for genetical analyses, total genomic DNA was extracted from peripheral blood leukocytes by standard protocols.26 After exclusion of known repeat expansions in the genes for SCA2, 3, 6, 7 and 12 the SCA1 CAG repeat was analysed. In 45 unrelated patients, repeat sizes from 42–65 triplets were found. For 15 individuals with (CAG)36 to (CAG)41 detailed molecular genetic analyses of the SCA1 gene were performed.

PCR, restriction and sequence analysis of the CAG repeats in the SCA1 gene

For analysis of repeat length and composition, PCR was performed in a volume of 25 μl containing 50 ng genomic DNA, 10 pmol of each primer, 5 pmol dNTP and 1 units Taq polymerase (Q•BIOgene) with an annealing temperature of 62°C using primers as described by Orr et al1 Products were separated on 6% denaturing polyacrylamide gels and visualised by silver staining. The presence of a SfaNI restriction site was used to test for loss of CAT interruptions. Repetitive CAG sequences interrupted by at least one CAT triplet are digested by SfaNI. After restriction, PCR products were separated on a 2% agarose gel to assess the presence of a CAT interruption. Precise allele sizes and repeat configurations were determined by sequencing. To determine the nucleotide sequence of different alleles, PCR products were cloned into the vector pCR 2.1-TOPO (Invitrogen) and sequenced using the dideoxy chain termination method on double stranded DNA templates in presence of IRD800 labeled universal M13 primers on a Licor 4200 automated sequencer. For each template, up to ten single colonies have been analysed to detect the most representative repeat length. Due to somatic mosaicism in leukocytes, inaccuracy in amplification or sequencing reactions the repeat length may vary by ±1 unit between the subclones. Using the GeneScan methodology the number of repeats in a single allele varied usually between three and seven peaks on the scan.21

In addition, the instability of repetitive sequences in bacteria results in divergent subclones. Variability is frequent in expanded alleles as shown for the CAG50 allele in sample A but is the exception within the normal range.

Haplotype analysis

Microsatellite markers F13A1, D6S2434, D6S274, D6S288, D6S260, D6S285, D6S298, D6S2439, D6S1641 and D6S1017 closely linked to the SCA1 locus were selected for linkage analysis in two families. In pedigree I four living members are affected by SCA1. Three individuals of this family (L, M, N) could be examined. Pedigree II with seven individuals affected originating from the same geographigal region as pedigree I. Six patients could be examined presenting characteristic SCA1 features such as pale discs, intention tremor, pyramidal signs (data not shown in detail). Genotypes were determined by PCR amplification (35 cycles: 95°C 30 s, 58°C 30 s, 72°C 30 s). PCR products were resolved on 6% polyacrylamide gels and visualised by silver staining.

Results

The SCA1 repeat sizes and compositions in DNA samples of alleles larger than 35 triplets in the glutamine coding region were determined by PCR, restriction and sequence analyses. Interestingly, using silver staining after polyacrylamide gel electrophoresis in our hands the pattern of interrupted repeats may be differentiated from expanded alleles of identical length with pure CAG sequences (Figure 1). We suppose that the higher number of visible signals for pure repeat expansions is correlated with loss of CAT interruptions and may be produced either in consequence of mitotic instability or by the PCR reaction itself. Presence of CAT triplets within the CAG repeat was verified by SfaNI digestion.

Figure 1
figure 1

PCR analysis of the SCA1 CAG repeat of normal and expanded alleles. In the upper part, PCR products were resolved on a denaturing polyacrylamide gel and detected by silver staining. Numbers of triplets as determined by sequencing are given. In the lower part, the agarose gel analysis before (−) and after SfaNI digestion (+) of the PCR products is shown. Expanded alleles with loss of CAT interruptions lack a SfaNI recognition site (marked by white arrows). SCA1 control samples: A, P.

Sequence analysis revealed that the 14 normal SCA1 alleles with 29 and 30 triplets as well as the alleles with 35–38 triplets are interrupted by the sequence CAT CAG CAT. In alleles with 39–41 triplets the interruption is missing in six of seven cases. DNA sequences are summarised in Table 1.

Table 1 Clinical data, repeat sequences and length of SCA1 alleles

Alleles with 35–38 triplets are present in patients with ataxia but without additional characteristic features of SCA1 (B, C, D, F, G, H, I). SCA1 phenotypes were found for individuals Q and R with (CAG)41. For clinical data see Table 1.

We found the allele with 39 triplets to be either formed by an interrupted or a pure CAG stretch. The perfect glutamine domain causes symptoms of SCA1 in family I (patients L+M+N) as well as in individual O. To investigate the genetic background of the 39 repeat allele in family I we performed haplotype analyses. We compared the allele distribution of 10 polymorphic markers tightly linked to the SCA1 locus between family I and a second pedigree (family II) originating from the same geographical region. The six affected persons of family II carry SCA1 alleles with repeat lengths between 48 and 51 CAG copies without interruptions as determined by restriction analyses. We found a common haplotype for the expanded SCA1 allele (marker distance between F13A1 and D6S285 about 16 cM) differing from normal alleles (Figure 2) indicating a common founder for both kindreds.

Figure 2
figure 2

Pedigree and haplotype analyses of families I and II. Haplotype analyses with 10 polymorphic markers showed identical alleles around the SCA1 locus (dark bars) for family I (on the left) and family II (on the right side) both originating from Northern Germany. The age of onset is given.

Characteristic features of SCA1 such as dysphagia, pale discs, intention tremor and pyramidal signs were present in the four SCA1 individuals with short expansions (mean age was 71.0±13.1 years). Their mean age of onset was 62.8±15.5 years with a mean disease duration of 8.3±3.2 years.

In the patient sample studied, the interrupted allele with 39 triplets did not cause characteristic features of SCA1: Individual K presented with prominent extrapyramidal symptoms such as choreoathetosis and dystonia and a very early disease onset. Basal ganglia dysfunction is rather uncommon in SCA1.27 In addition, early disease onset is generally thought to be associated with higher repeat numbers. Therefore, it is very unlikely that neurological symptoms in individual K are related to the interrupted 39 triplet allele.

Discussion

Here, we present the first familial case of stable inheritance of (CAG)39 SCA1 alleles missing CAT interruptions and correlating with symptoms characteristic for SCA1. Recently, Goldfarb et al21 found the (CAG)39 allele in affected family members of a large Siberian kindred (1484 individuals) but did not report stability of this SCA1 allele during transmission.

Our results give further evidence that the pathogenic development in SCA1 is caused by pure glutamine repeats without histidine residues that may change or stabilise the native protein.

The analysis of SCA1 alleles in individuals with 36–41 triplets suggests a change from normal to pathological alleles at 39 trinucleotides depending on the presence of CAT interruptions in the SCA1 repeat. These interruptions seem to modify the phenotype in critical alleles. Normal alleles with 39 repeats have been described in a Spanish kindred19 and are correlated with somatic stability for CAG sequences interrupted by a CAT trinucleotide.28

The clinical data in individuals with 39 triplets in the SCA1 repeat indicate that a perfect glutamine domain is necessary for the development of the SCA1 phenotype resulting in an altered function or processing of ataxin 1 at least in this intermediate range. Interruption of the domain by two histidine residues prevents or renders the aggregation or pathological effects more difficult. This hypothesis is also corroborated by a patient with an interrupted repeat of 58 triplets and a late onset at 50 years.23

Contrasting findings are described by Kennedy et al29 who reported a 39 triplet allele with CAT CAG CAT interruption in mother and son in one family with early age of onset. The brother of the female, who carries the same allele, is still asymptomatic at the age of 42. In this family, the development of pathological features is likely to be contingent upon additional factors.

Variability in disease expression or influence of unknown factors have also been reported in a second family30 where the 44 triplet SCA1 allele is interrupted twice by CAT CAG CAT. In contrast to the daughter who had mild developmental delay and truncal ataxia starting at the age of 2 years, her father carrying the identical allele failed to reveal signs of neurological impairment. In conclusion, the early and highly variable age at onset associated with interrupted sequences in the SCA1 gene is more likely to be caused by additional factors or other mutations in unrelated genes than on the interrupted allele itself.

For the individuals described here, SCA1 alleles with interruptions and shorter than 40 triplets are not associated with specific SCA1 features. In contrast, disease alleles are characterised by loss of interruptions. The 39 triplet allele with a pure CAG repeat in family I may be an expansion prone sequence. CAG repeats without CAT interruptions may be unstable during transmission as seen in family II. The corresponding haplotype in the SCA1 region suggests a common ancestor for these two pedigrees.

Careful molecular genetic investigation of intermediate alleles is highly required to support the clinical diagnosis and prognosis as well as for genetic counselling. Furthermore, the risk of expansion during transmission is estimated rather low for interrupted CAG stretches, but is considerably higher for pure CAG sequences resulting in anticipation in the next generation. Therefore, it is not sufficient to measure the repeat lengths in the intermediate range. The test for presence of CAT triplets within the CAG repeat should be included in molecular genetic diagnostics.