Introduction

Idiopathic generalised epilepsy (IGE) is a common disorder, with a cumulative incidence in the general population of 0.7%.1 IGE usually presents in childhood or adolescence. Seizure types include generalised tonic-clonic seizures (GTCS), myoclonic seizures and/or absences. Generalised spike waves or polyspike waves are observed on electroencephalography. A photoparoxysmal response may be observed. There are no associated neurological abnormalities or structural brain damage. Twin studies demonstrate the importance of genetic factors in IGE susceptibility, with concordance rates for monozygotic twin pairs far higher than for dizygotic twin pairs.2,3 Common forms of IGE show a complex pattern of inheritance, which, in keeping with other common genetic disorders, is expected to result from the action of a few or many genes of small to moderate effect. IGE includes many common syndromes such as juvenile myoclonic epilepsy (JME), childhood absence epilepsy (CAE) and juvenile absence epilepsy (JAE).4 There is considerable overlap between these syndromes and it is not always possible to subclassify cases. More than one IGE syndrome is frequently observed within families.5,6 In a twin study, out of 20 monozygotic pairs all 13 that were clinically concordant had the same syndrome of IGE in both twins.7 By comparison, there were five out of 21 clinically concordant dizygotic pairs, of which only one had the same syndrome. These observations suggest that several combinations of mutations cause IGE but only certain combinations are syndrome specific.

Linkage studies have proved very successful in identifying causative mutations in Mendelian disorders. However, they have been much less useful in complex disorders where possession of a susceptibility mutation is neither necessary nor sufficient to cause the disease. Very large sample sizes are required to achieve the necessary power to detect linkage with genes of such disorders, and as a result linkage studies are often underpowered. Many susceptibility mutations are likely to be common and association studies provide far more power to detect common mutations of small to moderate effect with sample sizes of a few hundred unrelated affected individuals.8,9 Although case–control studies are vulnerable to population stratification artefacts, within-family tests of association can be used which are robust to such stratification.

Mouse mutants, such as tottering, lethargic, and ducky, share a number of features with human IGE and have served as useful models for absence epilepsy. These include age-related presentations, the presence of generalised spike wave discharges, involvement of thalamo-cortical circuits and responses to pharmacological agents.10,11,12 Interestingly, these models are now known to be due to mutations in voltage-gated calcium channel subunits, all of which appear to be involved in P/Q-type channels: tottering (Cacna1a), lethargic (Cacnb4) and ducky (Cacna2d2).13,14,15 To date, four spontaneous recessive mutations in the mouse gene Cacna1a encoding the α1A subunit in P/Q-type channels have been described: tottering, rolling Nagoya, leaner and rocker.13,16,17 All include ataxia, while tottering and the more severe phenotype, leaner, also exhibit motor seizures and absence type seizures. Mutations in the human gene CACNA1A result in episodic ataxia 2 (EA-2), spinocerebellar ataxia type 6 (SCA6) and familial hemiplegic migraine (FHM).18,19 One ataxic child with absence seizures was shown to have a mutation in CACNA1A.20 Most of CACNA1A has been sequenced and has 47 exons extending over approximately 300 kb of DNA.18

Recently we reported an association study of IGE using five polymorphisms within CACNA1A in over 200 cases with a broad IGE phenotype.21 Comparison of the allele frequencies between probands and controls for a single nucleotide polymorphism (SNP) in exon 8 provided evidence for significant association with IGE (P=0.0003, OR=1.7). Haplotype-based haplotype relative risk (HHRR) and transmission disequilibrium test (TDT), robust to possible population stratification, confirmed this result and supporting evidence came from an association with a simple sequence repeat polymorphism in intron 7, D19S1150 (P=0.032). Other SNPs in exons 6, 16 and 20 were not associated with IGE. In a previous independent study of IGE families, a CAG repeat polymorphism in exon 47 was not associated with IGE.22 Here, using an enlargement of the same sample used in our earlier study,21 we report on an investigation of the region flanking the associated SNP in exon 8, the first step in delineating the region of CACNA1A containing a common functional variant(s) associated with IGE.

Materials and methods

Sample

The study included 232 probands, 234 controls and both parents from a subset of approximately half the probands. This is the same sample used for our previous study21 but with increased numbers. IGE probands were identified via the EEG Department and epilepsy clinics at Kent and Canterbury Hospital. DNA was obtained from Caucasian subjects via cheek swabs or blood. Probands with epilepsy were selected with generalised spike/wave (GSW) on EEG, normal background rhythm and a consistent clinical history. Control samples were collected from the same hospital from unrelated individuals in the same ethnic group but with no known history of epilepsy/blackouts. At the time of sample collection, the mean age of the probands was 23 years with 60% female and the mean age of the controls was 41 years with 59% female. Detailed clinical characterisation has been compiled by examination of case notes and EEG reports by A Osei-Lah and L Nashef and, in some cases, clinical assessment by L Nashef. Information on seizure types, pattern and age of onset, EEG results, imaging data where performed, family history and response to treatment was extracted. Classification by ILAE IGE syndrome was made according to strict criteria. Where syndrome classification was not possible, broad or overlap categories were used. Mean age of onset of the epilepsy was 11.7, with 85% presenting between age 3 and 20. Information was available on seizure types in over 92% of probands: 58% had absences, 77% generalised tonic-clonic seizures (GTCS), and 23% myoclonus. A previous history of febrile convulsions was present in 14% and 34% had photoparoxysmal response reported on EEG. Classification by ILAE syndrome 23 on 78% of the sample was as follows: idiopathic absence epilepsy (IAE–absences with or without GTCS but without myoclonus) 50% (CAE 21%, JAE 16%, other 13%), JME 17% (including 7% with prominent absences) and Epilepsy with GTCS only 26%. Other syndromes or overlap syndromes accounted for the remaining 7%.

Denaturing high performance liquid chromatography (DHPLC) analysis

DHPLC analysis was performed using the Transgenomic WAVE DNA Analysis System. Chromosomal regions investigated for SNPs were amplified by PCR to give overlapping amplicons of approximately 500 bp in length, the optimum size for both DHPLC and automated sequencing. The amplicon sequence was entered into the WAVEMAKER software, which predicts temperature and gradient conditions that resolve heteroduplexes on the WAVE system. Between 5–15 μl of crude PCR product was loaded on a DNASep column (Transgenomic, Santa Clara, CA, USA). The mobile phase of the column consisted of 0.1 M triethylamine (TEAA) pH 7.0 with (Buffer B) or without (Buffer A) 25% acetonitrile. The gradient was created by mixing buffers A and B, which, were delivered to the column at a constant rate of 0.9 ml/min for 4 min. Chromatograms were interpreted by looking for differences in the elution profiles of the samples. Typically the presence of a SNP, where present as a heterozygote, was indicated by the presence of an extra peak.

Sequencing

All amplicons, where differences in DHPLC traces were evident between samples, were sequenced after purification (QIAquick PCR Purification Kit; QIAGEN). The samples were prepared for sequencing using a ABI PRISM® BigDyeT v3.0 Terminator Cycle Sequencing Ready Reaction Kit, following the manufacturers' instructions. The samples were separated and analysed on an ABI 377 or ABI 3100 DNA sequencer (Perkin Elmer).

NCBI database

Four SNPs were found in the SNP database (http://www.ncbi.nlm.nih.gov/SNP/index.html) under the following IDs: Int6b, rs993464; Int6c, rs1742; Int10a, rs1502018; Int10b, rs150217. All 10 SNPs were identified on genomic contig NT_031915 and the marker–marker distances calculated as follows (see Figure 1): Int6a–Int6b=7347 bp; Int6b–Int6c=1522 bp; Int6c–Int7a=15 005 bp; Int7a–Int7b=614 bp; Int7b–Ex8=634 bp; Ex8–Int8a=63 bp; Int8a–Int8b=1373 bp; Int8b–Int10a=13 352 bp; Int10a–Int10b=94 bp; Int10b–Int11=2581 bp.

Figure 1
figure 1

Map of CACNA1A between exons 6 and 11. Exons are shown by solid blocks. Distances between SNPs as indicated on the map. LD relationships between pairs of SNPs are shown immediately below the map with haplotypes between SNPs at the bottom. Significant P-values for case/control association studies are given for single marker allele frequencies (see Table 2 for details) and haplotype frequencies.

Genotyping

Genomic DNA was isolated from cheek swabs or blood using standard laboratory protocols.24 Genotyping of the SNPs required the design of primers to introduce a restriction enzyme (REN) or fluorescently labelled allele specific PCR. The primers, annealing temperatures, and RENs used are listed in Table 1. For PCR 25 ng of genomic DNA was used in 25 μl with 0.3 μM of each primer, 1.5 mM MgCl2, 100 μM deoxynucleotide triphosphate (dNTP), 20 mM ammonium sulphate, 75 mM Tris-HCl (pH 8.8), 0.01% (v/v) Tween®20 and 0.5 U Taq DNA polymerase (Applied Biotechnologies, Epsom, UK). Reactions were performed in a 96-well microtiter plate, with an initial denaturation step at 96°C for 85 s and 35 cycles of denaturation at 96°C (35 s), annealing (1 min), and extension at 72°C (1 min) on a DNA Engine Tetrad thermocycler (MJ Research, Watertown, MA, USA). The final extension step was 10 min at 72°C. A total of 12 μl of the SNP reaction mixture was digested with 3 U of the appropriate REN. The DNA fragments were separated by agarose gel electrophoresis and visualised by ethidium bromide fluorescence. Where partial digestion by the restriction enzyme was suspected, the digestion was repeated with three-fold more enzyme. Fluorescently labelled PCRs were visualised on either an ABI PRISM 377 Genetic Analyzer or an ABI PRISM 3100 Genetic Analyzer.

Table 1 Positions, primers and detection methods of CACNA1A polymorphisms

Statistical Analysis

Allele and genotype frequencies in the control and proband populations were compared by means of the χ2 test. The family-based analyses: HHRR25 and TDT26 were performed as described. Haplotype frequencies were estimated using the EM algorithm implemented in EHPLUS using the computed likelihood statistics for haplotype frequency comparisons between probands and controls.25,27 Linkage disequilibrium (LD) parameters R and D' were calculated as described by Klitz et al28 using the computer program 2LD (www.iop.kcl.ac.uk/IoP/Departments/PsychMed/GEpiBSt/software.stm). Empirical P values29 were calculated by FASTEHPLUS. Logistic regression according to the method described by Cordell and Clayton30 was performed using the Stata statistical package.

Results

Each region to be screened for polymorphic variation was amplified from 30 unrelated individuals (60 chromosomes), which has 95% power to detect mutations of frequency 5%. After confirmation by sequencing, ten novel SNPs of frequency >5% were identified and genotyped in the case–control sample. All genotype frequencies were in Hardy–Weinberg equilibrium in both case and control samples (data not shown).

As shown in Table 2, four novel SNPs gave significant results by allele: Int6a, P=0.0057; Int7b, P=0.00029; Int8a, P=0.0015 and Int8b, P=0.010 with ORs between 1.5 and 2.4. Significant results were also obtained by genotype with P values of 0.025, 0.0015, 0.0057 and 0.029 respectively (data not shown). For comparison, results with an increased sample size for Ex8, the SNP in the original study,21 has been included in Table 2. We further investigated SNPs Int6a, Int7b and Int8a in family trios. All three novel SNPs showed a distribution between transmitted and untransmitted alleles of similar magnitude and in the same direction as that seen between cases and controls, although, unlike Ex8, none reached statistical significance. Importantly, as with Ex8, there was no significant difference between the untransmitted and control allele frequencies, consistent with cases and controls having similar genetic backgrounds (data not shown). Similar results were obtained by TDT analysis (data not shown).

Table 2 Allele frequencies in association studies with novel SNPs

For the remaining six novel SNPs there were no significant differences between cases and controls either by allele frequency (Table 2) or genotype frequency (data not shown). Power calculations were made on the basis of detection of an association with odds ratio of 1.6, the mean odds ratio of the four SNPs around exon 8 associated with IGE. These estimates show that of the SNPs not associated with IGE, Int10a and Int11 had >80% power, Int10b >70% power, Int7a >60% power and Int6b and Int6c >50% power (data not shown).

For Ex8 and the four novel SNPs showing significant evidence of association, the odds ratios are consistent with either a small to moderate gene effect throughout the entire sample of IGE probands, or a larger effect limited to a subset. To address this issue we examined the association of the five SNPs in relation to the clinical data and found no increased association with any of the variables or ILAE syndromes. Table 3 shows the results for Int7b and pairs of mutually exclusive subsets and very similar allele frequencies are seen throughout. To test for heterogeneity among syndromes, we analysed these data as a multiple logistic regression in which the SNP allele was the dependent variable and the different phenotypes were binary predictor variables. Equating the phenotype-specific regression coefficients did not result in any significant reduction in fit (likelihood ratio χ2=2.48, 3 d.f., P=0.48), indicating that there was no evidence that the specific syndrome was relevant to the association with Int7b. Similar results were obtained with the other four associated SNPs and by using the presence of each major seizure type as phenotype (data not shown).

Table 3 Allele frequencies of Int7b in different clinical subgroups

We investigated LD between pairs of SNPs by analysis of the combined genotype data for cases and controls. We generated two measures of LD: the global statistic R and the statistic D' that takes account of the limits on R imposed by different allele frequencies of the marker pair. The four SNPs around exon 8 associated with IGE (Int7b, Ex8, Int8a and Int8b) showed very strong LD with each other (Figure 1), having both high R and D' values. This represents a region or block of associated markers, that we will refer to as Exon 8 Block (X8B). Extension of X8B in both directions showed reduced levels of LD. The other SNP that was significantly associated with IGE (Int6a) exhibited no LD with X8B (Figure 1).

A haplotype analysis of all the markers was performed using haplotype frequencies predicted by the EH programme and tested for association with IGE. Of the pairwise haplotypes within X8B, haplotypes of Int7b, Ex8 and Int8a each with Int8b were more significantly associated than either SNP alone (Figure 1). All pairwise haplotypes of Int6a and each SNP in X8B were similarly more significantly associated with IGE (Figure 1). Highly significant associations (P<10−6) were found for both the 3-marker and 5-marker haplotypes shown in Figure 1. (Because of uncertainty over degrees of freedom in the 5-marker haplotype, we used empirical P-value calculated by a permutation procedure.29) Significant association with IGE (χ2=15.60, 3 d.f., P=0.0014) was also found for the predicted haplotypes of SNPs Int10a and Int11 (data not shown).

The contributions of the 5 SNPs associated with IGE were analysed by logistic regression.30 When five separate models each involving just one SNP were specified each one was significantly different from the null model and each SNP within the model had a significant odds ratio (data not shown). However, when the data were fitted to a single model involving all five SNPs (χ2=22.57, 5 d.f., P=0.0004), only Int6a and Int8a had odds ratios much greater than 1, of which that for Int6a reached significance (Table 4). When a stepwise procedure was adopted the optimum model required only the same two SNPs, Int6a and Int8a (χ2=21.98, 2 d.f., P=0.000017). An identical optimum was achieved irrespective of whether increasing numbers of SNPs were progressively added or removed from a starting configuration with all five SNPs.

Table 4 Logistic regression using 5 SNP and stepwise models

Discussion

We have previously reported a strong association between IGE and a SNP in exon 8 of CACNA1A both by case-control and within-family association studies.21 We have now identified several novel SNPs and, using an expanded version of the same sample, have shown that a total of five SNPs within a 30 kb region between exons 6 and 9 are associated with IGE. None of the associated SNPs have an obvious functional effect. Each association with IGE is therefore likely to be the result of linkage disequilibrium (LD) between the SNP and a putative functionally significant variant (FSV).

Recent data suggest that high levels of LD occur within blocks of SNP markers, that typically extend from a few kb to several hundred kb, with a mean of around 60 kb31,32,33 Within these blocks, SNPs are closely associated with each other and it is in general impossible to differentiate the functional variant from the other SNPs using association data alone. These high LD regions appear to be separated by shorter regions of around 1–2 kb, in which higher recombination rates occur (recombination hotspots). This means that LD between blocks is reduced or even absent.

The data presented here identify a group of four SNPs in such a block spanning a region of 2 kb around exon 8 (X8B), which are all in strong LD with each other and all associated with IGE. Another SNP (Ex6a), approximately 25 kb distant on the 5′ side, is not in LD with X8B and yet is strongly associated with IGE. This finding is likely to result from either two FSVs in the same gene that are independently associated with these two groups of SNPs, or a single FSV in LD both with X8B and with Int6a.

If a single FSV were responsible for all five associations with IGE, it would have to be in LD with all five SNPs. This could occur if the FSV were located between Int6a and Int7b such that some LD between the FSV and each SNP has survived despite the loss of LD between the two SNPs. Some support for this scenario was obtained by examination of the six most common 5-marker haplotypes which account for around 95% of the chromosomes in both cases and controls (data not shown). These haplotypes are consistent with a parsimonious model, where each haplotype is derived from two ancestral haplotypes, conferring increased risk of IGE (GTAAG) or protection (ACGGA). For all five SNPs IGE is associated with the major allele, suggesting that the risk ancestral haplotype was much more common than the protective ancestral haplotype. This could equally be due to a susceptibility FSV arising on the common ancestral haplotype or a protective FSV arising on the rarer ancestral haplotype.

Haplotype analysis of the five SNPs showed that pairwise haplotypes of Int6a with each of the four SNPs of X8B are more significantly associated with IGE than is either SNP alone (Figure 1) and highly significant associations were also found for three- and five-marker haplotypes. Furthermore, logistic regression demonstrated that no single SNP was able to explain the association of epilepsy with this gene, suggesting the possible involvement of two FSVs. However, none of these results can exclude the involvement of only one FSV in the association. Discrimination between these two alternatives will only be possible after identification and analysis of all SNPs within or close to this region of the gene.

Haplotype analysis of SNPs Int10a (G/A) and Int11 (T/G), which individually showed no association with IGE, showed significant differences between cases and controls. However, analysis of the four predicted haplotypes showed that GT and AG were apparently protective while GG and AT were both associated with increased risk (data not shown). As neither pair of risk or protective haplotypes share an allele in common, this result is inconsistent with an FSV occurring on a particular ancestral haplotype. This association was viewed as a likely false positive and not analysed further.

Some caution in the interpretation of all these data is necessary following a recent report by Sander et al,34 which failed to replicate our original association with the SNP Ex8.21 Their method of recruitment clearly differed from ours. Ours is an unselected broad-based IGE cohort with generalised spike/polyspike wave on EEG which was classified post recruitment. Theirs appears to consist of a grouping together of homogenous syndromes. JME, for example, constitutes 39% of their total population based sample compared to 17% in ours, although our finding was of significant association irrespective of clinical subtype of IGE. Their report does not mention how many patients had a family history for epilepsy but it is likely to be higher than in our sample as the same group has been part of a consortium for ascertaining cases with homogenous phenotypes for linkage studies. The presence in one cohort of a significant number of probands with a positive family history might be expected to result in a different range of variants being represented. Another possible explanation is that the two populations, collected from different European locations, differ in their genetic substructure (distribution of haplotypes) or in the balance of genetic risk factors (genetic heterogeneity). We have observed significant allele frequency differences between the two centres for other SNPs.35 While it remains possible that the findings reported here result from mere chance alone, this is highly unlikely given the lack of LD between one of the SNPs associated with IGE and the other four.

The work reported here and in our previous paper21 has largely excluded the region to the 3′ of exon 10. We have not yet excluded the region 5′ of exon 6, but as discussed above the region of greatest interest lies between exons 6 and 9. We have screened exons 6–9 by DHPLC analysis of genomic DNA from at least 30 IGE patients and 30 controls without finding any novel variant. We have also screened exons 1–13 of CACNA1A in cDNA derived from 16 controls with similar lack of success (data not shown). The putative susceptibility variant(s) in CACNA1A are therefore likely to be located in an intron and probably function either by affecting expression or by influencing splicing.

An alternatively spliced transcript of the rat homologue Cacna1a has been reported with differences at three sites. It expresses a channel with distinct properties that may be relevant to the different properties of P- and Q-type channels.36 One of these changes is at the exon 10 acceptor site which is only 2.6 kb from X8B associated with IGE. If similar alternative splicing occurs in humans it is conceivable that it might be modulated by an FSV. It is also possible that there is more alternative splicing yet to be discovered.

In humans several mutations in CACNA1A have been described which give rise to familial hemiplegic migraine (FHM), episodic ataxia type 2 (EA-2) and spinocerebellar ataxia type 6 (SCA6).19,37 Fifteen different amino acid substitutions at positions throughout the calcium channel subunit have been identified in FHM families,38 one of which is in the P loop of the second repeat and very close to the position of the tottering mouse mutation.18 Patients with this mutation also have progressive ataxia associated with FHM, as have patients with some of the other amino acid substitutions.38,39 Fourteen mutations in EA-2 families have also been described.37

Recently, an EA-2 patient was described who also has an absence epilepsy.20 The mutation in this patient results in a premature stop in the S6 transmembrane domain of the fourth repeat preventing translation of the C-terminal intracellular region. We have investigated this mutation in our cohort of IGE patients but failed to detect it.40 Since this mutation is rare it is unlikely to influence risk for common forms of IGE and cannot explain the data presented here. It is interesting that this mutation and the mutation in the tottering mouse are both implicated in absences, yet we could find no evidence for the association between CACNA1A SNPs and IGE being specific for absence epilepsies. This may be due to a similarly altered function in the two absence-causing mutations, whereas the FSV(s) responsible for the association with IGE may affect expression of the gene.

In conclusion, our data support one or two intronic FSV(s), close to the region between exons 6 and 9 of CACNA1A, that predispose to IGE but are not specific to any syndrome. We are currently screening for more SNPs in this critical region of the gene for fine mapping association studies. We are also investigating CACNA1A mRNA from the IOP brain bank to identify possible functional effects of the FSV.