Introduction

Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system (CNS), characterised by relapsing or progressive demyelination of the CNS. The aetiology of MS is poorly understood, although both genetic and environmental factors are considered to be of importance. Epidemiological analyses of familial clustering, twin concordance and adoptee studies1,2,3 supports the notion that several genetic factors are likely to confer susceptibility to MS as a complex trait.

Studies show that the HLA class II region on 6p21 confers susceptibility to MS, probably as a major genetic determinant,4,5 and also gives indirect support for the autoimmune mechanisms thought to be involved in the pathogenesis of MS. Non-HLA genes, however, have not been conclusively identified, in spite of rather large scale analyses of candidate genes, mostly focusing on components involved in immune regulation and possible autoimmune targets.6,7,8 In addition, four published genomic screens9,10,11,12 have indicated the possible importance of a few different loci, in particular, in 5p and 17q. A recent meta-analysis of published genomic screens identified several regions with nonparametric linkage scores (NPL) higher than 2, including 17q11 and 6p21.13 However, neither of these regions reached genome-wide significance for linkage, and most other potential loci differed between screens. These findings could result from a true but modest contribution of each involved gene, which thus are inefficiently detected by conventional linkage methods.

Searching for haplotype sharing among affected individuals in an isolated population is a potentially powerful alternative method for mapping disease genes.14,15 This method identifies regions shared identical-by-descent (IBD) by affected individuals through searching for conserved haplotypes. Therefore, it does not require knowledge of mode of inheritance and it is independent of accurate pedigree information. Usually, pedigree structures in isolated populations are too complicated to enable accurate evaluation of all possible routes of transmission of a phenotype and marker alleles.15,16,17 To some extent, this method is known to overcome certain drawbacks of classic lod score analysis, which is sometimes not capable of producing adequate proof for linkage. Recently, a kindred of 22 MS patients, originating from a single common ancestral couple in the 18th century, was reported from the community of Överkalix in the northern-most part of Sweden.18 Due to the geographic location, the population in this rural area is thought to have developed in relative isolation for some 300 years. Such a population is theoretically ideal for the identification of conserved disease-associated haplotypes, which includes gene mutations introduced by a few founders. In this report, we present a genetic analysis in this population by using a conserved haplotype sharing method.

Materials and methods

Pedigrees

A high incidence of multiple sclerosis (33 cases/about 4744 population) in the community of Överkalix northern Sweden previously has been described in detail.18 This isolated population is ethnically homogeneous and was probably founded in the 17th century by a few couples. In the present study, we collected DNA samples from 79 indivduals, including 31 persons with MS and 48 healthy relatives, from the Överkalix area. Using public archives, a genealogical analysis established that 19 of the MS patients and their relatives originate from a single common ancestral couple. Among them, 15 affected individuals provided samples from nuclear family members that permitted the construction of haplotypes. Twelve patients have so far not been identified as members of the pedigree.

Genotyping

Genomic DNA was purified from peripheral blood by a conventional method. The microsatellite markers used for genotyping were the human linkage mapping set from the Cooperative Human Genetic Center (CHLC)19 and published maps of Genethon.20 The genotyping procedures, based on the ABI 377 system for semiautomated genotying, were as previously described.7 Most markers from the CHLC database used for the initial genome screen were tetranucleotide repeats, thus minimising potential errors in allele determination. In addition, all individuals were genotyped for the HLA-DRB, -DQA and -DQB genes by the restriction fragment length polymorphism (RFLP) analysis.21

Haplotype construction

Haplotypes defined by alleles of adjacent marker pairs were constructed by tracing the parental origin of the alleles. When parents were not available for genotyping, haplotypes were, when possible, inferred from siblings or children.

Statistical analysis

In order to test for linkage diseqilibrium, the transmission/disequlibrium test (TDT) was used to determine whether alleles of a conserved haplotype demonstrated a higher frequency on disease-bearing chromosomes vs presumed normal, nontransmitted chromosomes.22 Within each nuclear family, the parental chromosomes transmitted to affected individuals were assumed to harbour a susceptibility allele, while those nontransmitted were assumed to harbour a normal allele. If no linkage disequilibrium exists, the expected frequency of transmitted and nontransmitted marker alleles should be 50%. The allele frequency differences between affected individuals and nontrasmitted parental allelels at each locus were compared to the expected 1 : 1 ratio using the χ2 test with one degree of freedom (1 df). Subsequently, haplotypes were constructed and haplotype frequencies were compared between affected individuals and nontransmitted haplotypes from their parents.

Results

Five affected individuals and their parents or siblings in four nuclear families (Figure 1) were selected for genome-wide genotyping with all 390 markers, spanning the genome at an average of 10 cM intervals. In this group, one sib-pair was included, since we considered regions unlikely to harbour susceptibility genes unless they were shared IBD in a sibling pair. Approximately 380 markers generated genotyping data. Using each pair of flanking markers, totally 358 haplotypes transmitted to patients were constructed. Next, we searched for segments shared by at least four out of eight chromosomes (50% or more) and also shared in the affected sibling pair. Seven segments on six different chromosomes were identified (Table 1). Surprisingly, haplotypes covering chromosome 6p21, harbouring the HLA gene complex, did not reach the selected threshold for sharing, although HLA class II genes have been identified as a major locus for MS in the Swedish population.4,23

Figure 1
figure 1

The pedigree of five MS patients selected for initial genomic screen and identified haplotypes expressed as length in base pairs of marker amplification product.

Table 1 Constructed flanking marker haplotypes shared by at least four of eight ‘disease’ chromosomes in the genomic screen. Only one of the suggested haplotypes was confirmed to be identical-by-descent (IBD) after analysis of additional markers

Subsequently, we genotyped all the affected individuals and healthy relatives for these seven segments, also using additional markers, mainly dinucleotide repeats from the Genethon database. In one chromosomal region, in 17p11, 12 out of 15 patients belonging to the large kindred shared some portion of the conserved haplotype. Assuming that the chromosomal region defined by the shared marker alleles might harbour a founder mutation, we subjected this region to testing for LD.

In order to evaluate the statistical evidence for this observation, we used TDT to detect LD for the markers based on the separate nuclear families within the large pedigree. The identified region at 17p11 consisted of five markers including two from the original set. Three of these five markers (D17S921, D17S1843 and D17S953) showed a significant deviation of transmission (P<0.05) in favour of transmission to affected individuals (Table 2). Furthermore, an identical four-marker haplotype (D17S921-D17S122-D17S1843-D17S953) was transmitted to six of 15 affected individuals, (40%) (χ2=7.0; P<0.01) (Table 3).

Table 2 Transmission analysis in 14 MS patients with informative parental alleles (in base-pairs) of a conserved, shared haplotype in 17p11. The number of transmitted (T) (to MS-patients) and nontransmitted (N) parental chromosomes are given, as well as results of the χ2 analysis
Table 3 Transmission analysis of related haplotypes on chromosome 17p11 defined by alleles of four adjacent microsatellite markers in 15 MS patients, where T signifies transmission and N nontransmission of a parental haplotype to an individual with MS

In addition, we genotyped all patients and healthy relatives for HLA DR and DQ. No increased sharing of haplotypes was observed. Likewise, a comparison of transmitted and non-transmitted DR15-containing haplotypes, failed to show statistically significant transmission distortion (χ2=0.7, P>0.05). 56% of the patients were carriers of DR15 allele in comparison with 50% of controls.

Discussion

We have used a haplotype sharing method to study a genetically isolated kindred in a community with a high prevalence of multiple sclerosis. Our results suggest the presence of a susceptibility gene for MS in 17p11. This conserved haplotype overlaps with a modest linkage peak observed in the British genome-wide screen.10 In addition, the MS genomic screen meta-analysis indicated an NPL score higher then 2 in this approximate region although the highest peak was found close to the neurofibromatosis-1 gene (17q11).13 The importance of this chromosomal segment was supported by several lines of evidence: (1) some portion of the conserved haplotypes, defined by four markers spanning 10 cM, was shared by 12 of the 15 patients; (2) particular alleles of the three markers showed linkage disequilibrium with MS by the TDT analysis; (3) six of 15 patients shared an identical four-marker haplotype (40%) with significant excess of transmission to patients.

The first conclusion of the present findings is that even in this large pedigree, in fact the largest MS pedigree to be described so far,18 there seems to be no single genetic factor that is both necessary and sufficient for the development of MS. Thus, even in this case, MS seems to fulfil the criteria of a genetically complex disorder.

Polygenes, and even other factors, such as environmental or stochastic events, are likely to influence the susceptibility to MS, which, as a consequence, results in a non-mendelian inheritance in MS. The contribution of each involved gene is likely to be small, as Ebers et al.9 have excluded 88% of the genome for a locus having an effect corresponding to λs>3, while Sawcer et al.10 have excluded 93% of the genome for a locus that contributes a λs of >5. The modest effect of each gene limits genetic mapping by conventional methods.24 The haplotype sharing analysis may have greater power for gene localisation, as this method is based on the principle of LD mapping which differs from classic lod score and affected sibpair analysis.15 LD mapping was first proposed by Landers and Bostein in 1986, and it has been applied to map genes for simple and even complex disorders in general populations.25,26,27 However, it was seriously questioned whether sufficient LD exists in the general population to allow gene identification at the genomic level.28 In contrast, genome-wide LD mapping in isolated populations has been proven efficient for rare diseases,15,16,17 but remains to be shown useful for complex traits such as MS.

The population of Överkalix has remained relatively isolated until recently, and the majority of the population is likely to be descendants of a small number of founders.18 We can assume that the chromosomal segments containing the disease genes brought into the population by a few ancestors are likely to be shared by most patients. The sizes of these segments are likely to remain large enough to be detected by the sharing search and further LD analysis, because of the low rate of recombination due to a limited number of meioses, and rare foreign genome interference. As MS is presumably polygenic, it is not to be expected that a possible single gene at 17p11 should confer the full genetic susceptibility to MS. The failure to detect other haplotypes could depend on insufficient density of the present marker map, or on the use of a limited number of affected individuals in the first screen in which only five patients were genotyped extensively. Durham and Freingold,29 using a simulation analysis, proposed that the median size of a segment shared IBD by five chromosomes, with a separation distance of six meiosis, is only 5.5 cM. Therefore, an optimal analysis of the present material might require an increased number of nuclear families in an initial screen and a denser marker map.

The chance of a false positive finding in a genome-wide analysis such as the present must be taken into account, although the theoretical risk has been evaluated using simulation analysis.15,29 In general, the shared haplotype method is considered to give a low rate of false positives. To minimise false positive findings in the initial screen, we set up the strict criteria outlined above. Indeed, a haplotype (D17S921D17S953) at 17p11 was shared by five of eight chromosomes among four patients and was shared by two affected sibs. In the continuation of this project, additional markers will be included to allow a denser marker map, which may permit the identification of smaller shared chromosomal segments. Further analysis of the identified segment in independent patient materials could help to delineate the region of interest.

The chromosome 17p11 region is rich in genes and a number of potential candidates may be selected. Possible candidate genes include peripheral myelin protein 22 and neurofibromin-1, both well known to cause neurological disorders.

In contrast, the transmission of DR15 to MS patients in this material was not more common than nontransmission. This implies that the HLA class II locus is less important in this kindred, compared to the general Swedish population of unrelated MS patients where its importance has been confirmed in several case–controls studies.30 In conclusion, our findings suggest the existence of a susceptibility gene for MS on chromosome 17p11. Further work is necessary to fine-map the 10 cM shared chromosomal segment.