Introduction

Missense single-nucleotide substitutions in the transthyretin (TTR) gene define autosomal dominant lethal hereditary types of systemic amyloidosis characterized by the extracellular deposition of protein fibrils as amyloid (ATTR) (MIM 176300). In Europe and Asia, autosomal dominant familial amyloid polyneuropathy (FAP), with peripheral sensorimotor, autonomic neuropathy and cardiomyopathy, is the most common syndrome.1 The most frequent mutation associated with FAP – V30M – is highly prevalent in Portugal, but is also found in Sweden, Japan, Spain, Italy, France, the United States and other countries. FAP V30M penetrance and age of disease onset vary among and within populations. In Portugal, the penetrance is high (at least 80%) and symptoms typically develop before the age of 40 years.2 Subsets of families with late-onset patients and aged asymptomatic carriers have also been identified.3,4 Some kindreds display genetic anticipation, that is, progressively earlier onset and more severe disease in the succeeding generations.5 Anticipation in many neurodegenerative diseases has been related to the expansion of trinucleotide repeats in specific genes; however, this form of dynamic mutation does not appear to be the case in Portuguese FAP.6

Swedish carriers of TTR V30M display lower penetrance (approximately 5%) and later onset, with symptoms usually developing after age 50 years.7,8 Disease onset in Japanese patients may be early or late.9

The first approach to genetic variation in FAP V30M involved the characterization of intragenic haplotypes defined by six intronic single-nucleotide polymorphisms (SNPs).10,11 The distribution of haplotypes associated with V30M disease in various populations has been established, and is not consistent with genetic drift arising from one ancient founder.10,11,12 Although some of the five haplotypes identified (out of 64 possible arrangements; named I–V) could have arisen by simple recombination events, others (e.g. haplotype III) would require complex crossovers and multiple recombinations within several hundred base pairs, or interchromosomal gene conversion, both statistically less likely than the occurrence of multiple founders, especially in view of the fact that the site, a CpG dinucleotide, constitutes a mutational hotspot.13,14

In Portugal and Sweden, only haplotype I had previously been found associated with the mutation. The second most common haplotype – III – segregates with V30M in French, Italian, British and Japanese kindreds. Thus, the evidence for recurring mutation is strong and descent from a single founder could only be explained by a rare combination of events. However, a single founder could be responsible for all of the cases in the Portuguese and Swedish populations. These conclusions have been based on analyses restricted to the exon and intervening intronic sequences and limited to a few markers. There are no data regarding genetic variation in regulatory sequences that might influence the clinical phenotype. Most importantly, the establishment of genotype–phenotype correlations, namely a genetic basis for the variability in clinical presentation and age of onset, if present in the vicinity of the gene, but not within it, is difficult.

To overcome these limitations and further characterize genetic variation within and around the TTR gene, we performed nucleotide sequence analysis and extended haplotyping in Portuguese patients with variable disease onset, and Swedish subjects.

Material and methods

Subjects and DNA samples

The present study included 127 FAP families with a total of 191 individuals (170 carriers and 21 noncarriers), and 92 control subjects (Table 1). Of the V30M carriers, 140 were affected and 30 were asymptomatic. The term Familial is used to identify individuals from families with several members studied here; the term Singleton refers to single representatives of distinct FAP families. The male-to-female ratio among the 152 Portuguese subjects was 1:1 (78 men and 74 women). Of the affected individuals, 61 were men and 61 were women, with a sex ratio of 1:1, while the 30 asymptomatic carriers showed a male-to-female ratio of 1.3:1. Among the Swedes, the sex ratio was 1.5:1 in the 18 patients. All individuals referred to as carriers, whether affected or not, had the TTR V30M as assessed by a restriction essay described elsewhere.15 The affected subjects had been diagnosed with FAP, according to the presence of characteristic clinical features of the disorder.1 In all, 58 Portuguese and 34 Swedish unrelated individuals, who did not carry the V30M mutation, served as controls. Blood samples were obtained with informed consent and DNA was extracted by standard techniques.

Table 1 Study subjects

Age of onset

Among the 122 affected carriers from Portugal, the age of onset was known for 113 patients and averaged 41.4±14.4 (SD) years, ranging from 21 to 80 years. In the classical onset subset (below age 40, n=71), the mean age of onset was 31.3±4.6 years, whereas the late-onset subset (above age 50, n=39) presented an average onset of 59.5±6.6 years. Of the 30 asymptomatic carriers, 18 were above 50 years of age at the time of the study, ranging from 53 to 82 years with an average of 66.6±10.8 years. Eight V30M carriers were asymptomatic over 70 years of age. All the Swedish patients were of late onset (above 50 years).

Genomic DNA sequencing

Genomic DNA from 10 Portuguese V30M carriers, five with onset at ages 22–24 years (early onset), three with onset between ages 63 and 72 years (late onset), and two asymptomatic carriers aged 71 and 72 years at the time the study was initiated (classified as late onset) was sequenced. A Portuguese noncarrier was used for the 3′ UTR portion of the target interval. A region of 19 888 bp (comprising the 7620 bp of TTR complete cds (GenBank M11518), plus 6,559 bp upstream (5′ UTR) and 5709 bp downstream (3′ UTR)) was amplified from each genomic DNA sample in 45 overlapping segments and sequenced using the same primers employed for PCR amplification (Table 2a).

Table 2 Primers employed for PCR amplification and cycle sequencing

PCR amplification was performed in a MWG Biotech Primus 96 HPL thermal cycler with AmpliTaq Gold (PE Applied Biosystems, Foster City, CA, USA) in 25-μl reactions containing 10–50 ng of genomic DNA. The samples were denatured at 96°C for 6 min and cycled 35 times with denaturation at 94°C for 30 s, annealing at temperatures specified above for 30 s and extension at 72°C for 30 s, followed by a final extension at 72°C for 10 min. Following column purification with a PCR-purification kit (Qiagen, Valencia, CA, USA), sequencing of both strands was performed with dye terminator chemistry (PE Applied Biosystems) according to the manufacturer's instructions. Sequencing reactions were electrophoresed on an ABI Prism 377 Sequencer (PE Applied Biosystems); data were analyzed with Sequencher 3.0 (Gene Codes, Ann Arbor, MI, USA).

Microsatellite analysis

The microsatellite study included 18 Portuguese families with 54 V30M carriers and 20 noncarriers, three Swedish kindreds with nine carriers and one noncarrier, 17 singletons (eight Portuguese and nine Swedes) and six Portuguese controls. Of the Portuguese kindreds, eight had classical onset, six late onset and four exhibited anticipation; six Portuguese singletons had classical onset and the remaining two had late onset. The Swedish subjects, either familial or singletons, had late onset.

Genotyping in the chromosomal region of TTR on proximal 18q was performed with markers D18S457 (GDB:239675), D18S456 (GDB:239623) and D18S1133 (GDB:659622) (centromere to telomere). The Genebridge 4 (GB4) map (The International RH Mapping Consortium) places the TTR gene immediately centromeric of a 2 cM interval defined by D18S457 (microsatellite anchor marker AFM238vb6) and D18S456 (microsatellite anchor marker AFM234zb6). This interval is located approximately 700 kb downstream of TTR cds and spans 1.3 Mb. Marker D18S1133, telomeric to D18S456, was used to facilitate the inference of linkage phase. Genotyping for D18S457, D18S456 and D18S1133 was performed using the PCR conditions and oligonucleotide primers available through the Genome database. Additionally, we designed primers to amplify two CA-repeat elements previously identified in TTR 5′ UTR,16 in order to determine if the sites were polymorphic and could be used as microsatellite markers (Table 2b). All primers were assembled using standard fluorophore chemistry (PE Applied Biosystems); amplification was followed by electrophoresis on an ABI Prism 377 DNA Sequencer with the GeneScan 3.1 software (PE Applied Biosystems). The data were analyzed with Genotyper 2.5, and haplotypes were constructed minimizing the number of intrafamilial recombinations. For each pedigree, the parental alleles associated with V30M were determined and counted only once.

SNP analysis

Four of the eight SNPs identified by genomic sequencing in TTR flanking regions were characterized in 117 V30M carriers and 87 controls. In all, 97 Portuguese carriers (90 affected, seven asymptomatic) were separated into two classes according to the age of onset: 58 had classical onset (below age 40 years) and 39 had late onset (above age 50 years). The asymptomatic subjects were included in the latter group since they were already over 60 years of age at the time the study was initiated. The analysis also included 54 Swedish subjects (20 carriers and 34 controls) and 53 Portuguese controls.

Genotyping was performed by PCR amplification of the target sequences using primers with mismatched nucleotides to create artificial restriction sites for allele detection. The SNPs, designated 5′III (A/G, nt 2970, 5′ UTR), 3′I (C/T, nt 21, 3′ UTR), 3′II (C/T, nt 162, 3′ UTR) and 3′V (T/C, nt 4934, 3′ UTR), were amplified using the primers shown in Table 2C, in which the lowercase letters are the mismatched nucleotides. The samples were denatured at 96°C for 6 min and cycled 35 times with denaturation at 94°C for 30 s, annealing at temperatures specified above for 30 s and extension at 72°C for 30 s, followed by a final extension at 72°C for 10 min. The amplicons were digested for 14 h with 10 U of NdeI (5′III, for which the A-to-G transition destroys the restriction site), BsmI (3′I, for which the C-to-T transition creates a natural restriction site), NcoI (3′II, for which the C-to-T transition destroys the restriction site) and BsrGI (3′V, for which the T-to-C transition creates a restriction site). Digested products were electrophoresed in 4–20% TBE polyacrylamide gels (Invitrogen, Carlsbad, CA, USA), stained with ethidium bromide and visualized under ultraviolet (UV) light. Genotyping of TTR intronic SNPs was performed as described elsewhere.12

Haplotype determination

The linkage phase of haplotypes was inferred by pedigree analysis in familial cases. For singleton subjects, haplotypes were assigned unambiguously in case of complete homozygozity or heterozygozity at a single site. Complex cases with more than one heterozygous site were resolved by identification of a set of putative haplotypes by ‘subtracting’ the unambiguous haplotypes.17

Statistical analysis

Statistical analyses were performed using the GraphPad Instat 3.01 package (GraphPad Software, San Diego, CA, USA) and StatXact, Version 5 (Cytel Software, Cambridge, MA, USA). The null hypothesis of no linkage disequilibrium was calculated for biallelic and multiallelic loci using a Fisher exact test (FET) through an exact calculation of the hypergeometric distribution, as implemented by StatXact.18

Portuguese and Swedish populations were evaluated independently. Carrier and control chromosomes were compared with respect to microsatellite alleles (13) and their respective haplotypes (14) (Tables 3 and 4) 5′ CA repeat alleles (2) (Table 6), flanking SNPs (8) and their haplotypes (6) (Tables 8 and 9). When both haplotypes and SNPs were available, only the haplotype comparisons were used, since they subsumed the individual SNPs. To account for the performance of 22 comparisons (14 microsatellite haplotypes, 2 CA repeat alleles, 6 SNP-determined haplotypes), we utilized a very conservative estimate of significance, that is, P<0.0023 (Bonferroni). Significant values are indicated in bold type.

Table 3 Allelic associations for FAP families
Table 4 Distribution of microsatellite haplotypes among Portuguese and Swedish kindreds
Table 6 Allele frequencies of a polymorphic CA-repeat in transthyretin 5′ untranslated region in the Portuguese and Swedish populations
Table 8 Allele frequencies of four newly identified SNPs in transthyretin flanking regions in the Portuguese and Swedish populations
Table 9 Frequencies of haplotypes defined by three TTR flanking SNPs among V30M carriers and controls in the Portuguese and Swedish populations

Classical and late-onset FAP patients were compared with respect to the haplotype distribution on the mutation carrier and noncarrier chromosomes. For the V30M carrier subsets, the FET was conducted using the ‘combined allele’ and the ‘multiallele’ methods. In the ‘combined allele’ method, the haplotype that is over-represented among V30M chromosomes was compared to all other haplotypes combined, whereas the in ‘multiallele’ method the haplotypes were not combined.

Results

Microsatellite allelic and haplotypic associations

The allelic distributions of the polymorphic microsatellite markers D18S457 and D18S456 are shown in Table 3. The 118-bp allele (3) of D18S457 was associated with V30M in 100% of the cases. However, due to the high frequency of allele 3, which is in accord with the estimated frequency in Caucasians (0.75) (GDB:2396675), it did not differ significantly among carrier and noncarrier chromosomes. The 235-bp allele (2) of D18S456 was significantly associated with the mutation on 86.8% of carrier chromosomes versus 45.8% of noncarrier chromosomes. When the hypothesis of linkage disequilibrium was tested for the Portuguese and Swedish populations independently, the associations remained significant for D18S456, but nonsignificant for D18S457 due to the high frequency of allele 3 in both populations and the relatively small sample size for the Swedes.

In all, 14 of a possible 42 haplotypes for D18S457–D18S456 were observed in our study population. Table 4 summarizes the microsatellite haplotype distribution in carrier and noncarrier chromosomes, respectively. Haplotypes 3-2 and 3-1 were associated with V30M in the Portuguese, whereas, among the Swedes, 3-2 was the only haplotype found in carrier chromosomes. Among the remaining 12 haplotypes identified, 11 were observed in the Portuguese and four in the Swedes. The infrequent haplotypes were only found in one or two chromosomes (not shown).

The most frequent haplotype associated with V30M in Portuguese families was equally distributed among individuals with classical and late onset (Table 5). In contrast, the analysis of noncarrier chromosomes in 37 V30M patients revealed a possible significant association between haplotype 3-2 and late onset (P=0.0128 by FET; odds ratio: 8.50; 95% CI: 1.503–48.07, when compared to the frequencies of all other haplotypes combined), suggesting a possible modulatory effect on the age of onset by a factor encoded in the interval on the noncarrier chromosome. The data were also analyzed without combining haplotypes, using an exact calculation of the hypergeometric distribution (FET)18 as implemented by StatXact. Although an exact P-value of 0.0419 was obtained, the differences were not significant after correction for multiple comparisons. A similar analysis without haplotype 3-2 resulted in a lack of statistical significance (P=0.4778), suggesting that the possible association is wholly due to this most common haplotype.

Table 5 Microsatellite haplotypes in carrier and noncarrier chromosomes of Portuguese FAP patients by group of disease onset

The 5′ untranslated region CA dinucleotide repeat (nt −643, relative to TTR cds) was found to be polymorphic with two alleles identified in 92 Portuguese and 18 Swedish chromosomes (Table 6). The (CA)10 allele was associated with V30M in both populations, but it did not approach statistical significance for the Swedes, presumably because of the smaller size of this subset. However, the association with the mutation was not exclusive as 31.0% of the noncarrier chromosomes bore the allele. The (CA)9 allele was not found on any of the 39 carrier chromosomes and accounted for an average of 32% of the noncarrier chromosomes in the two populations.

TTR sequence variation survey

Human TTR, located on chromosome 18q11.2–q12.1,19 has four exons distributed across approximately 7.6 kb of genomic DNA. A region of 19 888 bp was sequenced in 10 Portuguese V30M subjects (see Material and methods). A contiguous baseline reference sequence for the target region was obtained from GenBank. DNA variants were identified by matching each individual's DNA sequence to the reference sequence and to that of the other subjects. To minimize ambiguities, the sequence was obtained from both strands. In all, 16 variable sites were identified among 11 individuals (Figure 1), of which the six intronic sites had previously been described.10 Of the 10 newly identified polymorphisms, eight result from single-base substitutions (seven transitions, one transversion), and two from insertion/deletions reflecting copy number changes in dinucleotide sequences. The variable site designated 5′I was observed in a single chromosome among the 20 chromosomes. Variable sites 5′II, 3′IV and 3′VI were identified in three of the 10 subjects (3/20 chromosomes), suggesting that the allelic variants are common. Polymorphism 3′IV represents the only transversion (T-G) among the eight single-base substitutions identified here, while variable site 3′VI constitutes an insertion/deletion with two alleles resulting in a qualitative rather than quantitative change, as the insertion of a CT dinucleotide in a CT-repeat tract accompanies the deletion of a CA from an adjacent CA-repeat sequence.

Figure 1
figure 1

Schematic representation of the genomic interval surveyed by direct sequencing and polymorphisms newly identified with respective DNA sequence electropherograms. The numbering in Italic refers to the position of the polymorphisms relative to TTR cds (GenBank M11518); TTR exons are numbered from 1 to 4; the intronic SNPs identified by Yoshioka et al10 are numbered from I to VI; (CA)n: CA-dinucleotide repeats. From Sakaki et al16 – TATA: TATA box; CAAT box; H-1, H3, H-4: binding sites for hepatocyte nuclear factors HNF-1, HNF-3 and HNF-4, respectively; Tf: common motif to Tf-LF-1, Tf-LF-2 and LF-A1; enhancer: a region highly homologous to a tissue-specific enhancer of the mouse TTR gene; AFP: binding site for AFP-1 factor. Hom in blue font: homozygous for the upper allele; Het: heterozygous; Hom in red font; homozygous for the bottom allele.

Three sites were identified only on single chromosomes. Site 5′I, previously referred to, probably represents a rare polymorphism. Site 3′I is a confirmed rare polymorphism (shown below), while site 3′III, not present in any of the 20 chromosomes initially surveyed, was identified in a single kindred as a consequence of extended studies (see below).

To investigate a possible association with disease onset, the polymorphisms were organized in haplotypes. Six haplotypes resulted from these variations (Table 7). Three subjects were homozygous for the most common haplotype and all had developed the first clinical symptoms at an early age (22, 23 and 24 years of age). The two other early-onset individuals (onset at 21 and 22 years of age) were heterozygotes with haplotypes i–vi, and i–ii, respectively. Interestingly, the individuals bearing the single-substitution haplotypes (ii, iii and iv) were late onset with disease symptoms developing at ages 63 and 64 years and an asymptomatic carrier aged 74 years. The remaining late-onset individuals (onset at 72 years of age and an asymptomatic carrier aged 72 years) were heterozygotes bearing haplotypes i and ii. Although these results could be indicative of an association between particular haplotypes (or alleles) and the age of disease onset, they could not be statistically validated since the sample was too small and the distributions were complex. No mutations other than V30M were identified in the screened subjects, except for G6S, a nonsynonymous substitution described as a neutral polymorphism,20 identified in a compound heterozygote. The individual heterozygous for G6S and a subject homozygous for V30M both displayed classical onset and were homozygous for haplotype i, the most frequent haplotype.

Table 7 Frequencies of TTR haplotypes defined by the newly discovered SNPs among 20 FAP individuals surveyed by genomic sequencing

SNP haplotype analysis

We extended the analysis of four SNPs to 97 singletons (i.e. single representatives of distinct FAP families, V30M carriers), 20 Swedish carriers (including familial cases), 53 Portuguese and 34 Swedish controls. A total of 234 chromosomes from V30M carriers and 174 control chromosomes were genotyped by restriction analysis for polymorphisms 5′III, 3′I, 3′II and 3′V. Data on the allelic frequencies are shown in Table 8. Alleles A, C and T (sites 5′III, 3′II and 3′V, respectively) were strongly associated with V30M in the Portuguese. Among Swedes, the allele frequencies did not differ much between carriers and controls, suggesting that the alleles associated with the mutation are also very common in the general population. Overall, the alleles specified above were present in all V30M individuals, either as homo- or heterozygotes, except for a single subject who was homozygous for the alternative nucleotides (discussed below). Site 3′I was a rare polymorphism among the Portuguese (not determined for the Swedes), with the most common allele accounting for approximately 99% of the total number of chromosomes.

Haplotypes for 5′III–3′II–3′V were constructed as described in Material and methods. Of the eight possible combinations, six haplotypes were identified (Table 9). Fisher's exact analysis was used to test the null hypothesis of no linkage disequilibrium. It confirmed the strong association between haplotype A-C-T and V30M, as suggested by the sequencing results, the allelic distributions and the frequency of homozygote carriers (42/117). Although no such association was found for the Swedish population, A-C-T was the haplotype bearing the V30M mutation. Homozygote carriers and the use of familial cases established the linkage phase and validated the association of the haplotype – also common in the general population – to the mutation. Of the four least frequent haplotypes, only two were identified in the Swedes, occurring at a frequency similar to that seen in the Portuguese.

To investigate a possible effect of the noncarrier haplotype in disease onset, the Portuguese subjects were divided into the following groups: classical onset (below age 40 years, n=58) and late onset (above age 50 years, n=39). The distribution of haplotypes between the two groups was not significantly different (P=0.251; χ2=6.612, 5 d.f.), suggesting that the potential modulatory effect described above would not be located this close to TTR.

A new V30M-associated haplotype in the Portuguese population

The analysis of the nucleotide substitutions reported above resulted in the identification of a different set of alleles in association with the V30M mutation. Among the 117 V30M carriers surveyed, we identified one subject homozygous for alleles G, T and C (sites 5′III, 3′II and 3′V, respectively), bearing an unambiguous haplotype that would require several recombination events to arise from intragenic haplotype I. Using direct genomic sequencing, the individual (an asymptomatic carrier aged 53 years, designated N) was shown to carry the extended haplotype ii: G-G-G-(CA)9-C-T-G-G-C-(CT)9(CA)9, which includes intragenic haplotype III, a haplotype found for the first time in association with V30M in a Portuguese. It must be noted however, that the proband's father, even though of Portuguese nationality, had been abandoned in childhood. Hence, the individual may not be of long-standing Portuguese descent.

In the same kindred, a new SNP (not observed among the 10 subjects surveyed) was identified, consisting of a G-A transition and designated 3′III. Two noncarrier sibs were also genotyped for polymorphisms 5′III, 3′II and 5′V. The pedigree and respective haplotypes are shown in Figure 2. Subject N and his affected sib were homozygous for extended haplotype ii (intragenic haplotype III), whereas the remaining relatives were heterozygous for i and ii (intragenic haplotypes I and III), with the mutation associated with the latter. Thus, the mutation had been inherited from the father, but it could not be determined whether it had arisen in this individual or in his ancestors.

Figure 2
figure 2

Pedigree of the kindred bearing a new disease-associated haplotype and respective RFLP assays for polymorphisms 5′III, 3′II and 3′V. The lanes numbered from 1 to 5 refer to the RFLP analyses of the corresponding individuals for each of the specified polymorphisms. Genotypes for polymorphisms 5′III, 3′II, 3′III (exclusively identified in this family), 3′V and the classical intragenic haplotypes are shown below each subject, 5′ to 3′. Affected individuals are represented by black symbols; asymptomatic V30M carriers are identified by dot-containing symbols; noncarriers are indicated by N-containing symbols. Boxed haplotypes carry the V30M mutation. Dates indicate the year of birth and, when applicable, year of death. Arrow: proband, individual N; o: age of onset; Intr Hap: intragenic haplotype; L: 25-bp DNA ladder.

Interestingly, subject N and his affected sib were homozygous for allele A of SNP 3′III, implying that the nonaffected mother was also a carrier of this apparently rare polymorphism. The screening of TTR exons did not show additional mutations in the carrier siblings, but revealed the presence of a G6S variant in the father lying on the allele opposite to the mutation and associated with haplotype I.

Discussion

Our analysis of TTR and neighboring regions by means of microsatellite- and SNP-based haplotype analysis coupled with genomic sequencing in Portuguese and Swedish FAP kindreds has provided more extensive knowledge of the TTR gene and its surroundings. We have characterized the TTR UTRs and described a V30M-bearing haplotype not previously seen in the Portuguese population.

The observation that haplotype 3-2 was common in late-onset (n=9/18) but not in early-onset cases (n=2/19), relative to all other haplotypes together (p=0.0127) (Table 5), suggested that there may be a modulating effect on the age of onset in FAP V30M patients exerted by a locus within or closely linked to the interval defined by D18S457 and D18S456, and associated with haplotype 3-2 in the accompanying chromosome. Since the large number of possible haplotypes required multiple comparisons, when each haplotype was evaluated separately, the observation did not achieve statistical significance in that analysis; nonetheless it was suggestive of an influence of the noncarrier chromosome. If it can be independently validated, the effect suggested here conforms to a model in which a specific allele at a locus close to the wild-type TTR on that chromosome modulates the initiation of the first symptoms of disease through a trans-acting mechanism. The interval defined by microsatellite markers D18S457 and D18S456 on 18q contains two genes with protein products of unknown function and six potential genes based on observed ESTs. Beta-1,4-galactosyltransferase 6 (B4GAL-T6) and meprin A beta (MEP1B), as well as four genes with protein products of unknown function and seven possible genes related to known ESTs, map to the 700 kb region separating the marker interval from TTR (Public Genome Database).

Within the interval of 20 kb containing the TTR coding sequence, all the known regulatory regions and additional contiguous 3′ and 5′ sequences, we identified 10 new polymorphisms. Among the six haplotypes that could be constructed, the two most frequent intragenic haplotypes were found: haplotype I, on 17 chromosomes, and haplotype III, on the remaining three and not in association with V30M. No relationship could be established between the variable sites (and resulting haplotypes) and a particular mode of disease onset given the small sample; nonetheless, three of the five late-onset subjects were carriers of distinct haplotypes with a single heterozygous site, an occurrence not observed among the early-onset cases. Overall, the amount of variation found in the 20 chromosomes surveyed in this study (including coding sequence mutations) constitutes the expected level of variation, with a nucleotide diversity of 0.001 (or about one variable site per 1000 bp), which falls within the range of reported values estimated from a conservative analysis of allelic variation obtained from a scan of the human sequence database (0.001–0.003).21,22

Variable sites 5′III, 3′II and 3′V were typed in singleton and familial subjects from Portuguese and Swedish kindreds. All carriers were found to bear the haplotype associated with V30M, formed by alleles A, C and T, respectively. Linkage phase was established by familial cases and was in accordance to the phasing suggested by the genomic sequencing survey. The association of V30M with a single haplotype in this study is consistent with the reports of a sole intragenic disease-associated haplotype in these populations.

We identified one V30M asymptomatic carrier of possible Portuguese origin, who was homozygous for haplotype G-T-C, comprising intragenic haplotype III. Haplotype III has not previously been seen in association with V30M in Portuguese carriers, but had been found associated with the mutant allele in kindreds from Japan, France, Britain and Italy.10,23,24 In the Portuguese and Japanese populations, intragenic haplotype III has been reported to account for 47.4 and 35.0% of the noncarrier chromosomes, respectively, thus representing the second most common haplotype in these countries.10,23 Hence, the occurrence of V30M in the context of this haplotype could have been the result of migration of a founder from another European population or a new V30M mutation in the setting of a haplotype common in Portugal. The latter would be consistent with the growing evidence for recurring mutation in TTR. In either case, our data constitute the first report of a haplotype other than haplotype I in association with FAP in Portugal.

Among the six TTR-flanking haplotypes identified in the Portuguese, haplotypes A-C-T (i) and G-T-C (ii) were observed to encompass intragenic haplotypes I and III, respectively. The correspondence, unequivocally established for several subjects who were also genotyped for the intronic SNPs, was corroborated by the haplotypic frequencies obtained here (Table 9). The remaining haplotypes (A-T-C; G-T-T, G-C-T and A-T-T) also included different intragenic haplotypes (as observed in some subjects). They may arise by recombination between flanking haplotypes i and ii, as did intragenic haplotype II from I and III and intragenic haplotype IV from I and V. The relationship between the TTR-flanking haplotypes described in this paper and the intragenic haplotypes described elsewhere is proposed in Table 10. The correspondence of the polymorphic CA stretch and the microsatellite markers, based on direct evidence (from individuals genotyped for the distinct types of polymorphisms) and inferential procedures, is also established.

Table 10 Composite of the haplotype results obtained in the studies presented here: possible correspondence between the haplotypes based in TTR flanking SNPs and microsatellites (this study) and the intragenic haplotypes described elsewhere

Founder effects have clearly played a role in the population genetics of FAP. In spite of the growing evidence supporting recurring mutation in TTR to generate pathogenic variants in new chromosomal settings, the modest variation between Portuguese and Swedish carrier chromosomes within the surveyed intervals is not sufficient to exclude a single founder hypothesis in these two populations. Conversely, the high frequency of the disease-associated haplotype among noncarrier chromosomes, particularly in the Swedes (68% for the SNP- and 37.5% for the microsatellite-defined haplotypes) may mask the occurrence of de novo mutations within the same haplotype. The study of microsatellite haplotypes defining a 2 cM interval (1.3 Mb) downstream of (and very close to) TTR has broadened the survey for variation beyond the limits of the TTR intronic boundaries imposed by the earlier intragenic haplotype analyses. The identification of a single microsatellite haplotype (3-2) linked to V30M in Swedish patients (versus six haplotypes in noncarrier chromosomes) is supportive of the earlier studies, assigning a single intragenic haplotype to the Swedes. In contrast, the association of V30M with two microsatellite haplotypes (3-2 and 3-1) in the Portuguese is not in agreement with the earlier data for this population; however, the former haplotype could be derived from the latter by recombination rather than by a new mutation arising in association with haplotype 3-1. Two lines of evidence support this notion. The haplotype frequencies among FAP patients and the general population (19.2 and 21.3%, respectively) are very similar. Furthermore, the SNP haplotype analysis revealed a single haplotype (A-C-T) (with a sole exception) bearing the V30M mutation. The extended analyses may be more conclusive than those carried out previously, in that they consider the background variation in a broader interval around and in the vicinity of TTR. The identification of 13 microsatellite and six SNP haplotypes in Portuguese subjects, which, based on the total number of chromosomes examined, are significantly different from the six microsatellite and four SNP haplotypes in the Swedes illustrates the genetic diversity between these populations and is consistent with the older age of the Portuguese population.25 It suggests, from a genetic standpoint, that the Portuguese could have introduced the V30M mutation to Sweden with a common ancestor explaining the high frequency of a particular TTR flanking and intragenic haplotype linked to V30M in both countries.