Introduction

Since 1991, more than 12 diseases have been identified that are caused by amplifications of trinucleotide repeats. Although we have learned a lot about the genetics of the ‘trinucleotide repeat diseases’, not much is known about the mechanism of repeat expansion. The fragile X syndrome, one of these trinucleotide repeat diseases, is an X-linked disorder affecting 1 in 4,000 males and 1 in 6,000 females [1]. The main characteristics of the syndrome are mental retardation and macroorchidism [2]. The trinucleotide repeat causing the disease is a CGG repeat present in the 5′ untranslated region (UTR) of the fragile X mental retardation 1 gene (FMR1) [3]. Expansions of this repeat above 200 triplets, full mutations, cause methylation of the CpG island including the promoter region and the repeat itself [4, 5]. This methylation blocks the transcription and therefore results in the lack of the fragile X mental retardation protein (FMRP) [6, 7].

In nonaffected individuals, the highly polymorphic CGG repeat varies between 6 and 200 triplets with 29–32 triplets being the most common. These alleles are divided into two classes. Normal alleles, repeats between 6 and 54 triplets, inherited stably, whereas so-called premutation alleles of 43 to 200 repeat units show instability upon transmission [8, 9]. These premutations do not cause the disease, but they are prone to expand to full mutations in next generations.

Instable inheritance of the CGG repeat is observed via paternal and maternal transmission, but full mutations only arise upon female transmission. Inheritance via the paternal line never gives rise to full mutation alleles. Because of this parent of origin effect it has been suggested that expansions occur during gametogenesis [10]. However, it is also possible that expansions occur during early embryonic development. For obvious reasons, data about oocytes and early embryogenesis are limited. Recently, female fetuses showing a full mutation in somatic tissues were described whereas full mutations were already present in the oocytes [10]. Still the exact timing of the repeat expansion is not known, but it shows that the female germline is not protected against full mutation expansion.

In sperm of patients only permutations were found [11], suggesting contraction of full mutations in the immature testis. A hypothesis is that during spermatogenesis a kind of selection mechanism prevents germline precursors with a full mutation to arise or to develop into mature spermatocytes [10]. It has been suggested that FMRP is necessary for gametogenesis although the knock-out mice for Fmr1, lacking Fmrp completely, are fertile [12]. Previously, transmission of a deletion causing fragile X syndrome through a family could be traced back to the deceased grandfather. This grandfather transmitted the deletion to 3 daughters indicating that the deletion was already present in his sperm cells [13]. These findings make it less likely that FMRP is necessary to produce spermatocytes.

Moreover, somatic instability does also occur. Most patients show different lengths of full mutations often present as a smear on Southern blot analysis [14]. This shows that repeat length changes still occur in the early embryo. It has been estimated that in 40% of the fragile X patients a premutation is also present [15]. These mosaics show always one premutation allele and a smear for the full mutation. From previous studies, it is clear that this mosaic pattern is not caused by permanent mitotic instability. It is thought that mitotic instability can arise during a fixed window in early embryogenesis [1618].

At this moment it is not known which mechanisms are involved in expansions, mitotic or meiotic. To study these mechanisms, an animal model is required. Only in an animal model it will be possible to study gametogenesis and early embryogenesis at specific time points. To study the behavior of a premutation allele upon next generations, we generated transgenic mice with a premutation allele, (CGG)81 in the FMR1 promoter. The FMR1 promoter was fused to the reporter gene LacZ. A similar construct with a (CGG)16 repeat showed a similar expression in the complete FMR1 gene [19]. Assuming that the mouse model resembles the human situation regarding repeat instability, expansion of the repeat from a premutation to a full mutation is expected to be followed by methylation of the promoter region of FMR1 [5, 20]. It is therefore expected that if the repeat in these transgenic mice expands to a full mutation, expression of LacZ will be absent due to methylation of the FmRl promoter. Using this fusion gene as a transgene, the instability of the repeat upon transmission was studied.

Materials and Methods

Construct with the Expanded CGG Repeat

The construct of the FMR1 promoter fused to the LacZ reporter (FMR1/LacZ) was kindly provided by Hergersberg et al. [19]. The NruI/XhoI fragment containing a repeat of 16 CGGs in this construct was replaced by a NruI/XhoI fragment containing a premutation allele, present in pRN2 [21]. The repeat present in this fragment was amplified from a premutation carrier (CGG)81, [(CGG)11AGG (CGG)60CAG(CGG)8]. This repeat contains a pure (CGG)60 tract. This length is sufficient to cause instability in humans. Downstream of the pure (CGG)60 repeat, there is a CAG interruption and a (CGG)8.

Transgenic Mice

The transgene was isolated from agarose gel after a NotI digest. For microinjection, a transgene concentration of 5 ng/µl was used. The transgene was injected in the pronuclei of fertilized oocytes of FVB mice. After microinjection, the oocytes were transferred to a BCBA foster.

Transgenic animals were identified with PCR performed at tail DNA. PCR conditions were 20″ 94°C, 20″ 65°C and 45″ 72°C and primers E (5′ TGGGCCTCGAGCGCCCGCAGCCCACCTCTC 3′) and AGβ (5′ GGGATGTGCTGCAAGGCGATTTAAG 3′) were used. The product was visualized on a 2% agarose gel.

Repeat Stability

To determine the repeat length in the transgenic animals, radioactive PCR was performed with primers C (5′ GCTCAGCTCCGTTTCGGTTTCACTTCCGGT 3′) and ACβ (5′ GTACCCGGGGATCCTCTAGCGCCGGGA 3′). PCR conditions were as described by Deelen et al. [22]. The PCR products were run on a Polyacrylamide gel. Small changes in repeat length could be detected in this way. To determine whether there were large expansions of the repeat, Southern blot analysis was performed for a number of transgenic animals.

Results

Transgenic Animals with Enlarged CGG Repeat

Unstable CGG repeats in humans of a premutation size have a length between 43 and 200 repeat units. To introduce a repeat length in this range into mouse, we have replaced the normal CGG repeat in the FMR1 promoter by a repeat of 81 repeat units. The FMR1 promoter was located upstream a LacZ reporter gene (fig. 1).

Fig. 1
figure 1

Schematic representation of pENPL2 containing the FMR1 promoter region with (CGG)81 repeat upstream of the LacZ reporter gene.

Transgenic mice were obtained by microinjection of the construct pENPL2 into fertilized eggs. Tail DNA from pups was tested for the presence of the transgene and the number of CGG repeats. Three independent transgenic lines, B6, B7 and B29, were obtained. The copy number of the transgene was one for B7 and B29 and two for B6. This was determined by Southern blot analysis (data not shown). The repeat length in the founder mice was exactly 81 repeat units as in the original DNA construct used for microinjection. In order to study the stability of the repeat upon germline transmission, the transgenic founders were crossed with wild-type FVB mice. Transgenics of the first generation were crossed with transgenic litter mates if possible. When there were no transgenic litter mates available, transgenic animals were crossed with wild-type FVBs. Line B6 and B7 were crossed till the seventh generation, line B29 was crossed till the sixth generation. For each line, the transgene is situated on one of the autosomes as was concluded from the ratio of male and female transgenic progeny.

In total, 263 transgenic animals were identified as shown in table 1. Because of the parent of origin effect for the repeat instability seen in fragile X syndrome, the sex of the transmitting parent is depicted. In total, 25 times the repeat was transmitted by a male and 25 times by a female. When transgenic litter mates were crossed, we scored them for both male and female transmission. In this way 101 transgenic males and 98 transgenic females were generated.

Table 1 Transgenic offspring of transgenic mouse strains

In addition, the homozygous transgenic mice from the last generation were crossed with wild-type mice. For each line, 1 female and 1 male mouse is crossed with a wild-type animal. The next generation shows 64 heterozygous mice, 35 males and 29 females.

Repeat Stability

For all transgenic animals, radioactive PCR was performed to score for possible (small) differences in repeat length. All transgenic animals showed exactly the same size of PCR product, indicating that the repeat inherited stably (fig. 2). Southern blot analysis was performed on a number of samples to detect possible larger expansions especially when the radioactive PCR failed or gave a weak signal. Also by Southern blot analysis, no different repeat length could be identified indicating that the (CGG)81 repeat inherited stably in all mitosis and meiosis tested. LacZ expression in testes was tested on the three different strains. LacZ expression was found in testes of the B7 line. Strain B29 failed to show expression due to a deletion in the LacZ gene in the transgenic mice. Strain B6 did not show LacZ expression for unknown reasons. No methylation of the FMR1 promoter region was detected in this transgenic strain (data not shown).

Fig. 2
figure 2

The size of the (CGG) in the three independent transgenic lines was determined by PCR analysis. P shows the size of the (CGG) repeat in the original plasmid: (CGG)81.

Discussion

In this study we describe the inheritance of a CGG repeat of 81 repeat units in the human FMR1 promoter when present as a transgene. Expanded repeats of this size show instable inheritance in humans upon transmission. To our knowledge, instability of the CGG repeat in mice has not been observed. We used a repeat length of 81 repeat units. The longest pure CGG tract within this repeat is 60 CGGs. This number has shown to be sufficient to cause instability in humans [23]. The transgenic mice for the CGG repeat showed stable inheritance in all mitosis and meiosis studied, regardless of the sex of the transmitting parent. There might be a number of explanations for this.

In the murine Fmr1 CGG repeats of 9–12 repeat units have been identified in different mice strains. The size of the repeat necessary to cause instability in mice is not known. It is not known whether CGG repeat instability in mice exists. It might be a human-specific phenomenon. We assumed that underlying mechanisms causing instability would be present in humans and mice. Transgenic mice containing an expanded repeat in the androgen receptor cDNA, the Huntington cDNA or the SCA1 cDNA showed stable inheritance through many meiosis [2426], but recently, studies for other trinucleotide diseases, such as Huntington and myotonic dystrophy, have shown that small changes in repeat length can occur [2729].

Expansions of the CGG repeat in humans are only found at the 3′ end of the repeat [23, 30]. The observation of AGG triplets interrupting the CGG tract has led to the suggestion that these interruptions provide stability to the repeat [23, 31]. Likewise, instability might be the result from loss of AGG interruptions or from the growth of the repeat at the 3′ end. It has been proposed that repeats with an uninterrupted tract of 34 triplets are becoming unstable [23]. The CGG repeat in our construct contains a pure CGG tract of 60 repeat units, and therefore this length should be sufficient to cause instability. Downstream of the pure (CGG)60 tract a CAG triplet is found followed by a tract of 8 CGG repeats. This CAG interruption, although present in some mammals [32], has never been detected in humans. It is therefore likely that this CAG triplet originated from the cloning procedures, either in the Escherichia coli bacteria or as a PCR artifact. It is possible that this CAG interruption, like AGG interruptions in humans, has a stabilizing effect on the repeat.

The site of integration of the transgene might determine the stability of the repeat. Sequences in the interrupted chromosomal region might influence the behavior of the transgene. The FMR1 gene is normally present at the X-chromosome, while the transgene was found to be present at one of the autosomes. Since we do not know which mechanisms are involved in CGG expansions, the X chromosome might play an important role. Possibly mechanisms involved in X-inactivation might play a role in the stability of the repeat. Futhermore, flanking sequences of the FMR1 promoter might be important in generating instability. The region we used as promoter is sufficient to allow normal gene expression as described by Hergersberg et al. [19] for the fusion gene, but the behavior of the repeat upon replication might be influenced by additional flanking sequences. A way to circumvent these influences would be to generate a mouse model with an expanded CGG repeat in the endogenous Fmr1 gene. However, this is for technical reasons extremely difficult. Homologous recombination by itself is not difficult [12], but it is hard to generate a large amount of a long CGG repeat by PCR; a repeat up to 120 units can be amplified with the technique used, but the product can only be visualized by radioactive PCR. To achieve homologous recombination, the repeat has to be cloned in a fragment of DNA containing the promoter and propagated in E. coli. The repeat in E. coli is very unstable, and it is almost impossible to isolate enough DNA with a long repeat necessary for homologous recombination.

The variation in the CGG repeat is polar, and it has been found that the repeat is unstable at the 3′ end [23]. This might be influenced by the direction of replication of the repeat sequence. The direction of replication has shown to be important for inheritance of CGG repeats in E. coli. When the CGG strand is in the leading strand, it tends to behave more stable. The repeat is highly instable, showing deletions and expansions when the CCG strand is in the leading strand [21, 33]. This might also play a role in mice and humans as well. Although we describe three independent transgenic strains, the stable inheritance can still be a consequence of the site of integration and the direction of replication in that region. It has been suggested that slippage events during replication are causing the instability of the CGG repeat. Misalignment of the repeat during replication might lead to continued polymerization or skipping of template sequences. If this is followed by an incorrect repair, repeat expansion can occur. Crossing of transgenic animals carrying CGG repeats and mice deficient in mismatch repair systems might reveal pathways that play an essential role in generating repeat instability.

In this study, we were not able to generate a mouse model to study the instable behavior of the CGG repeat present in the FMR1 promoter. It is known that repeat expansions occur somewhere between gametogenesis and early embryogenesis. To study this, an animal model is required. Only then it will be possible to look at specific time points during development.

Furthermore, the differences in repeat stability upon paternal and maternal transmission and the importance of the methylation of the repeat remain to be elucidated. These differences can only be studied when a lot of material at specific points of development is available. The only way to obtain this material is the presence of an animal model which will enable us to study the behavior of the repeat and possibly to identify factors involved in the mechanism of CGG repeat expansion.