Introduction

Fibrillin-1 is a 350-kDa extracellular protein, a principal constituent of 10 nm microfibrils,1 and it has a variety of functions, including anchoring epithelial cells to the interstitial matrix and maintenance of elastic fibres.2, 3 Fibrillin-1 is encoded by the fibrillin-1 (FBN1; MIM# 134797) gene, which is located on human chromosome 15q21, with an estimated size of 237 kb.4, 5 There are more than 600 mutations that have been identified so far (http://www.umd.be:2030/) in association with Marfan syndrome (MFS; MIM# 154700) and other so-called fibrillinopathies, including Weill–Marchesani syndrome (MIM# 608328), Marfanoid skeletal syndrome6 and ectopia lentis (MIM# 129600).

At its 5′ end, the human FBN1 gene carries three in-frame and alternatively spliced exons, called B, A and C, as well as a common exon M.7 Exon M contains the putative initiating methionine, a Kozak consensus sequence (GGCATCATGCGG),8 which is currently being referred as the beginning of exon 1 of the FBN1 gene (http://www.umd.be:2030/). However, the presence of three alternatively spliced and in-frame exons and the absence of the peptide sequence from the extreme N-terminus of the protein generate some ambiguity regarding the site of translation initiation.7 Furthermore, a functional role of these upstream exons/introns is indicated by the association of sequence variations therein with systemic sclerosis, scleroderma and MFS.9, 10, 11

We present here the data on the extent of sequence conservation of 1.7 kb of the FBN1 5′-upstream region in humans and non-human primates, including rhesus monkey (Macaca mulatta), gorilla (Gorilla gorilla) and chimpanzee (Pan troglodytes). Our data demonstrate that there is a higher degree of conservation in the FBN1 5′-upstream exons/introns of humans and non-human primates.

Materials and methods

For analysing 1.7 kb of the 5′-upstream region of the FBN1 gene in humans and non-human primates (five rhesus monkeys, two chimpanzees and two gorillas), genomic DNA was extracted from venous blood, collected from humans (healthy blood donors), rhesus monkeys (Deutsches Primatenzentrum, Gottingen, Germany), chimpanzees (own DNA facility) and gorillas (own DNA facility), using standard protocols. The primers used in this study were taken from Singh et al.11 Standard PCR conditions were as follows: initial denaturation at 95°C for 10 min followed by 33 cycles of 96°C for 1 min, 58–62°C for 1 min and 72°C for 1 min with final elongation at 72°C for 10 min in a 50-μl reaction mixture containing 1 × buffer (Qiagen, Germany), 1 × Q solution (Qiagen), 20 pM each primer and 2.5 U Taq polymerase (Qiagen). PCR products were purified with ExoSAP-IT (USB, USA), and both strands were sequenced with BigDye Terminator chemistry version 1.1 by standard protocol (ABI, USA). Sequencing reactions were carried out at 96°C for 10 s, 50°C for 5 s and 60°C for 4 min (25 cycles) (Biometra, Germany). The reaction mixtures were further purified using DyeEx™ 2.0 Spin kit (Qiagen) and analysed on the 3100-Avant Genetic Analyzer, according to the supplier's instructions, using the sequence analysis software (ABI).

A search for transcription factor binding site was performed for human 1.7 kb of the 5′-upstream region containing alternatively spliced exons/introns using TFSEARCH (http://www.cbrc.jp/research/db/TFSEARCH.html).

Results

Homology between humans and non-human primates

A high degree of homology was seen in the 5′-upstream region of the FBN1 gene of humans and non-human primates (Figure 1 and Supplementary Figure 1). Striking features seen in the FBN1 5′-upstream exons/introns of humans and non-human primates were as follows: (1) presence of GC at splice donor site for exon B in rhesus monkeys, with all other known splice sites being conserved in humans and non-human primates (Figure 1 and Supplementary Figure 1); (2) insertion of a nucleotide C in exon A of non-human primates in comparison to humans (Supplementary Figure 1); (3) variations in the number of CGCCG repeats in intron A of humans ((CGCCG)2), chimpanzees ((CGCCG)2), gorillas ((CGCCG)5) and rhesus monkeys ((CGCCG)3) (Supplementary Figure 1); (4) an indel (indel A) polymorphism in exon C of rhesus monkeys and chimpanzees (Supplementary Figure 1); (5) conservation of Kozak consensus sequence in humans and non-human primates (Supplementary Figure 1); (6) two SNPs in the intron A +42C>G and intron C +58T>A of rhesus monkey; and (7) presence of intron C +48C in chimpanzees and gorillas and intron C +48T in the rhesus monkey sequence (Supplementary Figure 1).

Figure 1
figure 1

Schematic representation of human FBN1 upstream exons/introns. Exons B, A and C are alternatively spliced exons.5 The presumptive initiation codon is indicated by ATG in exon M. The sequence homology among humans,7 non-human primates (gorillas, chimpanzees and rhesus monkeys) and porcine7 is given above each exon/intron and splice donor sites are shown below.

Putative transcription factor binding sites identified within 5′-upstream region of human FBN1 gene

The search for transcription factor binding site in the 5′-upstream region of the human FBN1 gene revealed the presence of several putative transcription factor binding sites for MZF1, GATA-1/2/3, deltaE, Sp1, SRY, AP-4, c-Rel and Nkx-2.5 transcription factors (Table 1). The majority of the putative transcription factor binding sites identified were in intron regions of the alternatively spliced exons (Table 1).

Table 1 Putative transcription factor binding sites identified by using TFSEARCH for vertebrate transcription factors (threshold score 90) within alternatively spliced exons and associated introns of human FBN1 gene

Discussion

Direct sequencing revealed a high degree of conservation in the 5′-upstream exons/introns of the FBN1 gene in humans and non-human primates, which supports the functional relevance of this gene region. However, the presence of GG at the splice donor site of exon C of porcine (Figure 1), an indel (indel A) polymorphism in exon C of rhesus monkeys and chimpanzees, the insertion of a nucleotide C in exon A of non-human primates and the presence of GC at the splice donor site for exon B in rhesus monkeys (Figure 1) create some ambiguity about translation of these exons in these species. It is, however, noteworthy that all other confirmed human splice donor sites were conserved in non-human primates (Figure 1). Variations in the number of CGCCG repeats in intron A of humans and non-human primates show the polymorphic nature of these repeats (Supplementary Figure 1). Two SNPs seen in rhesus monkey were intron A +42C>G and intron C +58T>A; intron A +42C>G appears to be a result of evolutionary divergence, as G is present in humans and other non-human primates at the respective position, whereas the other SNP intron C +58T>A appears to be specific for rhesus monkey, and both alleles, were seen in the same frequency (allele frequency 0.50) (Supplementary Figure 1). Known human intronic variation, intron C +48 C>T, also appears to be a result of the evolutionary divergence, as intron C +48C was found in the wild-type sequence of the chimpanzees and gorillas and intron C +48T was found in the wild-type sequence of rhesus monkeys (Supplementary Figure 1). Our data on P. troglodytes were in full consensus with the available sequence from NCBI (Ptr15_WGA16806_2), except a variation where nucleotide A at position 3775016 was replaced by nucleotide G in our sequence.

The search for transcription factor-binding sites revealed the presence of several putative vertebrate transcription factor binding sites within the 5′-upstream region of the human FBN1 gene. These findings further indicate that 5′-upstream alternatively spliced exons/introns may control the FBN1 expression in a tissue-specific manner. This, however, remains speculative, as the data need to be confirmed functionally (Table 1).

In summary, the presence of 5′ alternatively spliced exons in humans, a higher degree of homology and conservation of the splicing sites across species borders indicate that this gene region may play a functional role. We were unable to define the 5′ end of the FBN1 gene; therefore, the presence of more upstream and in-frame exons cannot be ruled out. We are planning to study the expression of different 5′ alternatively spliced exons in different tissues, investigate their role in transcription and translation, identify the 5′-upstream end and confirm the translation initiation site of the FBN1 gene in humans.