Background & Summary

Williams-Beuren Syndrome (WBS) is a neurodevelopmental disorder caused by a hemizygous deletion of 1.5 Mb segment occurring in approximately 95% of cases and a larger 1.84 Mb deletion observed in about 1 of 20 cases1,2. Clinical main features comprise, distinctive facial features (elfin face)3,4, supravalvular aortic stenosis, connective tissue anomalies, hypertension, infantile hypercalcemia5, dental, kidney and thyroid abnormalities, premature ageing of the skin6, impaired glucose tolerance and silent diabetes2,7. The cognitive hallmark includes mental retardation, hypersensitivity to sound due to the absence of acoustic reflexes and hypersociability8,9. While the primary cause of WBS is well understood10, we still know little about the molecular basis of the phenotype. The first genome-wide transcription study performed in primary fibroblasts from eight individuals with WBS resulted in set of candidate pathways mis-regulated in WBS possibly involved in associated phenotypes2.

To facilitate the study of genes involved in WBS, we generated and transcriptionally profiled of mouse embryonic stem (ES) cells11,12 with inducible expression of the three GTF-transcription factors (GTF2IRD1, GTF2IRD2 and GTF2I) together with the translation initiation factor Eif4h (the human homolog is known as WBSCR113,14. The ES properties to self-renew15 and to differentiate in the three germ layers16,17 have made these cells a unique in vitro system for studying the molecular mechanisms that regulate lineage specification. The three GTF-family members are all highly expressed in the brain. Mouse hemizygote models for GTF2I and GTF2IRD1 present cognitive and behavioural phenotypes associated with WBS3,4, moreover GTF2I deletion is known to be associated with increased sociability while the GTF2I duplication results in increased separation anxiety18,19. Targeted Gtf2IRD1 knockout mouse is known to cause the up-regulation of growth factors and other genes involved in brain development and cellular proliferation which may be linked with the extreme thickening of the epidermis observed in the mouse model20. Moreover it has been reported that the transgenic expression of each of the three family members in skeletal muscle causes significant fiber type shifts21. Finally, WBSCR1, the human homolog of Eif4h, is known to contribute to neuroanatomical WBS deficits22: in vivo studies on knockout mice displayed growth retardation, a smaller brain volume, a reduction in both the number and complexity of neurons and severe impairments of fear-related associative learning and memory formation22.

In a previous study on Down Syndrome, we generated a collection of mouse ES clones capable of the inducible expression of 32 mouse genes (orthologs of human chromosome 21 genes) under the control of the tetracycline-response element (tetO)14. Here we used the same approach exploiting the ROSA-TET system23 to generate 12 mouse ES clones carrying the 4 Open Reading Frames (ORFs) of the GTF-transcription factors (GTF2IRD1, GTF2IRD2 and GTF2I) together with the translation initiation factor Eif4h (Fig. 1). Three positive clones (Supplementary Fig. 1) for each gene were selected and grown in medium deprived of tetracycline (Tc) to perform an induction time course. RNA was extracted (Supplementary Fig. 2) from each clone at the time-point of maximal expression (24 hrs, Supplementary Fig. 3) and total RNA extracted from un-induced clones used as control. Total RNA was profiled by Affimetrix microarrays (the whole set of results is available in the GEO database [GSE9670124])25,26. This analysis was performed to detect differentially expressed genes (that is, in induced versus non-induced cells, Supplementary Fig. 4) in ES cells modeling the WBS.

Methods

Generation of recombinant WBS-ES clones

The generation of recombinant WBS-ES clones started with the modification of the cell line EBRTcH3 (EB3) as described in14. The cells were cultured in ES media supplemented with the leukemia inhibitory factor (LIF), at 37 °C in 5% CO2. The ES media contained DMEM high glucose (Invitrogen, Catalog No. 11995–065) supplemented with 15% fetal bovine serum defined (hyClone, Catalog No. SH30070.03), 0.1 mM non-essential aminoacids (Gibco-Brl, Catalog No. 11140–050), 0.1 mM 2-mercaptoethanol (Sigma, Catalog No. M6250) and 1,000 U/ml ESGRO-LIF (Millipore, Catalog No. ESG1107). The basal expression of the transgenes in each stable clone was assured by the growth of the cells in this ES media +LIF supplemented with 1 μg/ml Tetracyclin (Tc) (Sigma, Catalog No. T7660). The selection of positive recombinant clones was assured by the growth of the cells the ES media (+LIF and +Tc) supplemented with 1.5 μg/ml of Puromycin (Puro, Sigma, Catalog No. P9620). After trypsinization (Trypsin-EDTA solution 10x, Sigma, Catalog No. T4174) the cells were plated 1 day before the nucleofection on a layer of 0.1% Gelatin (Gelatin Type I from porcine skin, Sigma) in 100-mm dishes (Nunc, Catalog No. 150350) in ES media (+LIF and +Tc). For Nucleofection protocol 2 × 106 cells were counted for each sample. Plasmids were prepared using Qiagen plasmid Midi-kit (Catalog No. 12145): 5–6 μg of pPthC vector in which the ORF of interest were cloned were incubated with 3 μg of pCAGGS-Cre vector27 and 100 μl of Mouse ES Cell Nucleofector Kit (Amaxa, Catalog No. VPH-1001) was added to the plasmid mix as described in14. Cells were then incubated for 15 minutes at room temperature in complete medium and then plated. The day after, the cells were washed twice with PBS (Dulbecco Phosphate buffered Saline 1x, Gibco, Catalog No. 14190), and switched to selection media (+LIF +Tc +1.5 μg/ml Puro). The colonies were grown for one week before they were individually trypsinized and transferred to 96-well U-bottom plates (Nunc, Catalog No. 163320), then each clone was equally distributed among two gelatin-coated 48-well plates for selection in “selection media” (ES media +LIF and +150 μg/ml Hygromicin B in PBS, (Invitrogen, Catalog No. 10687–010)): the clones resistant to selection media and in parallel dead in selection media were isolated, replicated in 12-well plates (Nunc, Catalog No. 150628) and then in 6-well plates (Nunc, Catalog No. 140675) to extract the genomic DNA using standard conditions.

Cloning strategy

Each human coding sequence was cloned from the ATG to the stop codon without the 5′ and 3′ UTRs. For the 4 WBS ORFs, we cloned the longest annotated coding sequence (NM_001368300 for GTF2IRD2; NM_001199207 for GTF2IRD1; NM_032999 for GTF2I; NM_022170 for WBSCR1). The exchange vector pPthC-Oct-3/4was modified as described in14 and the epitope 3xFLAG was designed to be in frame with the stop codon of each ORF. The cDNAs were amplified using the plasmids as templates by PCR in standard conditions: the forward and reverse primers were designed to include in the sequence the restriction sites recognized by the enzymes AscI and PacI at the 5′ and 3′ ends, respectively (Supplementary File 1). After digestion with specific restriction enzymes, the cDNA fragments were cloned into pTOPO-bluntII (Invitrogen, Catalog No. K2875J10), and then the cDNAs was cleaved by AscI-PacI. The fragments obtained by digestion were separated from pTOPO-bluntII as described in14, the purified cDNA fragments were then inserted into the appropriately digested and purified pPthC vector23. The Escherichia coli positive clones were selected by enzymatic digestions and then sequenced by using the universal M13Fw primer and, for longer sequences, internal forward primers specific to the gene of interest.

Induction of transgene expression

The induction of the 4 transgenes’ expression to Tc was verified on three positive clones for each WBS gene of interest. The complete removal of Tc results in sufficient induction of the Tet-off system as decribed in28. Cells to be induced were grown in medium deprived of Tc to perform a time course of induction (17, 24, 39 and 48 hours), by using the growth in Tc as control, time 0. Total RNA was extract at each time point of the time course and at time 0 and then 1 μg of each reverse-transcribed as described in14. The levels of each transcript was measured by Real-time RT-PCR experiments by using Light Cycler 480 Syber Green I Mastermix (Roche, Catalog No. 04887352001) for cDNA amplification and in LightCycler 480 II (Roche) for signal detection. RT-PCR results were analyzed using the comparative Ct method normalized against the housekeeping gene Actin B (refer to Supplementary File 2). All primer pair sequences used for RT-PCR are available in Supplementary File 2. For the time course of induction of the GTF2I clones (named C6, B3, A1), of the GTF2IRD1 clones (named D1, D2, D4), of the GTF2IRD2 clones (named A3, A4, A5) and of the WBSCR1 clones (named A3, A4, A1) refer to Supplementary File 3, Supplementary Fig. 3 and Online-only Table 1.

Microarray hybridization, data processing and statistical analysis

The preparation of the RNA’ samples for the microarray hybridization on the Affymetrix GeneChip Mouse Genome 430_2 array was described in14. Low-level analysis was performed by robust multiarray average (RMA) implemented using the RMA function of the Affymetrix package of the Bioconductor project29,30 in the R programming language31. The low-level analysis for the BAMarray tool (v3.0) was performed using the MAS5 method as described in14 and implemented using the corresponding function of the same Bioconductor package. For each gene, a t-test was used on RMA normalized data to determine the differentially expressed genes (induced versus uninduced). P-value adjustment for multiple comparisons was done with the FDR of Benjamini-Hochberg32 (threshold FDR <0.05, refer to Supplementary File 4 and Supplementary Fig. 4).

Accession codes

The whole set of results is available in the GEO database25,26 as “A transcriptomic study of Williams-Beuren syndrome associated genes in mouse embryonic stem”, SuperSerie code GSE9670124 (Supplementary File 4, Supplementary Fig. 4 and Online-only Table 1). The title of the SuprSeries is “Expression data from inducible ES stable cell line overexpressing the human GTF2IRD1, GTF2IRD2, WBSCR1, or GTF2I”. In details: 1) GSE95267 refers to expression data from inducible ES stable cell line overexpressing specifically the human gene GTF2IRD1; 2) GSE95268 refers to expression data from inducible ES stable cell line overexpressing specifically the human gene GTF2IRD2; 3) GSE95269 refers to expression data from inducible ES stable cell line overexpressing specifically the human gene WBSCR1; 4) GSE95270 refers to expression data from inducible ES stable cell line overexpressing specifically the human gene GTF2I Fig. 1.

Fig. 1
figure 1

Flowchart of experimental design of this study.

Data Records

The whole set of results is available in the GEO database25,26 as “A transcriptomic study of Williams-Beuren syndrome associated genes in mouse embryonic stem”, SuperSerie code GSE9670124.

Technical Validation

The overexpression of the 4 selected WBS genes was based on the inducible expression by means of a tetracycline-repressible promoter (tet-off system). The first validation of the system was based on the cloning of the luciferase (Luc) into the exchange vector as described in14, the second was the establishment of the expression of the YFP reporter gene, which is separated from the Luc gene in the recombinant locus by an IRES sequence, by detecting a comparable level of the YFP expression and protein accumulation following induction14. The study of the growth properties of our mES line (EB3) compared to the parental line (E14) (data not shown) and the ability of these cells to differentiate in the three main germ layers was also performed in14: in details the down-regulation of the pluripotens’ marker Oct3/4 was also confermed in the EB3 as well as a farther induction of the mesodermal (Brachyury), ectodermal (Gfap) and endodermal (Afp) markers during mES differentiation. Collectively these data suggest that the system we chose allows the efficient and long-term overexpression of the transgene in a dose and time-dependent manner. It is therefore suitable for systematic expression of WBS cDNAs. The positive clones overexpressing the 4 selected WBS genes were identified by PCR using the primer pair used in previous studies13,14: 5′-GCATCAAGTCGCTAAAGAAGAAAG-3′ and 5′-GAGTGCTGGGGCGTCGGTTTCC-3′ (Supplementary Fig. 1).