Introduction

Cleft lip and/or palate (CL/P) is a common congenital malformation affecting approximately 2 per 1000 newborns worldwide (Murray, 2002) and causing a major psychosocial and economic burden for the families. Gaining insight into the etiology of CL/P is important as it may lead to improved diagnosis, counselling and perhaps preventive measures in the future. Although CL/P is a complex disease caused by both environmental and genetic factors, most studies point to genetic factors as the major determinants (Christensen and Mitchell, 1996). Recently, several genes implicated in CL/P have been identified (Jugessur and Murray, 2005), but they can only account for a fraction of the etiology.

Moreover, epidemiological studies point to that CL/P subgroups may have different etiologies, as isolated cleft palate (CP) and combined cleft lip and palate (CLP) seldom occur in the same families (Fogh-Andersen, 1942), and isolated cleft lip (CL) and CLP display different sex ratios and prevalences of associated malformations (Harville et al., 2005).

To gain more insight into the genetic etiology of CL/P we studied expression profiles in lip and palate tissue from patients with different types of orofacial cleft: CL, CP and CLP, rationalizing that the epidemiological differences may reflect differences in the underlying molecular processes.

Results

Affymetrix GeneChip analysis

We identified 11 genes which were differentially expressed in the palate tissue from CP and CLP respectively: CD14 antigen, Serglycin (PRG1), Immunoglobulin lambda constant 2, Osteopontin/secreted phosphoprotein 1 (SPP1), Chemokine receptor 4 (CXCR4), Adenosine monophosphate deaminase 2 (isoform L) (AMPD2), Regeneration associated muscle protease, Hypothetical protein MGC27165, Immunoglobulin kappa constant, Likely ortholog of rat vacuole membrane protein 1, Hypothetical protein LOC124773 (Table 1).

Table 1 Differently expressed genes (P < .05) on Affymetrix GeneChip in palate tissue from patients with cleft palate only (CP) versus palate tissue from patients with cleft lip and palate (CLP), selected genes validated by QPCR. Accessions numbers are from the Affymetrix GeneChip analysis. Functions are from the functions of the human proteins listed in Gene Ontology (GO), http://www.geneontology.org/. The COGENE database was searched for expression in the relevant tissues (see Methods section) in Affymetrix chip analyses and SAGE libraries, A, absent; P, present; N.A., not available. A SAGE tag had to be identified twice or more to be considered present

We identified 19 genes differentially expressed in the lip tissue from CL and CLP respectively: Bone marrow stromal cell antigen 2 (BST2), SMART/HDAC1 associated repressor protein (SPEN), Deiodinase, iodothyronine type II (DIO2), Cytochrome c oxidase subunit VIIa polypeptide 1 (COX7A1), Butyrophilin, subfamily 3, member A3 (BTN3A3), Crystallin, alpha B (CRYAB), gb:M27487, gb:AA770596, Major histocompatibility complex, class II, DQ alpha 2 (HLA-DQA1), Heat shock protein, alpha-crystallin-related, B6 (HSPB6), Protein kinase C and casein kinase substrate in neurons 3 (PACN3), EGF-like-domain, multiple 6 (EGFL6), Calsequestrin 1 (CASQ1), gb:AF116709, Nuclear factor I/X (CCAAT-binding transcription factor) (NFIX), Kinase interacting stathmin (UHMK1), Hypothetical protein (CMYA1), Hypothetical protein MGC34032, EH domain binding protein 1-like 1 (EHBP1L1) (Table 2).

Table 2 Differently expressed genes (P < .05) on Affymetrix GeneChip in lip tissue, from patients with cleft lip only (CL) versus lip tissue from patients with cleft lip and palate (CLP), selected genes validated by QPCR (ns is non-significant). Accessions numbers are from the Affymetrix GeneChip analysis. Functions are from the functions of the human proteins listed in Gene Ontology (GO), http://www.geneontology.org/. The COGENE database was searched for expression in the relevant tissues (see Methods section) in Affymetrix chip analyses and SAGE libraries, A, absent; P, present; N.A., not available. A SAGE tag had to be identified twice or more to be considered present

QPCR

While the GeneChip results from the palate tissue were verified by QPCR (Table 1), the results from the lip tissue could not be verified as it only showed a non-significant trend (Table 2), and therefore we focus on the results from the palate tissue in the following discussion.

Immunohistochemistry

One important bias in our study is that the tissue samples investigated are not from the foetal period where the development of the lip and palate takes place but from fully developed children aged 4-16 months. To investigate if the genes identified in the GeneChip analysis were also expressed during human development, we performed immunohistochemical staining in craniofacial tissue from human embryos (6th to 11th week of gestation). Immunostainings against the selected proteins osteopontin (SPP1), chemokine receptor 4 (CXCR4) and serglycin (PRG1) in frontal sections showed a characteristic, and in many respects complementary expression during initial growth and elevation of the palatal shelves (corresponding to 6-8 gestational weeks, Figure 1A and B), and during fusion of the palatal processes and removal of the medial edge epithelium at the site of fusion (corresponding to 9-11 gestational weeks, Figure 1C and D). PRG1 protein expression was only present during the palatal fusion and medial edge epithelium breakdown (Figure 1E). Together, this indicates that all the three chosen genes were present in the human palate during fusion and therefore may play a role in normal palate development.

Figure 1
figure 1

Immunohistochemical staining of human embryonic craniofacial sections. (A) During elevation (6th-8th week embryos) of the palatal shelves osteopontin (SPP1) protein is present in the palatal mesenchyme. (B) Chemokine receptor 4 (CXCR4) was present in the epithelial lining of the oronasal cavity and the palatal shelves. (C) During fusion (9th-11th week fetuses) of the palatal shelves SPP1 is present in the palatal mesenchyme and (D) CXCR4 is present in the epithelium of the oronasal cavity, including the medial edge epithelium, where the palatal shelves meet and fuse. (E) Serglycin (PRG1) protein expression was not present in the early gestational age (6-8 weeks), but became apparent during palatal fusion in the epithelial lining of the oronasal cavity and in the medial edge epithelium. Black color indicates expression. (Hematoxylin-eosin or toluidine blue staining, primary magnification ×1.25).

In silico analysis

CD14, SPP1, CXCR4, AMPD2 and the immunoglobulin kappa and lambda constants found in the palate tissue were also identified in COGENEs Affymetrix and/or SAGE profiles, and our human protein interaction network identified a putative interaction between SPP1 and PRG1.

The palate tissue from the two CL/P subgroups CP and CLP are also from two different age groups (16 and 4 months, respectively). This may introduce a bias to our study, as the differences in gene expression may be related to either differences in age, CL/P type or both. To circumvent this problem, we studied the expression of spp1 by QPCR in wild type mouse palates (three palate samples from each of the four age groups; newborn, 1 week, 1 month and 2 months old, for information on cDNA synthesis and primer sequences, see Methods), and found an increase in spp1 expression in 1 week old mouse palates compared to the other age groups (results not shown).

Discussion

Expression analysis in human lip and palate tissue from patients with different types of cleft lip and palate has not been performed before, and epidemiological studies suggest different etiologies in the three CL/P subgroups; CL, CLP and CP (Fogh-Andersen, 1942; Harville et al., 2005). Hence, we performed the present study in order to obtain more information on the genetic etiology of CL/P in general and more information on the genetic differences in the CL/P subgroups.

Ideally, we would have preferred to compare expression profiles from patients with CL/P with samples from normal children at comparable ages, but this was unfortunately not available to us. Thus, it is not possible from the present study to determine which cleft types have up- or downregulated gene expression patterns compared to normal children, merely expression differences were detected.

Our GeneChip analysis identified 11 genes differentially expressed in palate tissue from CP and CLP (CD14 antigen, PRG1, Immunoglobulin lambda constant 2, SPP1, CXCR4, AMPD2, Regeneration associated muscle protease, Hypothetical protein MGC27165, Immunoglobulin kappa constant, Likely ortholog of rat vacuole membrane protein 1, Hypothetical protein LOC124773) (Table 1) and 19 genes differentially expressed in the lip tissue from CL and CLP (BST2, SPEN, DIO2, COX7A1, BTN3A3, CRYAB, gb:M27487, gb:AA770596, HLA-DQA1, HSPB6, PACN3, EGFL6, CASQ1, gb:AF116709, NFIX, UHMK1, CMYA1, Hypothetical protein MGC34032, EHBP1L1) (Table 2).

In the palate tissue all genes, except Regeneration associated muscle protease, were higher expressed in patients with CLP than CP and in the lip tissue all genes, except Hypothetical protein MGC34032, were higher expressed in patients with CL than CLP. The reason for these skewed results is unclear, but a possible explanation may be that the genes or some of them are working in a mutual biological pathway.

While QPCR validated the GeneChip results in palate tissue, QPCR gene expression analysis of lip tissue only showed non-significant changes. This may indicate that the differences in the molecular processes underlying CL and CLP are small or at least undetectable by the techniques used in our study. Consequently, the following discussion will focus on the genes identified as differentially expressed in the palate tissue in the two CL/P subgroups CLP and CP.

Secreted phosphoprotein 1, also referred to as osteopontin (SPP1) was expressed at significantly higher levels in the palate tissue from the CLP patients compared to the CP patients with GeneChip analysis and QPCR, and it was also identified in human embryonic palate by immunohistochemistry. Although spp1 transgenic mice do not display CL/P (Rittling et al., 1998), SPP1 appears to be a promising candidate as SPP1 is involved in ossification (Jono et al., 2000), and hence potentially involved in the formation of bone in the palate. Moreover, SPP1 has been emphasized as important for palate formation in two recent micro array studies on foetal murine palates, which demonstrated significant changes in expression of SPP1 during craniofacial development (Brown et al., 2003; Mukhopadhyay et al., 2004).

Our protein interaction network identified a putative interaction between SPP1 and Serglycin (PRG1) (Rual et al., 2005), another gene differentially expressed in palate tissue. PRG1 was also identified in immunostainings of human embryonic palate. Our demonstration of PRG1 and the recent finding that PRG1 plays a role in the ossification process by inhibiting bone mineralization in vitro (Theocharis et al., 2006), implicates PRG1 as a candidate gene for the regulation of normal palatal development.

Interestingly, several genes with a function in the immune system were differentially expressed in palate tissue from CLP and CP: CD14 antigen, CXCR4, and immunoglobulin kappa and lambda constants were all expressed at significantly higher levels in palate tissue from CLP patients compared to CP (see Table 1 for further information on biological functions of the proteins). Immunostainings showed that CXCR4 was present in the medial edge epithelium in sections of human embryonic palate, supporting a role for CXCR4 in normal palate development.

The involvement of the immune system in relation to CL/P is presently unknown, but animal studies support a role for immune related factors in craniofacial development: Immune stimulation following teratogenic exposure in pregnant mice decreased the frequency of foetal abnormalities, including cleft palate (Holladay et al., 2002). This observed "rescue" of foetal abnormalities was considered mediated by TGFβ3, IGFs and ILs (Sharova et al., 2002). TGFβ3 is a known immunomodulatory cytokine (Wahl et al., 2004) and is considered a CL/P candidate gene based on association- and linkage studies and mouse models (Murray, 2002). Moreover, previously recognized CL/P genes have functions related to the immune response, most notably, interferon regulating factor 6 (IRF6) (Taniguchi and Takaoka, 2002; Zucchero et al., 2004) and poliovirus receptor-like 1 (PVRL1) (Suzuki et al., 2000; Avila et al., 2006).

The reason why TGFβ3, IRF6 and PVRL1 were not detected in the present study may be that we identified differences in expression in the CL/P subgroups, so if TGFβ3, IRF6 and PVRL1 have similar expression profiles in both CP and CLP, we would not identify it.

Furthermore, an epidemiological study has pointed to an association between maternal infection during pregnancy and increased incidence of CL/P (Metneki et al., 2005), and together these studies suggest a link between the immune system and craniofacial development.

A cross regulation between bone metabolism and the immune system has been identified (Walsh et al., 2006), and our findings are suggestive of an interaction between the immune response and bone metabolism in palate formation, as SPP1 and PRG1 are involved in both processes (Jono et al., 2000; Theocharis et al., 2006; Kolset and Tveit, 2008).

A potential weakness of this study is that the expression profiles in the tissue collected and analyzed postnatally may not reflect embryonic expression, as the cleft-causing processes may have ended by the time of analysis. We tried to circumvent this problem by performing immunostainings in human embryonic craniofacial tissue sections, and detected all the selected proteins (SPP1, PRG1 and CXCR4), which support a role for these genes in at least normal palate formation. However, this does not necessarily indicate that these genes play a different role in CP compared to CLP, as the palate tissue samples from the CP and CLP patients were from different age groups (approximately 16 months and 4 months of age, respectively), because of differences in the surgical protocols.

In order to address this age-related problem, we studied the expression of spp1 by QPCR in wild type mouse palates and found a surge in spp1 expression in 1 week old mouse palates compared to newborn mouse, 1 month and 2 months old mouse palates. As CLP is operated earlier than CP, the presence of a similar increase in palatal SPP1 in humans during the early neonatal period could explain the higher levels of SPP1 in CLP compared to CP.

In conclusion, by Gene chip analysis and staining with selected antibodies in human embryonic palates, we found supportive evidence that osteopontin (SPP1), chemokine receptor 4 (CXCR4), serglycin (PRG1) and immune related factors may play a role in the development of the normal palate, and perhaps different roles in the development of the cleft palates in CP compared to CLP, although this is uncertain due to age differences in the CP and CLP subgroups. Our data pointed to an interesting cross regulation between immune related factors and bone metabolism, a finding which should be explored further.

A valuable lesson learned from this study is that future expression studies in cleft lip and palate should preferentially use human foetal tissue with a fixed gestational age in order to reduce the potential for the biases discussed here.

Methods

Patients

The study was approved by the local scientific ethics committee. Patients with CL, CLP (only the unilateral type) or CP only of Caucasian origin were included in the study. The patients were examined by the first author (LPJ) and were excluded from the study if other visible malformations than the CL/P were present. Blood tests or ultrasound examinations were not performed. Tissue samples were collected during the primary cleft operation, if the parents had given their informed written consent. The patients were operated by one of three surgeons at the same hospital. According to the surgical protocols the children with CL and CLP are operated at approximately four months of age and CP at approximately 16 months of age.

To achieve optimal healing and hence cosmetic and functional results, a small piece of mucosa is routinely removed during the operation from the borders of the cleft region (either lip and/or palate tissue depending on the CL/P type). In each of the three CL/P subgroups, samples were obtained from three patients. As six tissue samples were collected from the CLP patients (three lip and three palate tissue samples), the following two types of comparisons were possible in the total of 12 samples: Expression in lip tissue from patients with CL compared to lip tissue from CLP, and expression in palate tissue from CP compared to CLP. The mucosa samples from the lips and palates are primarily composed of non-keratinized epithelium and sub-epithelial stroma.

In embryological terms the lip samples were taken from mucosa developed from the maxillary (lateral) prominence on the cleft side, not from the frontonasal (median) prominence and samples from the palate were taken from the soft palate mucosa at the borders of the cleft. Only one sample from either lip and/or palate tissue depending on the CL/P type was obtained from each patient.

Affymetrix GeneChip analysis

Total RNA was extracted from the tissue using the mirVana miRNA Isolation Kit (Ambion Inc., Austin, TX). This kit provides high quality total RNA and can be used both to yield total RNA or RNA enriched for small RNAs, such as miRNAs. Initially, we planned to carry out both gene expression and miRNA expression in the tissue, but as we realized that miRNAs are expressed during embryonic development, we declined to perform the miRNA chip analysis. The quality and quantity of the extracted total RNAs were assessed by agarose gel electrophoresis, spectrophotometric ultraviolet (UV) absorbance at 260/280 nm, and Agilent Bioanalyzer analysis (Agilent Technologies, Inc., Santa Clara, CA). Gene expression was performed on Affymetrix GeneChip Human Genome U133A Plus 2.0 Array (Affymetrix, Inc., Santa Clara, CA). Labelling, hybridization, washing and image scanning were performed according to the Affymetrix standard protocol: Two µg of total RNA was used to synthesise double-stranded cDNA with the Superscript Choice system (Invitrogen) using an oligo(dT) primer containing a T7 RNA polymerase promoter (GenSet). The cDNA was used as the template for an in vitro transcription reaction to synthesise biotin-labeled antisense cRNA (IVT labelling Kit, Affymetrix Inc.). After fragmentation at 94℃ for 35 min in fragmentation buffer (40 mM Tris, 30 mM magnesium acetate, 10 mM potassium acetate), the labeled cRNA was hybridised for 16 h to Affymetrix HG-U133 Plus 2.0 arrays (Affymetrix Inc.), that contain 54,613 probe sets. The arrays were washed and stained with phycoerythrin-streptavidin (SAPE) using the Affymetrix Fluidics Station 450, and the arrays were scanned in the Affymetrix GeneArray 3000 scanner to produce raw data image files.

Normalization of gene expression values and hierachical clustering were performed using dCHIP software (Li and Wong, 2001). Transcripts were identified as being differentially expressed if the comparison of groups yielded a fold change higher than 1.5 (using the 90% lower confidence interval of the fold change), a difference of means higher than 50 and a P < .05 (two-tailed, two-sample unequal variance t-test, Welch test).

Validation by quantitative RT-PCR (QPCR)

QPCR was performed in nine selected genes (based on their molecular functions according to Gene Ontology Consortium (www.geneontology.org)) in two samples from each of the four groups of tissue; lip tissue from CL and CLP and palate tissue from CP and CLP: cDNA was synthesised with superscript II (RNase H-) reverse transcriptase (Invitrogen, Carlsbad, CA) and QPCR analysis was performed in a DNA Engine Opticon 2 (Biorad, Hercules, CA) using LightCycler FastStart DNA master SYBR GreenI kit (Roche, Indianapolis, IN). Primers were purchased from TAG Copenhagen (Copenhagen, DK). All primers are intron-spanning and designed using Oligo6 software (Cascade, CO). The identity of PCR products and primer sequence specificity was confirmed by BLAST analysis (NCBI, NIH, http://www.ncbi.nlm.nih.gov/BLAST/) and melting curve analyses. Gene expression levels were normalised to the expression of β-2-microglobulin precursor (B2M). Primer sequences (from 5' to 3') were: B2M, forward (F)-TGTGCTCGCGCTACTCTCTC, reverse (R)-CTGAATGCTCCACTTTTTCAATTCT; BST2, F-GATGGAGTGTCGCAATGTCA, R-GCCCAGCAGCACAATCA; SPEN, F-GTCCTGGCCCATCGGT, R-TTGGCCTGCAGGTAAGTGAT; DIO2, F-AGAGGGACTGCGCTGCGTCT, R-CTGGCAGCTGGCTCGTGAAA; CRYAB, F-GTTGGGAGATGTGATTGAGG, R-GCTTCTCTTCACGGGTGAT; PACN3, F-GGGTGAGGGCACTCTATGAC, R-GCCCTCAGCCTAGACTCGT; EGFL6, F-AGACATTGGCCGATTGAA, R-ATGCCAGAGGTCCTATGATG; SPP1 (human), F-CTAAGAAGTTTCGCAGACCT, R-CTATCAATCACATCGGAATG; Spp1 (mouse), F-TCTGATGAGACCGTCACTGC, R-TCGGAATTTCAGATACCTATCATCT; CXCR4, F-TCCAGTAGCCACCGCATCT, R-TGCCCACAATGCCAGTTAAG; AMPD2, F-CTCAGCTATCACCGGAATC, R-TCTTTACCTTGTGCGAGAAG.

Validation by immunohistochemistry

A total of seven human embryos (6th to 8th week) and five human fetuses (9th to 11th week) were examined. The human embryos and fetuses, which ranged from 7-60 mm crown-rump length (CRL), were obtained from spontaneous and legal abortions or ectopic pregnancies. Informed consent was obtained. Immediately following the operation, the embryos and fetuses were dissected into appropriate blocks and fixed for 12 to 24 h at 4℃ in one of the following fixatives: 10% neutral buffered formalin, 4% Formol-Calcium, Lillie's or Bouin's fixatives. The specimens were dehydrated with graded alcohols, cleared in xylene and paraffin embedded. Serial sections, 3-5 µm thick, were cut in transverse, sagittal or horisontal planes and placed on silanized slides. Representative sections of each series were stained with hematoxylin and eosin or with toluidine blue.

The paraffin sections from tissue blocks containing the craniofacial region (including the palatal shelves) were dewaxed, rehydrated and washed in Tris-buffer saline (TBS: 0.05 M Tris, 0.15 M NaCl, pH 7.6) with 0.01% Nonidet P-40 (TBS/Nonidet)). All sections were incubated in 1.5% H2O2 in TBS/Nonidet for 15 min to block endogenous peroxidase activity, and then in 10% normal goat serum in TBS/Nonidet for 30 min to block non-specific binding. Sections were incubated overnight at 4℃ with the primary antibody, diluted in 10% goat serum. Sections were incubated with (1) human monoclonal anti-CXCR4 Ig, diluted 1:1500 (RD systems MAB173), (2) rabbit polyclonal anti-SPP1 Ig, diluted 1:600 (Abcam ab33046), and (3) rabbit polyclonal anti-serglycin Ig, diluted 1:400 (kindly provided from the laboratory of Dr. Niels Borregaard, (Niemann et al., 2004)).

The primary antibodies were detected using the peroxidase revelation method (DakoCytomation EnVision+DualLink System), according to the manufacturer's recommendations. As negative controls, sections were incubated with only the secondary antibody.

In silico analysis

The Craniofacial and Oral Gene Expression Network (COGENE) is a publicly available database, comprising gene expression profiles (Affymetrix GeneChip analysis and Serial Analysis of Gene Expression (SAGE)) from human embryonic craniofacial structures. Our results in palate tissue were compared to COGENE Affymetrix expression profiles from the following structures, since these structures develop into palatal tissue: 4th week pharyngeal arch 1, 5th week pharyngeal arch 1, 6th week maxilla, and 8.5th week palatine shelves. Expression in lip tissue was compared to COGENE Affymetrix expression profiles from: 4th week pharyngeal arch 1, 5th week pharyngeal arch 1, 6th week maxilla, 4th week frontonasal prominence, 5th week frontonasal prominence, 6th week median nasal prominence, and 8.5th week upper lip, as these structures develop into lip tissue. A few COGENE SAGE profiles were relevant for the lip development: 4th week frontonasal prominence, 5th week frontonasal prominence, and 8.5th week upper lip.

Signalling pathways were analyzed using a human protein interaction network, which we have previously constructed using several network databases (Lage et al., 2007).

Electronic-database information

The URLs for data presented herein are as follows: Craniofacial and Oral Gene Expression Network (COGENE), http://hg.wustl.edu/cogene/Gene Ontology (GO), http://www.geneontology.org/Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/omim/UCSC Genome Bioinformatics site, March 2006 assembly, http://www.genome.ucsc.edu/