Association between genetic variants in the HNF4A gene and childhood-onset Crohn’s disease

Article metrics


Hepatocyte nuclear 4 alpha (HNF4α), involved in glucose and lipid metabolism, has been linked to intestinal inflammation and abnormal mucosal permeability. Moreover, in a genome-wide association study, the HNF4A locus has been associated with ulcerative colitis. The objective of our study was to evaluate the association between HNF4α genetic variants and Crohn’s disease (CD) in two distinct Canadian pediatric cohorts. The sequencing of the HNF4A gene in 40 French Canadian patients led to the identification of 27 single nucleotide polymorphism (SNP)s with a minor allele frequency >5%. To assess the impact of these SNPs on disease susceptibility, we first conducted a case–control discovery study on 358 subjects with CD and 542 controls. We then carried out a replication study in a separate cohort of 416 cases and 1208 controls. In the discovery cohort, the genotyping of the identified SNPs revealed that six were significantly associated with CD. Among them, rs1884613 was replicated in the second CD cohort (odds ratio (OR): 1.33; P<0.012) and this association remained significant when both cohorts were combined and after correction for multiple testing (OR: 1.39; P<0.004). An 8-marker P2 promoter haplotype containing rs1884613 was also found associated with CD (P<2.09 × 10−4 for combined cohorts). This is the first report showing that the HNF4A locus may be a common genetic determinant of childhood-onset CD. These findings highlight the importance of the intestinal epithelium and oxidative protection in the pathogenesis of CD.


Inflammatory bowel disease (IBD) refers to two chronic inflammatory disorders affecting the intestinal mucosa: Crohn’s disease (CD, (MIM 266600)) and ulcerative colitis (UC, (MIM 191390)). CD is common in developed countries, with a prevalence estimated at 100–300/100 000.1, 2 The etiology of CD has not yet been elucidated, but is considered to involve a complex interaction between predisposing genes, environmental factors and impaired immune response to the commensal gut microbiome. The understanding of the genetic contribution to risk of CD has advanced enormously as a result of recent case–control and genome-wide association studies (GWAS).3, 4, 5, 6 Indeed, GWAS,7 followed by deep sequencing of GWAS loci,8 have identified 85 distinct loci associated with the disease. However, the genes identified thus far only explain 23% of the genetic contribution to CD.7

Hepatocyte nuclear factor 4 alpha (HNF4α, NR2A1) belongs to the nuclear hormone receptor superfamily.9 It is expressed in the liver, kidney, pancreatic islets and gut.9, 10, 11 HNF4α interacts with regulatory elements in promoters and enhancers of genes involved in cholesterol, fatty acid and glucose metabolism.12 Genes transactivated by HNF4α encode various transcription factors, enzymes and proteins involved in numerous processes, including hematopoiesis, blood coagulation/fibrinolysis, as well as hepatic development and function.13, 14, 15, 16, 17 HNF4A is located on locus 20q13.1–13.2. Thirteen exons have been identified, and alternative splicing of these exons result in at least nine isoforms of the protein. The transcription of three of these isoforms is driven by an alternate promoter known as P2, which is located 45.6 kb upstream P1 promoter.18, 19 It has been suggested that P2 promoter drives transcription in pancreatic β cells,18, 19 while the P1 promoter is mainly active in liver cells.18, 20 Both promoters appear to be effective in the intestine.20

The key hepatic and pancreatic functions of HNF4α are well established. It activates gluconeogenesis in hepatocytes,21 maintains glucose homeostasis by regulating gene expression in pancreatic β cells,12, 22 activates insulin genes through both direct and indirect mechanisms22, 23 and regulates the expression of many genes, such as apolipoproteins.24 Rare loss-of-function mutations in the HNF4A gene cause a monogenic form of type 2 diabetes (T2D), type 1 maturity-onset diabetes of the young (MODY1).25 Also, HNF4α has been reported to be associated with the risk of late-onset T2D in several populations.26, 27, 28 In the gut, HNF4α has a role in colonic development,29 lipid transport30 as well as intestinal epithelial cell differentiation and phenotype expression.31, 32 It has also been associated with susceptibility to abnormal intestinal permeability, inflammation and oxidative stress.33, 34 Of particular relevance, a recent GWAS demonstrated associations between the 20q13.1 locus that harbors the HNF4A gene and risk of developing UC.35 Interestingly, no associations with CD were found. In this study, we have hypothesized that HNF4A gene polymorphisms are associated with the risk of developing CD. We comprehensively examined the association between variants in and around the HNF4A gene and CD in two distinct cohorts of Canadian children.


SNP discovery by sequencing

To determine the single nucleotide polymorphism (SNP) content of HNF4α in our population, 30 selected fragments of the HNF4A gene were sequenced in 40 IBD French Canadian patients. As summarized in Table 1, sequencing of the gene led to the identification of 27 SNPs with a minor allele frequency >5%. Among the identified SNPs, one was non-synonymous (rs1800961, T130I) and 26 were located either in intronic or in promoter regions. All SNPs had been previously reported in dbSNP (build 131). Most of the variants identified in this study were previously associated with the risk of developing T2D36 and dyslipidemia.37 The relative positions of SNPs on the HNF4A locus are illustrated in Figure 1.

Table 1 Summary of the identified SNPs in the targeted HNF4A regions
Figure 1

Schematic illustration of the location of 27 SNPs identified in the HNF4A gene. Relative position of 27 SNPs revealed by sequencing within the HNF4A locus. The labeled shaded regions are exons, numbered 1–10. , Non-synonymous SNP, □, synonymous intronic SNP.

Genotyping for association with Crohn’s disease in discovery cohort

A total of 356 (271 French Canadian, 57 Jewish and 30 non-Caucasian) subjects with CD and 542 controls were included for genotyping. The descriptive and clinical characteristics of participants of the discovery cohort are shown in Table 2. There was a non-significant higher proportion of males among the cases (53.35%). The mean age at diagnosis (15.41±7.63 years) was similar to age of controls (13.67±2.72 years). Based on the Montreal Classification,38 most cases (n=224, 62.57%) had ileocolonic location (L3±L4) and inflammatory disease (B1±p) (n=287, 80.17%). The majority of the population was of Caucasian ancestry (n=271, 75.70%).

Table 2 Characteristics of controls and Crohn’s disease subjects in discovery, replication and combined cohorts

Among the 27 SNPs identified, three could not be adequately genotyped owing to technical difficulties (rs2425640, rs16988991 and rs3212184). The remaining 24 SNPs were analyzed for association. Table 3 shows the distribution of the frequencies of the corresponding alleles in cases and controls. Six SNPs demonstrated significant associations with CD: rs4810424 (P<0.007), rs1884613 (P<0.004), rs1884614 (P<0.005), rs2144908 (P<0.003), rs3212172 (P<0.044) and rs1800963 (P<0.048). Analysis including only individuals of Caucasian ancestry revealed similar results. However, the associations for two SNPs (rs3212172 and rs1800963) were no longer significant probably owing to reduced power.

Table 3 Distribution of allele frequencies for controls and Crohn’s disease subjects in discovery cohort

Genotyping for association with Crohn’s disease in replication cohort

For replication, we selected 10 SNPs significantly associated with CD in the single SNP and haplotype analyses of the discovery study. A total of 416 Caucasian subjects with CD and 1208 controls were included for genotyping. The descriptive and clinical characteristics of participants of the replication cohort are shown in Table 2. The proportion of males among the cases was higher (56.49%), but the difference was not significant. The mean age at diagnosis (12.69±3.41 years) was similar to that of controls (12.71±2.98 years). A high percentage of cases (n=200, 48.08%) had ileocolonic location (L3±L4) and inflammatory disease (B1±p) (n=365, 87.75%). All subjects in replication cohort were of Caucasian ancestry. Table 4 shows the distribution of the frequencies of the corresponding alleles in cases and controls. All SNPs were in Hardy–Weinberg equilibrium. Among the 10 SNPs genotyped for replication, rs1884613 remained significantly associated with CD (odds ratio (OR): 1.327; P<0.012).

Table 4 Distribution of allele frequencies for controls and Crohn’s disease subjects in replication and combined cohorts

Single SNP analysis in combined cohorts

The descriptive and clinical characteristics of participants of the combined discovery and replication cohorts are shown in Table 2. Association analysis revealed a significant association for three of the six SNPs associated in the discovery cohort, namely rs1884613 (OR: 1.389, P<0.0001), rs1884614 (OR: 1.295, P<0.001) and rs2144908 (OR: 1.260, P<0.006) (Table 4). After correction for multiple testing (40 tests), the association for rs1884613 and rs1884614 remained significant (P<0.004 and P<0.04, respectively).

Haplotype analysis

Linkage disequilibrium (LD) analysis (Figure 2) showed that the SNPs were distributed within six major haplotype blocks: a first block including eight SNPs overlapping Promoter 2 and spanning on a 14-kb region (rs4810424, rs1884613, rs1884614, rs6031543, rs2144908, rs6031550, rs6031551 and rs6031552); a second block of two adjacent intronic SNPs (rs6103716 and rs6031558); a third block of three SNPs (3 kb) in the intronic region between both promoters (rs6130608, rs2425637 and rs2425639); a fourth block of two SNPs (4 kb) also located in the intronic region between the two promoters (rs2071197 and rs736824); a fifth block of two intronic SNPs (rs745975 and rs3212183, respectively, in introns 1 and 2); and finally a sixth block of three SNPs (5 kb) located in introns 3 and 4 (rs3212195 and rs3212198). Table 5 shows the results of the haplotype analyses performed on the SNPs within each block of LD in the discovery cohort. One 8-marker haplotype was significantly associated with CD (haplotype IndexTermCGTCACTC, χ2=8.276, P<0.004). Subsequently, association analysis was replicated for the significant P2 promoter haplotype. In the replication cohort, the association with the IndexTermCGTCACTC haplotype remained significant (χ2=8.266, P<0.004) (Table 5). Combining both cohorts, the significant association was also replicated (χ2=19.997, P<7.755 × 10−6), even after correcting for 27 haplotype comparisons (P<2.09 × 10−4). Moreover, a second haplotype was found significantly associated with CD (IndexTermGCCCGTCA, (χ2=4.038, P<0.045)) when both cohorts were combined.

Figure 2

Illustration of the 6 major haplotype blocks in the HNF4A gene. LD plot in the HNF4A region is displayed. Haplotype analysis was carried out using HAPLOVIEW Software version 3.11.

Table 5 Distribution of haplotype frequencies for controls and Crohn’s disease subjects in discovery and combined cohorts

Oxidant and antioxidant status

To assess the oxidative status of CD patients in comparison with controls and according to their rs1884613 genotype, plasma malondialdehyde (MDA) was measured. Results show that MDA levels were significantly elevated in CD subjects compared with controls (P<0.0001) (Figure 3a), but no significant difference was noted when MDA levels were separated according to rs18834613 genotype (Figure 3b).

Figure 3

Oxidative stress status in control and CD subjects. Plasma MDA was assessed in CD patients compared with healthy controls (a) and according to their rs1884613 genotype (b). Plots indicate individual MDA levels and means±s.e.m. are specified. *P<0.0001 vs controls.

Subjects’ antioxidant profile was assessed by measuring plasma retinol, β-carotene, γ-tocopherol and α-tocopherol. Compared with controls, the plasma concentrations of β-carotene were reduced in CD (P<0.0001) (Figure 4a), whereas retinol (Figure 4b) and γ-tocopherol (Figure 4c) levels were elevated (P<0.0001 and P<0.001, respectively). No significant difference was observed in α-tocopherol levels (Figure 4d). Figure 5 shows the differences in vitamin levels according to the rs1884613 genotype in CD subjects. A tendency of lower levels of retinol, γ-tocopherol and α-tocopherol was observed in the homozygote carriers of the rare allele (G), but the differences did not reach statistical significance. Importantly, a large inter-individual disparity was observed in these experiments.

Figure 4

Antioxidant vitamins status in control and CD subjects. Plasma levels of β-carotene (a), retinol (b), γ-tocopherol (c) and α-tocopherol (d) were quantified in controls and CD patients. Plots indicate individual vitamin levels and means±s.e.m. are specified. *P<0.0011; **P<0.0001 vs controls.

Figure 5

Antioxidant vitamins status according to rs1884613 genotype. Plasma levels of β-carotene (a), retinol (b), γ-tocopherol (c) and α-tocopherol (d) were compared among the rs1884613 genotypes. Plots indicate individual vitamin levels and means±s.e.m. are specified.


This is the first study reporting an association between genetic variants in the HNF4A gene and risk for CD. In a discovery study, we found that six HNF4A SNPs were significantly associated with CD. In a replication study performed on distinct cohorts of CD subjects and controls, one SNP (rs1884613) remained significantly associated with CD. Combining both cohorts, the single SNP analysis demonstrated significant associations for three of the six SNPs (rs1884613, rs1884614 and rs2144908), due to the gain in power. The associations for rs1884613 and rs1884614 remained significant after correcting for multiple testing. Moreover, haplotype analyses underlined the association between CD and a 8-marker haplotype containing the SNPs found to be associated in the single SNP analysis.

In line with our findings, recent studies have provided evidence for a role of HNF4α in inflammation.33 Our group has previously explored the effects of HNF4α knockdown gene expression in an intestinal epithelial cell model and found that reduced HNF4α gene and protein expression amplified lipid peroxidation, reduced cellular antioxidant defences and increased cellular vulnerability to iron-ascorbate-generating oxidative stress.34 In line with our observations, HNF4α expression was significantly decreased in patients with IBD.39 Furthermore, dextran sulfate sodium-induced colitis was more severe in the intestine-specific HNF4α knockout mouse model that was characterized by an increase in pro-inflammatory cytokines.39 Darsigny et al.33 reported that loss of HNF4α affects colonic ion transport and causes chronic inflammation resembling IBD in a knockout mouse model. Finally, a crosstalk between HNF4α and NF-κB was reported,40, 41 supporting its role in inflammation.

We believe our findings are of high interest in view of the association between the HNF4A region and the risk of UC revealed in a whole genome study.35 This association was seen at rs6017342, which maps 5 kb distal to the 3′-untranslated region of the HNF4A gene, within a recombination hot spot. However, rs6017342 was not in high LD with the identified variant associated with CD in our study (rs1884613). In fact, none of the SNPs associated with CD in the discovery study were in strong LD with rs6017342, which can be explained by the fact that rs60317342 is located within a recombination hot spot. In addition, in the GWAS United Kingdom (UK) cohort, the rs60317342 locus did not show any association to CD, suggesting that different signals on the HNF4A gene are associated with different types of IBD. Hence, it is possible that the associations are independent and it is also probable that they may even be linked to different genes within the 12q12–13 region. Cryptic differences in the genetic structure of the French Canadian ‘founder’ population, compared with the UK population used in the GWAS, could also explain the different associations in the HNF4A locus. Moreover, it has been put forward that some genes/loci may be specific to early onset CD patients42 and that new variants in many genes could have been missed by GWAS in this specific population.43

Under the control of its two promoters, the HNF4A gene encodes a total of nine isoforms44 with various 3′ truncations. The liver-specific P1 promoter drives the expression of transcripts HNF4α1–6, which include exons 1A and 2–10 (HNF4α1–3) or exons 1A, 1B and 2–10 (HNF4α4–6). Transcripts HNF4α7–9 are expressed from the pancreatic P2 promoter located 46 kb upstream of the HNF4α transcription start and exhibit splicing of the upstream exon 1D to exon 2, without the inclusion of sequences from either exon 1A or 1B.45 The observed genetic variations in our study suggest a contribution of the P2 promoter in HNF4α implication in regulating inflammatory processes.

In our study, the P2 promoter variant rs1884613 was the only one that was replicated in a second independent cohort of cases and controls. This P2 promoter genetic variant has been associated with type 2 diabetes mellitus (T2DM) in several studies, pointing out to HNF4α’s potential role in inflammation. In fact, rs1884613 was found to be associated with T2DM in Ashkenazi,46 Mexican American,47 and Scandinavian populations.48 Moreover, a link between rs1884613 and insulin resistance was noted.49 However, the association with T2DM was not replicated in UK46 and a Finnish population,36 nor in a broader meta-analysis with additional populations.50

The identification of HNF4α, which has been associated with MODY1 and T2DM, as a CD-susceptibility gene is in line with the recent concept of shared genetic determinants for clinically distinct disorders.51 GWAS have identified several genes conferring susceptibility to multiple conditions, such as CD, ankylosing spondylitis, rheumatoid arthritis, systemic lupus erythematosus and type I diabetes.52 It has been suggested that there may be a general set of susceptibility genes for autoimmunity, which are modulated by disease-specific genes, as well as the host’s human leukocyte antigen status. A specific combination of polymorphisms, combined with environmental factors, could determine the type of disease developed by a subject.53

To predict the effect of the P2 promoter SNP rs1884613, we investigated the impact on putative transcription factor-binding sites. Our in silico analyses show that variations in that SNP could theoretically modify the binding of the ras-responsive element binding protein 1 (RREB1), a transcription factor involved in DNA repair by modulating p53 transcription54 and associated with immune tolerance.55 Thus, studying the impact of rs1884613 and other P2 promoter SNPs on HNF4α gene expression and function might help understand the role of this gene in inflammation and IBD.

During liver development, HNF4α regulates the expression of cell adhesion proteins.56 It also provokes the expression of tight-junction adhesion molecules and the modulation of subcellular distribution of junction and cell polarity proteins, resulting in junction formation and epithelial polarization in embryonal carcinoma cells.57 Moreover, using an adult mouse model lacking HNF4α in the intestinal epithelium, HNF4α was shown to have a pivotal role in the homeostasis of the intestinal epithelium, in the epithelial cell architecture, and in intestinal barrier function.58 These results underline the potential role of HNF4α in epithelial integrity in IBD physiopathology.

In an attempt to explore the mechanisms behind the rs1884613-(G/G) haplotype, we measured oxidative stress biological markers in controls and CD subjects. CD patients displayed higher oxidative stress status, as documented by the elevated MDA levels and the reduced β-carotene. Yet, the average plasma γ-tocopherol was increased in subjects with CD; such elevation in CD was previously described in the literature.59 Although no significant difference was observed in MDA and vitamin levels in the case of rs1884613 genotype, an apparent trend was noted for the levels of retinol, γ-tocopherol and α-tocopherol when compared with CC and CG genotypes. Discriminating patients according to C-reactive protein levels or disease activity could not contribute to explain the differences in antioxidant levels (data not shown). Given the limited number of patients with the rare genotype available in our study, larger cohorts are needed to focus on this aspect.

In conclusion, our results suggest that the HNF4A locus may be a common genetic determinant of CD, but its relative contribution may differ between populations. Further replication of these data in international IBD cohorts is necessary to estimate the effect of the HNF4α polymorphisms on risks for CD and UC. Functional studies are also necessary to investigate the impact of the aforementioned genetic variants on HNF4α protein functions.

Materials and methods


For the discovery cohort, patients were recruited from the IBD clinics of tertiary pediatric and adult hospitals in Montreal (CHU-Sainte-Justine, Montreal General, Royal Victoria and Montreal Children’s Hospitals) between 30 June 2008 and 30 January 2010. For the replication cohort, patients were those diagnosed and followed at the pediatric gastroenterology clinics of three hospitals across Canada: CHU-Sainte-Justine, Montreal; the British Columbia’s Children’s Hospital, Vancouver; and the Children’s Hospital of Eastern Ontario, Ottawa. These patients were recruited from 1 January 2003 to 30 June 2011. The diagnosis of CD was confirmed based on standard clinical, endoscopic, radiological and histopathological criteria.60, 61 Clinical and demographic information acquired included age at diagnosis, gender and ethnicity. Disease location and clinical phenotype were classified according to World Gastroenterology Organization’s Montreal classification (L1, ileum; L2, colon; L3, ileocolon; L4, upper GI tract; B1, non-stricturing and non-penetrating; B2, structuring; B3, penetrating; p, perianal modifier).38 The designation of French Canadian, Jewish or other ethnicity was based on self-report. Self-identified race/ethnicity has previously been shown to highly correlate with genetic cluster categories.62 For all patients, blood or saliva was collected for DNA analysis. Controls were chosen from the 1999 Quebec Child and Adolescent Health and Social Survey, a school-based survey of youth aged 9, 13 and 16 years providing DNA samples.63 The institutional Ethics Review Boards of all centers approved the study and informed consent was acquired from all participating subjects.

DNA extraction

Genomic DNA was prepared from white blood cells, total blood or saliva with the Puregene DNA Isolation kit (Gentra Systems, Qiagen Inc., Toronto, ON, Canada) using methods described by the manufacturer.

DNA variants detection by direct sequencing

To identify SNPs present in our population, we first sequenced the HNF4A gene in a total of 40 French Canadian patients diagnosed with childhood-onset IBD (20 CD and 20 UC patients). The sequencing targeted the coding regions, the P1 promoter region (2.5 kb upstream exon 1a) and other regions containing SNPs previously associated with the risk of developing diseases, such as T2D and dyslipidemia.26, 36, 37, 50, 64, 65 In total, 30 fragments were sequenced. Genomic DNA (2 ng) was amplified in a total volume of 50 μl volume using 5 μl PCR buffer (10 × ), 1.5 μl MgCl2 (50 mM), 2 μl dNTPs (2.5 mM), 0.4 μM of each corresponding primer (25 μM) and 1.0 units of Platinum Taq DNA Polymerase (Invitrogen, Carlsbad, CA, USA). The PCR amplifications were performed using a GeneAmp PCR System 9700 (Applied Biosystems, Carlsbad, CA, USA) under the following profile: 35 cycles of amplification were used at 95 °C for 30 s, 58 °C for 30 s and 72 °C for 45 s. Amplicons were verified on standard ethidium bromide stained 1.5% agarose gel. The specific primers for each fragment and the amplicon size are available upon request. Amplified fragments were sent to the McGill University Genome Quebec Innovation Center in Montreal for sequencing using Applied Biosystem’s 3730 × l DNA Analyzer technology. Complete sequences were aligned, assembled and compared using the MultiAlign software.66 Visual inspection of chromatograms was used for identification of each candidate SNP.


Discovery cohort

Based on sequencing results, identified SNPs were genotyped using the Luminex × MAP/Autoplex Analyser CS1000 system (Perkin Elmer, Waltham, MA, USA). The 27 selected SNPs were amplified in a single multiplex assay and hybridized to Luminex MicroPlex–xTAG Microspheres67 for genotyping using allele-specific primer extension. Amplification and reaction conditions are available upon request. Allele calls were assessed and compiled using the Automatic Luminex Genotyping software.68 For quality control purposes, genotyping of a systematic random sample of 20% of the specimens was repeated.

Replication cohort

Replication genotyping was performed on the SNPs significantly associated with CD in the discovery study (in the single SNP and haplotype analyses). In total, 10 SNPs were genotyped using Sequenom-based primer-extension methods. These methods are designed for high-throughput SNP genotyping. The platform has a high assay conversion rate (85%), high genotyping success rate (95%) and minimal error rates (0.5–1%). Genotyping was carried out at the McGill University and Genome Quebec Innovation Center in Montreal.

Biological studies

Blood samples

In order to examine the levels of plasma MDA and antioxidant vitamins, blood samples were collected in tubes containing EDTA 1 gl−1. Plasma was separated immediately by centrifugation (700 g for 20 min at 4 °C). CD patients were characterized according to their rs1884613 genotype.


The amount of free MDA in plasma was determined by HPLC in 48 CD patients and 213 healthy controls using an improved method previously described by our unit.69

Antioxidant vitamins

The antioxidant profile was determined by measuring antioxidant vitamin levels (β-carotene, retinol, γ-tocopherol and α-tocopherol) in 45 CD patients and 112 healthy controls using an improved method previously described by our unit.70

In silico analysis

To explore the potential interaction between transcription factors and the HNF4α P2 promoter polymorphism rs1884613, we performed in silico analyses using the Genomatix MatInspector program (Genomatix Software GmbH, Munich, Germany) with a standard (0.75) core similarity. Transcription factor recognition site sequences were identified in the HNF4A gene region containing the SNP.

Statistical analysis

Potential genotyping errors were assessed using χ2-tests, which evaluate the deviation of each SNP from Hardy–Weinberg equilibrium. Allelic association for individual SNPs was carried out using logistic regression by fitting an additive model. Genotype and allele frequencies were compared between cases and controls using χ2-tests and Fisher’s exact tests where appropriate. OR and 95% confidence intervals were estimated. In addition to single SNP analysis, haplotype analysis was carried out. LD blocks were defined using the ‘single gamete rule’ implemented in the HAPLOVIEW Software, version 3.11.71 The association of specific haplotypes within blocks with the outcome was examined and P-values were estimated. For the biological studies, statistical differences were assessed by Anova and Student’s two-tailed t-test. P-values <0.05 after correction for multiple hypotheses were considered significant in the genetic analysis based on the combined cohorts. Adjusting for multiple comparisons was made using Bonferroni methods separately for the single SNP and haplotype analysis for the combined analysis. For the single SNP analysis, we tested 24 SNPs in the discovery cohort, 10 in the replication cohort and 6 in the combined cohort, we therefore accounted for 40 comparisons. As for the haplotype analysis, we tested 19 haplotypes in the discovery cohort, 4 in the replication cohort and 4 in the combined cohort, thus we accounted for 27 comparisons.

Power estimations

Based on findings of the discovery cohort, the power required to replicate associations in an independent cohort was made after considering the observed allele frequencies and ORs, assuming an alpha level of significance of 0.05, an available case sample of 450 cases and a control population of 1300 subjects. Based on this pre-defined sample size, it was estimated that the replication cohort would have >80% power to replicate associations noted in the discovery cohort. Power analysis was carried out using QUANTO Software, version 1.2.4 (


  1. 1

    Bernstein CN, Wajda A, Svenson LW, MacKenzie A, Koehoorn M, Jackson M et al. The epidemiology of inflammatory bowel disease in Canada: a population-based study. Am J Gastroenterol 2006; 101: 1559–1568.

  2. 2

    Kappelman MD, Rifas-Shiman SL, Kleinman K, Ollendorf D, Bousvaros A, Grand RJ et al. The prevalence and geographic distribution of Crohn’s disease and ulcerative colitis in the United States. Clin Gastroenterol Hepatol 2007; 5: 1424–1429.

  3. 3

    Rioux JD, Xavier RJ, Taylor KD, Silverberg MS, Goyette P, Huett A et al. Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis. Nat Genet 2007; 39: 596–604.

  4. 4

    Parkes M, Barrett JC, Prescott NJ, Tremelling M, Anderson CA, Fisher SA et al. Sequence variants in the autophagy gene IRGM and multiple other replicating loci contribute to Crohn's disease susceptibility. Nat Genet 2007; 39: 830–832.

  5. 5

    Prescott NJ, Fisher SA, Franke A, Hampe J, Onnie CM, Soars D et al. A nonsynonymous SNP in ATG16L1 predisposes to ileal Crohn’s disease and is independent of CARD15 and IBD5. Gastroenterology 2007; 132: 1665–1671.

  6. 6

    Amre DK, Mack D, Israel D, Morgan K, Lambrette P, Law L et al. Association between genetic variants in the IL-23R gene and early-onset Crohn’s disease: results from a case-control and family-based study among Canadian children. Am J Gastroenterol 2008; 103: 615–620.

  7. 7

    Franke A, McGovern DP, Barrett JC, Wang K, Radford-Smith GL, Ahmad T et al. Genome-wide meta-analysis increases to 71 the number of confirmed Crohn’s disease susceptibility loci. Nat Genet 2010; 42: 1118–1125.

  8. 8

    Rivas MA, Beaudoin M, Gardet A, Stevens C, Sharma Y, Zhang CK et al. Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat Genet 2011; 43: 1066–1073.

  9. 9

    Sladek FM, Zhong WM, Lai E, Darnell JE . Liver-enriched transcription factor HNF-4 is a novel member of the steroid hormone receptor superfamily. Genes Dev 1990; 4: 2353–2365.

  10. 10

    Taraviras S, Monaghan AP, Schutz G, Kelsey G . Characterization of the mouse HNF-4 gene and its expression during mouse embryogenesis. Mech Dev 1994; 48: 67–79.

  11. 11

    Miquerol L, Lopez S, Cartier N, Tulliez M, Raymondjean M, Kahn A . Expression of the L-type pyruvate kinase gene and the hepatocyte nuclear factor 4 transcription factor in exocrine and endocrine pancreas. J Biol Chem 1994; 269: 8944–8951.

  12. 12

    Stoffel M, Duncan SA . The maturity-onset diabetes of the young (MODY1) transcription factor HNF4alpha regulates expression of genes required for glucose transport and metabolism. Proc Natl Acad Sci USA 1997; 94: 13209–13214.

  13. 13

    Chien CC, Yen BL, Lee FK, Lai TH, Chen YC, Chan SH et al. In vitro differentiation of human placenta-derived multipotent cells into hepatocyte-like cells. Stem Cells 2006; 24: 1759–1768.

  14. 14

    Fiegel HC, Lioznov MV, Cortes-Dericks L, Lange C, Kluth D, Fehse B et al. Liver-specific gene expression in cultured human hematopoietic stem cells. Stem Cells 2003; 21: 98–104.

  15. 15

    Lian G, Wang C, Teng C, Zhang C, Du L, Zhong Q et al. Failure of hepatocyte marker-expressing hematopoietic progenitor cells to efficiently convert into hepatocytes in vitro. Exp Hematol 2006; 34: 348–358.

  16. 16

    Parviz F, Matullo C, Garrison WD, Savatski L, Adamson JW, Ning G et al. Hepatocyte nuclear factor 4alpha controls the development of a hepatic epithelium and liver morphogenesis. Nat Genet 2003; 34: 292–296.

  17. 17

    Sladek FM . Orphan receptor HNF-4 and liver-specific gene expression. Receptor 1993; 3: 223–232.

  18. 18

    Boj SF, Parrizas M, Maestro MA, Ferrer J . A transcription factor regulatory circuit in differentiated pancreatic cells. Proc Natl Acad Sci USA 2001; 98: 14481–14486.

  19. 19

    Thomas H, Jaschkowitz K, Bulman M, Frayling TM, Mitchell SM, Roosen S et al. A distant upstream promoter of the HNF-4alpha gene connects the transcription factors involved in maturity-onset diabetes of the young. Hum Mol Genet 2001; 10: 2089–2097.

  20. 20

    Briancon N, Weiss MC . In vivo role of the HNF4alpha AF-1 activation domain revealed by exon swapping. EMBO J 2006; 25: 1253–1262.

  21. 21

    Rhee J, Inoue Y, Yoon JC, Puigserver P, Fan M, Gonzalez FJ et al. Regulation of hepatic fasting response by PPARgamma coactivator-1alpha (PGC-1): requirement for hepatocyte nuclear factor 4alpha in gluconeogenesis. Proc Natl Acad Sci USA 2003; 100: 4012–4017.

  22. 22

    Wang H, Maechler P, Antinozzi PA, Hagenfeldt KA, Wollheim CB . Hepatocyte nuclear factor 4alpha regulates the expression of pancreatic beta-cell genes implicated in glucose metabolism and nutrient-induced insulin secretion. J Biol Chem 2000; 275: 35953–35959.

  23. 23

    Bartoov-Shifman R, Hertz R, Wang H, Wollheim CB, Bar-Tana J, Walker MD . Activation of the insulin gene promoter through a direct effect of hepatocyte nuclear factor 4 alpha. J Biol Chem 2002; 277: 25914–25919.

  24. 24

    Tegude H, Schnabel A, Zanger UM, Klein K, Eichelbaum M, Burk O . Molecular mechanism of basal CYP3A4 regulation by hepatocyte nuclear factor 4alpha: evidence for direct regulation in the intestine. Drug Metab Dispos 2007; 35: 946–954.

  25. 25

    Fajans SS, Bell GI, Polonsky KS . Molecular mechanisms and clinical pathophysiology of maturity-onset diabetes of the young. N Engl J Med 2001; 345: 971–980.

  26. 26

    Bagwell AM, Bento JL, Mychaleckyj JC, Freedman BI, Langefeld CD, Bowden DW . Genetic analysis of HNF4A polymorphisms in Caucasian-American type 2 diabetes. Diabetes 2005; 54: 1185–1190.

  27. 27

    Hansen SK, Rose CS, Glumer C, Drivsholm T, Borch-Johnsen K, Jorgensen T et al. Variation near the hepatocyte nuclear factor (HNF)-4alpha gene associates with type 2 diabetes in the Danish population. Diabetologia 2005; 48: 452–458.

  28. 28

    Vaxillaire M, Dina C, Lobbens S, Dechaume A, Vasseur-Delannoy V, Helbecque N et al. Effect of common polymorphisms in the HNF4alpha promoter on susceptibility to type 2 diabetes in the French Caucasian population. Diabetologia 2005; 48: 440–444.

  29. 29

    Garrison WD, Battle MA, Yang C, Kaestner KH, Sladek FM, Duncan SA . Hepatocyte nuclear factor 4alpha is essential for embryonic development of the mouse colon. Gastroenterology 2006; 130: 1207–1220.

  30. 30

    Frochot V, Alqub M, Cattin AL, Carriere V, Houllier A, Baraille F et al. The transcription factor HNF-4alpha: a key factor of the intestinal uptake of fatty acids in mouse. Am J Physiol Gastrointest Liver Physiol 2012; 302: G1253–G1263.

  31. 31

    Lussier CR, Babeu JP, Auclair BA, Perreault N, Boudreau F . Hepatocyte nuclear factor-4alpha promotes differentiation of intestinal epithelial cells in a coculture system. Am J Physiol Gastrointest Liver Physiol 2008; 294: G418–G428.

  32. 32

    Babeu JP, Darsigny M, Lussier CR, Boudreau F . Hepatocyte nuclear factor 4alpha contributes to an intestinal epithelial phenotype in vitro and plays a partial role in mouse intestinal epithelium differentiation. Am J Physiol Gastrointest Liver Physiol 2009; 297: G124–G134.

  33. 33

    Darsigny M, Babeu JP, Dupuis AA, Furth EE, Seidman EG, Levy E et al. Loss of hepatocyte-nuclear-factor-4alpha affects colonic ion transport and causes chronic inflammation resembling inflammatory bowel disease in mice. PLoS One 2009; 4: e7609.

  34. 34

    Marcil V, Seidman E, Sinnett D, Boudreau F, Gendron FP, Beaulieu JF et al. Modification in oxidative stress, inflammation and lipoprotein assembly in response to hepatocyte nuclear factor 4 alpha knockdown in intestinal epithelial cells. J Biol Chem 2010; 285: 40448–40460.

  35. 35

    Barrett JC, Lee JC, Lees CW, Prescott NJ, Anderson CA, Phillips A et al. Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat Genet 2009; 41: 1330–1334.

  36. 36

    Bonnycastle LL, Willer CJ, Conneely KN, Jackson AU, Burrill CP, Watanabe RM et al. Common variants in maturity-onset diabetes of the young genes contribute to risk of type 2 diabetes in Finns. Diabetes 2006; 55: 2534–2540.

  37. 37

    Weissglas-Volkov D, Huertas-Vazquez A, Suviolahti E, Lee J, Plaisier C, Canizales-Quinteros S et al. Common hepatic nuclear factor-4alpha variants are associated with high serum lipid levels and the metabolic syndrome. Diabetes 2006; 55: 1970–1977.

  38. 38

    Silverberg MS, Satsangi J, Ahmad T, Arnott ID, Bernstein CN, Brant SR et al. Toward an integrated clinical, molecular and serological classification of inflammatory bowel disease: report of a Working Party of the 2005 Montreal World Congress of Gastroenterology. Can J Gastroenterol 2005; 19 (Suppl A): 5–36.

  39. 39

    Ahn SH, Shah YM, Inoue J, Morimura K, Kim I, Yim S et al. Hepatocyte nuclear factor 4alpha in the intestinal epithelial cells protects against inflammatory bowel disease. Inflamm Bowel Dis 2008; 14: 908–920.

  40. 40

    De BK, Vanden Berghe W, Haegeman G . Cross-talk between nuclear receptors and nuclear factor kappaB. Oncogene 2006; 25: 6868–6886.

  41. 41

    Nikolaidou-Neokosmidou V, Zannis VI, Kardassis D . Inhibition of hepatocyte nuclear factor 4 transcriptional activity by the nuclear factor kappaB pathway. Biochem J 2006; 398: 439–450.

  42. 42

    Amre DK, Mack DR, Morgan K, Israel D, Deslandres C, Seidman EG et al. Association between genome-wide association studies reported SNPs and pediatric-onset Crohn’s disease in Canadian children. Hum Genet 2010; 128: 131–135.

  43. 43

    Bianco AM, Zanin V, Girardelli M, Magnolato A, Martellossi S, Tommasini A et al. A common genetic background could explain early-onset Crohn’s disease. Med Hypotheses 2012; 78: 520–522.

  44. 44

    Sladek FM, Seidel SD . Hepatocyte nuclear factor 4á. In: Burris TP, McCabe E (eds). Nuclear Receptors and Genetic Diseases. Academic Press London, UK, 2001, pp 309–361.

  45. 45

    Ellard S, Colclough K . Mutations in the genes encoding the transcription factors hepatocyte nuclear factor 1 alpha (HNF1A) and 4 alpha (HNF4A) in maturity-onset diabetes of the young. Hum Mutat 2006; 27: 854–869.

  46. 46

    Barroso I, Luan J, Wheeler E, Whittaker P, Wasson J, Zeggini E et al. Population-specific risk of type 2 diabetes conferred by HNF4A P2 promoter variants: a lesson for replication studies. Diabetes 2008; 57: 3161–3165.

  47. 47

    Lehman DM, Richardson DK, Jenkinson CP, Hunt KJ, Dyer TD, Leach RJ et al. P2 promoter variants of the hepatocyte nuclear factor 4alpha gene are associated with type 2 diabetes in Mexican Americans. Diabetes 2007; 56: 513–517.

  48. 48

    Johansson S, Raeder H, Eide SA, Midthjell K, Hveem K, Sovik O et al. Studies in 3523 Norwegians and meta-analysis in 11,571 subjects indicate that variants in the hepatocyte nuclear factor 4 alpha (HNF4A) P2 region are associated with type 2 diabetes in Scandinavians. Diabetes 2007; 56: 3112–3117.

  49. 49

    Saif-Ali R, Harun R, Al-Jassabi S, Wan Ngah WZ . Hepatocyte nuclear factor 4 alpha P2 promoter variants associate with insulin resistance. Acta Biochim Pol 2011; 58: 179–186.

  50. 50

    Winckler W, Graham RR, de Bakker PI, Sun M, Almgren P, Tuomi T et al. Association testing of variants in the hepatocyte nuclear factor 4alpha gene with risk of type 2 diabetes in 7883 people. Diabetes 2005; 54: 886–892.

  51. 51

    Seldin MF, Amos CI . Shared susceptibility variations in autoimmune diseases: a brief perspective on common issues. Genes Immun 2009; 10: 1–4.

  52. 52

    Huang W, Wang P, Liu Z, Zhang L . Identifying disease associations via genome-wide association studies. BMC Bioinformatics 2009; 10 (Suppl 1): S68.

  53. 53

    Eleftherohorinou H, Wright V, Hoggart C, Hartikainen AL, Jarvelin MR, Balding D et al. Pathway analysis of GWAS provides new insights into genetic susceptibility to 3 inflammatory diseases. PLoS One 2009; 4: e8068.

  54. 54

    Liu H, Hew HC, Lu ZG, Yamaguchi T, Miki Y, Yoshida K . DNA damage signalling recruits RREB-1 to the p53 tumour suppressor promoter. Biochem J 2009; 422: 543–551.

  55. 55

    Flajollet S, Poras I, Carosella ED, Moreau P . RREB-1 is a transcriptional repressor of HLA-G. J Immunol 2009; 183: 6948–6959.

  56. 56

    Battle MA, Konopka G, Parviz F, Gaggl AL, Yang C, Sladek FM et al. Hepatocyte nuclear factor 4alpha orchestrates expression of cell adhesion proteins during the epithelial transformation of the developing liver. Proc Natl Acad Sci USA 2006; 103: 8419–8424.

  57. 57

    Satohisa S, Chiba H, Osanai M, Ohno S, Kojima T, Saito T et al. Behavior of tight-junction, adherens-junction and cell polarity proteins during HNF-4alpha-induced epithelial polarization. Exp Cell Res 2005; 310: 66–78.

  58. 58

    Cattin AL, Le BJ, Barreau F, Saint-Just S, Houllier A, Gonzalez FJ et al. Hepatocyte nuclear factor 4alpha, a key factor for homeostasis, cell architecture, and barrier function of the adult intestinal epithelium. Mol Cell Biol 2009; 29: 6294–6308.

  59. 59

    Genser D, Kang MH, Vogelsang H, Elmadfa I . Status of lipidsoluble antioxidants and TRAP in patients with Crohn's disease and healthy controls. Eur J Clin Nutr 1999; 53: 675–679.

  60. 60

    Lennard-Jones JE . Classification of inflammatory bowel disease. Scand J Gastroenterol Suppl 1989; 170: 2–6.

  61. 61

    Sands BE . From symptom to diagnosis: clinical distinctions among various forms of intestinal inflammation. Gastroenterology 2004; 126: 1518–1532.

  62. 62

    Tang H, Quertermous T, Rodriguez B, Kardia SL, Zhu X, Brown A et al. Genetic structure, self-identified race/ethnicity, and confounding in case-control association studies. Am J Hum Genet 2005; 76: 268–275.

  63. 63

    Paradis G, Lambert M, O’Loughlin J, Lavallee C, Aubin J, Berthiaume P et al. The Quebec Child and Adolescent Health and Social Survey: design and methods of a cardiovascular risk factor survey for youth. Can J Cardiol 2003; 19: 523–531.

  64. 64

    Andrulionyte L, Laukkanen O, Chiasson JL, Laakso M . Single nucleotide polymorphisms of the HNF4alpha gene are associated with the conversion to type 2 diabetes mellitus: the STOP-NIDDM trial. J Mol Med 2006; 84: 701–708.

  65. 65

    Muller YL, Infante AM, Hanson RL, Love-Gregory L, Knowler W, Bogardus C et al. Variants in hepatocyte nuclear factor 4alpha are modestly associated with type 2 diabetes in Pima Indians. Diabetes 2005; 54: 3035–3039.

  66. 66

    Corpet F . Multiple sequence alignment with hierarchical clustering. Nucleic Acids Res 1988; 16: 10881–10890.

  67. 67

    Koo SH, Ong TC, Chong KT, Lee CG, Chew FT, Lee EJ . Multiplexed genotyping of ABC transporter polymorphisms with the Bioplex suspension array. Biol Proced Online 2007; 9: 27–42.

  68. 68

    Bourgey M, Lariviere M, Richer C, Sinnett DALG . Automated genotype calling of luminex assays. PLoS One 2011; 6: e19368.

  69. 69

    Courtois F, Suc I, Garofalo C, Ledoux M, Seidman E, Levy E . Iron-ascorbate alters the efficiency of Caco-2 cells to assemble and secrete lipoproteins. Am J Physiol Gastrointest Liver Physiol 2000; 279: G12–G19.

  70. 70

    Levy E, Rizwan Y, Thibault L, Lepage G, Brunet S, Bouthillier L et al. Altered lipid profile, lipoprotein composition, and oxidant and antioxidant status in pediatric Crohn disease. Am J Clin Nutr 2000; 71: 807–815.

  71. 71

    Barrett JC, Fry B, Maller J, Daly MJ . Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2005; 21: 263–265.

Download references


This study was supported by the Canadian Institutes of Health Research Team Grant (CTP-82942; ES, FB, FPG, JFB, DM and EL), the JA DeSève Research Chair in Nutrition (EL), the Canada Research Chair in Immune Mediated Gastrointestinal Disorders (ES), the Canadian Institutes of Health Research Fellowship Award and The Richard and Edith Strauss Postdoctoral Fellowships Award in Medicine, McGill University (VM). A preliminary version of this study, presented by VM, was awarded with a Presidential Poster Award at the American College of Gastroenterology 2010 Annual Scientific Meeting. We thank Mrs Schohraya Spahis for her technical assistance.

Author information

Correspondence to E Levy.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Rights and permissions

Reprints and Permissions

About this article


  • Crohn’s disease
  • HNF4α
  • genetic variants
  • oxidative stress

Further reading