Obesity and type 2 diabetes (T2D) are metabolic disorders influenced by lifestyle and genetic factors that are characterized by insulin resistance in skeletal muscle, a prominent site of glucose disposal. Numerous genetic variants have been associated with obesity and T2D, of which the majority are located in non-coding DNA regions. This suggests that most variants mediate their effect by altering the activity of gene-regulatory elements, including enhancers. Here, we map skeletal muscle genomic enhancer elements that are dynamically regulated after exposure to the free fatty acid palmitate or the inflammatory cytokine TNFα. By overlapping enhancer positions with the location of disease-associated genetic variants, and resolving long-range chromatin interactions between enhancers and gene promoters, we identify target genes involved in metabolic dysfunction in skeletal muscle. The majority of these genes also associate with altered whole-body metabolic phenotypes in the murine BXD genetic reference population. Thus, our combined genomic investigations identified genes that are involved in skeletal muscle metabolism.
The prevalence of obesity and T2D comorbidity is reaching epidemic proportions worldwide, with currently 1.9 billion adults estimated as being overweight or obese1 and 380 million suffering from T2D2. Skeletal muscle constitutes the largest metabolic organ and accounts for 30% of the basal metabolic rate3, and as the most prominent site of insulin-mediated glucose uptake in humans, insulin resistance (IR) in muscle is considered a contributing defect during development of T2D4. While the molecular basis for the pathology of obesity and T2D is incompletely understood, it is clear that both genetic and environmental factors contribute, probably in a synergistic manner5. Genome-wide association studies (GWAS) have identified a plethora of genetic variants associated with T2D and obesity traits6,7,8. However, only a minority (<5%) of GWAS identified variants are located in coding sequences9, which makes functional characterization complex. Several studies have identified that a substantial proportion of the disease-associated variants lie within regulatory regions, including enhancer elements9,10,11.
Enhancers serve as binding sites for transcription factors and co-regulators that assist in DNA looping and recruitment of the transcriptional machinery to targeted promoters. With an estimated 50,000 to 100,000 active enhancers in any given mammalian cell type12, enhancers are thought to account for the complexity of gene regulation. Enhancers are characterized by the presence of histone modifications including monomethylation of histone 3 lysine 4 (H3K4me1) and acetylation of histone 3 lysine 27 (H3K27ac)13,14,15. Thus, by determining the genome-wide distribution of these histone marks, it is possible to generate genome-wide maps of active enhancers (the enhancerome) in a specific tissue. Mapping the enhancerome in various cell types and during embryonic stem cell differentiation has demonstrated that enhancer activation is highly cell-type specific and dynamic16,17, and several studies have proposed that impaired enhancer activation could be at the origin of disease18,19,20,21. Besides interacting with nearby promoters, enhancers also engage in long-range interactions. Indeed, it is estimated that approximately 35–40% of all promoter-enhancer interactions are intervened by at least one gene22, which makes exact enhancer-target prediction challenging. Long-range enhancers interactions can be identified by chromosome conformation capture methods23,24.
In the present study, we aimed to identify target genes of GWAS SNPs in human skeletal muscle by using cultured myotubes subjected to metabolic stress by either palmitate or TNFα exposure. Elevation of plasma levels of free fatty acids and proinflammatory cytokines associates with increasing adiposity25 and represent an important link between obesity, skeletal muscle IR, and T2D. By RNA profiling and genome-wide mapping of enhancer elements in myotubes, we found that palmitate or TNFα treatment led to massive changes in gene transcription, as well as alterations in the activity of enhancers. Moreover, we showed that enhancers regulated by palmitate or TNFα exposure, overlapped SNPs from GWAS of BMI, waist-to-hip ratio (WHR), IR or T2D. Moreover, by mapping global promoter-enhancer interactions by chromatin conformation analysis, we directly couple these enhancers to promoters, where we found a concurrent change in gene transcription by the respective treatments. Thus, we established physical links between numerous GWAS SNPs and muscle-expressed genes and provided insight into the association between the identified genes and metabolic function in vivo.
Transcriptomic profiling of human skeletal muscle cells
To study concurrent changes in gene transcription, enhancer activities and chromatin conformation, we used primary human skeletal muscle cells differentiated into myotubes that were subjected to metabolic stress by treatment with either palmitate or TNFα (Supplementary Fig. 1A). As previously reported26,27,28,29, both treatments lowered insulin sensitivity, as confirmed by decreased AKT Ser-473 phosphorylation in response to insulin stimulation (Supplementary Fig. 1B–E).
First, we performed transcriptomic analysis by RNA-sequencing (RNA-seq). Multidimensional Scaling (MDS) plots showed a clear sample separation based on palmitate or TNFα treatment (Fig. 1a). In total, we detected expression of 14,402 genes in skeletal muscle cells, of which 1542 were regulated by palmitate treatment (621 downregulated and 921 upregulated; Fig. 1b, Supplementary Data 1, and Supplementary Fig. 2A) and 4522 were changed by TNFα treatment (2247 downregulated and 2275 upregulated; Fig. 1c, Supplementary Data 1, and Supplementary Fig. 2B). Gene ontology (GO) analysis of the differentially expressed genes (Supplementary Data 2) demonstrated strong upregulation of genes involved in lipid metabolism, as well as regulation of inflammatory responses (Fig. 1d) by palmitate exposure, whereas terms related to nucleosome assembly were specifically downregulated (Fig. 1e). GO analysis of TNFα upregulated genes returned several terms related to immune signaling (Fig. 1f), whereas downregulated genes were related to protein targeting to the endoplasmic reticulum (ER), insulin-like growth factor signaling and muscle filament sliding (Fig. 1g). Interestingly, both treatments seemed to significantly upregulate genes involved in inflammation (Supplementary Data 2 and Fig. 1h), and to downregulate genes related to muscle contraction (Supplementary Data 2 and Fig. 1i), both of which are processes related to skeletal muscle dysfunction and insulin resistance. Thus, our transcriptomic analyses of human muscle myotubes reveal thousands of target genes of which many are related to metabolic dysfunction.
The dynamic enhancerome of skeletal muscle cells
Through chromatin immunoprecipitation followed by sequencing (ChIP-seq), we mapped the distribution of the enhancer-associated histone H3 modifications, H3K4me1 and H3K27ac, in the muscle myotubes treated with TNFα or palmitate. Genome-wide, we identified 107,405 and 80,388 significant peaks of H3K4me1 or H3K27ac, respectively (Fig. 2a). These were mostly located in non-coding DNA, such as introns and intergenic regions, as well as in promoters (Supplementary Fig. 3). In order to identify enhancers, we subtracted active promoter regions (defined by the promoter-associated H3K4me3 mark). We found that most (95.5%) of the non-promoter associated H3K27ac peaks overlapped a H3K4me1 peak, whereas only 36.9% of the H3K4me1 peaks overlapped H3K27ac (Fig. 2a). These findings support the notion that enhancers can be primed (marked by only H3K4me1) or active (marked by both H3K4me1 and H3K27ac)16,17. MDS plots of non-promoter associated H3K27ac and H3K4me1 ChIP-seq data demonstrated a clear treatment-based separation of samples for H3K27ac (Fig. 2b), whereas this was less obvious for H3K4me1 (Fig. 2c), underlining the assumption that especially H3K27ac undergoes dynamic regulation in response to external stimuli and determines enhancer activity16,30,31. Therefore, to identify enhancers that were differentially activated after palmitate or TNFα treatment, we searched for peaks within the 62,866 identified active enhancers (covered by both H3K4me1 and H3K27ac) that showed significant changes in H3K27ac levels. This analysis returned 2243 enhancers with altered activity after palmitate treatment (FDR < 0.01, 1190 with a decreased activity and 1053 with an increased activity) (Fig. 2d and Supplementary Data 3), and 17,037 enhancers that changed activity after TNFα treatment (FDR < 0.01, 12,380 with a decreased activity and 4,657 with an increased activity) (Fig. 2e and Supplementary Data 3). Examples of enhancers with a strong increase in H3K27ac after palmitate treatment included elements located 10 kb upstream of the PDK4 promoter and 9 kb upstream of ANGPTL4 (Fig. 2f, g)—two genes known to play a role in fatty acid metabolism. Moreover, some enhancers strongly regulated by TNFα were located close to cytokine genes, exemplified by enhancers located 21 kb downstream of CCL11 and 17 kb upstream of CCXL8 (Fig. 2h, i). The changes in H3K27ac were validated independently by ChIP-qPCR (Supplementary Fig. 4A), which further confirmed the presence of H3K4me1 at these sites (Supplementary Fig. 4B, C). None of the validated enhancer regions showed enrichment of the promoter-associated H3K4me3 mark (Supplementary Fig. 4D), ruling out that these genomic regions act as alternative promoters. Consistent with increased enhancer activity, expression of PDK4, ANGPTL4, CCL11, and CCXL8 were markedly upregulated after palmitate or TNFα treatment (Fig. 2j, k), supporting a regulatory role of these enhancers on expression of their nearby promoters. To further validate the cis-regulatory activity of the identified regions, we cloned the PDK4-10 kb and the CCXL8-17kb enhancers into a luciferase reporter vector. When transfected into muscle cells, luciferase activity was markedly increased in response to palmitate or TNFα treatment (Supplementary Fig. 5), confirming a regulation of enhancer activity by these treatments. Collectively, our results identify thousands of dynamic enhancer activities in human skeletal muscle cells after treatment with palmitate or TNFα.
Capture Hi-C identifies enhancer-promoter interactions
Besides interacting with nearby promoters, enhancers can also engage in long-range interactions, which makes enhancer-target prediction challenging. To overcome this, we performed genome-wide mapping of enhancer-promoter interactions in skeletal muscle cells by the use of high-resolution Promoter Capture Hi-C22,24. First, we tested if treatment of myotubes with palmitate or TNFα was associated with a dynamic reorganization of promoter-enhancer interactions. Hi-C libraries were generated from skeletal muscle myotubes followed by hybridization-based capture of 21,841 human promoters, using a collection of 37,608 biotinylated RNA baits (approximately two baits per promoter) previously designed and tested by others22. By sequencing the captured ligation fragments and testing for a difference in mapped Hi-C interactions by palmitate or TNFα treatment, we did not detect any significant changes (Supplementary Fig. 6A-B), suggesting that acute exposure to these treatments does not cause major changes to chromatin structure. This agrees with another study showing that TNFα-responsive enhancers are already in contact with their target promoters before transient activation or repression of enhancer activity by TNFα treatment in human fibroblasts32.
Next, we pooled all Promoter Capture Hi-C conditions in order to obtain a general chromatin conformation capture of myotubes. This identified 36,809 significant promoter-enhancer interactions (Fig. 3a and Supplementary Data 4). Interactions covered 47% of tested promoters and 51% of identified enhancers regions (Fig. 3a) and largely spanned the entire genome (Supplementary Fig. 7). Genomic distances of identified promoter-enhancer interactions ranged up to 6.2 Mb, with a median distance of 93.8 kb (Fig. 3b) and each of the captured promoters were on average connected to 4 enhancer regions (Fig. 3c).
To validate if our Promoter Capture Hi-C data identified functional enhancer-promoter interactions, i.e., where a dynamic change in enhancer activity also associate with a concurrent change in promoter transcription, we divided the promoters captured in our chromatin interaction data into three groups (Fig. 3d): promoters connected to enhancers that did not change H3K27ac in response to palmitate or TNFα treatment (“None”) and promoters connected to enhancers that either gained H3K27ac (“Up”) or lost H3K27ac (“Down”). Empirical cumulative distribution function (EDCF) plots of gene expression changes (RNA-seq logFC values) in the different groups revealed that promoters connected to enhancers with gained activity have higher logFC values than the “None” group (Fig. 3e, f), whereas promoters connected to enhancers with decreased activity have significantly lower logFC values for both palmitate and TNFα treatments (Fig. 3g, h), supporting a regulatory role of the connected enhancers. Taken together, we have generated an enhancer-promoter connectivity map of skeletal muscle myotubes and demonstrated a general capture of promoter-enhancer pairs with concurrent changes in activity by palmitate or TNFα treatment.
Chromatin interaction data predict enhancer target genes
Given that the vast majority of disease-associated variants are predicted to be located in regulatory regions9,10,11, our data represent an opportunity to identify target genes of GWAS SNPs in skeletal muscle cells by combining our enhancer mapping with information on chromatin conformation and gene transcription. For this, we used four sets of GWAS SNPs associated with T2D6, IR33,34,35,36,37,38, BMI8 or WHR7, as well as tagged SNPs in high linkage disequilibrium (LD, r2 > 0.8) (Fig. 4a). After overlapping the variants with enhancer regions regulated by either palmitate or TNFα treatment, we identified 58 palmitate-regulated enhancers and 522 TNFα-regulated enhancers each harboring one or more GWAS SNPs (Fig. 4b). Next, we selected enhancers that were both captured by our Promoter Capture Hi-C analysis and linked to genes differentially expressed after palmitate or TNFα treatment. When only considering enhancer-gene pairs where enhancer activity and gene expression were regulated in the same direction (i.e., either upregulated or downregulated), our analysis retrieved 11 palmitate-regulated, and 124 TNFα-regulated enhancers interacting with 11 and 99 predicted target gene promoters, respectively (Fig. 4b and Supplementary Data 5). The predicted target genes included several known players in metabolism such as IRS1, IGFBP3, PPARG, SOCS2, and LEPR, providing a link between disease-associated SNPs and the ability of skeletal muscle to adapt to metabolic and inflammatory stress. To further narrow down the list of potential gene targets, we investigated the association between genotype of the enhancer-overlapping GWAS SNPs and the basal expression of each of their target genes in skeletal muscle biopsies of 139 individuals (by expression quantitative trait locus (eQTL) analysis). This approach identified 13 significant skeletal muscle eGenes (CEP68, GAB2, LAMB1, MACF1, EIF6, PABPC4, BTBD1, FILIP1L, TCEA3, NRP1, ZHX3, TBX15, and TNFAIP8) for 61 GWAS-SNPs, located within 20 distinct enhancer regions (Fig. 4c, d and Supplementary Data 6). Thus, by overlapping our genomic datasets, we have identified numerous putative target genes of metabolic GWAS SNPs, which may play a functional role under lipid toxicity or in response to proinflammatory stimuli. Moreover, for 13 genes, we demonstrate a significant association between GWAS SNP genotype and basal gene expression levels in human skeletal muscle.
Identified target genes are linked to energy metabolism
In order to understand the role of the identified putative GWAS-SNP target genes in whole body metabolism in vivo, we analyzed the association between 48 metabolic traits in the BXD murine genetic reference population fed a control diet (CD) or high fat diet (HFD)39,40,41 (Supplementary Data 7), and expression levels of the 13 identified eGenes in skeletal muscle (Supplementary Data 8), adipose tissue (Supplementary Data 9) and liver (Supplementary Data 10). Strikingly, expression of 12 out of the 13 genes (Cep68, Gab2, Lamb1, Macf1, Eif6, Btbd1, Filip1l, Tcea3, Nrp1, Zhx3, Tbx15, and Tnfaip8) showed associations with metabolic measures, such as blood glucose levels during glucose tolerance tests (GTTs), plasma lipid levels, body composition, and exercise performance, in at least one of the tested tissues (Table 1). For some target genes, metabolic measurements were specifically associated with expression in skeletal muscle. For example, expression of Tbx15 (Fig. 5a), which we found linked to SNPs associated with WHR in humans, was positively associated with lean body mass (Fig. 5b) and VO2 max (Fig. 5c), as well as negatively associated with total body fat mass (Fig. 5d) and blood glucose levels during an oral GTT (Fig. 5e) in the BXD mice. Interestingly, the expression of Cep68, which we find linked to SNPs associated with T2D, was correlated with blood glucose levels during GTTs in HFD-fed mice in both muscle and liver (Fig. 5f). More specifically, Cep68 expression was negatively correlated with blood glucose levels during an intraperitoneal GTT in skeletal muscle of both male (Fig. 5g) and female (Fig. 5h) mice, and oral GTT in liver tissue (Fig. 5i). Moreover, Cep68 association with body fat mass and lean mass percentages in adipose tissue (Fig. 5j) suggests that CEP68 has a role in T2D through dysregulated expression in multiple organs. Collectively, these data demonstrate that the expression of identified putative GWAS SNP targets correlates with metabolic measures in mice, and suggest a role for these genes in the regulation of energy metabolism in vivo.
Long-range interactions connect WHR SNPs to EIF6 expression
For some candidate genes identified as regulated by non-coding GWAS SNPs, including EIF6, the gene was not located at close vicinity of the differentially activated enhancer region, but connected through long-range chromatin interactions. The SNPs that we found linked to EIF6 are located within the UQCC1 locus and associate with WHR (Fig. 6a). We identified four enhancer regions, UQCC1 + 100 kb, UQCC1 + 26 kb, UQCC1 + 16 kb, and UQCC1 + 13 kb, that were all regulated by TNFα (Fig. 6b) and captured by our Promoter Capture Hi-C data. The enhancer regions overlapped several highly linked WHR-associated SNPs (Fig. 6a). From our chromatin interaction data, we found all enhancers to interact with the promoter of EIF6 (Fig. 6a). Moreover, the UQCC1 + 100 kb enhancer also interacted with MMP24 and EDEM2, whereas UQCC1 + 26 kb, UQCC1 + 16 kb, and UQCC1 + 13 kb enhancer regions looped to the GDF5/CEP250 shared promoter (Fig. 6a). Out of these genes, MMP24, EIF6 and GDF5 remained candidates to be under the regulation of the enhancers, since the expression of these genes was concurrently decreased by TNFα treatment (Fig. 6c). Importantly, the UQCC1 promoter was not found linked to the enhancer nor did UQCC1 change expression by TNFα. While GDF5 expression was below detection limit in skeletal muscle and could not be analyzed for eQTLs, we found associations of several LD-linked WHR associated SNPs, including rs878639, with the expression of EIF6 (Supplementary Data 6 and Fig. 6d), but not with MMP24 (Fig. 6e). In the case of rs878639, the major allele associates with an increased WHR, which establishes a link between lower EIF6 expression and an unhealthy body fat distribution. Consistently, we found that Eif6 expression in muscle from BXD mice positively associates with running distance (Fig. 6f, Supplementary Data 8), VO2 basal (Fig. 6g, Supplementary Data 8) and VO2 max levels after training (Fig. 6h, Supplementary Data 8), suggesting better aerobic capacity in animals with higher skeletal muscle Eif6 expression. To further validate our findings, we used siRNAs (siEif6#1 and siEif6#2) to knock down Eif6 expression in skeletal muscle cells (Fig. 6i and Fig. S8A). We assessed mitochondrial respiration by measuring oxygen consumption rate (OCR) at basal state or during FCCP-induced uncoupling (Fig. 6j and Supplementary Fig. 8B) and found that decreased Eif6 expression resulted in lower OCR (Fig. 6k), especially during maximal FCCP-induced respiration (Fig. 6l and Supplementary Fig. 8C). Moreover, after differentiating C2C12 cells into myotubes, we found that Eif6 knockdown (Supplementary Fig. 9A) led to reduced protein levels of the mitochondrial oxidative phosphorylation complex II (Fig. 6m and Supplementary Fig. 9B), whereas we did not detect any changes in insulin-stimulated glucose uptake (Supplementary Fig. 9C), glycogen synthesis (Supplementary Fig. 9D), or AKT phosphorylation (Supplementary Fig. 9E, F).
Thus, long-distance interactions networks suggest that EIF6 is regulated by genetic variants associated with body fat distribution. Accordingly, we identified correlations between lower skeletal muscle Eif6 expression and reduced exercise performance, and further provide evidence for a role of EIF6 in the regulation of mitochondrial function in skeletal muscle.
Here, we mapped the transcriptome and enhancerome of human skeletal muscle cells subjected to lipid-induced toxicity or a proinflammatory cytokine. We demonstrate a profound transcriptional reprogramming with thousands of promoter and enhancer regions showing altered activity. Integrating these data with GWAS of T2D, IR, BMI and WHR measures as well as genome-wide chromatin interaction studies, allowed us to detect concurrent changes in the activity of enhancers encompassing GWAS SNPs and transcription from a connected promoter, thereby establishing links between numerous non-coding disease-associated SNPs and gene targets. Using the murine BXD genetic reference population we provide further insight into the role of the identified target genes in the regulation of metabolic phenotypes like body composition, glucose response and exercise performance in vivo. In particular, we provide evidence that one of our identified targets, Eif6, controls mitochondrial respiration in skeletal muscle cells.
Our cell-system using chronic exposure with palmitate or TNFα in human primary muscle cells allowed investigation into the distinct mechanisms by which the metabolic function of the skeletal muscle cell is impaired. Palmitate induces insulin resistance at the level of AKT phosphorylation42, impairs mitochondrial function43, lowers expression of the master regulator of mitochondrial function peroxisome proliferator-activated receptor-gamma coactivator (PGC)-1 α44, and induces ER stress45. Interestingly, incubation of skeletal muscle cells with palmitate induces TNFα secretion by the muscle cell, suggesting that while saturated fatty acids and TNFα appear to activate distinct intracellular pathways, these pathways may share common nodes46. Saturated free fatty acid and TNFα treatment both alter upstream insulin signaling, but TNFα treatment does not alter insulin-stimulated glucose uptake in muscle cells whereas palmitate does42,47. In vivo however, TNFα infusion is associated with both lower activation of the upstream insulin-signal pathway and impaired glucose transport48. Even though TNFα exposure is not associated with lower fatty acid oxidation in muscle ex vivo49, we identified EIF6 as a gene regulated by TNFα exposure and show EIF6 plays a role in fatty-acid oxidation. The discrepancy between the effects of palmitate and TNFα on primary skeletal muscle cell cultures compared to in vivo may be due to specific tissue-culture conditions, different extracellular milieus or the influence of systemic factors.
While the activity of enhancers and promoters were markedly changed after palmitate or TNFα exposure, promoter-enhancer interactions did not appear to be affected. These findings are consistent with a previous study showing that enhancers-promoter interactions are unchanged in fibroblasts treated with TNFα32. We cannot rule out, however, that palmitate or TNFα exposure could remodel chromatin in myotubes, as low sequencing depth or low power may have limited our capacity to detect subtle changes. From previous studies it seems clear that dynamic remodeling of promoter-enhancer interactions occurs during cellular differentiation, particularity at cell type-specific enhancers23,50,51,52,53. Interestingly, the discrepancy between activation of cell type-specific enhancers and enhancers induced by treatments such as TNFα seems to correlate with H3K4me1 levels. Indeed, treatment-induced enhancers appear to exhibit largely unchanged levels of H3K4me1, despite a quick induction of H3K27ac, whereas cell type-specific enhancers display highly variable H3K4me1 levels32. This is consistent with our data, where palmitate- and TNFα-induce large changes in H3K27ac levels at enhancers but only minor changes in H3K4me1. Still, certain chromatin interactions were recently described to be variable in a circadian fashion54, suggesting that promoter-enhancer interactions can indeed be dynamic even within a defined cell type.
Our mapping of the chromatin interactome of human myotubes identified 36,809 specific enhancer-promoter interactions. Integrating these data with RNA transcription and enhancer activity analyses allowed us to specifically capture enhancer-promoter interactions where 1) the enhancer overlaps one or more SNPs associated with T2D, IR, BMI or WHR and 2) the enhancer activity and gene expression were regulated in the same direction by either palmitate or TNFα exposure. Our analysis retrieved more than 100 predicted GWAS target genes, which included several known players in metabolism such as IRS1, IGFBP3, PPARG, SOCS2, and LEPR. However, our eQTL analysis did not detect an association between genotype and gene expression for most of these genes. We therefore speculate that GWAS SNPs may be functionally linked with gene expression in situations of cellular stress encountered in metabolic disease such as increased plasma levels of fatty acids or proinflammatory cytokines.
For the genes identified as significant eGenes in our eQTL analysis, we analyzed the association between their expression levels in skeletal muscle, adipose, or liver tissue and measures of 48 metabolic traits in the BXD murine genetic reference population. We found that 12 out of 13 genes (Cep68, Gab2, Lamb1, Macf1, Eif6, Btbd1, Filip1l, Tcea3, Nrp1, Zhx3, Tbx15, and Tnfaip8) exhibited marked associations with metabolic phenotypes in one or more of the tested tissues. For some targets, including Tbx15, the associations appeared specific for skeletal muscle expression and were not detected in either adipose or liver tissue, suggesting a muscle-specific role of Tbx15. This is consistent with the earlier finding that Tbx15 regulates muscle metabolism in mice and Tbx15 knockout animals are resistant to diet induced obesity and impaired glucose tolerance55. For other targets, such as Cep68, we identified associations in all of the tested tissues revealing the metabolic role of these genes in multiple organs. Linking gene expression with metabolic phenotypes represents a valuable tool to gain insight into gene function, although it does not infer on causality. Circulating leptin levels, for instance, are positively associated with fat mass56, but loss-of-function mutations of LEP are associated with obesity57. In our study, we observed a similar phenomenon where the CEP68 T2D risk variants are associated with increased CEP68 expression, but Cep68 expression is negatively associated with blood glucose levels during GTTs in mice. While further investigations are warranted to establish causal relationships and the mechanism by which CEP68 may regulate whole body metabolism, we speculate that dysregulated expression of CEP68 is involved in the pathogenesis of T2D.
For some genes that we identified as potential targets of metabolic GWAS SNPs, the SNP-enhancer locus was not located in close proximity to the predicted target gene, but engaged in long-range DNA looping formations. For example, we identified interactions between the promoter of the translation initiation factor EIF6 and several enhancers located within the UQCC1 gene, each spanning SNPs associated with WHR in humans. We found both enhancers and EIF6 expression were downregulated by TNFα and we detected significant eQTLs for EIF6 expression with SNPs of all loci. In the BXD mice, Eif6 muscle expression was associated with increased running distance, as well as with basal and maximal VO2 uptake after training. These findings are consistent with a study linking EIF6 to the regulation of energy metabolism during endurance training in humans and showing reduced exercise performance in Eif6 haploinsufficient mice58. Moreover, hypermethylation of the EIF6 promoter is linked to childhood obesity59. In support of this, we demonstrate that Eif6 knockdown in murine muscle cells causes lower mitochondrial respiration and reduced levels of the mitochondrial oxidative complex II. The identified link between EIF6 and modulation of WHR are consistent with data demonstrating that genetic variants within mitochondrial genes are associated with metabolic measures including WHR60. Notably, we did not detect a physical link between the UQCC1 intronic enhancers and the UQCC1promoter, nor did UQCC1 change expression by TNFα. A recent study has shown that human UQCC1 coding variants are associated with WHR61. Interestingly, eQTL analysis indicates that these variants associate not only with the expression levels of UQCC1, but also EIF661, suggesting that several genes within this locus could contribute to the modulation of WHR in humans. Thus, our data demonstrate that EIF6 expression is regulated by TNFα and suggest a role for muscle-specific expression of Eif6/EIF6 in the regulation of mitochondrial function and exercise performance in mice, as well as in WHR ratio in humans.
In conclusion, our study identified skeletal muscle enhancer elements that are dysregulated in the context of lipid-toxicity or under exposure of the proinflammatory cytokine TNFα. We identify hundreds of dysregulated enhancers which overlap with genetic loci previously implicated in metabolic disease and, using chromatin conformation assay, we predict the corresponding gene targets. We identify genes with known roles in metabolism, as well as targets that have not previously been linked to human metabolic disease, and demonstrate their association with metabolic phenotypes in mice. Given the influence of lifestyle and genetic factors in the development of obesity and T2D, and the prominent contribution of skeletal muscle in energy metabolism in humans, our investigations constitute a resource for identifying genes participating in the progression of metabolic disorders.
Human skeletal muscle cells (CC-2561 from Lonza) were cultured in DMEM/F-12, GlutaMAX™ (Life Technologies) supplemented with 20% FBS (Sigma-Aldrich) and 1% penicillin/streptomycin (Life Technologies) during proliferation. Differentiation was initiated when cells were 80% confluent by addition of differentiation media (DMEM/F-12, GlutaMAX™ supplemented with 2% FBS (Sigma-Aldrich) and 1% penicillin/streptomycin). Cells were differentiated for 5–7 days. For palmitate and TNFα treatment, the differentiated myotubes were added 0.5 μM palmitate for 48 h (on day 5–7 of differentiation) or 10 ng/ml TNFα for 24 h (on day 6–7 of differentiation).
Murine C2C12 myoblasts (CRL-1772 from ATCC) were cultured in DMEM (Life Technologies) supplemented with 10% FBS (Sigma-Aldrich) and 1% penicillin/streptomycin (Life Technologies). Differentiation was initiated when cells were 80% confluent by addition of differentiation media (DMEM/F-12, GlutaMAX™ supplemented with 2% horse serum (Sigma-Aldrich) and 1% penicillin/streptomycin). Cells were differentiated for 5 days.
Insulin stimulation experiments for human skeletal muscle cells and C2C12 cells were performed by serum depriving differentiated myotubes for 4 h before stimulating with either 10 or 100 nmol/L insulin for 5 min.
For Western blotting, protein lysates were prepared using a phospho-protein lysis buffer (20 mM Tris (pH 7.4), 150 mM NaCl, 5 mM EDTA, 150 mM NaF, 2 mM Na3VO4, 10 mM sodium pyrophosphate, 0.5 mM phenylmethylsulfonyl fluoride). Immunoblotting was performed according to standard protocols using total-AKT (CST-9272S, 1:1000 dilution), Phospho-Ser473 AKT (CST-9271S, 1:1000 dilution), or OXPHOS cocktail (Abcam 110413, 1:5000 dilution) as primary antibodies and goat anti-rabbit (Bio-Rad 170-6515, 1:10,000 dilution) or goat anti-mouse IgG (Bio-Rad 170-6516, 1:10,000 dilution) horseradish peroxidase conjugate secondary antibodies. Total protein on the membrane was quantified using Bio-Rad stain-free gels. Images were developed with Immun-Star WesternC Chemiluminescence kit (Bio-Rad) using a Molecular Imager ChemiDoc XRS + (Bio-Rad) and analyzed using ImageLab software (Bio-Rad).
Genomic DNA was purified from human skeletal muscle cells using DNeasy Blood & Tissue Kit (Qiagen). The PDK4-10 kb (hg38, chr7:95,608,432-95,609,655) and CXCL8-17kb (hg38, chr4:73,725,619-73,726,851) enhancers were PCR amplified (primers are listed in in Supplementary Data 12) from skeletal muscle DNA and ligated into the pGL4.23[luc2/minP] vector (Promega) using KpnI and NheI restriction enzymes for the PDK4-10kb and NheI and XhoI for CXCL8-17kb. All constructs were verified by Sanger sequencing. pGL4.23[luc2/minP] plasmids, either empty or containing enhancer fragments, were co-transfected with 1:100 pRL-SV40 vector (Promega) into skeletal muscle cells in 96-well plates using TransIT-X2 (Mirius) according to manufactures protocol (n = 5 independent transfections). After 24 h, cell media was changed and cells to be treated with palmitate or TNFα were added 0.5 mM palmitate or 10 ng/ml TNFα, respectively. After 48 h, luminescence was determined using the Dual-Glo Luciferase Assay System (Promega) and a Hidex Sense microplate reader. Firefly luciferase counts were normalized to Renilla luciferase counts.
siRNA mediated knockdown of Eif6
C2C12 myoblasts were reverse transfected by seeding cells together with transfection mix containing siScr (SIC001-10NMOL, Sigma-Aldrich) or siRNAs against Eif6 using siEif6#1 (SASI_Mm01_00034707: 5′-cucuggacuuuggcucauu-3′, Sigma-Aldrich) or siEif6#2 (SASI_Mm01_00034708: 5′-gucagagcgucguucgaga-3′, Sigma-Aldrich). Transfections were performed using TransIT-X2 (Mirius) according to manufactures protocol and cells were assayed 48 h after transfection (myoblasts) or 5 days after differentiation (myotubes).
Measurement of oxygen consumption rate (OCR)
Real-time measurements of OCR were performed using a Seahorse XFe96 Extracellular Flux Analyzer (Agilent Technologies). C2C12 myoblasts were reverse transfected by seeding 5000 cells per well in Seahorse XFe96 Cell Culture Microplates (Agilent Technologies) together with transfection mix containing siScr or siRNAs against Eif6 (siEif6#1, n = 6 biological replicates or siEif6#2, n = 8 biological replicates). Cells were assayed 48 h after transfection using the Seahorse XF Cell Mito Stress Test kit (Agilent Technologies). OCR was measured under basal conditions and after injection of final concentrations of 1 µM oligomycin, 2.3 µM FCCP, or 2.55 µM antimycin A combined with 1 µM rotenone. The measured OCR values were normalized to protein levels by lysing the cells and performing BCA protein assay (Pierce BCA Protein Assay Kit from Thermo Scientific).
Differentiated myotubes in 12-well plates were serum-starved for 4 h, washed with PBS then stimulated in the presence or absence of 10 nM insulin in 500 µl of KRP buffer (pH 7.3) for 20 min. 25 µl of 0.2 mM 2-deoxy-glucose, 10 µCi/ml [1,2-3 H (N)] 2-deoxy-glucose (Perkin Elmer) was added for the last 5 min of insulin stimulation and then cells were washed 3 times with cold PBS on ice and cells were lysed in 200 µl of phospho-protein lysis buffer. Radioactivity was determined by liquid scintillation counting after the addition of Ultima Gold LSC. Values were subtracted from background samples for each condition (cells treated with 2 µl of 10 mM Cytochalasin B during the insulin stimulation). Values were normalized to protein levels performing BCA protein assay (Pierce BCA Protein Assay Kit from Thermo Scientific).
Differentiated myotubes in 12-well plates were serum-starved for 4 h, followed by a 1 h incubation in KRP buffer (pH 7.3) containing 5 mM glucose, 2 µCi/ml Glucose, D-[U-14C] (Perkin Elmer), in the absence or presence of 100 nM insulin. Cells were washed 3 times in cold PBS, harvested in 200 µl of 1 M NaOH and heated to 70 °C for 15 min. Ten microliter was taken for the determination of protein (BCA) and to the remainder, 25 µl saturated Na2SO4, and 900 µl ice-cold ethanol was added, vortexed and frozen for 30 min at −80 °C, followed by a centrifugation step (10 min, 16,000×g, 4 °C). Pellets were resuspended in 100 µl H2O, followed by addition of 1 ml ice-cold ethanol and re-centrifugation. The final pellet was resuspended in 100 µl H2O and radioactivity was determined by liquid scintillation counting after the addition of Ultima Gold LSC. Values were normalized to protein levels performing BCA protein assay (Pierce BCA Protein Assay Kit from Thermo Scientific).
Total RNA was purified from human skeletal myotubes (control, palmitate or TNFα treated) using AllPrep DNA/RNA/miRNA Universal Kit (Qiagen). For quantification, total RNA was reverse-transcribed using iScript™ cDNA Synthesis Kit (Bio-Rad), according to the manufacturer’s instructions and analyzed by real-time PCR using Brilliant III Ultra-fast SYBR Green QPCR Master Mix (AH Diagnostic) and a C1000 Thermal cycler (Bio-Rad). mRNA primer sequences are listed in Supplementary Data 12.
One microgram of total RNA was depleted of rRNA and subsequently used to generate libraries using the TruSeq standard total RNA with Ribo-Zero Gold kit (Illumina). The PCR cycle number for each library amplification was optimized by running 10% of the library DNA in a real-time PCR reaction using Brilliant III Ultra-fast SYBR Green QPCR Master Mix (AH Diagnostic) on a C1000 Thermal cycler (Bio-Rad) (Supplementary Data 11). Libraries were sequencing on a NextSeq500 system (Illumina) using the NextSeq 500/550 High Output v2 kit (75 cycles). An overview of all RNA-seq experiments are given in Supplementary Data 11.
For bioinformatic analysis of RNA-seq data, reads were aligned to the hg38 GENCODE Comprehensive gene annotations62 version 27 using STAR v2.5.3a63. Read summation onto genes was performed by featureCounts v1.5.364. Differential expression testing was performed with edgeR v3.14.065 using a model of the form ~0 + group + block, where group was a factor containing information on both passage and treatment, and block encoded the two replicates. Differential expression was found by testing e.g., (P5_Palmitate + P6_Palmitate)/2 – (P5_Control + P6_Control)/2 using the quasi-likelihood tests in edgeR. GO enrichments were found using the camera function66, which takes both inter-gene correlations and the distribution of log fold changes in the data-set into consideration and is part of the edgeR package. Only gene ontologies containing between 10 and 500 genes were investigated. Initial visualization of samples was performed by multi-dimensional scaling (MDS) plots, which are similar to PCA plots but use average log fold changes of the 500 most divergent interactions.
Skeletal muscle myotubes were treated with palmitate or TNFα (n = 4 biological replicates using cells from two different passages), and cross-linked in 1% formaldehyde in PBS for 10 min at room temperature followed by quenching with glycine (final concentration of 0.125 M) to stop the cross-linking reaction. Cells were washed with PBS and harvested in 1 ml SDS Buffer (50 mM Tris-HCl (pH 8), 100 mM NaCl, 5 mM EDTA (pH 8.0), 0.2% NaN3, 0.5% SDS, 0.5 mM phenylmethylsulfonyl fluoride) and centrifuged for 6 min at 250 × g. The pelleted nuclei were lysed in 1.5 ml ice-cold IP Buffer (67 mM Tris-HCl (pH 8), 100 mM NaCl, 5 mM EDTA (pH 8.0), 0.2% NaN3, 0.33% SDS, 1,67% Triton X-100, 0.5 mM phenylmethylsulfonylfluoride) and sonicated (Diagenode, Biorupter) to an average length of 200–500 bp (between 15 and 20 cycles, high intensity). Before starting the ChIP experiment, chromatin was cleared by centrifugation for 30 min at 20,000 × g. For each ChIP, 2–10 μg DNA was combined with 2.5 μg antibody and incubated with rotation at 4 °C for 16 h. The following antibodies were used for ChIP: H3K27ac (Ab4729), H3K4me1 (Ab8895), H3K4me3 (CST-9751S), H3 (Ab1791). Immunoprecipitation was performed by incubation with Protein G Sepharose beads (GE healthcare) for 4 h followed by three washes with low-salt buffer (20 mM Tris-HCl (pH 8.0), 2 mM EDTA (pH 8.0), 1% Triton X-100, 0.1% SDS, 150 mM NaCl) and two washes with high-salt buffer (20 mM Tris-HCl (pH 8.0), 2 mM EDTA (pH 8.0), 1% Triton X-100, 0.1% SDS, 500 mM NaCl). Chromatin was de-cross-linked in 120 μl 1%SDS and 0.1 M NaHCO3 for 6 h at 65 °C, and DNA was subsequently purified using Qiagen MinElute PCR purification kit. For library preparation and sequencing, 3–10 ng of immunoprecipitated DNA was used to generate adapter-ligated DNA libraries using the NEBNext® Ultra DNA library kit for Illumina (New England Biolabs, E7370L) and indexed multiplex primers for Illumina sequencing (New England Biolabs, E7335). The PCR cycle number for each library amplification was optimized by running 10% of the library DNA in a real-time PCR reaction using Brilliant III Ultra-fast SYBR Green QPCR Master Mix (AH Diagnostic) and a C1000 Thermal cycler (Bio-Rad) (Supplementary Data 11). DNA libraries were sequenced on a HiSeq2000 by 50‐bp single‐end sequencing at the National High‐Throughput Sequencing Centre (University of Copenhagen, Denmark). An overview of all ChIP-seq experiments are given in Supplementary Data 11.
ChIP-qPCR validations were performed by ChIP followed by real-time PCR using Brilliant III Ultra-fast SYBR Green QPCR Master Mix (AH Diagnostic) and a C1000 Thermal cycler (Bio-Rad). All reactions were analyzed in quadruplicates. ChIP-qPCR primer sequences are listed in Supplementary Data 12.
For bioinformatic analysis of ChIP-seq data, sequenced reads were aligned using the sub-read aligner v1.5.067 against a full index of the main chromosomes of the hg38 reference genome, as genomic DNA and keeping only uniquely mapped reads. Duplicate reads were removed using Picard tools (http://broadinstitute.github.io/picard). Peaks were called using MACS2 v22.214.171.1245073168 with input control. H3K4me1 peaks were called as broad peaks, while H3K27ac peaks were called as narrow peaks. The quality of individual samples was assessed by testing whether fragment lengths could be estimated and whether more than 200,000 peaks could be called with a P-value cutoff of 0.05. These individual peak lists were only used to identify samples where the IP-step had failed and were not used in the downstream analysis. All samples passed these two tests. The consensus peak list used in the analysis was generated following the ENCODE 2012 IDR pipeline. For each histone modification a consensus peak set was generated as follows. All samples were pooled and the pooled reads were shuffled and split in two (pseudo replicates). Initial peak lists were called as above on each of these three samples (pool and two pseudo replicates), with a P-value cutoff of 0.05 and sorted by P-value. Finally, a consensus peak list was generated using the irreproducible discovery rate (IDR) software v2.0.269 with the pseudo replicate peak lists as input and the pooled peak list as oracle peak list. The IDR is analogous to an FDR, and has been shown to be a better measure of reproducibility in peak-calling experiments70. A lenient IDR threshold of 0.05 was used. For each sample, reads were summarized into consensus peaks using featureCounts v1.20.664. Differentially bound peaks were detected in edgeR v3.22.0 as described65, using reads along the entire peak and the same model and testing procedure as in the RNA-seq analysis. Peaks were considered overlapping if they overlapped by any amount.
H3K4me3 peaks from human skeletal muscle myotubes derived from skeletal muscle myoblasts were downloaded from Roadmap Epigenomics71 (sample E121), lifted to hg38 using the UCSC liftOver tool72 and filtered to keep peaks with a FDR < 0.05. Active promoters were defined as RefSeq gene73 promoters with a H3K4me3 peak within 3000 bp upstream or 1000 bp downstream of its TSS. Enhancers were defined as regions that contained a consensus peak of both H3K27ac and H3K4me1, as defined in the ChIP-seq, and was more than 3000 bp upstream or 1000 bp downstream of the TSS of an active promoter.
Promoter capture Hi-C
Two 15 cm plates of in vitro differentiated myotubes (n = 3 biological replicates using three different passages of cells) were treated with either palmitate or TNFα or left untreated as control. Promoter Capture Hi-C was performed using similar protocols as described in22,24. Cells were cross-linked in 2% formaldehyde for 10 min followed by quenching with glycine (final concentration of 0.125 M). After washing with PBS, the cells were centrifuged for 10 min at 400 × g and frozen at −80 °C until further analysis. Cells were lysed in 50 ml ice-cold lysis buffer (10 mM Tris-HCl pH 8, 10 mM NaCl, 0.2% Igepal CA-630 and protease inhibitor cocktail (Roche complete, EDTA-free)). After 30 min incubation on ice, nuclei were pelleted by centrifugation at 650 × g for 5 min. The pellet was resuspended in 1.25× NEBuffer 2 and added SDS (final concentration of 0.3%) followed by rotation at 37 °C for 1 h. Triton X-100 was added (final concentration of 1.7%) and the samples were incubated shaking at 37 °C for 1 h. After digesting with HindIII (NEB R0104T, 1500 units per 5 million cells starting material) at 37 °C overnight, restriction fragment overhangs were filled by Klenow (NEB) using biotin-14-dATP (Life Technologies), dCTP, dGTP and dTTP (all at a final concentration of 30 µM) and incubating for 60 min at 37 °C. Enzymes were deactivated by adding SDS (final concentration of 1.47%) and incubated shaking for 30 min at 65 °C. Ligation was performed using 50 units T4 DNA ligase (Invitrogen) per 5 million cells starting material in a total volume of 8.2 ml 1X ligation buffer (NEB B0202S) containing 100 µg/ml BSA (NEB) and 0.9% Triton X-100, and by incubating for four hours at 16 °C followed by 30 min at room temperature. Cross-links were reversed by incubation with Proteinase K at 65 °C overnight. After 16 h, additional Proteinase K was added and the samples were further incubated for 2 h at 65 °C. RNase A treatment was performed for 60 min at 37 °C, and DNA was purified by two sequential phenol-chloroform extractions. DNA concentration was measured using a Qubit Fluorometer and Qubit dsDNA HS Assay Kit (Life technologies). In order to remove biotin from non-ligated DNA ends, 40 μg DNA was incubated with T4 DNA polymerase in a buffer containing 1× NEBuffer 2, 0.1 mg/ml BSA, and 0.1 mM dATP for 4 h at 20 °C followed by phenol-chloroform extraction. DNA was sheared by sonication (Diagenode, Biorupter) to an average length of 400 bp (20 cycles, low intensity), followed by DNA end-repair by incubation with T4 DNA polymerase (NEB M0203L), T4 DNA Polynucleotide kinase (NEB M0201L), Klenow (NEB M0210L), and dNTP mix (0.25 mM) in 1X ligation buffer (NEB B0202S). After 30 min incubation at room temperature, DNA was purified using Qiagen MinElute PCR purification kit. For addition of dATP to the Hi-C libraries, DNA was incubated with Klenow exo- and 0.23 mM dATP in 1X NEBuffer 2 for 30 min at 37 °C. Enzymes were inactivated by incubation at 65 °C for 20 min. DNA fragments were size-selected by a double-sided SPRI bead purification (SPRI beads solution volume to sample volume to 0.6:1 followed by 0.9:1). Biotin-marked ligation products were isolated using MyOne Streptavidin C1 Dynabeads (LifeTechnologies). After washing the beads in tween buffer (5 mM Tris, 0.5 mM EDTA, 1 M NaCl, 0.05% Tween), binding of DNA was performed in binding buffer (5 mM Tris, 0.5 mM EDTA, 1 M NaCl) for 30 min at room temperature, followed by two washes in binding buffer, and one wash in ligation buffer (NEB B0202S). The beads were resuspended in ligation buffer and adapters (from SureSelect XT library prep kit ILM, Agilent Technologies) were ligated to the bead-bound DNA by the addition of T4 DNA ligase (NEB) and incubation for 2 h at room temperature. The beads were subsequently washed twice in tween buffer, once in binding buffer and twice in 1X NEBuffer 2 before resuspending the beads in 40 µl 1X NEBuffer 2. The bead-bound library DNA was amplified with 12–14 PCR amplification cycles according to the SureSelect XT library prep kit ILM (Agilent Technologies) protocol before promoter capture. Promoter capture was performed by using 37,608 biotin-labeled RNA baits (each 120 nucleotides) covering 21,841 human promoters (approximately two baits per promoter, targeting each end of a HindIII fragment22). The RNA baits were synthesized by Agilent Technologies and hybridization was performed using the Sure Select Target Enrichment kit ILM (Agilent Technologies) and SureSelect XT library prep kit ILM (Agilent Technologies) according to manufacturer’s instructions. DNA libraries were paired-end sequenced on a NextSeq500 system (Illumina) using the NextSeq 500/550 High-Output v2.5 Kit (150 cycles).
For bioinformatic analysis of Promoter Capture Hi-C data, di-tags reads were filtered and mapped against the main chromosomes of the hg38 reference genome by the HiCUP pipeline v0.6.174 using bowtie2 v2.2.675 without limits on maximum and minimum di-tag length. The HiCUP pipeline also removes PCR duplicate reads and filters out re-ligations and other experimental artefacts. Downstream analysis was performed with diffHic76 as follows: di-tags were filtered keeping only DNA fragments shorter than 600 bp, and with a minimum inwards and outwards facing gap distance of 1000 and 25,000, respectively, as recommended in the manual. All conditions and passages of cells were then pooled to obtain a general chromatin conformation capture of myotubes; all enhancers were widened to 10 kb and interactions between a promoter and a histone mark were extracted and filtered to remove weak interactions so that only interactions with an average mean of 5 counts per million, calculated by the aveLogCPM function, and with a signal at least two-fold above the expected were kept (as calculated by the filterTrended function—see the diffHic manual for code examples). The connectCounts function was used to count reads supporting interactions for each library, interactions with enough reads to test for differential binding were selected using the filterByExpr function of edgeR and differential binding was performed using the edgeR quasi-likelihood function as in the RNA-seq and ChIP-seq experiments but without the replicate blocking factor, resulting in a model of the form ~0+ group. These criteria and cut-offs were as described in the diffHic package manual. The set of interactions interrogated for differential interactions is the one used in downstream analysis and reported in the Supplementary tables. To visualize Promoter Capture Hi-C data as heatmaps, rotated plaid plots were generated by the rotPlaid function supplied by the diffHic package on the merged dataset. Each chromosome was split in 1000 bins, and colored by the amount of reads in the interaction. Any interaction with more than 20 reads was colored a solid red.
Overlapping enhancer regions with GWAS SNPs
GWAS studies for T2D6, BMI8, and WHR7 have identified 402, 941 and 463 distinct association signals, respectively. For IR we collected distinctive GWAS signals covering from studies of fasting insulin (FI) with and without adjustment for BMI34,36,37, HOMA-IR33, the modified Stumvoll Insulin Sensitivity Index (ISI)38, and 53 genomic variants associated with both higher FI levels adjusted for BMI, lower HDL cholesterol levels and higher triglyceride levels35, leading to a total of 82 distinct association signals with IR. Each of these four sets was expanded with SNPs in high LD (R2 > 0.8) with the original distinct association signals. Specifically, plink19 (http://www.cog-genomics.org/plink/1.9/)77 was used to extract high LD SNPs within a 1 MB range of each SNP based on a subset (6148 Danish individuals) of the HRC imputed dataset used in the T2D GWAS6. The variant positions were converted into genome build38 before overlapping them with palmitate and TNFα responsive enhancer regions.
The ADIGEN study participants79,80 were selected from the Danish draft boards records. The study was approved by the Ethics Committee from the Capital Region of Denmark and informed consent was obtained from all participants in accordance with the Declaration of Helsinki II. Juvenile obesity was defined as weight 45% above the Metropolitan desirable weight (BMI ≥ 31 kg/m2) at the draft board visit. 1930 obese individuals and 3601 randomly selected individuals for the population-representative control group were invited to participate in the study. In total 557 individuals volunteered to participate. From a subset of these Danish white men, 71 juvenile obese and 74 age-matched control individuals, skeletal muscle biopsies were taken under lidocaine local anesthesia from their right thigh using a thin Bergström needle and snap frozen in liquid nitrogen. The participants were healthy by self-report and under 65 years of age at the time of ADIGEN examination.
Gene expression analysis was performed by extracting total RNA using miRNeasy kit (Qiagen). The yield was optically measured and a randomly selected subset of the RNA samples were examined using an Experion electrophoresis station (BioRad) for integrity (RIN value), which was good in all cases. Gene expression of ~47,000 transcripts was measured by the HumanExpression HT-12 Chip (Illumina, USA). cRNA was synthesized from total RNA using the Nano Labeling Kit from Illumina (Epicentre), and the cRNA concentration was measured by Qubit fluorescent dye (Invitrogen, Germany) before loading the arrays. Hybridization was performed as recommended by Illumina and the Illumina HiScan was used to obtain the raw probe intensity level data. For failed expression arrays cRNA was resynthesized and rerun. The raw probe intensity values were exported from GenomeStudio without background correction and imported into R where the lumi package81 was used for pre-processing. The array pre-processing included; quantile normalization, log2 transformation and probe filtering to remove probes with a detected P-value above 0.01.
The participants were genotyped using the Illumina CoreExome Chip v1.0 containing 538,448 genetic variants of which more than 240,000 are common. Genotypes were called using the Genotyping module (version 1.9.4) of GenomeStudio software (version 2011.1, Illumina) and Illumina HumanCoreExome-12v1-0_B.egt cluster file. The genotype data were subjected to standard quality control and then phased with EAGLE282 and imputed with the 1000 Genomes Project Phase III panel using Minimac383. We selected 29 or 420 SNPs located within 11 or 124 enhancer regions, which changed activity by palmitate or TNFα treatment, respectively (see text for further description on how SNPs were selected). Only SNPs that were missing in less than 10% of the individuals, with an imputation quality (R2) higher than 0.4 and no significant deviation from Hardy-Weinberg equilibrium were extracted.
Matrix eQTL84 was used to assess the association between 461 (TNFα) and 39 (palmitate) gene-SNP pairs (selected based on our Promoter Capture Hi-C data) in a total of 140 individuals with both expression and SNP data available (R version 3.5.0). To account for complex non-genetic factors, we used probabilistic estimation of expression residuals (PEER)85. Specifically, eQTL analysis was performed on inverse normal-transformed expression residuals adjusted for age, BMI-group (obese or control) and 15 PEER factors which is the number of factors recommended by the GTEX consortium86 for studies with less than 150 individuals. The models were also run without the adjustment for BMI. Significant e-genes were identified after hierarchical multiple testing correction of the p-values from TNFα and palmitate eQTL tests using the Bonferroni-BH procedure recommended by Huang et al.87.
Correlation analysis in BXD mice strains
We selected and extracted the mean values of 48 metabolic phenotypes (Supplementary Data 7) that were measured across 42 and 37 BXD cohorts fed on CD and HFD, respectively39,40,41,88. Moreover, we extracted gene expression values of Cep68, Gab2, Lamb1, Macf1, Eif6, Pabpc4, Btbd1, Filip1l, Tcea3, Nrp1, Zhx3, Tbx15, and Tnfaip8 from skeletal muscle tissue (quadriceps) (GSE60151)89,90,91, adipose tissue (GN779; accessible on http://www.genenetwork.org/) and liver tissue (GSE60149)40,41,91 from the different BXD mice strains. Spearmans rank correlation analysis was performed to determine significant associations between phenotypes and gene expression. The p-values from the 48 correlations from each diet and tissue were adjusted using false discovery rate correction (FDR)92.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
For analysis of RNA-seq data, reads were aligned to the hg38 GENCODE Comprehensive gene annotations [https://www.gencodegenes.org/]. H3K4me3 ChIP-seq peaks from human skeletal muscle myotubes were downloaded from Roadmap Epigenomics (sample E121) [https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidated/narrowPeak/E121-H3K4me3.narrowPeak.gz]. Gene expression values from BXD cohorts were downloaded from the GEO data base entry GSE60151 and GSE60149, or from http://www.genenetwork.org/ [http://gn1.genenetwork.org/webqtl/main.py?FormID=sharinginfo&GN_AccessionId=779]. All novel sequencing data have been deposited in the NCBI Gene Expression Omnibus (GEO) and are accessible through GEO SuperSeries accession number GSE126102. RNA-seq data from GSE126101 have been used to generate Fig. 1a–i, Fig. 2j, k, Fig. 3e–h, Fig. 6c, and Supplementary Fig. 2. ChIP-seq data from GSE126099 have been used to generate Fig. 2a–i, Fig. 3e–h, Fig. 6b, Supplementary Fig. 3, and Supplementary Fig. 4. Promoter Capture Hi-C data from GSE126100 have been used to generate Fig. 3a–h, Fig. 6a, Supplementary Fig. 6, and Supplementary Fig. 7. The source data underlying Figs. 1H, I, 2G–J, 6B–E, 6I–M and Supplementary Fig. 1B–E, 4A–B, 4D, 5, 8A–D, and 9A–F are provided as a Source Data file.
All custom computer codes used for sequencing data analysis and figure generation are available at https://github.com/lars-work-sund/Skeletal-muscle-enhancer-interactions-identify-genes-controlling-human-metabolism [https://github.com/lars-work-sund/Skeletal-muscle-enhancer-interactions-identify-genes-controlling-human-metabolism].
WHO. W. H. O. Obesity and overweight—fact sheet, http://www.who.int/mediacentre/factsheets/fs311/en/. (2016).
Zheng, Y., Ley, S. H. & Hu, F. B. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat. Rev. Endocrinol. 14, 88–98 (2018).
Zurlo, F., Larson, K., Bogardus, C. & Ravussin, E. Skeletal muscle metabolism is a major determinant of resting energy expenditure. J. Clin. Invest. 86, 1423–1427 (1990).
DeFronzo, R. A. & Tripathy, D. Skeletal muscle insulin resistance is the primary defect in type 2 diabetes. Diabetes Care 32(Suppl 2), S157–S163 (2009).
Huang, T. & Hu, F. B. Gene-environment interactions and obesity: recent developments and future directions. BMC Med. Genomics 8(Suppl 1), S2 (2015).
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
Pulit, S. L. et al. Meta-analysis of genome-wide association studies for body fat distribution in 694,649 individuals of European ancestry. Hum. Mol. Genet. https://doi.org/10.1093/hmg/ddy327 (2018).
Yengo, L. et al. Meta-analysis of genome-wide association studies for height and body mass index in approximately 700000 individuals of European ancestry. Hum. Mol. Genet. 27, 3641–3649 (2018).
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49 (2011).
Trynka, G. et al. Chromatin marks identify critical cell types for fine mapping complex trait variants. Nat. Genet. 45, 124–130 (2013).
Corradin, O. & Scacheri, P. C. Enhancer variants: evaluating functions in common disease. Genome Med. 6, 85 (2014).
Heintzman, N. D. & Ren, B. Finding distal regulatory elements in the human genome. Curr. Opin. Genet. Dev. 19, 541–549 (2009).
Heintzman, N. D. et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat. Genet. 39, 311–318 (2007).
Koch, C. M. et al. The landscape of histone modifications across 1% of the human genome in five human cell lines. Genome Res. 17, 691–707 (2007).
Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl Acad. Sci. USA 107, 21931–21936 (2010).
Rada-Iglesias, A. et al. A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279–283 (2011).
Akhtar-Zaidi, B. et al. Epigenomic enhancer profiling defines a signature of colon cancer. Science 336, 736–739 (2012).
Pasquali, L. et al. Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants. Nat. Genet. 46, 136–143 (2014).
Stitzel, M. L. et al. Global epigenomic analysis of primary human pancreatic islets provides insights into type 2 diabetes susceptibility loci. Cell Metab. 12, 443–455 (2010).
Tuupanen, S. et al. Characterization of the colorectal cancer-associated enhancer MYC-335 at 8q24: the role of rs67491583. Cancer Genet. 205, 25–33 (2012).
Mifsud, B. et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat. Genet. 47, 598–606 (2015).
Novo, C. L. et al. Long-range enhancer interactions are prevalent in mouse embryonic stem cells and are reorganized upon pluripotent state transition. Cell Rep. 22, 2615–2627 (2018).
Schoenfelder, S. et al. The pluripotent regulatory circuitry connecting promoters to their long-range interacting elements. Genome Res. 25, 582–597 (2015).
Boden, G. Obesity and free fatty acids. Endocrinol. Metab. Clin. North Am. 37, 635–646 (2008).
Roden, M. et al. Mechanism of free fatty acid-induced insulin resistance in humans. J. Clin. Invest. 97, 2859–2865 (1996).
Hommelberg, P. P. et al. Palmitate-induced skeletal muscle insulin resistance does not require NF-kappaB activation. Cell Mol. Life Sci. 68, 1215–1225 (2011).
Uysal, K. T., Wiesbrock, S. M., Marino, M. W. & Hotamisligil, G. S. Protection from obesity-induced insulin resistance in mice lacking TNF-alpha function. Nature 389, 610–614 (1997).
de Alvaro, C., Teruel, T., Hernandez, R. & Lorenzo, M. Tumor necrosis factor alpha produces insulin resistance in skeletal muscle by activation of inhibitor kappaB kinase in a p38 MAPK-dependent manner. J. Biol. Chem. 279, 17070–17078 (2004).
Bonn, S. et al. Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development. Nat. Genet. 44, 148–156 (2012).
Zhang, B. et al. A dynamic H3K27ac signature identifies VEGFA-stimulated endothelial enhancers and requires EP300 activity. Genome Res. 23, 917–927 (2013).
Jin, F. et al. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature 503, 290–294 (2013).
Dimas, A. S. et al. Impact of type 2 diabetes susceptibility variants on quantitative glycemic traits reveals mechanistic heterogeneity. Diabetes 63, 2158–2171 (2014).
Dupuis, J. et al. New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat. Genet. 42, 105–116 (2010).
Lotta, L. A. et al. Integrative genomic analysis implicates limited peripheral adipose storage capacity in the pathogenesis of human insulin resistance. Nat. Genet. 49, 17–26 (2017).
Manning, A. K. et al. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nat. Genet. 44, 659–669 (2012).
Scott, R. A. et al. Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat. Genet. 44, 991–1005 (2012).
Walford, G. A. et al. Genome-wide association study of the modified stumvoll Insulin sensitivity Index Identifies BCL2 and FAM19A2 as novel Insulin sensitivity loci. Diabetes 65, 3200–3211 (2016).
Andreux, P. A. et al. Systems genetics of metabolism: the use of the BXD murine reference panel for multiscalar integration of traits. Cell 150, 1287–1299 (2012).
Williams, E. G. et al. Systems proteomics of liver mitochondria function. Science 352, aad0189 (2016).
Wu, Y. et al. Multilayered genetic and omics dissection of mitochondrial activity in a mouse reference population. Cell 158, 1415–1430 (2014).
Storz, P., Doppler, H., Wernig, A., Pfizenmaier, K. & Muller, G. Cross-talk mechanisms in the development of insulin resistance of skeletal muscle cells palmitate rather than tumour necrosis factor inhibits insulin-dependent protein kinase B (PKB)/Akt stimulation and glucose uptake. Eur. J. Biochem. 266, 17–25 (1999).
Hirabara, S. M., Curi, R. & Maechler, P. Saturated fatty acid-induced insulin resistance is associated with mitochondrial dysfunction in skeletal muscle cells. J. Cell Physiol. 222, 187–194 (2010).
Coll, T. et al. Palmitate-mediated downregulation of peroxisome proliferator-activated receptor-gamma coactivator 1alpha in skeletal muscle cells involves MEK1/2 and nuclear factor-kappaB activation. Diabetes 55, 2779–2787 (2006).
Salvado, L. et al. Oleate prevents saturated-fatty-acid-induced ER stress, inflammation and insulin resistance in skeletal muscle cells through an AMPK-dependent mechanism. Diabetologia 56, 1372–1382 (2013).
Jove, M. et al. Palmitate induces tumor necrosis factor-alpha expression in C2C12 skeletal muscle cells by a mechanism involving protein kinase C and nuclear factor-kappaB activation. Endocrinology 147, 552–561 (2006).
Hirabara, S. M. et al. Time-dependent effects of fatty acids on skeletal muscle metabolism. J. Cell Physiol. 210, 7–15 (2007).
Plomgaard, P. et al. Tumor necrosis factor-alpha induces skeletal muscle insulin resistance in healthy human subjects via inhibition of Akt substrate 160 phosphorylation. Diabetes 54, 2939–2945 (2005).
Bruce, C. R. & Dyck, D. J. Cytokine regulation of skeletal muscle fatty acid metabolism: effect of interleukin-6 and tumor necrosis factor-alpha. Am. J. Physiol. Endocrinol. Metab. 287, E616–E621 (2004).
Rubin, A. J. et al. Lineage-specific dynamic and pre-established enhancer-promoter contacts cooperate in terminal differentiation. Nat. Genet. 49, 1522–1528 (2017).
Siersbaek, R. et al. Dynamic rewiring of promoter-anchored chromatin loops during adipocyte differentiation. Mol. Cell 66, 420–435 (2017). e425.
Javierre, B. M. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell 167, 1369–1384 (2016). e1319.
Greenwald, W. W. et al. Subtle changes in chromatin loop contact propensity are associated with differential gene regulation and expression. Nat. Commun. 10, 1054 (2019).
Kim, Y. H. et al. Rev-erbalpha dynamically modulates chromatin looping to control circadian gene transcription. Science 359, 1274–1277 (2018).
Lee, K. Y. et al. Tbx15 controls skeletal muscle fibre-type determination and muscle metabolism. Nat. Commun. 6, 8054 (2015).
Maffei, M. et al. Leptin levels in human and rodent: measurement of plasma leptin and ob RNA in obese and weight-reduced subjects. Nat. Med. 1, 1155–1161 (1995).
Farooqi, I. S. & O’Rahilly, S. Mutations in ligands and receptors of the leptin-melanocortin pathway that lead to obesity. Nat. Clin. Pr. Endocrinol. Metab. 4, 569–577 (2008).
Clarke, K. et al. The role of Eif6 in skeletal muscle homeostasis revealed by rndurance training co-expression networks. Cell Rep. 21, 1507–1520 (2017).
Ding, X. et al. Genome-wide screen of DNA methylation identifies novel markers in childhood obesity. Gene 566, 74–83 (2015).
Kraja, A. T. et al. Associations of mitochondrial and nuclear mitochondrial variants and genes with seven metabolic traits. Am. J. Hum. Genet. 104, 112–138 (2019).
Justice, A. E. et al. Protein-coding variants implicate novel genes related to lipid homeostasis contributing to body-fat distribution. Nat. Genet. 51, 452–469 (2019).
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Lun, A. T. & Smyth, G. K. De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly. Nucleic Acids Res. 42, e95 (2014).
Wu, D. & Smyth, G. K. Camera: a competitive gene set test accounting for inter-gene correlation. Nucleic Acids Res. 40, e133 (2012).
Liao, Y., Smyth, G. K. & Shi, W. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res. 41, e108 (2013).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Li, Q. H., Brown, J. B., Huang, H. Y. & Bickel, P. J. Measuring reproducibility of high-throughput experiments. Ann. Appl Stat. 5, 1752–1779 (2011).
Bailey, T. et al. Practical guidelines for the comprehensive analysis of ChIP-seq data. PLoS Comput Biol. 9, e1003326 (2013).
Bernstein, B. E. et al. The NIH roadmap Epigenomics Mapping Consortium. Nat. Biotechnol. 28, 1045–1048 (2010).
Hinrichs, A. S. et al. The UCSC genome browser database: update 2006. Nucleic Acids Res. 34, D590–D598 (2006).
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
Wingett, S. et al. HiCUP: pipeline for mapping and processing Hi-C data. F1000Res 4, 1310 (2015).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Lun, A. T. & Smyth, G. K. diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data. BMC Bioinforma. 16, 258 (2015).
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337 (2010).
Buemann, B. et al. Lower-body fat mass as an independent marker of insulin sensitivity–the role of adiponectin. Int. J. Obes. (Lond.) 29, 624–631 (2005).
Larsen, L. H. et al. Mutation analysis of the preproghrelin gene: no association with obesity and type 2 diabetes. Clin. Biochem. 38, 420–424 (2005).
Du, P., Kibbe, W. A. & Lin, S. M. lumi: a pipeline for processing Illumina microarray. Bioinformatics 24, 1547–1548 (2008).
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500–507 (2012).
Consortium, G. T. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Huang, Q. Q., Ritchie, S. C., Brozynska, M. & Inouye, M. Power, false discovery rate and Winner’s Curse in eQTL studies. Nucleic Acids Res. 46, e133 (2018).
Pirinen, E. et al. Pharmacological Inhibition of poly(ADP-ribose) polymerases improves fitness and mitochondrial function in skeletal muscle. Cell Metab. 19, 1034–1041 (2014).
Williams, E. G. et al. An evolutionarily conserved role for the aryl hydrocarbon receptor in the regulation of movement. PLoS Genet. 10, e1004673 (2014).
Ryu, D. et al. NAD+ repletion improves muscle function in muscular dystrophy and counters global PARylation. Sci. Transl. Med. 8, 361ra139 (2016).
Williams, E. G. et al. Quantifying and localizing the mitochondrial proteome across five tissues in a mouse population. Mol. Cell Proteom. 17, 1766–1777 (2018).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate—a practical and powerful approach to multiple testing. J. R. Stat. Soc. B Met. 57, 289–300 (1995).
We thank Professor Peter Fraser for providing us with details about the Promoter Capture Hi-C protocol. We thank Professor Thorkild IA Sørensen for providing us with post-hoc access to data from the ADIGEN study. The Novo Nordisk Foundation Center for Basic Metabolic Research (http://www.metabol.ku.dk) is an independent research Center at the University of Copenhagen, partially funded by an unrestricted donation from the Novo Nordisk Foundation. This work was supported by an individual postdoc grant from the Independent Research Fund Denmark (DFF).
The authors declare no competing interests.
Peer review information Nature Communications thanks Sander Kersten and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Williams, K., Ingerslev, L.R., Bork-Jensen, J. et al. Skeletal muscle enhancer interactions identify genes controlling whole-body metabolism. Nat Commun 11, 2695 (2020). https://doi.org/10.1038/s41467-020-16537-6
The Journal of Physiology (2020)