Promoter interactome of human embryonic stem cell-derived cardiomyocytes connects GWAS regions to cardiac gene networks

Choy, Mun-Kit; Javierre, Biola M.; Williams, Simon G.; Baross, Stephanie L.; Liu, Yingjuan; Wingett, Steven W.; Akbarov, Artur; Wallace, Chris; Freire-Pritchett, Paula; Rugg-Gunn, Peter J.; Spivakov, Mikhail; Fraser, Peter; Keavney, Bernard D.

doi:10.1038/s41467-018-04931-0

Download PDF

Article
Open access
Published: 28 June 2018

Promoter interactome of human embryonic stem cell-derived cardiomyocytes connects GWAS regions to cardiac gene networks

Nature Communications volume 9, Article number: 2526 (2018) Cite this article

6297 Accesses
36 Citations
24 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 12 November 2018

This article has been updated

Abstract

Long-range chromosomal interactions bring distal regulatory elements and promoters together to regulate gene expression in biological processes. By performing promoter capture Hi-C (PCHi-C) on human embryonic stem cell-derived cardiomyocytes (hESC-CMs), we show that such promoter interactions are a key mechanism by which enhancers contact their target genes after hESC-CM differentiation from hESCs. We also show that the promoter interactome of hESC-CMs is associated with expression quantitative trait loci (eQTLs) in cardiac left ventricular tissue; captures the dynamic process of genome reorganisation after hESC-CM differentiation; overlaps genome-wide association study (GWAS) regions associated with heart rate; and identifies new candidate genes in such regions. These findings indicate that regulatory elements in hESC-CMs identified by our approach control gene expression involved in ventricular conduction and rhythm of the heart. The study of promoter interactions in other hESC-derived cell types may be of utility in functional investigation of GWAS-associated regions.

A compendium of promoter-centered long-range chromatin interactions in the human genome

Article 09 September 2019

Inkyung Jung, Anthony Schmitt, … Bing Ren

Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease

Article Open access 28 February 2023

Matteo D’Antonio, Jennifer P. Nguyen, … Kelly A. Frazer

A reference map of murine cardiac transcription factor chromatin occupancy identifies dynamic and conserved enhancers

Article Open access 28 October 2019

Brynn N. Akerberg, Fei Gu, … William T. Pu

Introduction

Long-range chromosomal interactions play an important role in differentiating cells, where they organise three-dimensional promoter-enhancer networks that regulate lineage-specifying developmental genes^1,2. Distal promoter contacts are highly cell-type specific³ and form networks of co-regulated genes correlated with their biological functions¹. The cis-regulatory contact upon lineage commitment is a dynamic process that includes acquisition and loss of specific promoter interactions⁴. Cardiomyocytes differentiated from human embryonic stem cells (hESC-CMs) have become a popular model to elucidate functional genomics of cardiac disease; however, there is little information supporting the notion that genomic regions associated with adult phenotypes (such as heart rate) play important roles in hESC-CMs, which have an immature phenotype. More generally, it is essential, in the context of potential cardiac regenerative medicine approaches, to understand how promoters make specific interactions in hESC-CMs to regulate the gene expression patterns underlying the process of cardiomyocyte development. In this study we identify the promoter interactome of hESC-CMs using promoter capture Hi-C (PCHi-C)⁵, which allows physical promoter-enhancer contacts in cells to be identified. We demonstrate that the promoter interactome of cardiac genes is reorganised after cardiac differentiation and cardiac enhancers are enriched in the promoter-interacting regions (PIRs). The PIRs also share the same target genes with expression quantitative trait loci (eQTLs) of cardiac left ventricles and overlap genetic variants associated with conduction and rhythm of the heart.

Results

PCHi-C identifies promoter interactions in hESC-CMs

In order to elucidate the spatial promoter-enhancer organisation after the differentiation process of cardiomyocytes from hESCs, we used the PCHi-C approach⁵ on three biological replicates of 20 million hESC-CMs. From the three biological replicates, we mapped and obtained 774 million valid, unique and on-target captured read pairs (63.3% capture efficiency) with Hi-C User Pipeline (HiCUP)⁶, and called 179,880 significant hESC-CM promoter–genome interactions (Supplementary Data 1) with Capture Hi-C Analysis of Genomic Organisation (CHiCAGO)⁷. Of the promoter interactions, 23.7% were common in all three biological replicates (Supplementary Fig. 1). We detected interactions between 18,159 unique promoters and 107,145 unique hESC-CM promoter-interacting regions (cPIRs) at an average degree of one promoter to 35.1 cPIRs (median = 21; maximum = 433; minimum = 1). In turn, each cPIR interacted with an average of 2.9 promoters (median = 2; maximum = 57; minimum = 1). Also, 60.5% of the interactions occurred within previously published topologically associating domains of hESCs⁸. PIRs are known to be associated with the presence of histone modifications that are hallmarks of functional activities^1,3,4,5,7. We identified the locations of hESC-CM histone marks, H3K4me3, H3K27me3 and H3K36me3, by analysing published data of histone chromatin immunoprecipitation-sequencing (ChIP-seq) in hESC-CMs⁹ using Model-based Analysis of ChIP-Seq (MACS)¹⁰. We also determined whether the genes corresponding to the promoters were transcriptionally active in hESC-CMs by analysing published RNA-seq data in hESC-CMs¹¹. The active histone marks of H3K4me3 and H3K36me3 were significantly associated with transcriptionally active promoters involved in hESC-CM interactions and the repressive histone mark of H3K27me3 with the inactive ones (Fig. 1). Consistent with the reported observations that the transcriptional status of a promoter can be reflected in the histone marks of its interacting partners^4,5, H3K4me3 and H3K36me3 were also significantly enriched in the cPIRs interacting with active promoters, while H3K27me3 was enriched in those interacting with inactive ones (Fig. 1). Thus, our approach identified functionally significant promoter interactions during cardiomyocyte development.

Cardiac VISTA enhancers are specifically enriched in cPIRs

In order to further verify the functional roles of the cPIRs in cardiac cells, we examined whether the cPIRs intersect with enhancers that are conserved and experimentally verified in the murine heart listed in the VISTA Enhancer Browser¹². cPIRs were found to overlap with cardiac VISTA enhancers highly significantly when compared to the random background (Bonferroni-adjusted p = 0.006; Fig. 2a; Supplementary Data 2). The two non-cardiac control sets of enhancers, active in branchial arch and limb respectively, showed no significant overlap (Bonferroni-adjusted p > 0.05; Fig. 2a). By contrast, PIRs detected in hESCs significantly overlapped with VISTA enhancers of all three tested developmental tissues, reflecting the pluripotency of these cells (p < 0.05; Fig. 2a). The target genes of the VISTA enhancers have previously been predicted based on the enhancer-gene linear proximity alone. Using the promoter interactomic data, we mapped the VISTA enhancers to their interacting gene promoters (Supplementary Data 2). We functionally confirmed a promoter–cPIR/VISTA enhancer interaction for VISTA enhancer mm172. This enhancer is conserved, cardiac specific in mice (Fig. 2b) and involved in heart development^13,14. It has been predicted by VISTA to interact with the promoter of either Ednra or Ttc29 based on proximity. Our promoter interactomic data showed an interaction between the human orthologue of mm172 and the promoter of EDNRA (Fig. 2c), a gene that is important for myocardium formation¹⁵. We deleted the rat orthologue of mm172 in the rat cardiac myoblast cell line H9c2 (2-1) using CRISPR-Cas9 technology and showed that the gene expression of Ednra was up-regulated, indicating that mm172 can function as a repressive regulatory element for Ednra (Fig. 2d; Supplementary Figs. 2 and 3).

hESC-CM promoter interactome predicts target genes of eQTLs

Since overlap with eQTLs can provide evidence for PIR regulatory functions³, we also investigated if the promoter interactome predicts the target genes of eQTLs in two cardiac tissues, the left ventricle and atrial appendage, using publicly available data from the Genotype-Tissue Expression (GTEx) Portal¹⁶. Where PIRs in our data overlapped a GTEx eQTL, we ascertained, separately for cPIRs and hESC PIRs, whether the target gene of the eQTL was the same as the target gene of the PIR, and expressed the agreement as a percentage of all eQTL-overlapping PIR–gene pairs. We tested whether there was evidence for enhanced agreement within the cPIRs or hESC PIRs compared with the random background. Previous data indicate that higher Hi-C read counts provide more robust support for interactions between distal regulatory elements and target genes¹⁷, and therefore we explored the effect of different read count cut-offs on the agreement between eQTL–gene and PIR–gene pairs. As expected, the agreement percentage increased with sequencing read counts of promoter interactions (Fig. 3). The highest significance was achieved with the top 20% hESC-CM promoter interactions and left ventricular eQTLs (Bonferroni-adjusted p < 0.05), with a similar trend observed in the atrial appendage (Fig. 3). The higher significance of agreement between target genes of cPIRs and eQTLs in the left ventricle is consistent with the finding that hESC-CMs typically mature into ventricular-like cardiomyocytes¹⁸.

Of the 1264 top 20% PIR–gene interactions overlapping with lead left ventricular eQTLs (in strong linkage disequilibrium blocks (LD; r² > 0.8)), 16.5% of them target the same genes as the eQTLs. eQTLs are linked to trait-associated single-nucleotide polymorphisms (SNPs)¹⁹ and therefore are important information for functional elucidation of genetic findings. Given that the top 20% cPIRs for read counts showed the most significant agreement of target genes with ventricular eQTLs, we reasoned that these top 20% of promoter interactions represented the most relevant subset mediating eQTL–gene interactions.

Cardiac promoter networks reorganise upon differentiation

Within the 35,960 top 20% hESC-CM promoter interactions, 6376 unique promoters interacted with 27,123 unique cPIRs at an average degree of one promoter to 13.7 cPIRs (median = 10; maximum = 74; minimum = 1). However most cPIRs interacted with only one promoter (average = 1.8; median = 1; maximum = 11; minimum = 1). Also, 89.9% of the interactions occurred within previously published topologically associating domains of hESCs⁸. The target genes of the top 20% hESC-CM promoter interactions included key regulators of cardiovascular development and function (Supplementary Table 1). The top 20% hESC-CM promoter interactions were found to constitute 3547 subnetworks with connected nodes using Cytoscape²⁰. The subnetwork with the highest average degree (5.85) had 40 nodes, including 12 promoter nodes and 28 cPIR nodes, and 117 edges (Fig. 4a; Supplementary Data 3). The 12 promoter nodes represent 17 genes in the proximity of MYH6 (most connected) and MYH7 genes (Fig. 4a) and the MYH6-MIR208A-MYH7-MIR208B network is known as myomiR network that regulates myosin heavy chains in the heart during hypertrophic responses²¹. Given the paucity of promoter interactions in MYH6 region observed in hESCs (~5 times less; Fig. 4b), the highly connected cluster of interactions in this locus observed in hESC-CMs has likely emerged through dynamic reorganisation⁴ of chromosomal interactions upon differentiation. MYH6 and MYH7 are key genes in cardiac development and function. Mutations in MYH6 cause congenital heart diseases (CHDs), in particular defects in the atrial and ventricular septum, via dysfunction of cardiac myofibrils²². MYH7 mutations are a major cause of hypertrophic cardiomyopathy, and have also been reported in CHD families²². The identification of promoter interaction networks in hESC-CMs involving key cardiac genes, such as MYH6 and MYH7 in our data, illustrates the potential of this approach to map the relationships between genes and regulatory elements in genomic regions of cardiac importance.

hESC-CM promoter interactions are associated with heart rate

Promoter interactions detected in trait-relevant tissues hold the clues to the putative target genes of non-coding genome-wide association study (GWAS) SNPs²³. We therefore explored if the hESC-CM promoter interactome was associated with any of the three cardiovascular phenotype groups by considering their GWAS profiles: (1) CHD^24,25; (2) coronary artery disease (CAD; http://www.cardiogramplusc4d.org)²⁶; and (3) cardiac conduction and rhythm disorders (CRD)²⁷. Our prior hypothesis was that we would find an association between the hESC-CM promoter interactome and CRD, which is a phenotypic group largely deriving from dysfunction that involves myocardial cells, but not for CHD or CAD. For each GWAS, we obtained the p values of SNPs located in cPIRs or hESC PIRs and constructed QQ plots against averaged p values of SNPs located in 1000 sets of randomised cPIRs or hESC PIRs (Fig. 5a). For SNPs located in cPIRs (cPIR SNPs), the highest inflation was observed in GWAS p values of CRD (inflation factor, λ = 1.237). On the other hand, SNPs located in hESC PIRs showed the highest inflation in GWAS p values of CAD (inflation factor, λ = 1.250). We then investigated whether all the inflated SNPs (p < 0.001) from each of these GWAS studies overlapped significantly with the top 20% cPIRs or hESC PIRs. Using GoShifter (https://github.com/immunogenomics/goshifter)²⁸, we observed a significant overlap for only CRD GWAS SNPs (Bonferroni-adjusted p = 0.045). This indicated that cPIRs are enriched for regulatory regions controlling genes involved in cardiac rhythm, a phenotype in which cardiomyocytes directly participate.

Some of the significant cPIR SNPs (at the suggestive threshold of p < 1 × 10⁻⁵) associated with CRD were interacting with promoters of known heart rate-associated genes²⁷ such as CCDC141 on chromosome 2, GJA1 on chromosome 6 and MYH6 on chromosome 14 (Fig. 5b; Supplementary Data 4). With the promoter interactomic data, the interactions of variants within a GWAS signal can be further identified, for example, the complex MYH6 interaction network in MYH6 region and CCDC141-SESTD1 interactions in CCDC141 region. Interacting partners of SESTD1 such as TRPC (transient receptor potential-canonical) channels are important regulators of the calcium-dependent hypertrophy pathway^29,30. The significant cPIR SNPs also coincided with published regions of TFPI on chromosome 2, SLC35F1 on chromosome 6 and HCN4 on chromosome 15 but, upon integration of the chromosomal interaction information, those regions were found interacting with ZSWIM2, PLN and NPTN respectively (Fig. 5b; Supplementary Data 4). PLN is a highly credible candidate gene because PLN is a crucial regulator of cardiac contractility³¹. Neuroplastin (NPTN) forms a complex with plasma membrane Ca²⁺ ATPases to modulate calcium homoeostasis^32,33. The Np55 variant of NPTN is expressed in the heart³⁴ and the disruption of NPTN has been shown to cause increased heart weight in the International Mouse Phenotyping Consortium database (IMPC; https://www.mousephenotype.org/)³⁵. ZSWIM2 is an E3 ubiquitin-protein ligase, a family of proteins that may play a role in degrading mal-processed ion channel proteins in the heart³⁶. It has been reported that genes some distance from a strong candidate locus harbouring a GWAS “hit” SNP may have an important influence on complex phenotypes³⁷. For example, polymorphism in the FTO gene are the strongest GWAS signals for obesity, but the functional mechanism of the association appears chiefly due to the effects of the SNPs on two neighbouring genes, IRX3 and RPGRIP1L³⁷. Therefore, promoter interactions of the neighbouring genes within a GWAS signal may provide crucial information to pinpoint the true target genes.

In addition, we identified a peak in chromosome 19 with significant cPIR SNPs interacting with ACTN4–CAPN12 (Figs. 5b and 6a; Supplementary Data 4). This peak did not reach conventional significance (p < 5 × 10⁻⁸) in the GWAS study²⁷. However, by incorporating eQTL data from the GTEx Portal¹⁶, the significant cPIR SNPs in chromosome 19 in our data were shown to be significant eQTLs for ACTN4 and CAPN12 in the left ventricle of human hearts (Fig. 6b). Functionally, calpains may play a role in atrial fibrillation³⁸ and ACTN4 may promote muscular differentiation as a transcriptional regulator³⁹. One candidate explanation for “missing heritability” in genetic studies of complex traits is that many biologically significant (and potentially druggable) genetic effects are of insufficient size to be detected at conventional significant levels even in large GWAS studies⁴⁰. By focusing on interactions in regions of borderline significant genetic associations, we were able to identify potentially physiologically relevant target genes of GWAS SNPs through functional rather than proximity considerations. This may, in the future, facilitate the identification of new genes influencing cardiac conduction and rhythm disorders.

Discussion

The hESC-CM interactomic map of gene promoters generated in this study not only enabled us to understand the promoter–genome networks of established cardiac genes but also those of a number of novel genes possibly playing a role in cardiac development and function. We mapped known conserved cardiac-specific VISTA enhancers to their target genes using the promoter interactome. We validated one of the VISTA enhancers involved in cardiac development, mm172, by deleting it using CRISPR-Cas9 and showed that it indeed was a regulatory element of its interacting gene EDNRA, which is a critical gene in chamber myocardium formation during heart development¹⁵. Somewhat unexpectedly, we observed that deleting this enhancer resulted in an increase of EDNRA expression, consistent with a number of repressive lineage-specifying promoter interactions detected in the pluripotent state² and in early hESC-derived neural progenitors⁴. There remains a possibility that silencers of the same gene could be disrupted as well when a cPIR/enhancer was targeted by CRISPR-Cas9 since enhancers and silencers can be located in the same locus or clustered in the same locus control region⁴¹. Further work will be required to elucidate this possibility.

The GWAS SNPs associated with cardiac conduction and rhythm disorders were enriched in cPIRs. While the target genes of GWAS SNPs are usually identified by gene proximity, it is known that non-coding variations can influence genes situated at substantial distance, rendering selection of a candidate in a region problematic. Using promoter interactions in hESC-CMs, we were able to identify distal target genes of GWAS SNPs for a phenotype relevant to the specific cell type and new genomic regions associated with cardiac conduction and rhythm disorders. The hESC-CM promoter interactome, rather than the hESC promoter interactome, also had a significantly higher likelihood, compared with the random background, mapping to the target genes of the left ventricular eQTLs located within the cPIRs. Taken together with the GWAS findings, it suggests that hESC-CMs are a relevant model to study ventricular conduction and rhythm. Recently, an “omnigenic” model of complex disease involving few “core” genes and many “non-core” genes influencing a particular phenotype has been proposed⁴²; the most complex interactomic subnetwork of cardiomyocytes in our data were represented prominently among GWAS “hits” for the heart rate, which would be in keeping with this model. Moreover, our integration of interactomics with GWAS data suggest that similar approaches to that we present may prove particularly useful to identify “core” genes. This approach may be extended to other hESC-derived cell types to functionally elucidate GWAS regions.

Methods

The hESC-CM differentiation

The hESC line WA09 (H9) was obtained from WiCell, maintained and cultured in monolayers according to WiCell Feeder-Independent Pluripotent Stem Cell Protocols using mTeSR1 Medium (SOP-SH-002). The hESC-CM differentiation was performed on WA09 hESC monolayers (passage 32) at 80–100% confluence using PSC Cardiomyocyte Differentiation Kit (Gibco; A2921201). After the treatments of Cardiomyocyte Differentiation Medium A and B (2 days each), the cells were allowed to recover in Cardiomyocyte Maintenance Medium for a day. The cells were then passaged (1:1.5 ratio) into plates coated with fibronectin (12.5 µg/ml in 0.02% gelatin; Sigma) and allowed to grow for 6 days in RPMI1640-B27 minus insulin (Gibco) before changing to RPMI1640-B27 complete (Gibco) for further 2 days. The cardiomyocytes were then enriched in glucose-depleted-lactate-enriched medium (Dulbecco’s modified Eagle's medium (DMEM) without glucose (Gibco; 11966025); 4 mM lactate (Sigma); non-essential amino acids (Gibco); GlutaMAX (Gibco)) for 6 days and recovered in Iscove’s modified Dulbecco’s medium (Gibco) containing 20% foetal bovine serum (FBS) for 2 days. The hESC-CMs were cultured in RPMI1640-B27 complete (Gibco) for 7 more days before the quality check (at least 80–90% purity by flow cytometry and quantitative polymerase chain reaction (qPCR) using myosin heavy chain and TNNT2 as markers) and the fixation for PCHi-C.

PCHi-C

PCHi-C was performed as previously described³ and summarised as follows: hESC-CMs were fixed by 2% formaldehyde (Agar Scientific; R1026) for 10 min at room temperature, quenched by cold 0.125 M glycine (5 min at room temperature followed by 15 min on ice), flash frozen in liquid nitrogen and stored at − 80 °C. Hi-C libraries were prepared with in-nucleus ligation and PCHi-C was performed with Agilent SureSelect Target Enrichment System. Then, 22,076 HindIII fragments containing a total of 31,253 annotated promoters for 18,202 protein-coding and 10,929 non-protein genes according to Ensembl v.75 (http://grch37.ensembl.org) were captured. A post-capture PCR amplification step was carried out using PE PCR 1.0 and 2.0 primers with four PCR amplification cycles. High-throughput sequencing reactions were performed on the Illumina HiSeq2500 platform. Sequencing reads were processed and mapped with HiCUP⁶ and PCHi-C interaction was called using CHiCAGO⁷ with default parameters.

Data processing and statistical analyses

After calling the significant PCHi-C interactions with CHiCAGO⁷, the biologically replicated read counts of the promoter interactions in hESC-CMs (three biological replicates) and those previously obtained from WA09 hESCs (two biological replicates; Gene Expression Omnibus (GEO); GSE86821; in-solution ligation)⁴ were normalised using DESeq2 (https://bioconductor.org/packages/release/bioc/html/DESeq2.html; local dispersion fit)⁴³ and averaged. The peak calling for published H3K4me3, H3K27me3 and H3K36me3 histone ChIP-seq (GEO; GSE35583) of hESC(WA07[H7])-CM (14 days; one biological replicate each)⁹ was performed using MACS 2.1.0 (peak mode for H3K4me3, and block mode for H3K27me3 and H3K36me3; q < 0.05; mfold = 10, 30; band width = 300)¹⁰. The RPKM (reads per kilobase per million mapped reads) for each gene was calculated from published RNA-seq data (GEO; GSE69618) of hESC(WA01[H1])-CMs (10 days; four biological replicates)¹¹. For significant tests of enrichment/agreement percentages, randomly generated promoter–PIR interactions were created to retain the distribution of interaction distances of the observed promoter–PIRs in each set. For each promoter bait, the relative positions of its PIRs were calculated and randomly transplanted to another randomly selected promoter bait. For each observed set, 1000 random sets were generated with each set containing the same number of interactions as the observed set, and a one-tailed permutation test was performed. Each random PIR set was then assessed for its enrichment percentage of VISTA enhancers or agreement percentage of eQTLs in the same way as the observed data and used for comparisons. For the eQTL analysis, single-tissue cis-eQTL data were downloaded from GTex (V6p; https://gtexportal.org/home/)¹⁶. For each tissue, lead SNP-gene pairs were selected based on them having the lowest p values within each LD block calculated using the Ensembl API (1000 genomes phase 3; EUR population; r² > 0.8; window size = 500 kb)⁴⁴. To assess the agreement percentage of eQTL and PIR target genes for each set of hESC-CM or hESC promoter interactions, the gene promoters that each PIR interacts with were compared to the target genes of eQTLs that were found to be located within the PIR region. The agreement percentage is then expressed as the percentage of PIRs containing at least one eQTL targeting the same gene as the PIR among all eQTL-overlapping PIR–gene pairs. GTex eQTLs were limited to those SNP-gene pairs occurring within one megabase distance and therefore promoter–PIR interactions occurring more than one megabase distance were filtered out for the eQTL agreement assessment⁴⁵. For GWAS/QQ plot analyses, the overlap of PIRs with GWAS SNPs was determined for each set of observed hESC-CM and hESC promoter interactions. The same was done for each of the 1000 randomly generated interaction sets for each observed set. The p values of the observed overlapping SNPs and those overlapping the randomly generated PIRs were compared, first by interpolating the lists of p values to the shorted list (observed or random) and then taking the mean of each rank-ordered p value list across the 1000 permutations based on QQperm package (https://cran.r-project.org/web/packages/QQperm/index.html). The inflation factor (lambda, λ) of observed/random QQ plots were estimated using QQperm (estlambda2) as well. All sequenced reads were mapped to the human reference genome (build hg19/GRCh37).

CRISPR-Cas9 and lentiviral transduction

The guide RNA (gRNA) sequences (gRNA1-GGA CCA TAA CTC AAG CAG GGC AGG; gRNA2-GGC TGT GAC TTT GAT GGG CCA AGG) used to delete VISTA enhancer mm172 in rat cardiomyoblast H9c2 2-1 (Sigma; cultured in DMEM (Gibco) supplemented with 10% FBS and 1× penicillin–streptomycin at 37 ˚C and 5% CO₂) were cloned into plentiCRISPRv2 (Addgene; plasmid 52961) according to published paired gRNA design⁴⁶ using human U6 promoter for gRNA1 and murine one for gRNA2. The cloned plentiCRISPRv2 and second-generation packaging plasmids psPAX2 (Addgene; plasmid 12260) and pMD2.G (Addgene; plasmid 12259) were co-transfected into HEK293T cells (cultured in DMEM (Gibco) supplemented with 10% FBS and 1× penicillin–streptomycin at 37 ˚C and 5% CO₂) in the proportion of 4:3:2 using Lipofectamine 2000 (Invitrogen) for lentivirus production. Lentivirus-containing media were collected 48 h after transfection, centrifuged to remove cell debris and 0.45 µm filtered prior to infecting H9c2 cell monolayers in the presence of 10 µg/ml polybrene (Millipore). At 24 h post infection, the transduced H9c2 cells were selected in 1 µg/ml puromycin media for 48 h.

Polymerase chain reaction

Genomic DNA (gDNA) was extracted from cells using PureLink Genomic DNA Mini Kit (Invitrogen) and total RNA using RNAqueous-Micro Kit (Ambion). First-strand complementary DNA (cDNA) was synthesised from DNase (Promega) treated total RNA using M-MLV Reverse Transcriptase (Promega) and both random hexamer and oligo(dT) primers (Promega). PCR was performed using OneTaq 2×Master Mix (New England BioLabs) and the following primers: (1) mm172 (forward TGATTGGTAGAGGAGAGCGG; reverse CAAGGCTGCATCACTCTGAC; product length 2766 bp (gDNA)); (2) Ednra (forward AGCCTCTCTCTGATCCAACG; reverse CCATAGAACTGCACGGAAGC; product length 59804 bp ((gDNA) or 1892 bp (cDNA)); (3) Actb (forward CTATGTTGCCCTAGACTTCG; reverse AGGTCTTTACGGATGTCAAC; product length 318 bp (gDNA) or 228 bp (cDNA)); and (4) plentiCRISPRv2 (forward CGCAAATGGGCGGTAGGCGTG; reverse AATCATGGGAAATAGGCCCTC; product length 1857 bp (gDNA/plasmid DNA)).

Immunoblot analysis

Whole-cell lysates were prepared by resuspending cells in RIPA buffer containing 5× cOmplete Mini Protease Inhibitor Cocktail (Sigma) and homogenised with a 19-gauge syringe. Equal amounts of protein samples were boiled in Laemmli buffer, resolved by a Mini-PROTEAN TGX Precast Gel (Bio-Rad), transferred to a nitrocellulose membrane by a Trans-Blot Turbo Transfer System (Bio-Rad), incubated with primary antibodies against EDNRA (Abcam; ab117521; 1:2500 dilution; 48 kDa and glycosylated EDNRA 60–90 kDa⁴⁷) and ACTB (Cell Signalling; #8457; 1:1000 dilution; 45 kDa), and subsequently a goat anti-rabbit horseradish peroxidase secondary antibody (Pierce; 1:1000 dilution). The membrane was incubated with SuperSignal West Femto Maximum Sensitivity Substrate (Thermo Scientific) before being imaged with a Molecular Imager ChemiDo XRS+System (Bio-Rad).

Data availability

The PCHi-C data for hESC-CMs have been deposited in Gene Expression Omnibus (GEO) with accession number GSE100720 and Capture HiC Plotter (CHiCP; https://www.chicp.org/).

Change history

12 November 2018
In the original version of the Article, the gene symbol for tissue factor pathway inhibitor was inadvertently given as ‘TFP1’ instead of ‘TFPI’. This has now been corrected in both the PDF and HTML versions of the Article.

References

Schoenfelder, S. et al. The pluripotent regulatory circuitry connecting promoters to their long-range interacting elements. Genome Res. 25, 582–597 (2015).
Article CAS Google Scholar
Schoenfelder, S. et al. Polycomb repressive complex PRC1 spatially constrains the mouse embryonic stem cell genome. Nat. Genet. 47, 1179–1186 (2015).
Article CAS Google Scholar
Javierre, B. M. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell 167, 1369–1384.e19 (2016).
Article CAS Google Scholar
Freire-Pritchett, P. et al. Global reorganisation of cis-regulatory units upon lineage commitment of human embryonic stem cells. Elife 6, e21926 (2017).
Article Google Scholar
Mifsud, B. et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat. Genet. 47, 598–606 (2015).
Article CAS Google Scholar
Wingett, S. et al. HiCUP: pipeline for mapping and processing Hi-C data [version 1; referees: 2 approved, 1 approved with reservations]. F1000Research 4, 1310 (2015).
Article Google Scholar
Cairns, J. et al. CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data. Genome Biol 17, 127 (2016).
Article Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article ADS CAS Google Scholar
Paige, S. L. et al. A temporal chromatin signature in human embryonic stem cells identifies regulators of cardiac development. Cell 151, 221–232 (2012).
Article CAS Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article Google Scholar
Busser, B. W. et al. An orthologous epigenetic gene expression signature derived from differentiating embryonic stem cells identifies regulators of cardiogenesis. PLoS One 10, e0141066 (2015).
Article Google Scholar
Visel, A., Minovitsky, S., Dubchak, I. & Pennacchio, L. A. VISTA Enhancer Browser-a database of tissue-specific human enhancers. Nucleic Acids Res. 35, D88–D92 (2007).
Article CAS Google Scholar
Ounzain, S. et al. Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease. J. Mol. Cell Cardiol. 76, 55–70 (2014).
Article CAS Google Scholar
He, A. et al. Dynamic GATA4 enhancers shape the chromatin landscape central to heart development and disease. Nat. Commun. 5, 4907 (2014).
Article CAS Google Scholar
Asai, R. et al. Endothelin receptor type A expression defines a distinct cardiac subdomain within the heart field and is later implicated in chamber myocardium formation. Development 137, 3823–3833 (2010).
Article CAS Google Scholar
Consortium, G. T. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article Google Scholar
Lu, Y., Zhou, Y. & Tian, W. Combining Hi-C data with phylogenetic correlation to predict the target genes of distal regulatory elements in human genome. Nucleic Acids Res. 41, 10391–10402 (2013).
Article CAS Google Scholar
Lian, X. et al. Robust cardiomyocyte differentiation from human pluripotent stem cells via temporal modulation of canonical Wnt signaling. Proc. Natl Acad. Sci. USA 109, E1848–E1857 (2012).
Article CAS Google Scholar
Nicolae, D. L. et al. Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet. 6, e1000888 (2010).
Article Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS Google Scholar
van Rooij, E., Liu, N. & Olson, E. N. MicroRNAs flex their muscles. Trends Genet. 24, 159–166 (2008).
Article Google Scholar
England, J. & Loughna, S. Heavy and light roles: myosin in the morphogenesis of the heart. Cell Mol. Life Sci. 70, 1221–1239 (2013).
Article CAS Google Scholar
Mishra, A. & Hawkins, R. D. Three-dimensional genome architecture and emerging technologies: looping in disease. Genome Med. 9, 87 (2017).
Article Google Scholar
Cordell, H. J. et al. Genome-wide association study of multiple congenital heart disease phenotypes identifies a susceptibility locus for atrial septal defect at chromosome 4p16. Nat. Genet. 45, 822–824 (2013).
Article CAS Google Scholar
Cordell, H. J. et al. Genome-wide association study identifies loci on 12q24 and 13q32 associated with tetralogy of Fallot. Hum. Mol. Genet. 22, 1473–1481 (2013).
Article CAS Google Scholar
Nikpay, M. et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015).
Article CAS Google Scholar
den Hoed, M. et al. Identification of heart rate-associated loci and their effects on cardiac conduction and rhythm disorders. Nat. Genet. 45, 621–631 (2013).
Article Google Scholar
Trynka, G. et al. Disentangling the effects of colocalizing genomic annotations to functionally prioritize non-coding variants within complex-trait loci. Am. J. Hum. Genet. 97, 139–152 (2015).
Article CAS Google Scholar
Miehe, S. et al. The phospholipid-binding protein SESTD1 is a novel regulator of the transient receptor potential channels TRPC4 and TRPC5. J. Biol. Chem. 285, 12426–12434 (2010).
Article CAS Google Scholar
Wu, X., Eder, P., Chang, B. & Molkentin, J. D. TRPC channels are necessary mediators of pathologic cardiac hypertrophy. Proc. Natl. Acad. Sci. USA 107, 7000–7005 (2010).
Article ADS CAS Google Scholar
MacLennan, D. H. & Kranias, E. G. Phospholamban: a crucial regulator of cardiac contractility. Nat. Rev. Mol. Cell Biol. 4, 566–577 (2003).
Article CAS Google Scholar
Korthals, M. et al. A complex of neuroplastin and plasma membrane Ca(2+) ATPase controls T cell activation. Sci. Rep. 7, 8358 (2017).
Article ADS Google Scholar
Herrera-Molina, R. et al. Neuroplastin deletion in glutamatergic neurons impairs selective brain functions and calcium regulation: implication for cognitive deterioration. Sci. Rep. 7, 7273 (2017).
Article ADS Google Scholar
Langnaese, K., Beesley, P. W. & Gundelfinger, E. D. Synaptic membrane glycoproteins gp65 and gp55 are new members of the immunoglobulin superfamily. J. Biol. Chem. 272, 821–827 (1997).
Article CAS Google Scholar
Koscielny, G. et al. The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data. Nucleic Acids Res. 42, D802–D809 (2014).
Article CAS Google Scholar
Rougier, J. S., Albesa, M. & Abriel, H. Ubiquitylation and SUMOylation of cardiac ion channels. J. Cardiovasc. Pharmacol. 56, 22–28 (2010).
Article CAS Google Scholar
Tung, Y. C., Yeo, G. S., O’Rahilly, S. & Coll, A. P. Obesity and FTO: changing focus at a complex locus. Cell. Metab. 20, 710–718 (2014).
Article CAS Google Scholar
Bukowska, A., Lendeckel, U., Bode-Boger, S. M. & Goette, A. Physiologic and pathophysiologic role of calpain: implications for the occurrence of atrial fibrillation. Cardiovasc. Ther. 30, e115–e127 (2012).
Article CAS Google Scholar
Foley, K. S. & Young, P. W. The non-muscle functions of actinins: an update. Biochem. J. 459, 1–13 (2014).
Article CAS Google Scholar
Williams, S. M. & Haines, J. L. Correcting away the hidden heritability. Ann. Hum. Genet. 75, 348–350 (2011).
Article Google Scholar
Maston, G. A., Evans, S. K. & Green, M. R. Transcriptional regulatory elements in the human genome. Annu. Rev. Genom. Hum. Genet. 7, 29–59 (2006).
Article CAS Google Scholar
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell 169, 1177–1186 (2017).
Article CAS Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article Google Scholar
Yates, A. et al. The Ensembl REST API: ensembl data for any language. Bioinformatics 31, 143–145 (2015).
Article CAS Google Scholar
Consortium, G. T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article Google Scholar
Vidigal, J. A. & Ventura, A. Rapid and efficient one-step generation of paired gRNA CRISPR-Cas9 libraries. Nat. Commun. 6, 8083 (2015).
Article ADS CAS Google Scholar
Lupp, A. et al. Reassessment of endothelin receptor A expression in normal and neoplastic human tissues using the novel rabbit monoclonal antibody UMB-8. Peptides 66, 19–25 (2015).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Heather J. Cordell and Ruth J. Loos for sharing their GWAS data. We thank David A. Eisner, Andrew W. Trafford, Luigi A. Venetucci and Yatong Li for verifying the electrophysiological phenotypes of our hESC-CMs. We also thank members of The University of Manchester, especially David Talavera and Sabu Abraham, and The Babraham Institute who provided constructive comments and shared materials/methods. This work was supported by the following grants: BHF (CH/13/2/30154), BHF (RG/15/12/31616), BBSRC (BB/J004480/1), MRC (MC_UP_1302/5, MR/L007150/1) and Wellcome Trust (097820/Z/11/A, WT107881). M.-K.C. is a Life Member of Clare Hall, Cambridge, S.L.B. was supported by FS/16/58/32734 BHF 4-Year PhD Studentship Programme, Y.L. was supported by The University of Manchester PUHSC Alliance and China Scholarships Council and A.A. was supported by HEFCE. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by NCI, NHGRI, NHLBI, NIDA, NIMH and NINDS.

Author information

Authors and Affiliations

Division of Cardiovascular Sciences, The University of Manchester, Manchester, M13 9PT, UK
Mun-Kit Choy, Simon G. Williams, Stephanie L. Baross, Yingjuan Liu, Artur Akbarov & Bernard D. Keavney
Nuclear Dynamics Programme, The Babraham Institute, Cambridge, CB22 3AT, UK
Biola M. Javierre, Steven W. Wingett, Paula Freire-Pritchett, Mikhail Spivakov & Peter Fraser
Josep Carreras Leukaemia Research Institute, Campus ICO-Germans Trias I Pujol, Badalona, 08916, Barcelona, Spain
Biola M. Javierre
MRC Biostatistics Unit, University of Cambridge, Cambridge, CB2 0SR, UK
Chris Wallace
Department of Medicine, University of Cambridge, Cambridge, CB2 0QQ, UK
Chris Wallace
Division of Cell Biology, Medical Research Council Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
Paula Freire-Pritchett
Epigenetics Programme, The Babraham Institute, Cambridge, CB22 3AT, UK
Peter J. Rugg-Gunn
Department of Biological Science, Florida State University, Tallahassee, 32306, FL, USA
Peter Fraser

Authors

Mun-Kit Choy
View author publications
You can also search for this author in PubMed Google Scholar
Biola M. Javierre
View author publications
You can also search for this author in PubMed Google Scholar
Simon G. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie L. Baross
View author publications
You can also search for this author in PubMed Google Scholar
Yingjuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Steven W. Wingett
View author publications
You can also search for this author in PubMed Google Scholar
Artur Akbarov
View author publications
You can also search for this author in PubMed Google Scholar
Chris Wallace
View author publications
You can also search for this author in PubMed Google Scholar
Paula Freire-Pritchett
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. Rugg-Gunn
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Spivakov
View author publications
You can also search for this author in PubMed Google Scholar
Peter Fraser
View author publications
You can also search for this author in PubMed Google Scholar
Bernard D. Keavney
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.-K.C., B.M.J., M.S. and P.F. conceived the idea and designed the study. M.S., B.D.K. and P.F. secured funding for experiments and sequencing. M.-K.C., B.M.J., S.L.B., Y.L., P.J. R-G., P.F. and B.D.K. performed or supervised laboratory work. M.-K.C., S.G.W., S.W.W., A.A., C.W., P.F-P., M.S. and B.D.K. performed or supervised statistical analyses. M.-K.C. wrote the first draft. All authors commented critically on and revised the draft.

Corresponding authors

Correspondence to Mun-Kit Choy, Peter Fraser or Bernard D. Keavney.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Choy, MK., Javierre, B.M., Williams, S.G. et al. Promoter interactome of human embryonic stem cell-derived cardiomyocytes connects GWAS regions to cardiac gene networks. Nat Commun 9, 2526 (2018). https://doi.org/10.1038/s41467-018-04931-0

Download citation

Received: 26 July 2017
Accepted: 29 May 2018
Published: 28 June 2018
DOI: https://doi.org/10.1038/s41467-018-04931-0

This article is cited by

Priority index for critical Covid-19 identifies clinically actionable targets and drugs
- Zhiqiang Zhang
- Shan Wang
- Hai Fang
Communications Biology (2024)
Low input capture Hi-C (liCHi-C) identifies promoter-enhancer interactions at high-resolution
- Laureano Tomás-Daza
- Llorenç Rovirosa
- Biola M. Javierre
Nature Communications (2023)
Long-range linkage disequilibrium in French beef cattle breeds
- Abdelmajid El Hou
- Dominique Rocha
- Romain Philippe
Genetics Selection Evolution (2021)
A novel RNA-mediated mechanism causing down-regulation of insulating promoter interactions in human embryonic stem cells
- Yingjuan Liu
- Simon G. Williams
- Mun-Kit Choy
Scientific Reports (2021)
Detecting chromosomal interactions in Capture Hi-C data with CHiCAGO and companion tools
- Paula Freire-Pritchett
- Helen Ray-Jones
- Valeriya Malysheva
Nature Protocols (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.