It is evident that epigenetic factors, especially DNA methylation, have essential roles in obesity development. Here, using pig as a model, we investigate the systematic association between DNA methylation and obesity. We sample eight variant adipose and two distinct skeletal muscle tissues from three pig breeds living within comparable environments but displaying distinct fat level. We generate 1,381 Gb of sequence data from 180 methylated DNA immunoprecipitation libraries, and provide a genome-wide DNA methylation map as well as a gene expression map for adipose and muscle studies. The analysis shows global similarity and difference among breeds, sexes and anatomic locations, and identifies the differentially methylated regions. The differentially methylated regions in promoters are highly associated with obesity development via expression repression of both known obesity-related genes and novel genes. This comprehensive map provides a solid basis for exploring epigenetic mechanisms of adipose deposition and muscle growth.
Obesity can be considered an epidemic that has become a major threat to the quality of human life in modern society1. By 2030, up to 58% of the world's adult population might be either overweight or obese1. Adipose tissues (ATs) and skeletal muscle tissues (SMTs) have important roles in the pathogenesis of obesity and its comorbidities by secreting cytokines involved in the regulation of metabolism2,3. The metabolic risk factors of obesity and increased body weight are more related to adipose distribution rather than the total adipose mass4,5. ATs located within the abdominal and thoracic cavity, known as visceral ATs (VATs), have been recognised to be anatomically, functionally and metabolically distinct from that of the compartmental subcutaneous ATs (SATs)6, and have been found to be related to a series of diseases, including cardiovascular disease, type II diabetes mellitus and metabolic syndrome7. Nonetheless, SATs can have direct and beneficial effects on the control of body weight and metabolism8.
Pig (Sus scrofa) is emerging as an attractive biomedical model for studying energy metabolism and obesity in humans, because of their similar metabolic features, cardiovascular systems, proportional organ sizes and lack of brown adipose postnatally9. The pig model offers, in fact, the advantages of low genetic variance, homogeneous feeding regime and remitting confounding factors typical of humans, such as smoking, alcohol drinking and so on. In the modern industry, pigs have undergone strong genetic selection in the relatively inbred commercial lines for lean meat production, or in some cases, for adipose production, which has led to remarkable phenotypic changes and genetic adaptation, making these breed lines a perfect model for comparative studies10,11.
There has been extensive research to hunt for 'obesity alleles', most recently by whole-genome association studies12,13. It is evident that DNA sequence polymorphism alone does not provide adequate explanations for mechanisms of obesity regulation. Recently, epigenetic factors, especially DNA methylation that is a stably inherited modification affecting gene regulation and cellular differentiation, has gained a greater appreciation as an alternative perspective on the aetiology of complex diseases14,15. Nevertheless, current understanding of the roles of DNA methylation in the aetiology of obesity remains fairly rudimentary16.
Here, for three well-defined pig breeds displaying distinct fat contents in comparable environments, we collected eight ATs from different body sites and two phenotypically distinct SMTs, and studied genome-wide DNA methylation differences among breeds, sexes and anatomic locations. We showed the landscape of methylome distribution in the genome, analysed differentially methylated regions (DMRs) and identified genes that were involved in the development of obesity. The work performed here will serve as a valuable resource for future functional validation and aid in searching for epigenetic biomarkers for obesity prediction and prevention, and promoting further development of pig as a model organism for human obesity research.
Samples and their obesity-related phenotypes
We chose three pig breeds in this study, based on known history of breed formation and measurement of obesity-related phenotypes (see Supplementary Methods). The Landrace breed has been selected for less adipose for more than 100 years in Europe, whereas the Rongchang breed was selected for extreme adipose. The Tibetan breed is almost a feral breed that has undergone very little artificial selection. On average, adult females exhibit higher fat percent than males upon reaching sexual maturity at 210 days old. To investigate sexual differences, we also separated males and females in the comparison. As expected, body density, which negatively correlates with fat percent, showed significant difference among the three breeds (two-way analysis of variance (ANOVA), PB=6.98×10−10) and between male and female (two-way ANOVA, PS=0.02; Fig. 1a). Measurement of metabolism indicators in serum also revealed the same ranking (Supplementary Fig. S1).
To study adipocyte regulation in different anatomic locations, we sampled eight ATs from various body regions (Fig. 1b), which exhibited dissimilar fatty acid composition (Fig. 1c) and significantly different adipocyte volumes (three-way ANOVA, PT=6.74×10−12) among the three breeds (three-way ANOVA, PB<10−16), and between males and females (three-way ANOVA, PS=10−16; Fig. 1d; Supplementary Fig. S2). We also sampled two SMTs, white longissimus dorsi muscle (LDM) and red psoas major muscle (PMM; Fig. 1b), representing two different fibre types, of which PMM has a higher percentage of capillaries, myoglobin, lipids and mitochondria17. Compared with PMM, LDM has higher myofibre cross-sectional area (three-way ANOVA, PT=9.66×10−12; Fig. 1e; Supplementary Fig. S2) and ratio of fast to slow twitch myofibre (three-way ANOVA, PT<10−16; Fig. 1f; Supplementary Fig. S2). There is also significant divergence in myofibre cross-sectional area (three-way ANOVA, PB<10−16, PS=0.005) and myofibre type ratio (three-way ANOVA, PB=4.42×10−10, PS=5.45×10−9) among the three breeds and between the two sexes. These phenotypic differences for ATs and SMTs between breeds, sexes and anatomic locations imply the intrinsically epigenomic differences.
Landscape of the DNA methylomes
We generated a total of 1,381 Gb methylated DNA immunoprecipitation sequencing (MeDIP-seq) data from 180 samples (~7.67 Gb per sample), of which 1,067 Gb (77.3%) clean reads were aligned on the pig genome. After removing the ambiguously mapped reads and reads that may have come from duplicate clones, we used 993 Gb (71.9%) uniquely aligned non-duplicate reads in the following analysis (Supplementary Table S1). To avoid false positives in enrichment, we required at least ten reads to determine a methylated CpG in a sample. On average, 16.1% of the CpGs were covered by this threshold (Supplementary Fig. S3).
Measurement of DNA methylation level along chromosomes showed that the X chromosome is globally hypermethylated in females compared with males (Fig. 2a), which can be explained by the X chromosome inactivation in females18. Through comparison of DNA methylation level between each pair of samples, we found variable correlation rates in different categories (Fig. 2b). The biological replicates highly correlated with each other (median Pearson's r=0.95 for SMTs and r=0.94 for ATs), which suggested both experimental reliability and epigenetic consistency within the same breed/sex/tissue type group. The correlation rates were relatively lower between male and female (median Pearson's r=0.92 for SMTs and r=0.91 for ATs), and even lower between different anatomic locations (median Pearson's r=0.91 for SMTs, r=0.89 for between ATs and SMTs, and r=0.87 for ATs) and between different breeds (median Pearson's r=0.88 for ATs and r=0.84 for SMTs), indicating significant biological differences in the latter categories.
We observed that methylation level correlates negatively with chromosome length (Pearson's r=−0.614, P=0.005) and positively with GC content (Pearson's r=0.784, P=7×10−5), repeat density (Pearson's r=0.336, P=0.159), gene density (Pearson's r=0.535, P=0.018), and especially with observed over expected number of CpG (CpGo/e) ratio (Pearson's r=0.902, P=1.33×10−7) (Fig. 3a), which is consistent with previous reports19. More detailed analysis showed that the regions with GC content around 46% and CpGo/e ratio around 0.35 tend to have a higher methylation level (Fig. 3b,c), where the average GC content in the pig genome is 40% and CpGo/e ratio is 0.21 (Fig. 3d,e). Nonetheless, there is no significant correlation between the GC content and CpGo/e ratio on a genomic scale (Pearson's r=0.10, P=0.106; Fig. 3f). The analysis showed that CpGo/e ratio has a higher correlation rate with DNA methylation level than with GC content. Single-nucleotide polymorphism (SNP) density also positively correlated with the methylation level (Pearson's r=0.597, P=0.007; Fig. 3a). Although substantial association of SNP density and DNA methylation level has been found20, the mechanism is still unclear. Furthermore, the gene-rich subtelomeric region (7 Mb from each telomere) has significantly higher methylation in most (~80%, 15 out of 19) chromosomes (Student's t-test, P <0.01; Fig. 3g).
Characterisation of DMRs
We used statistics to measure the methylation rate changes and defined DMRs across breeds (B-DMRs), sexes (S-DMRs) and tissues (T-DMRs; see Supplementary Methods). The high correlattion (average Pearson's r=0.994) between the number of DMRs, the number of CpGs in DMRs and the length of DMRs implied that DMR detection in regions of different length and its embeded number of CpGs was non-biased. The number of DMRs varied considerably between categories (for example, 387 muscle S-DMRs versus 218,623 muscle B-DMRs; Table 1).
A macroscopical display of DMRs along chromosomes shows that DMR-rich regions also predominantly have higher CpGo/e ratios (~0.35) than the genomic average (0.21; Fig. 4a). Over 20% of DMRs are located in subtelomeric regions, which only occupy 11.76% of the whole genome. Among the 282 pig genes that were orthologs to known human obesity-related genes2,12,13,21, 223 (~80%) were within our defined DMRs (Fig. 4a; Supplementary Data 1 and 2), which suggested that the DMRs have high association with the known obesity-related genes and these genes may have functional roles in obesity development by ways of methylation rate changes.
We then looked at DMRs in the 31 categories of functional genomic elements. We separated promoters into three types according to the CpG representation as previously described22 (Fig. 4b), and also classified CpG islands (CGIs) into five classes according to their genomic locations as previously described23 (see Methods; Fig. 4c). DMRs occur more frequently in intermediate CpG promoter (ICP) than in high CpG promoter (HCP) and low CpG promoter (LCP) regions (two-way ANOVA, P=0.056; Fig. 4d). The ICP class contains many weak CGIs24 (<500 bp, have moderate CpG richness and/or have a GC content below 55%). This result validated previous findings that weak CGIs are more predisposed to regulation by DNA methylation and preferential targeting of weak CGIs is a general phenomenon in mammals22. Promoter hypermethylation has a critical role in suppressing gene expression, yet in gene bodies, it is also important in regulating alternative promoters and preventing spurious transcription initiation23. Interestingly, the first exon regions have the lowest DMRs within the gene body (Fig. 4d), which may be because of some functional motifs overlapping between the proximal region of promoters and first exons. In addition, the distal (D) regions of both mRNA and microRNA (miRNA) promoters have more DMRs than the intermediate (I) and proximal (P) regions (Fig. 4d), suggesting that changes in methylation at D regions of promoters may be a more prevalent mechanism for producing transcriptional variability. We found that most DMRs are located in CGI shores rather than in CGIs (Fig. 4d) in all five classes of genomic elements (two-way ANOVA, P=0.002), which is consistent with previous reports25,26,27. CpGo/e ratio of most CGI shores is around 0.35–0.36, whereas CGIs have CpGo/e ratios far greater than this cutoff (Wilcoxon rank-sum test, P<10−16; Fig. 4e), as shown by the plot of reads distribution against CpGo/e ratio in Fig. 3c.
Promoter methylation and transcriptional repression
We explored the correlation between methylation rate in promoters and expression level of associated mRNAs and miRNAs (Fig. 5; Supplementary Fig. S4). It is believed that DNA methylation in promoters is only one of the several mechanisms for regulating gene expression; hence, it is logical that not all genes have correlated methylation and expression patterns. The order of correlation level in mRNA promoters, from high to low, was HCP, ICP, LCP and P, I, D (Fig. 5), which validated previous report28. Nonetheless, as DMRs are enriched in ICPs and D regions of promoters (Fig. 4d), there were more mRNA-DMR pairs exhibiting correlation in ICP (8,257) than in HCP (6,217) and LCP (3,099), and more pairs in the D region (9,313) than in the I (4,923) and P (3,337) regions (Fig. 5). Further, it indicated that the CpG content difference of promoters has a more profound impact on mRNA expression (two-way ANOVA, P=5.56×10−4) than the distance to transcription start site (TSS) of the regions in promoters (two-way ANOVA, P=0.07).
There was correlation between miRNA expression and methylation rate in P regions of miRNA promoters (Pearson's r=−0.368, P=4.44×10−8), but almost no correlation in I (Pearson's r=−0.116, P=0.146) and D (Pearson's r=0, P=0.996) regions of miRNA promoters (Supplementary Fig. S4). Primary miRNA transcripts can be several thousand bases long and embeds a ~70 nucleotide long stem-loop precursor (pre-miRNA)29. Although little is known about the TSS of primary miRNA transcripts30, the results here suggested that DNA methylation in 5′ upstream of pre-miRNAs could have a role in transcriptional silencing of mature miRNA.
Methylation in CGI shores had stronger (Pearson's r=−0.146, P=6.15×10−29) correlation with mRNA expression than in CGIs (Pearson's r=−0.128. P=0.066; Supplementary Fig. S4), which is consistent with previous studies25,26,27. Nonetheless, miRNA expression has little or no correlation with CGI (Pearson's r=−0.133, P=0.241) and CGI shore methylation (Pearson's r=0.099, P=0.268). A recent study also showed that a common feature of DNA methylation-repressed miRNAs is the absence of CGIs in the promoter region31.
Promoter DMRs best discriminate breeds and tissues
To analyse whether DMRs exhibit any breed, sex and/or anatomic locations specific pattern, we performed unsupervised clustering for all samples using DMRs of each category of genomic elements. The adipose and muscle B-DMRs in promoters were well clustered by breed (Fig. 6a,b). Clustering of samples by corresponding mRNA expression is generally similar (Supplementary Fig. S5), indicating consistent relationships between DNA methylation in the promoters and gene expression. The clustering by B-DMRs in CGI shores can group most samples from each breed, but not as distinctly as that by B-DMRs in promoters. This suggested that, although methylation in CGI shores is important in regulating gene expression25,26,27, methylation differences at promoters are better predictors of differences among the breeds. The clustering by B-DMRs of other genomic elements is even less distinct, suggesting that most methylation in these genomic elements may have weak or no direct association with functional divergence of the three breeds.
The B-DMRs in promoters of ATs showed that the Rongchang and Tibetan breed are closer to each other than the Landrace pig (Fig. 6a). The same analysis in muscle tissues showed that the Landrace breed is closer to the Tibetan than the Rongchang breed (Fig. 6b). The same distance relationship pattern of the three breeds is reflected by the corresponding mRNA expression data clustering (Supplementary Fig. S5). The different clustering patterns may be explained by the marked phenotypic changes between the feral Tibetan, the leaner Landrace and the fatty Rongchang pig breeds because of opposite breeding direction, which results in differences not only at the genetic level, but also in the epigenetic state, and potential genotype–epigenotype interactions32 as well.
In addition, the T-DMRs in promoters could largely cluster samples of the same tissue type together (Fig. 6c), indicating that promoter methylation also correlates with adipose distribution across the anatomic locations. It is well established that VATs have intrinsic features distinct from SATs, and are more highly correlated with the metabolic risk factors of obesity than SATs4,6,7,8. Interestingly, intermuscular adipose (IAD), which deposited between muscle bundles, was more similar to VATs in terms of methylation. This observation suggests that IAD may be a new risk factor for obesity-related diseases. Pericardial adipose (PAD) around coronary arteries is a higher correlative risk factor for cardiovascular disease than other VATs, and although thoracic PAD shares a common embryonic origin with other abdominal VATs—the splanchnic mesoderm33, we observed significant-site specific differences in methylation rate between them (Fig. 6c). Tissue types are also better discriminated by T-DMRs in promoters than in other genomic elements (Supplementary Fig. S5). X chromosome methylation between male and female is expected to be significant because of the overriding effect of X chromosome inactivation in females18. Clustering of S-DMRs by sex was less distinct after removing DMRs on the X chromosome (Supplementary Fig. S5).
Genes involved in phenotypic divergence
To study the association of differential methylation in promoter regions with phenotypic divergence, we first investigated the relationship between DNA methylation at promoters and the expression data of known obesity-related genes obtained through and MassArray and quantitative PCR (q-PCR). For example, FTO (fat mass- and obesity-associated gene) is a gene unequivocally associated with obesity and is ubiquitously expressed34,35. From the leaner Landrace, feral Tibetan to the fatty Rongchang breed, and across both adipose and muscle tissue types, FTO is hypermethylated in the D region of the promoter with a lower gene expression level (Fig. 7a). The fact that the level of methylation is highest in Landrace pig and lowest in Rongchang pig is consistent with the observation that loss of FTO expression and/or function protects against obesity and food intake35. ATP1B1, which encodes the ubiquitously expressed β subunit of Na+/K+ ATPase, is required for the proper cellular positioning of ATPase and its stability. Decreased ATPase activity precedes obesity and hyperinsulinaemia by influencing thermogenesis and energy balance36. COL8A2, which encodes the α2 chain of type VIII collagen, is necessary for mesangial matrix expansion, as well as for hypercellularity. Lack of COL8A2 confers renoprotection in diabetic nephropathy37. Both ATP1B1 and COL8A2 have hypermethylation in I region of promoter and lower gene expression level that is more pronounced in the VATs and IAD than SATs (Fig. 7b), suggesting that hypermethylation in promoters of these two genes are potential biomarkers of high-risk visceral obesity.
We also found correlation between methylation in promoter and gene expression, and reasonable association to breed and anatomic location divergence, for many other genes with known roles in adipose deposition and muscle growth. For example, ESD that increases expression in obesity-prone models38; PPP1R3C that functions against intramyocellular lipid build-up, and reduces circulating leptin and triglycerides39; GHSR that promotes growth hormone-release and increased lean, but not fat, mass in obese subjects40; LIPA that inhibits intramuscular lipid stores41; MC4R that inhibits food intake and prevents hyperinsulinaemia and hyperglycinaemia42; and PROX1 that prevents lymphatic vascular defects that cause adult-onset obesity43 (Supplementary Fig. S6). The genes preferentially expressed in adipose (such as HEXB and HTR2A) or muscle (such as ACE, PRKAR1A and PRKCQ) tissues only, were validated by the methylation in the promoter and gene expression data as well (Supplementary Fig. S6). The full list of candidate obesity-related genes we collected together with their DNA methylation pattern in promoters is provided in Supplementary Data 1 and 2.
In addition, out of the 2,311 genes/~282.57 Mb quantitative trait loci (QTLs) region assembled from 901 high confidence and narrowed (<2 Mb) QTLs affecting fatness and pork quality in the PigQTL database44; 1,669 (72.22%) genes overlap with the defined DMRs (Supplementary Data 3, 4, 5). This high consistency highlights the potential of identifying candidate regions or genes of quantitative traits (such as obesity) based on genome-wide DNA methylation data, such as the newly developed methylation QTL analysis15. Notably, out of 77 putative genes located in these QTLs region, 66 (85.71%) overlap with our defined DMRs. Methylation level of these gene's promoters strongly inversely correlated with the gene expression, suggesting that these uncharacterised protein coding genes may be involved in adipose deposition and muscle growth. Typical examples are shown in Supplementary Fig. S7.
Furthermore, numerous miRNAs having known or potential roles in obesity were also identified (Supplementary Data 6). We found an S-DMR with hypermethylation in males compared with females. This S-DMR is located in the promoter region of an miRNA cluster that includes adjacent miR-99b, let-7e and miR-125a (Fig. 7c). Although no previous evidence exists for a direct relationship of these three miRNAs to obesity, the key functions and targets of these miRNAs are associated with the suppression of prostate cancer in male45 and breast cancer in female46, and therefore, potentially contribute to sexual differences in obesity development.
To identify novel genes potentially responsible for phenotypic differences, we performed enrichment analysis for genes with DMRs in promoters (Supplementary Fig. S8; Supplementary Data 7 and 8). As expected, most enriched functional Gene Ontology categories of adipose B-DMRs in promoters were related to the pathogenesis of obesity, such as 'homeostasis of sterol, lipid, cholesterol', 'lipase inhibitor activity', 'type I diabetes mellitus' and 'dyslipidaemia' (Fig. 7d). Notably, the multigene family of glutathione transferase and cytochrome P450, two important groups of multifunctional detoxifying enzymes responsible for metabolising an array of xenobiotic compounds47,48, were among the enriched adipose B-DMRs in promoters (Fig. 7d). 'Trans-1,2-dihydrobenzene-1,2-diol dehydrogenase activity' enzymes, which also participate in metabolism of endocrinally disruptive xenobiotics49, were universally identified among enriched adipose T-DMRs in promoters (Supplementary Data 8). Pigs as well as humans are exposed to an increasing numbers of environmental xenobiotics through ingestion of contaminated food or water, inhalation of polluted air or even dermal exposure. A link between exposure to endocrinally disruptive xenobiotics and obesity has been proposed50. Our finding suggests that DNA methylation rate changes of genes coding for detoxifying enzymes induced by pollutants may potentially explain the pathogenicity of obesity caused by chemical environmental endocrine disruptors.
Our analyses also revealed many other functional gene categories that were potentially involved in adipose and muscle regulation (Supplementary Data 8). For example, immune-related gene categories, including 'RIG-I-like receptor signalling pathway', 'interferon-α/β receptor binding', 'natural killer cell-mediated cytotoxicity' and 'antigen processing and presentation', were identified among the enriched muscle B-DMRs in promoters, which is consistent with previous finding of obesity-induced immune dysfunction51. Intriguingly, given that AT derives from mesoderm, the identification of 'mesoderm development' gene category among enriched adipose S-DMRs in promoters indicated that sex-specific obesity is potentially related to the establishment of differential methylation during embryonic development. In addition, the enriched gene categories of muscle T-DMRs, and adipose versus muscle T-DMRs, in promoters reflected the well-characterised tissue-specific functions. Methylation differences in genes coding for proteins involved in GTP-related energy metabolism may be responsible for the differences in percentage of mitochondria between the two phenotypically distinct SMTs17. Differential methylation of genes involved in 'cytoskeletal protein binding', 'regulation of cellular protein metabolic process' and 'enzyme activator activity' may explain the developmental differences between adipose and muscle tissues52.
This study reports the comprehensive genome-wide epigenetic survey of various adipose and SMTs, based on directly sequenced animal DNA methylomes. Through identification of DMRs among breeds, sexes and anatomic locations, and classification of the DMRs according to their locations in various genomic elements, we found that DMRs in promoters can repress gene expression and are highly associated with phenotypic variation. Identified DMRs were preferentially situated in ICP and in CGI shores. This validated the hypothesis that weak CGIs are more prone to regulation by DNA methylation, as the higher feasibility for weak CGIs to become de novo methylated regions, and preferentially associated with the general phenomenon and non-malignant, common complex diseases (such as obesity) instead of the highly heterogeneous lesions (such as cancer)22. We also found that the intermuscular IAD was more similar to the VATs in methylation pattern, which provided the first epigenomic evidence for IAD as a candidate risk factor for obesity. The data set and research here shed new light on the epigenomic regulation of adipose deposition and muscle growth.
It is considered that pigs can serve as a good biomedical model for human obesity studies because they share the same general physiology with human. Indeed, we found that about 80% of the known or candidate human obesity-related genes and 72% of genes in the QTL regions that affect fatness and pork quality were within our defined DMRs. Detailed analysis indicated that the methylation regulation patterns of these genes are consistent with their known biological functions. We also predicted many novel candidate genes that were associated with variation in obesity-related phenotypes and that require further experimental validation. Domesticated breeds also provide additional advantage of highly homogeneous genetic backgrounds, large litter size (10~12 piglets per litter; 24~36 piglets per year), short generation interval (12 months) and a homogeneous feeding regime, which are particularly suitable for survey of transgenerational epigenetic inheritance53. In addition to providing new information for biomedical research, genomic/epigenomic studies of pigs may also help uncover the molecular basis that underlies economic traits in pig, which can be used to improve the efficiency of artificial selection, hence the production of healthier pork.
Nine females and nine males at 210-day-old for each of the Landrace (a leaner, Western breed), the Tibetan (a feral, indigenous Chinese pig that has not undergone artificial selection) and the Rongchang (a fatty, Chinese breed) pig breeds were used in this study. There is no direct and collateral blood relationship within the last 3 generations among the 18 pigs from each of the breeds. The piglets were weaned simultaneously at 28±1 day of age. A starter diet provided 3.40 Mcal kg−1 metabolisable energy (ME), 20.00% crude protein and 1.15% lysine from the thirtieth to sixtieth day after weaning. From the sixty-first to one hundred and twentieth day, the diet contained 3.40 Mcal kg−1 ME, 17.90% crude protein and 0.83% lysine. From the one hundred and twenty-first to two hundred and tenth day, the diet contained 3.40 Mcal kg−1 ME, 15.00% crude protein and 1.15% lysine. The animals were allowed access to feed and water ad libitum and lived under the same normal conditions.
Animals were humanely killed as necessary to ameliorate suffering and not fed the night before they were slaughtered. All the animals and samples used in this study were collected according to the guidelines for the care and use of experimental animals established by the Ministry of Agriculture of China.
Eight ATs from different body sites and two phenotypically distinct SMTs were rapidly separated from each carcass, immediately frozen in liquid nitrogen and stored at −80 °C until RNA and DNA extraction. The eight ATs are divided into four groups: (1) three types of SATs (that is, abdominal subcutaneous adipose, upper layer of backfat and inner layer of backfat near the last third or fourth rib); (2) three types of VATs in the abdominal cavity (i.e., greater omentum, mesenteric adipose and retroperitoneal adipose); (3) one type of VAT in the thoracic cavity (i.e., PAD, which is located between visceral and parietal pericardium); and (4) IAD, the adipose visible between muscle groups and beneath the muscle fascia in the hips. Two phenotypically distinct SMTs are LDM (typical white SMT) near the last third or fourth rib and the intermediate section of PMM (typical red SMT).
Measurements of obesity-related phenotype
Measurements of pig body density, concentrations of 24 serum-circulating indicators of metabolism, adipocyte volume, myofibre cross-sectional area, myofibre type rate (fast/slow) and fatty acid composition are described in detail in Supplementary Methods.
Methylated DNA immunoprecipitation sequencing
We randomly selected three pigs with a specific sex from each breed as biological replicates. And we have ten tissues for each individual; so in total, 180 samples were sequenced separately. MeDIP DNA libraries were prepared following the protocol as our previous description54. Each MeDIP library was subjected to paired-end sequencing using Illumina HiSeq 2000 and a 50 bp read length. Details are listed in Supplementary Methods.
Identification of DMRs
After filtering the low-quality reads, the MeDIP-seq data were aligned to the UCSC pig reference genome (Sscrofa9.2) using SOAP2 (Version 2.21)55. The genomic regions enriched in methylated CpGs B-DMRs, S-DMRs and T-DMRs were identified using our newly developed method by calculating variation of single CpG. Additional details for the process are listed in Supplementary Methods.
Definition of genomic elements
We referred to the UCSC pig reference genome (Sscrofa9.2) annotation data for the identification of genomic elements. All 17,930 promoters (−2,200 to +500 bp) were classified into three types according to CpG representation as previously described22. There were 7,249 HCPs, 6,629 ICPs and 4,052 LCPs (Fig. 4b). Each promoter of 2,700 bp length was divided into three regions as previously described28: P (−200 to +500 bp), I (−200 to −1,000 bp) and D (−1,000 to −2,200 bp). We obtained genomic locations of 38,778 CGIs and definitions of 21,533 Ensembl genes from the UCSC pig reference genome (Sscrofa9.2), as well as genomic locations of 803 pre-miRNAs based on our small RNA-seq results. We grouped CGIs into five classes on the basis of their distance to Ensembl genes or pre-miRNAs as previously described23 with some modifications (Fig. 4c). There are (1) 7,126 promoter CGIs (if a CGI ends after 1,000 bp upstream of a gene's TSS, and starts before 300 bp downstream of a gene's TSS); (2) 13,611 intragenic CGIs (if a CGI starts after 300 bp downstream of a gene's TSS and ends before 300 bp upstream of a gene's transcription end site (TES)); (3) 2,305 3′-transcript CGIs (if a CGI ends after 300 bp upstream of a gene's TES and starts before 300 bp downstream of a gene's TES); (4) 16,954 intergenic CGIs (if a CGI starts after 300 bp downstream of a gene's TES and ends before 1,000 bp upstream of a gene's TSS); (5) 169 miRNA promoter CGIs (if there was a >60% overlap of a CGI with 2 kb upstream of the pre-miRNA). There are overlaps between five classes of CGIs, because of the overlapping gene annotation at a specific genome coordinate. CGI shores were defined as extending up to 2 kb from CGIs.
We also identified the genomic locations of the 17,932 first exons, 14,760 first introns, 117,200 internal exons, 116,186 internal introns and 15,259 last exons of the 21,533 Ensembl genes, together with the 13,626 intergenic regions, 4,309,043 repeats and 59,385 SNPs by referring to the UCSC Genome Browser for pig.
The DNA isolated from three biological replicates for each breed/sex/tissue type combination were pooled in equal quantities and treated with bisulphite using an EZ DNA methylation-Gold Kit (ZYMO Research) according to the manufacturer's specifications. Quantitative methylation analysis of the DMRs was performed using the Sequenom MassARRAY platform (CapitalBio, Beijing, China) as described previously56. PCR primers were designed using the EpiDesigner software (Sequenom). The oligo sequences and the genomic coordinates of the amplicons across which DNA methylation was assessed in this study are given in Supplementary Table S2. The resultant methylation calls were analysed with EpiTyper software v1.0 (Sequenom) to generate quantitative results for each CpG or an aggregate of multiple CpGs.
Gene expression microarray
Genome-wide gene expression analysis of 180 samples that corresponded to the samples used for MeDIP-seq was performed using the Agilent Pig Gene Expression Oligo Microarray (Version 2). Data analysis was performed with MultiExperiment Viewer. Details are listed in Supplementary Methods.
RNase-free DNase I (TaKaRa) was used for removal of genomic DNA from RNA samples used for microarray analysis. cDNA was synthesised using the oligo (dT) and random 6-mer primers provided in the PrimeScript RT Master Mix kit (TaKaRa). q-PCR was performed using the SYBR Premix Ex Taq kit (TaKaRa) on a CFX96 Real-Time PCR detection system (Bio-Rad). Primer sequences used for the q-PCR are shown in Supplementary Table S3. All measurements contained a negative control (no cDNA template), and each RNA sample was analysed in triplicate. Porcine ACTB, TBP and TOP2B were simultaneously used as endogenous control genes. Relative expression levels of objective mRNAs were calculated using the ΔΔCt method.
miRNA discovery and profiling
Eight ATs and two SMTs of the three female Landrace pigs were used for small RNA-seq. The construction of small RNA libraries and single-end sequencing in 36 bp reads using Illumina Genome Analyzer II, generated a total of 7.12 Gb reads for the ten libraries. The bioinformatics pipeline for miRNA discovery was carried out as our previous description57, with some improvements. Details are listed in Supplementary Methods. Our results extend the repertoire of pig miRNAome to 803 pre-miRNAs (174 known, 210 novel and 419 candidate), encoding for 1,014 mature miRNAs, of which 952 are unique (Supplementary Data 9).
Acccession codes: The high-throughput sequencing data and microarray data have been deposited in NCBI's Gene Expression Omnibus under GEO Series accession numbers GSE30344 (MeDIP-seq data), GSE30343 (gene expression microarray data) and GSE30334 (small RNA-seq data).
How to cite this article: Li, M. et al. An atlas of DNA methylomes in porcine adipose and muscle tissues. Nat. Commun. 3:850 doi: 10.1038/ncomms1854 (2012).
Kelly, T., Yang, W., Chen, C. S., Reynolds, K. & He, J. Global burden of obesity in 2005 and projections to 2030. Int. J. Obes. (Lond) 32, 1431–1437 (2008).
MacDougald, O. A. & Burant, C. F. The rapidly expanding family of adipokines. Cell Metab. 6, 159–161 (2007).
Rosen, E. D. & Spiegelman, B. M. Adipocytes as regulators of energy balance and glucose homeostasis. Nature 444, 847–853 (2006).
Despres, J. P. & Lemieux, I. Abdominal obesity and metabolic syndrome. Nature 444, 881–887 (2006).
Lusis, A. J., Attie, A. D. & Reue, K. Metabolic syndrome: from epidemiology to systems biology. Nat. Rev. Genet. 9, 819–830 (2008).
Ibrahim, M. M. Subcutaneous and visceral adipose tissue: structural and functional differences. Obes. Rev. 11, 11–18 (2010).
Tran, T. T., Yamamoto, Y., Gesta, S. & Kahn, C. R. Beneficial effects of subcutaneous fat transplantation on metabolism. Cell Metab. 7, 410–420 (2008).
Satoor, S. N. et al. Location, location, location: beneficial effects of autologous fat transplantation. Sci. Rep. 1, 81 (2011).
Spurlock, M.E. & Gabler, N. K. The development of porcine models of obesity and the metabolic syndrome. J. Nutr. 138, 397–402 (2008).
Rocha, D. & Plastow, G. Commercial pigs: an untapped resource for human obesity research? Drug Discov. Today 11, 475–477 (2006).
Andersson, L. How selective sweeps in domestic animals provide new insight into biological mechanisms. J. Intern. Med. 271, 1–14 (2012).
Heid, I. M. et al. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat. Genet. 42, 949–960 (2010).
Speliotes, E. K. et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat. Genet. 42, 937–948 (2010).
Feinberg, A. P. Epigenomics reveals a functional genome anatomy and a new approach to common disease. Nat. Biotechnol. 28, 1049–1052 (2010).
Rakyan, V. K., Down, T. A., Balding, D. J. & Beck, S. Epigenome-wide association studies for common human diseases. Nat. Rev. Genet. 12, 529–541 (2011).
Feinberg, A. P. et al. Personalized epigenomic signatures that are stable over time and covary with body mass index. Sci. Transl. Med. 2, 49ra67 (2010).
Lefaucheur, L., Ecolan, P., Plantard, L. & Gueguen, N. New insights into muscle fiber types in the pig. J. Histochem. Cytochem. 50, 719–730 (2002).
Hellman, A. & Chess, A. Gene body-specific methylation on the active X chromosome. Science 315, 1141–1143 (2007).
Weber, M. et al. Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells. Nat. Genet. 37, 853–862 (2005).
Kerkel, K. et al. Genomic surveys by methylation-sensitive SNP analysis identify sequence-dependent allele-specific DNA methylation. Nat. Genet. 40, 904–908 (2008).
Rankinen, T. et al. The human obesity gene map: the 2005 update. Obesity (Silver Spring) 14, 529–644 (2006).
Weber, M. et al. Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat. Genet. 39, 457–466 (2007).
Maunakea, A. K. et al. Conserved role of intragenic DNA methylation in regulating alternative promoters. Nature 466, 253–257 (2010).
Takai, D. & Jones, P. A. Comprehensive analysis of CpG islands in human chromosomes 21 and 22. Proc. Natl Acad. Sci. USA 99, 3740–3745 (2002).
Doi, A. et al. Differential methylation of tissue- and cancer-specific CpG island shores distinguishes human induced pluripotent stem cells, embryonic stem cells and fibroblasts. Nat. Genet. 41, 1350–1353 (2009).
Irizarry, R. A. et al. The huan colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat. Genet. 41, 178–186 (2009).
Ji, H. et al. Comprehensive methylome map of lineage commitment from haematopoietic progenitors. Nature 467, 338–342 (2010).
Koga, Y. et al. Genome-wide screen of promoter methylation identifies novel markers in melanoma. Genome Res. 19, 1462–1470 (2009).
Carthew, R. W. & Sontheimer, E. J. Origins and mechanisms of miRNAs and siRNAs. Cell 136, 642–655 (2009).
Zhou, X., Ruan, J., Wang, G. & Zhang, W. Characterization and identification of microRNA core promoters in four model species. PLoS Comput. Biol. 3, e37 (2007).
Vrba, L., Garbe, J. C., Stampfer, M. R. & Futscher, B. W. Epigenetic regulation of normal human mammary cell type specific miRNAs. Genome Res. 21, 2026–2037 (2011).
Gertz, J. et al. Analysis of DNA methylation in a three-generation family reveals widespread genetic influence on epigenetic regulation. PLoS Genet. 7, e1002228 (2011).
Ho, E. & Shimada, Y. Formation of the epicardium studied with the scanning electron microscope. Dev. Biol. 66, 579–585 (1978).
Cecil, J. E., Tavendale, R., Watt, P., Hetherington, M. M. & Palmer, C. N. An obesity-associated FTO gene variant and increased energy intake in children. N. Engl. J. Med. 359, 2558–2566 (2008).
Church, C. et al. Overexpression of Fto leads to increased food intake and results in obesity. Nat. Genet. 42, 1086–1092 (2010).
Iannello, S., Milazzo, P. & Belfiore, F. Animal and human tissue Na,K-ATPase in obesity and diabetes: a new proposed enzyme regulation. Am. J. Med. Sci. 333, 1–9 (2007).
Biswas, S. et al. Missense mutations in COL8A2, the gene encoding the alpha2 chain of type VIII collagen, cause two forms of corneal endothelial dystrophy. Hum. Mol. Genet. 10, 2415–2423 (2001).
Wang, X. et al. Differential expression of liver proteins between obesity-prone and obesity-resistant rats in response to a high-fat diet. Br. J. Nutr. 106, 612–626 (2011).
Crosson, S. M., Khan, A., Printen, J., Pessin, J. E. & Saltiel, A. R. PTG gene deletion causes impaired glycogen synthesis and developmental insulin resistance. J. Clin. Invest. 111, 1423–1432 (2003).
Sun, Y., Wang, P., Zheng, H. & Smith, R. G. Ghrelin stimulation of growth hormone release and appetite is mediated through the growth hormone secretagogue receptor. Proc. Natl Acad. Sci. USA 101, 4679–4684 (2004).
Du, H., Dardzinski, B. J., O'Brien, K. J. & Donnelly, L. F. MRI of fat distribution in a mouse model of lysosomal acid lipase deficiency. Am. J. Roentgenol. 184, 658–662 (2005).
Farooqi, I. S. et al. Clinical spectrum of obesity and mutations in the melanocortin 4 receptor gene. N. Engl. J. Med. 348, 1085–1095 (2003).
Harvey, N. L. et al. Lymphatic vascular defects promoted by Prox1 haploinsufficiency cause adult-onset obesity. Nat. Genet. 37, 1072–1081 (2005).
Hu, Z. L., Fritz, E. R. & Reecy, J. M. AnimalQTLdb: a livestock QTL database tool set for positional QTL information mining and beyond. Nucleic. Acids Res. 35, D604–D609 (2007).
Sun, D. et al. miR-99 family of MicroRNAs suppresses the expression of prostate-specific antigen and prostate cancer cell proliferation. Cancer Res. 71, 1313–1324 (2011).
Fu, S. W., Chen, L. & Man, Y. G. miRNA biomarkers in breast cancer detection and management. J. Cancer. 2, 116–122 (2011).
Fabrini, R. et al. The extended catalysis of glutathione transferase. FEBS Lett. 585, 341–345 (2011).
Yang, X. et al. Systematic genetic and genomic analysis of cytochrome P450 enzyme activities in human liver. Genome Res. 20, 1020–1036 (2010).
Chow, K. C., Lu, M. P. & Wu, M. T. Expression of dihydrodiol dehydrogenase plays important roles in apoptosis- and drug-resistance of A431 squamous cell carcinoma. J. Dermatol. Sci. 41, 205–212 (2006).
Tang-Peronard, J. L., Andersen, H. R., Jensen, T. K. & Heitmann, B. L. Endocrine-disrupting chemicals and obesity development in humans: a review. Obes. Rev. 12, 622–636 (2011).
Iyer, A., Fairlie, D. P., Prins, J. B., Hammock, B. D. & Brown, L. Inflammatory lipid mediators in adipocyte function and obesity. Nat. Rev. Endocrinol. 6, 71–82 (2010).
Harrison, B. C. & Leinwand, L. A. Fighting fat with muscle: bulking up to slim down. Cell Metab. 7, 97–98 (2008).
Braunschweig, M., Jagannathan, V., Gutzwiller, A. & Bee, G. Investigations on transgenerational epigenetic response down the male line in F2 pigs. PLoS One 7, e30583 (2012).
Li, N. et al. Whole genome DNA methylation analysis based on high throughput sequencing technology. Methods 52, 203–212 (2010).
Li, R. et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25, 1966–1967 (2009).
Ehrich, M. et al. Quantitative high-throughput analysis of DNA methylation patterns by base-specific cleavage and mass spectrometry. Proc. Natl Acad. Sci. USA 102, 15785–15790 (2005).
Li, M. et al. MicroRNAome of porcine pre- and postnatal development. PLoS One 5, e11541 (2010).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
We thank Cheng Ye for help with data management, and Carol Yueng and Chris Heble for help with manuscript editing. This work was supported by grants from the National Special Foundation for Transgenic Species of China (2009ZX08009-155B and 2011ZX08006-003), the Specialized Research Fund of Ministry of Agriculture of China (NYCYTX-009), the Project of Provincial Twelfth 5 Years' Animal Breeding of Sichuan Province (2011YZGG15) and the International Science & Technology Cooperation Program of China (2011DFB30340) to X.L., and the National Natural Science Foundation of China (30901024) to M.L.
The authors declare no competing financial interests.
Supplementary Figures S1-S8, Supplementary Tables S1-S3, Supplementary Methods and Supplementary References. (PDF 8464 kb)
List of pig genes which are orthologs to the known human obesity-related genes. (XLS 118 kb)
List of DMRs which overlap 223 pig genes that are orthologs to the known human obesity-related genes. (XLS 644 kb)
List of 2,311 genes in the QTLs region that impact fatness and pork quality. (XLS 441 kb)
List of 901 high confidence and narrowed (< 2Mb) porcine QTLs which impact fatness and pork quality. (XLS 116 kb)
List of DMRs that overlap 1,669 pig genes in the QTLs region which affect fatness and pork quality. (XLS 3312 kb)
Identified miRNAs with known or potential role in obesity. List of DMRs overlapping miRNA promoters. (XLS 264 kb)
Enrichment analysis for genes with DMRs in promoters. List of DMRs overlapping mRNA promoters. (XLS 6783 kb)
Significant functional gene categories for the DMRs in promoter. (XLS 637 kb)
Profile of porcine miRNAs. (XLS 620 kb)
About this article
Cite this article
Li, M., Wu, H., Luo, Z. et al. An atlas of DNA methylomes in porcine adipose and muscle tissues. Nat Commun 3, 850 (2012). https://doi.org/10.1038/ncomms1854
Transcriptome analysis reveals the long intergenic noncoding RNAs contributed to skeletal muscle differences between Yorkshire and Tibetan pig
Scientific Reports (2021)
Integrative analysis of genome‐wide DNA methylation and gene expression profiles reveals important epigenetic genes related to milk production traits in dairy cattle
Journal of Animal Breeding and Genetics (2021)
Nitric Oxide in the Spinal Cord Is Involved in the Hyperalgesia Induced by Tetrahydrobiopterin in Chronic Restraint Stress Rats
Frontiers in Neuroscience (2021)
Generation of a Biobank From Two Adult Thoroughbred Stallions for the Functional Annotation of Animal Genomes Initiative
Frontiers in Genetics (2021)