Identifying key genes in milk fat metabolism by weighted gene co-expression network analysis

Mu, Tong; Hu, Honghong; Ma, Yanfen; Wen, Huiyu; Yang, Chaoyun; Feng, Xiaofang; Wen, Wan; Zhang, Juan; Gu, Yaling

doi:10.1038/s41598-022-10435-1

Download PDF

Article
Open access
Published: 27 April 2022

Identifying key genes in milk fat metabolism by weighted gene co-expression network analysis

Tong Mu¹,
Honghong Hu¹,
Yanfen Ma^1,2,
Huiyu Wen³,
Chaoyun Yang¹,
Xiaofang Feng¹,
Wan Wen⁴,
Juan Zhang¹ &
…
Yaling Gu¹

Scientific Reports volume 12, Article number: 6836 (2022) Cite this article

1540 Accesses
6 Citations
Metrics details

Subjects

Abstract

Milk fat is the most important and energy-rich substance in milk, and its content and composition are important reference elements in the evaluation of milk quality. However, the current identification of valuable candidate genes affecting milk fat is limited. IlluminaPE150 was used to sequence bovine mammary epithelial cells (BMECs) with high and low milk fat rates (MFP), the weighted gene co-expression network (WGCNA) was used to analyze mRNA expression profile data in this study. As a result, a total of 10,310 genes were used to construct WGCNA, and the genes were classified into 18 modules. Among them, violet (r = 0.74), yellow (r = 0.75) and darkolivegreen (r = − 0.79) modules were significantly associated with MFP, and 39, 181, 75 hub genes were identified, respectively. Combining enrichment analysis and differential genes (DEs), we screened five key candidate DEs related to lipid metabolism, namely PI4K2A, SLC16A1, ATP8A2, VEGFD and ID1, respectively. Relative to the small intestine, liver, kidney, heart, ovary and uterus, the gene expression of PI4K2A is the highest in mammary gland, and is significantly enriched in GO terms and pathways related to milk fat metabolism, such as monocarboxylic acid transport, phospholipid transport, phosphatidylinositol signaling system, inositol phosphate metabolism and MAPK signaling pathway. This study uses WGCNA to form an overall view of MFP, providing a theoretical basis for identifying potential pathways and hub genes that may be involved in milk fat synthesis.

Weighted gene co-expression network analysis identifies modules and functionally enriched pathways in the lactation process

Article Open access 27 January 2021

Milk proteome from in silico data aggregation allows the identification of putative biomarkers of negative energy balance in dairy cows

Article Open access 04 July 2019

TWAS revealed significant causal loci for milk production and its composition in Murrah buffaloes

Article Open access 16 December 2023

Introduction

Milk is not only the source of nutrition for newborn cows, it is also an important source of protein, sugar, lipids, and other nutrients for humans¹. Milk fat is the most important and energy-rich substance in milk and is an important component in the production of butter and yogurt, with a content of about 3–5% in milk. Milk fat also plays an important role in the nutrition and metabolism of human growth and development² Polyunsaturated fatty acids such as conjugated and non-conjugated linoleic acid (C18:2) contained in milk fat play a beneficial role in lowering blood lipids, suppressing immune responses, and stimulating lipid metabolism³; while high concentrations of saturated fatty acids such as myristic acid (C14:0), lauric acid (C12:0) and palmitic acid (C16:0) increase the concentration of low density lipoproteins in the blood, which is associated with cardiovascular disease⁴. Therefore, the content and composition of milk fat is the main reference element to evaluate the quality of milk. Nowadays, milk fat content is not only one of the important indicators for the core competitiveness of dairy products, but also a major target trait for dairy cattle breeding⁵. Exploring the theory and methods of milk fat formation and regulation to improve milk fat content in dairy cows has become a hot spot in international lactation biology research.

In the past, many scholars have extensively studied the complex regulatory mechanisms of mammary gland development and elucidated the major pathways of milk fat synthesis (including de novo synthesis and FA uptake in the blood)⁶. Breeding researchers have identified a range of potent genes and biomarkers in milk fat metabolism with the widespread use of next-generation sequencing technologies and the dramatic reduction in sequencing costs⁷. Despite the transcriptome determining the characteristic of spatial and temporal specificity, there is less linkage to phenotypic data⁸. Weighted gene co-expression network analysis (WGCNA) can combine gene expression with phenotypic data^9,10 and gather genes with similar expression patterns into one module¹¹. Genes in modules are often involved in the same function or pathway, which can be used in data analysis of complex processes, and playing an important role in exploring the characteristics of gene networks associated with complex diseases¹². For example, several biomarker genes screened and identified using the WGCNA method are associated with many biological problems such as cancer¹³, types I diabetes¹⁴, rheumatoid arthritis¹⁵, feed efficiency¹⁶, and meat quality¹⁷. Potential of this approach for grouping genes into the functional modules and revealing regulatory mechanisms underlying the complex traits have been highlighted in many recent studies¹⁸. In addition, there have been some studies on functional characteristics and lactation properties in ruminant mammary gland¹⁹, however, the application of WGCNA in milk fat metabolism in dairy cows has not been reported.

In this study, we used WGCNA to comprehensively analyze the mRNA expression profile data of high and low milk fat percentage (MFP) dairy cow mammary epithelial cells which measured by Illumina PE150 in the early stage of our group. Enrichment analysis of hub genes in modules closely related to MFP reveal its potential functions. In order to explore the signature genes and important functional enrichment pathways associated with MFP and provide a theoretical basis for understanding the complex biology of the milk fat synthesis process in dairy cows.

Results

Overview of BMECs sequencing data

Using Illumina PE150 sequencing platform, 81,605,996–97,102,888 and 78,710,246–88,676,080 raw_reads were obtained in high and low MFP BMECs, and 80,633,532–94,731,948 and 76,807,276–86,508,476 clean_reads were obtained after removing the adapter related, containing N and low quality, respectively. The sequencing error rate of the 8 samples is 0.02%, Q20 is greater than 97%, Q30 is greater than 94%, the GC content is about 53%, which ensures the accuracy of the subsequent analysis. The mapping rates for each sample after comparing clean_reads to the reference genome were 94.43% (H_2046), 94.96% (H_2098), 94.79% (H_2190), 94.97% (H_2226), 94.08% (L_2034), 94.32% (L_2037), 94.71% (L_2137) and 94.69% (L_2170).

Principal component and correlation analysis of samples

The correlation of gene expression levels between samples and principal component analysis are important indicators to test the reliability of experimental samples. Principal component and correlation analysis was performed based on TPM values of all genes in each sample (Fig. 1). It was found that the samples of the high-milk fat group and the low-milk fat group were significantly different, and the correlation coefficients within the group were all above 0.89, which indicated a high similarity of expression patterns within the samples, and no outlier samples were found. Therefore, the transcriptome sequencing results are reliable and can be used for subsequent analysis.

Weighted correlation network analysis

WGCNA analyzed 10 310 genes obtained after data preprocessing. When the scale-free topology model fit reached 0.8 (R² = 0.8), a soft thresholding power was 14 (β = 14) (Fig. 2A). The 41 co-expression modules were constructed by WGCNA (Fig. 2B), and 18 modules were obtained after merging modules with a similarity greater than 0.75 (Fig. 2C). The module containing the most genes was the green module (2 626 genes), followed by the pink module (1 890 genes), blue module (874 genes) and skyblue3 module (828 genes) (Supplementary Table 1).

The correlation between the co-expression module and the MFP phenotype was analyzed. The results showed that multiple modules were associated with MFP phenotype, among them violet (r = 0.74) and yellow (r = 0.75) modules are significantly positively correlated with MFP (Fig. 2D), including 169 genes (Fig. 3B) and 547 genes (Fig. 3C) respectively, while darkolivegreen module was significantly negatively correlated with MFP (r = − 0.79) (Fig. 2D), including 336 genes (Fig. 3A).There were 39, 181 and 75 genes that had high gene significance (GS > 0.4, Fig. 3D) with MFP phenotype in these three modules, respectively, and the correlation between all these genes and module members (MM) is greater than 0.9, so these genes were considered as hub genes (Supplementary Table 2).

After obtaining hub genes of the three modules, the intersection was taken with the 915 DEs screened from the transcriptome data (Supplementary Table 3) (Fig. 3E), and it was found that the hub genes were isolated between the three modules, and the yellow module contains 14 DEs, which is the largest number, followed by the darkolivegreen module (5 DEs) and the violet module (4 DEs). The DEs contained in each module are shown in Table 1.

Table 1 DEs contained within the darkolivegreen, violet and yellow modules.

Full size table

Functional enrichment analysis of hub genes

To determine the specific functions of the genes within the three modules significantly associated with MFP, we performed GO and KEGG enrichment analysis on 295 hub genes in the three modules. A total of 1 301 GO terms were enriched as a result (Supplementary Table 4), and 180 GO terms were significantly enriched (P < 0.05). There were 37 significantly enriched GO terms related to lipid metabolism, and Fig. 4A shows 11 significantly enriched biological processes (BP) related to lipid metabolism, 15 molecular functions (MF) related to lipid metabolism and 7 representative cellular components (CC). GO terms closely related to milk fat synthesis include acylglycerol lipase activity, ubiquitin protein ligase binding, intermembrane lipid transfer and lipid binding. Notably, SLC16A1, ATP8A2 and PI4K2A are hub genes related to lipid metabolism and also DEs screened in the transcriptome data (P < 0.05), which were mainly enriched in monocarboxylic acid transport, phospholipid transport and phosphatidylinositol phosphorylation terms.

The KEGG enrichment results showed that 239 pathways were enriched (Supplementary Table 4), among them 66 pathways were significantly enriched (P < 0.05) and 13 pathways were associated with lipid metabolism. Figure 4B show the hub genes contained in 13 lipid metabolism-related pathways, with PI4K2A, VEGFD and ID1 being the DEs and significantly enriched to phosphatidylinositol signaling system, inositol phosphate metabolism, Rap1, MAPK and Ras signaling pathway. Interestingly, the down-regulated gene PI4K2A within the yellow module was significantly enriched in GO terms and KEGG pathways related to lipid metabolism (Fig. 5A). The PI4K2A gene is involved in diacylglycerol and glycerophospholipid metabolism of the phosphatidylinositol and inositol phosphate metabolic pathways, respectively (Fig. 6), which suggesting that PI4K2A may play an important function in milk fat synthesis. In addition, some hub genes were not significantly different between the high and low milk fat groups, but they were significantly enriched in the GO terms and KEGG pathways related to lipid metabolism (Table 2), and these genes are likely to play a potential role in milk fat synthesis.

Table 2 Hub genes significantly enriched to lipid metabolism-related GO terms and the KEGG pathway (non-DEs).

Full size table

Protein interaction network analysis

The results of hub genes enrichment analysis intersected with DEs to identify SLC16A1, ATP8A2, PI4K2A, VEGFD and ID1 may be the key candidate genes involved in milk fat synthesis (Fig. 5A). We performed protein interaction network analysis on these five candidate genes. Since these genes are not directly functionally related to each other, we have selected the top 10 interacting proteins that are similar in function to the candidate gene. PI4K2A and VEGFD (FIGF) had the highest strength of data support with the 10 proteins with a common function (Fig. 5B,F) and the confidence levels of 0.9 and 0.7 respectively, followed by ID1 with a confidence level of 0.7 (Fig. 5E). The strength of data support for SLC16A1 and ATP8A2 with the top 10 proteins was low (Fig. 5C,D), with moderate confidence (0.4), but still have reliable reference value.

Tissue expression profile analysis of key candidate genes

The expression levels of SLC16A1, ATP8A2, PI4K2A, VEGFD and ID1 varied in different tissues. PI4K2A gene expression was relatively highest in the mammary gland (Fig. 7E), which significantly higher than small intestine, liver, kidney, heart and ovary tissues (P < 0.05), and slightly lower in uterus than mammary gland. The expression level of ATP8A2 and ID1 genes in mammary gland ranks second (Fig. 7C,D); VEGFD gene expression in mammary gland was significantly lower than heart and similar to that in uterus (Fig. 7B). The SLC16A1 gene was highly expressed in the kidney, followed by the liver, with lower expression in other tissues and non-significant differences (Fig. 7A). In addition, we examined the relative expression levels of key candidate genes for milk fat synthesis in BMECs from the high and low milk fat groups (three technical replicates), it was found that the trends of the qRT-PCR experiment results and the RNAseq sequencing results are consistent by using log₂FoldChange to convert the difference multiples (Fig. 7F), confirming the accuracy of the transcriptome sequencing.

Discussion

Some new analytical methods are gradually making up for the limitations of traditional biological research with the continuous innovation of sequencing technology and the rapid development of bioinformatics, which can fully and effectively explore the biological significance of the massive amount of data^20,21. Compared to other regulatory networks, WGCNA is an effective data mining method that modularizes large datasets based on similar expression patterns of genes to obtain co-expression modules with high biological significance²². In recent years, WGCNA has been applied to explore the characteristics of human and plant life activities^23,24,25. However, the use of WGCNA on the metabolism of milk fat has not been reported. Milk fat synthesis in dairy cows is related to many physiological and metabolic changes. To gain new insights into the expression and regulation of key genes in milk fat synthesis, we used WGCNA to comprehensively analyze the mRNA expression profile data of high and low MFP dairy cow mammary epithelial cells which measured by Illumina PE150 in the early stage of our group. Clustering the important genes into modules of specific biological pathways that may be associated with MFP in cows, thereby improving the efficiency of identification of important genes.

In this study, we constructed the first gene co-expression network for high and low MFP in Holstein cows, and obtained three modules significantly associated with MFP, violet, yellow and darkolivegreen respectively. The hub gene enrichment analysis in the three modules showed that SLC16A1, ATP8A2, VEGFD, ID1 and PI4K2A genes, which overlap with DEs, were significantly enriched to lipid metabolism-related pathways. Among them, the SLC16A1, ATP8A2 and PI4K2A genes were significantly enriched for monocarboxylate transport, phospholipid transport and phosphatidylinositol phosphorylation terms. It is well known that many monocarboxylic acids were utilised by the body’s metabolism, and acetic acid and β-hydroxybutyric acid are the main substrates for the de novo synthesis of fatty acids in ruminants and are essential for meeting the energy requirements (70%) and milk fat synthesis in cows. Acetic acid and β-hydroxybutyric acid play a positive regulatory role in the de novo synthesis of fatty acids, the transport and desaturation of long-chain fatty acids and the synthesis of triglycerides^26,27,28; Phosphatidylinositol is a phospholipid, it is also one of the five main polar lipids in milk (less than 2% of total fat) in milk²⁹. Polar lipids are the main component of the milk fat globule membrane (MFGM), which is responsible for wrapping the lipid droplets secreted by BMECs³⁰. Therefore, SLC16A1, ATP8A2 and PI4K2A may be key candidate genes for the regulation of milk fat synthesis. The MAPK signalling pathway plays a key role in the inflammatory response and induces the expression of a variety of inflammatory mediators and pro-inflammatory cytokines³¹. The RAP1 pathway is a key component of the BMECs³², and during peak lactation in dairy cows, MAPK and RAP1 signaling pathways can increase milk production by regulating the proliferation and differentiation of BMECs^33,34. Rap1 has been shown to antagonize Ras signals in inactive complexes by capturing its effector protein (serine/threonine kinase Raf)³⁵. In this study, KEGG enrichment analysis revealed that PI4K2A, VEGFD and ID1 genes (i.e. DEs and hub genes) were significantly enriched in the phosphatidylinositol, Rap1, MAPK and Ras signaling pathway, which suggested that PI4K2A, VEGFD and ID1 genes were likely to be involved in the lactation process of cows. Notably, the PI4K2A gene was significantly enriched to the GO terms and KEGG pathways associated with lipid metabolism and was involved in diacylglycerol and glycerophospholipid metabolism in the phosphatidylinositol and inositol phosphate metabolic pathways, respectively, which indicates its potential to be involved in regulating milk fat synthesis.

PI4K2A is a key enzyme for the synthesis of phosphatidylinositol 4-phosphate with multiple cell signaling functions³⁶, which is critical for epidermal growth factor receptor degradation³⁷, transferrin receptor recycling³⁸, autophagy-lysosome fusion³⁹ and prognosis of breast cancer patients⁴⁰. A genome-wide association study of milk fatty acid composition in Italian Simmental and Italian Holstein cows by using single nucleotide polymorphism arrays⁴¹, which revealed that PI4K2A may be involved in milk fat metabolism. In addition, the CDIPT gene, which significantly interacts with PI4K2A, and it also plays an important role in fatty acid and energy metabolism⁴², which reflecting the potential importance of the PI4K2A gene in milk fat metabolism. This study found that the expression level of PI4K2A was significantly higher in dairy cows’ mammary tissue than the small intestine, liver, kidney, heart and ovary. It further indicates that PI4K2A may have an important function in milk fat synthesis in dairy cows, and a more in-depth functional verification of specific mechanisms is required. DNA binding inhibitor 1 (ID1) is a helix-loop-helix transcription factor that is highly expressed in brown adipose tissue⁴³ and promotes obesity by inhibiting brown fat thermogenesis and white fat browning⁴⁴. Functionally ID1 is involved in regulating the transcriptional activity of ADD1/SREBP-1c, thereby regulating adipogenesis⁴⁵. Marcin et al.⁴⁶ showed that Mammalian target of rapamycin can regulate mammary epithelial cells growth through ID1. ATP8A2 is a P4-ATPase that transfers (flips) phosphatidylserine and phosphatidylethanolamine from the ectoplasmic lobules of the cell membrane lipid bilayer to the cytoplasmic lobules, resulting in asymmetric lipid partitioning between membrane lobules⁴⁷. Vascular endothelial growth factor D (VEGFD) is considered the main angiogenic component of adipose tissue⁴⁸, which enhances lymphangiogenesis and reduces obesity-related immune accumulation in mouse adipose tissue⁴⁹, but VEGFD deficiency does not affect adipose tissue development in mice⁵⁰. The expression levels of ATP8A2, ID1 and VEGFD were significantly higher in mammary tissue than small intestine, liver, kidney and ovary of dairy cows, which suggested that they may have potential biological functions in the mammary gland. SLC16A1 has an important role in short-chain fatty acid transport⁵¹. Hu et al.⁵² studies suggested that SLC16A1 may be involved in hepatic lipid metabolism in pigs, which is consistent with the high-level expression results of SLC16A1 in liver tissues of this study. Although the expression level of the SLC16A1 gene in dairy cows’ mammary gland tissue is significantly lower than that in liver and kidney. However, compared with other tissues, the expression abundance of SLC16A1 is still at the upper-middle level, so the role of SLC16A1 in dairy cows’ milk fat metabolism cannot be ignored. In the future, members of our group will continue to investigate the functional mechanisms of these key candidate DEs (SLC16A1, VEGFD, ID1, ATP8A2 and PI4K2A) in lipid metabolism screened by WGCNA, and in order to elucidate their specific regulatory mechanisms.

Conclusion

In this study, a comprehensive analysis of mRNA expression profile data based on Illumina PE150 sequencing of high and low MFP BMECs was performed by WGCNA, resulting in three modules (violet, yellow and darkolivegreen) that were significantly associated with MFP. After enrichment analysis, a total of 5 candidate DEs related to lipid metabolism were screened out, namely PI4K2A, SLC16A1, ATP8A2, VEGFD and ID1. Among them, PI4K2A is more likely to be involved in milk fat metabolism. The results of this study provide a new way to understand the function of genes in milk fat synthesis in dairy cows and it also provide a new perspective on the study of the lactation process in cattle.

Materials and methods

Ethics statement

Animal experiments were conducted in accordance with the Regulations for the Administration of Affairs Concerning Experimental Animals (Ministry of Science and Technology, China, 2004). It is authorized by the Animal Ethics Committee of Ningxia University (permit number NXUC20200619). The cattle used in the experiments was electric shocked before being released. Take tissue samples immediately, making all efforts to minimize its suffering. This work also conformed to the requirements of American Veterinary Medical Association (AVMA) Guidelines. This study is reported in accordance with the recommendations put forward by the ARRIVE guidelines.

Data source and preprocessing

The data of 14 543 mRNA expression profiles in this study were obtained from the results of Illumina PE150 sequencing of BMECs of high and low MFP cows in the previous study by our research group (Supplementary Table 5). Sequencing samples were obtained from the Maosheng pasture of He Lanshan in Ningxia state farm, where the test cows were fed the same balanced total mixed diet. A total of 245 Holstein cows of similar age and in the mid and late lactation were selected. Collect milk samples of each cow in the morning, at noon, and in the evening for dairy herd improvement (DHI). Screen 8 Holstein cows with somatic cell counts within 100,000/mL and extreme differences in MFP (Table 3). BMECs were isolated from fresh milk by aseptic collection⁵³, and the library construction and transcriptome sequencing were carried out by Beijing Nuohe Zhiyuan Biotechnology Co., Ltd.

Table 3 High and low MFP of Holstein cattle.

Full size table

A chain-specific library was constructed by removing ribosomal RNA. After passing the library inspection, Illumina PE150 sequencing was performed. After the original data is obtained, the reads with adapter, N (undetermined base information) ratio greater than 0.002, and low-quality bases with a read length of more than 50% are removed. After sequencing error rates (Q20 and Q30) and GC content distribution checks, clean reads for subsequent analysis were obtained. Hisat2 (http://ccb.jhu.edu/software/hisat2, version 2.1.0) software were used to compare and analyze the RNA sequencing (RNA-seq) data (the reference genome version is bos_taurus_Ensembl_97)⁵⁴.

Since the mRNA expression profile data in transcriptome sequencing is represented by the FPKM value, the FPKM was converted to TPM by using the colSums function of the tidyverse package of the R (version 4.1.1). After that, the principal component and correlation analysis of the eight samples was performed by online post-sale tool platform provided in Beijing Nuohe Zhiyuan Biotechnology Co., Ltd (https://magic.novogene.com/customer/ main#/home/2d9dc26d1e059b931b9ac5364 9482c7c).

Construction of co-expression network

The median absolute deviation of different gene expression profiles were first calculated by the apply function in R to eliminate outliers and abnormal values in the data set, and then the goodSamplesGenes function was used to detect missing values and samples below the sample threshold. And finally, 10,310 genes with relatively high expression were obtained. The co-expression network of mRNA expression profile data was constructed by the R package WGCNA⁵⁵. The construction of a weighted gene network requires the optimal selection of soft thresholding power β that improves co-expression similarity and calculates the adjacency. Therefore, picking the optimal soft thresholding power β was performed using the function pickSoftThreshold (based on the criterion of approximate scale-free topology) in the R package WGCNA. When 0.8 is used as the correlation coefficient threshold (R² = 0.8), the soft thresholding power β was 14 and the minimum number of genes in the module is 111, and the number of genes to construct the co-expression network is set to 100. The module detection sensitivity was 2 (deepSplit = 2), and the cut height for merging of modules was 0.25 (mergeCutHeight = 0.25, i.e., merge into one module if the correlation coefficient of eigengenes within the module is greater than 0.75).

Identification of key candidate genes

In the module-trait correlation analysis, hub genes were considered as genes with GS greater than 0.4 and high module group members (MM, weighted correlation index > 0.9), indicating a significant correlation with milk fat percentage.

Functional enrichment and protein interaction network analysis

Here, functional enrichment analysis was performed using the KOBAS website (http://kobas.cbi.pku.edu.cn/genelist/, version 3.0) and its results were visualized by the R package GOplot (version 1.0.2). The hub genes were intersected with the differential genes (DEs) screened by the transcriptome (P < 0.05) and combining the results of enrichment analysis to screen key candidate genes for milk fat metabolism, and protein interaction network analysis carried by String website (https://www.string-db.org/https://www.string-db.org/, version 11.5).

qRT-PCR validation of key candidate genes

Small intestine, liver, kidney, heart, ovary, uterus and mammary gland tissues were collected from three cows in the mid and late lactation with similar age, and the tissues were cut and quickly placed in liquid nitrogen and brought back to the laboratory for total RNA extraction and first-strand cDNA synthesis. Real-time quantitative reverse transcription PCR (qRT-PCR) was used to detect the expression of key candidate genes for milk fat in different tissues and to verify their expression levels in BMECs of high and low MFP cows.

Total RNA was extracted by using RNA simple Total RNA Kit (Tiangen Biochemical Technology Co., Ltd). First-strand cDNA synthesis was performed by using PrimeScript RT Kit (Takara, Dalian, China). qRT-PCR (three replicates) was performed by SYBR Premix Ex Taq II (TaKaRa, Dalian, China) on the Bio-Rad CFX96 Touch Real-Time PCR Detection System (Bio-Rad, Hercules, CA, USA). Amplification procedure: 95 °C for 30 s, 95 °C for 5 s, annealing for 30 s, 40 cycles. qRT-PCR primers were designed by using Primer Premier 5.0 and the primers span at least one intron, the sequence and annealing temperature of each primer were shown in Table 4.

Table 4 Primer sequence and annealing temperature.

Full size table

Statistical analysis

The statistical significance of differences between the two groups was analyzed using a non-parametric test or t-test based on the data distribution characteristics. All the analyses were conducted using the software R; the P < 0.05 was considered statistically significant. The relative expression of DEs was analyzed by the 2^−ΔΔCt method and normalized using the glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene.

Institutional review board statement

The Animal Ethics Committees of Ningxia University approved the experimental design and animal sample collection for the present study (permit number NXUC2 0200619). And animal experiments were conducted strictly followed the guidelines of the Regulations for the Administration of Affairs Concerning Experimental Animals (Ministry of Science and Technology, China, 2004).

Data availability

All data generated or analyzed in this study are included in this article [and its Supplementary Information File], and the datasets have been submitted to the SRA database with the Accession Number PRJNA730595. Access to the data of permanent link to https://www.ncbi.nlm.nih.gov/sra/PRJNA730595.

References

Wu, Y. et al. Effect of calcium on absorption properties and thermal stability of milk during microwave heating. Int. J. Mol. Sci. 19, 1747. https://doi.org/10.3390/ijms19061747 (2018).
Article CAS PubMed Central Google Scholar
Chen, Z. et al. MicroRNA-106b regulates milk fat metabolism via ATP binding cassette subfamily A member 1 (ABCA1) in bovine mammary epithelial cells. J. Agric. Food Chem. 67, 3981. https://doi.org/10.1021/acs.jafc.9b00622 (2019).
Article CAS PubMed Google Scholar
Belury, M. A. Dietary conjugated linoleic acid in health: Physiological effects and mechanisms of action. Annu. Rev. Nutr. 22, 505. https://doi.org/10.1146/annurev.nutr.22.021302.121842 (2002).
Article CAS PubMed Google Scholar
Zhou, C., Shen, D., Li, C., Cai, W. & Zhang, S. Comparative transcriptomic and proteomic analyses identify key genes associated with milk fat traits in chinese Holstein cows. Front. Genet. 10, 672. https://doi.org/10.3389/fgene.2019.00672 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, D. et al. MiR-486 regulates lactation and targets the PTEN gene in cow mammary glands. PLoS ONE 10, e0118284. https://doi.org/10.1371/journal.pone.0118284 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bauman, D. E., Mather, I. H., Wall, R. J. & Lock, A. L. Major advances associated with the biosynthesis of milk. J. Dairy Sci. 89, 1235. https://doi.org/10.3168/jds.S0022-0302(06)72192-0 (2006).
Article CAS PubMed Google Scholar
Xu, B. et al. Multiple roles for the non-coding RNA SRA in regulation of adipogenesis and insulin sensitivity. PLoS ONE 5, e14199. https://doi.org/10.1371/journal.pone.0014199 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Fan, Y., Arbab, A., Zhang, H., Yang, Y. & Yang, Z. Lactation associated genes revealed in Holstein dairy cows by weighted gene co-expression network analysis (WGCNA). Animals 11, 314. https://doi.org/10.3390/ani11020314 (2021).
Article PubMed PubMed Central Google Scholar
Yuan, Y. D., Zhang, B., Tang, X. G., Zhang, J. C. & Lin, J. Comparative transcriptome analysis of different dendrobium species reveals active ingredients-related genes and pathways. Int. J. Mol. Sci. 21, 861. https://doi.org/10.3390/ijms21030861 (2020).
Article CAS PubMed Central Google Scholar
Ali, M. et al. Comparative transcriptomic analysis to identify the genes related to delayed gland morphogenesis in Gossypium bickii. Genes 11, 472. https://doi.org/10.3390/genes11050472 (2020).
Article CAS PubMed Central Google Scholar
Edgardo, G. V. & Ernesto, P. R. Identification of modules with similar gene regulation and metabolic functions based on co-expression data. Front. Mol. Biosci. 6, 139. https://doi.org/10.3389/fmolb.2019.00139 (2019).
Article CAS Google Scholar
Yao, Q., Song, Z., Wang, B., Qin, Q. & Zhang, J. A. Identifying key genes and functionally enriched pathways in sjgren’s syndrome by weighted gene co-expression network analysis. Front. Genet. 10, 1142. https://doi.org/10.3389/fgene.2019.01142 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jia, R., Zhao, H. & Jia, M. Identification of co-expression modules and potential biomarkers of breast cancer by WGCNA. Gene 750, 144757. https://doi.org/10.1016/j.gene.2020.144757 (2020).
Article CAS PubMed Google Scholar
Medina, I. R. & Lubovac-Pilav, Z. Gene co-expression network analysis for identifying modules and functionally enriched pathways in type 1 diabetes. PLoS ONE 11, e0156006. https://doi.org/10.1371/journal.pone.0156006 (2016).
Article CAS Google Scholar
Ma, C. et al. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis. Int. J. Rheum. Dis. 20, 971. https://doi.org/10.1111/1756-185X.13063 (2017).
Article CAS PubMed Google Scholar
Salleh, S. M., Mazzoni, G., Løvendahl, P. & Kadarmideen, H. N. Gene co-expression networks from RNA sequencing of dairy cattle identifies genes and pathways affecting feed efficiency. BMC Bioinform. 19, 513. https://doi.org/10.1186/s12859-018-2553-z (2018).
Article CAS Google Scholar
Bordini, M., Zappaterra, M., Soglia, F., Petracci, M. & Davoli, R. Weighted gene co-expression network analysis identifies molecular pathways and hub genes involved in broiler white striping and wooden breast myopathies. Sci. Rep. 11, 1776. https://doi.org/10.1038/s41598-021-81303-7 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bakhtiarizadeh, M. R., Hosseinpour, B., Shahhoseini, M., Korte, A. & Gifani, P. Weighted gene co-expression network analysis of endometriosis and identification of functional modules associated with its main hallmarks. Front. Genet. 9, 453. https://doi.org/10.3389/fgene.2018.00453 (2018).
Article CAS PubMed PubMed Central Google Scholar
Deng, T. et al. Campanile G. Integrative analysis of transcriptome and GWAS data to identify the hub genes associated with milk yield trait in buffalo. Front. Genet. 10, 36. https://doi.org/10.3389/fgene.2019.00036 (2019).
Article CAS PubMed PubMed Central Google Scholar
Guo, R., Zhao, Y., Zou, Q., Fang, X. & Peng, S. Bioinformatics applications on apache spark. GigaScience 8, 098. https://doi.org/10.1093/gigascience/giy098 (2018).
Article CAS Google Scholar
Chen, Y. C. et al. Systematic elucidation of the mechanism of genistein against pulmonary hypertension via network pharmacology approach. Int. J. Mol. Sci. 20, 5569. https://doi.org/10.3390/ijms20225569 (2019).
Article CAS PubMed Central Google Scholar
Zhao, W. et al. Weighted gene coexpression network analysis: State of the art. J. Biopharm. Stat. 20, 281. https://doi.org/10.1080/10543400903572753 (2010).
Article MathSciNet PubMed Google Scholar
Wu, Y. D. et al. Co-expression of key gene modules and pathways of human breast cancer cell lines. Biosci. Rep. 39, 20181925. https://doi.org/10.1042/BSR20181925 (2019).
Article Google Scholar
Xu, Y., Zhu, C., Xu, C., Sun, J. & Chen, K. Integration of metabolite profiling and transcriptome analysis reveals genes related to volatile terpenoid metabolism in finger citron (C. medica var. sarcodactylis). Molecules 24, 2564. https://doi.org/10.3390/molecules24142564 (2019).
Article CAS PubMed Central Google Scholar
Ye, Z., Sun, B., Mi, X. & Xiao, Z. D. Gene co-expression network for analysis of plasma exosomal miRNAs in the elderly as markers of aging and cognitive decline. PeerJ 8, e8318. https://doi.org/10.7717/peerj.8318 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sun, Y. et al. Effect of short-chain fatty acids on triacylglycerol accumulation, lipid droplet formation and lipogenic gene expression in goat mammary epithelial cells. Anim. Sci. J. 87, 242. https://doi.org/10.1111/asj.12420 (2016).
Article CAS PubMed Google Scholar
Urrutia, N. L. & Harvatine, K. J. Effect of conjugated linoleic acid and acetate on milk fat synthesis and adipose lipogenesis in lactating dairy cows. J. Dairy. Sci. 100, 1. https://doi.org/10.3168/jds.2016-12369 (2017).
Article CAS Google Scholar
Ali, I., Li, C., Li, L., Kuang, M. & Wang, G. Effect of acetate, β-hydroxybutyrate and their interaction on lipogenic gene expression, triglyceride contents and lipid droplet formation in dairy cow mammary epithelial cells. In Vitro Cell. Dev. Biol. Anim. 57, 66. https://doi.org/10.1007/s11626-020-00538-2 (2021).
Article CAS PubMed Google Scholar
Lígia, P., Ana, G., Manuela, P. & Miguel, R. Isolation and analysis of phospholipids in dairy foods. J. Anal. Methods Chem. 2016, 9827369. https://doi.org/10.1155/2016/9827369 (2016).
Article CAS Google Scholar
Monks, J. et al. Xanthine oxidoreductase mediates membrane docking of milk-fat droplets but is not essential for apocrine lipid secretion. J. Physiol. 594, 5899. https://doi.org/10.1113/JP272390 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kim, S. D. et al. Baicalein inhibits agonist- and tumor cell-induced platelet aggregation while suppressing pulmonary tumor metastasis via cAMP-mediated VASP phosphorylation along with impaired MAPKs and PI3K-Akt activation. Biochem. Pharmacol. 92, 251. https://doi.org/10.1016/j.bcp.2014.09.019 (2014).
Article CAS PubMed Google Scholar
Itoh, M., Nelson, C. M., Myers, C. A. & Bissell, M. J. Rap1 integrates tissue polarity, lumen formation, and tumorigenic potential in human breast epithelial cells. Cancer. Res. 67, 4759. https://doi.org/10.1158/0008-5472 (2007).
Article PubMed PubMed Central Google Scholar
Fata, J. E., Mori, H., Ewald, A. J., Hui, Z. & Bissell, M. J. The MAPK(ERK-1,2) pathway integrates distinct and antagonistic signals from TGFalpha and FGF7 in morphogenesis of mouse mammary epithelium. Dev. Biol. 306, 193. https://doi.org/10.1016/j.ydbio.2007.03.013 (2010).
Article CAS Google Scholar
Farhadian, M., Rafat, S. A., Panahi, B. & Mayack, C. Weighted gene co-expression network analysis identifies modules and functionally enriched pathways in the lactation process. Sci. Rep. 11, 2367. https://doi.org/10.1038/s41598-021-81888-z (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kosuru, R. & Chrzanowska, M. Integration of Rap1 and calcium signaling. Int. J. Mol. Sci. 21, 1616. https://doi.org/10.3390/ijms21051616 (2020).
Article CAS PubMed Central Google Scholar
Tai, A. W., Bojjireddy, N. & Balla, T. A homogeneous and nonisotopic assay for phosphatidylinositol 4-kinases. Anal. Biochem. 417, 97. https://doi.org/10.1016/j.ab.2011.05.046 (2011).
Article CAS PubMed PubMed Central Google Scholar
Minogue, S. Phosphatidylinositol 4-kinase is required for endosomal trafficking and degradation of the EGF receptor. J. Cell Sci. 119, 571. https://doi.org/10.1242/jcs.02752 (2006).
Article CAS PubMed Google Scholar
Ketel, K. et al. A phosphoinositide conversion mechanism for exit from endosomes. Nature 529, 408. https://doi.org/10.1038/nature16516 (2016).
Article ADS CAS PubMed Google Scholar
Baba, T., Toth, D. J., Sengupta, N., Kim, Y. J. & Balla, T. Phosphatidylinositol 4,5-bisphosphate controls Rab7 and PLEKMH1 membrane cycling during autophagosome–lysosome fusion. EMBO. J. 38, e100312. https://doi.org/10.15252/embj.2018100312 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pataer, A., Ozpolat, B., Shao, R. P., Cashman, N. R. & Swisher, S. G. Therapeutic targeting of the PI4K2A/PKR lysosome network is critical for misfolded protein clearance and survival in cancer cells. Oncogene 39, 1. https://doi.org/10.1038/s41388-019-1010-4 (2019).
Article CAS Google Scholar
Palombo, V. et al. Genome-wide association study of milk fatty acid composition in Italian Simmental and Italian Holstein cows using single nucleotide polymorphism arrays. J. Dairy. Sci. 101, 11004. https://doi.org/10.3168/jds.2018-14413 (2018).
Article CAS PubMed Google Scholar
Fu, C. Z., Wang, H., Mei, C. G., Wang, J. L. & Jiang, B. J. SNPs at 3’-UTR of the bovine CDIPT gene associated with Qinchuan cattle meat quality traits. Genet. Mol. Res. 12, 775. https://doi.org/10.4238/2013.March.13.6 (2013).
Article CAS PubMed Google Scholar
Zhang, K. et al. A novel role of Id1 in regulating oscillatory shear stress-mediated lipid uptake in endothelial cells. Ann. Biomed. Eng. 46, 849. https://doi.org/10.1007/s10439-018-2000-3 (2018).
Article PubMed Google Scholar
Patil, M. et al. Id1 promotes obesity by suppressing brown adipose thermogenesis and white adipose browning. Diabetes 66, 1611. https://doi.org/10.2337/db16-1079 (2017).
Article CAS PubMed PubMed Central Google Scholar
Moldes, M. et al. Functional antagonism between inhibitor of DNA binding (Id) and adipocyte determination and differentiation factor 1/sterol regulatory element-binding protein-1c (ADD1/SREBP-1c) trans-factors for the regulation of fatty acid synthase promoter in adipocytes. Biochem. J. 344, 873. https://doi.org/10.1042/0264-6021:3440873 (1999).
Article CAS PubMed PubMed Central Google Scholar
Marcin, J., Bernd, G. & Sylvane, D. Mammalian target of rapamycin regulates the growth of mammary epithelial cells through the inhibitor of deoxyribonucleic acid binding Id1 and their functional differentiation through Id2. Mol. Endocrinol. 20, 2369. https://doi.org/10.1210/me.2006-0071 (2006).
Article CAS Google Scholar
Andersen, J. P. et al. P4-ATPases as phospholipid flippases—Structure, function, and enigmas. Front. Physiol. 7, 275. https://doi.org/10.3389/fphys.2016.00275 (2016).
Article PubMed PubMed Central Google Scholar
Hausman, G. J. & Richardson, R. L. Adipose tissue angiogenesis. J. Anim. Sci. 82, 925. https://doi.org/10.1051/gse:2003061 (2004).
Article CAS PubMed Google Scholar
Chakraborty, A., Barajas, S., Lammoglia, G. M., Reyna, A. J. & Rutkowski, J. M. Vascular endothelial growth factor–D (VEGF-D) overexpression and lymphatic expansion in murine adipose tissue improves metabolism in obesity. Am. J. Pathol. 189, 924. https://doi.org/10.1016/j.ajpath.2018.12.008 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lijnen, H. R., Frederix, L., Hoef, B. V. & Dewerchin, M. Deficiency of vascular endothelial growth factor-D does not affect murine adipose tissue development. Biochem. Biophys. Res. Commun. 378, 255. https://doi.org/10.1016/j.bbrc.2008.11.032 (2009).
Article CAS PubMed Google Scholar
Stumpff, F. A look at the smelly side of physiology: Transport of short chain fatty acids. Pflugers. Arch. 470, 571. https://doi.org/10.1007/s00424-017-2105-9 (2018).
Article CAS PubMed Google Scholar
Hu, Y., Chen, D., Yu, B., Yan, H. & Luo, L. Effects of dietary fibres on gut microbial metabolites and liver lipid metabolism in growing pigs. J. Anim. Physiol. Anim. Nutr. 104, 1484. https://doi.org/10.1111/jpn.13429 (2020).
Article CAS Google Scholar
Duan, A. Q. et al. Isolation, culture and identification of mammary epithelial cells in buffalo milk. Chin. Anim. Husb. Vet. Med. 44, 3243. https://doi.org/10.16431/j.cnki.1671-7236.2017.11.019 (2017).
Article Google Scholar
Mu, T., Hu, H., Feng, X., Ma, Y. & Gu, Y. Screening and joint analysis of key lncRNAs for milk fat metabolism in dairy cows. Front. Genet. 13, 772115. https://doi.org/10.3389/fgene.2022.772115 (2022).
Article PubMed PubMed Central Google Scholar
Langfelder, P. & Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 9, 559. https://doi.org/10.1186/1471-2105-9-559 (2008).
Article CAS Google Scholar

Download references

Acknowledgements

Thanks to all the teachers who helped with this experiment, and to all the authors of this paper for their hard work.

Funding

This project is supported by the special breeding project of high-quality and high yield dairy cows in the Ningxia Autonomous region (Grant No: 2019NYYZ05).

Author information

Authors and Affiliations

School of Agriculture, Ningxia University, Yinchuan, 750021, China
Tong Mu, Honghong Hu, Yanfen Ma, Chaoyun Yang, Xiaofang Feng, Juan Zhang & Yaling Gu
Key Laboratory of Ruminant Molecular and Cellular Breeding, Ningxia Hui Autonomous Region, Ningxia University, Yinchuan, 750021, China
Yanfen Ma
Maosheng Pasture of He Lanshan in Ningxia State Farm, Yinchuan, 750001, China
Huiyu Wen
Animal Husbandry Extension Station, Yinchuan, 750001, China
Wan Wen

Authors

Tong Mu
View author publications
You can also search for this author in PubMed Google Scholar
Honghong Hu
View author publications
You can also search for this author in PubMed Google Scholar
Yanfen Ma
View author publications
You can also search for this author in PubMed Google Scholar
Huiyu Wen
View author publications
You can also search for this author in PubMed Google Scholar
Chaoyun Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofang Feng
View author publications
You can also search for this author in PubMed Google Scholar
Wan Wen
View author publications
You can also search for this author in PubMed Google Scholar
Juan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yaling Gu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Data processing and article writing, T.M. and C.Y. Article modification and visualization, H.H. Sample collection and verified by qRT-PCR, H.W., X.F., W.W. and J.Z. Article grammar modification, Y.M. Conceptual analysis, writing-review and editing, Y.G. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Yaling Gu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Legends.

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4.

Supplementary Table 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mu, T., Hu, H., Ma, Y. et al. Identifying key genes in milk fat metabolism by weighted gene co-expression network analysis. Sci Rep 12, 6836 (2022). https://doi.org/10.1038/s41598-022-10435-1

Download citation

Received: 19 November 2021
Accepted: 21 March 2022
Published: 27 April 2022
DOI: https://doi.org/10.1038/s41598-022-10435-1

This article is cited by

Integrated transcriptomic and WGCNA analyses reveal candidate genes regulating mainly flavonoid biosynthesis in Litsea coreana var. sinensis
- Na Xie
- Qiqiang Guo
- Lan Yang
BMC Plant Biology (2024)
Genetic improvement of economic traits in Murrah buffalo using significant SNPs from genome-wide association study
- Linda George
- Rani Alex
- Archana Verma
Tropical Animal Health and Production (2023)
Identification of milk-related genes and regulatory networks in Bactrian camel either supplemented or under grazing
- Lili Guo
- DaoLema
- Wenguang Zhang
Tropical Animal Health and Production (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Overview of BMECs sequencing data

Principal component and correlation analysis of samples

Weighted correlation network analysis

Functional enrichment analysis of hub genes

Protein interaction network analysis

Tissue expression profile analysis of key candidate genes

Discussion

Conclusion

Materials and methods

Ethics statement

Data source and preprocessing

Construction of co-expression network

Identification of key candidate genes

Functional enrichment and protein interaction network analysis

qRT-PCR validation of key candidate genes

Statistical analysis

Institutional review board statement

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links