Introduction

Clear visible estrus behaviors, such as standing reflex, reddening and swelling of the vulva, mucus discharge from the vulva, etc., contribute to the reproductive performance of sows1. However, the proportion of gilts and sows that do not express estrus behaviors is increasing in recent years2,3,4. Estrus expression of gilts or sows is associated with clearly visible physiological and behavioral changes5, which reach the maximum intensity before the onset of estrus6. Reproductive behavior culminates in the standing reflex, which indicates sexual receptivity6, 7. Previous studies showed that Chinese indigenous pigs which originated in Taihu Lake Basin, such as Meishan, Erhualian, and Mi pigs, had superior reproductive performance and estrus expression traits than European pigs8,9,10,11. The difference on estrus expression between pig breeds indicates that the clear visible expression of estrus is heritable1, and could be improved by selection12, 13. A positive correlation was observed between the serum estrogen concentration of gilts and the intensity of estrus behaviors14,15,16. The synthesis of ovarian estrogen is under the action of luteinizing hormone (LH) and follicle-stimulating hormone (FSH) in the theca cell and granulosa cells17. During the growth and development of follicles, LH stimulates the theca interna to secrete testosterone, and then under the stimulation of FSH, granulosa cells convert testosterone to estradiol, namely “double cell dual-sex hormone effect mode”17.

The plasma concentration of estrogen is affected by the dynamic equilibrium of estrogen synthesis and metabolism, which requires the activity of enzymes, such as cytochrome P450 19A1 (CYP19A1), estradiol 17-beta-dehydrogenase 12 (HSD17B12), sulfotransferase family 1 C member 3 (SULT1C3), etc18, 19. Previous studies found that differentially expressed genes (DEGs)), were related to follicular development and hormone metabolism20,21,22.

Chinese indigenous Mi pigs, as well as Meishan and Erhualian pigs, originated in the lake Taihu basin in Jiangsu province (east China)23. These Chinese pig breeds have similar biological characteristics24, such as greater litter size and clearer expression of estrus behaviors. Gilts of these Chinese pig breeds reach puberty at an earlier age25, express behavioral estrus longer, and have slightly shorter estrus cycles8 than those of Landrace and Large White. The duration of high estradiol concentrations was greater in Meishan than in Large White hybrid gilts during the follicular phase, the periovulatory period, and the early luteal phase26.

In the present study, we investigated the mRNA expression profiles of follicular tissue from Large White gilts at diestrus (LD) and estrus (LE), and Mi gilts at diestrus (MD) and estrus (ME) using RNA-seq. The goals of this study were to screen genes and new transcripts that are differentially expressed between two breeds or/and two stages of estrus cycle, and to find which biological processes and pathways have the greatest number of DEGs which could contribute to identifying molecular genetic markers associated with the expression of estrus in pigs.

Results

Sequencing data summary

A sample from a Mi gilt at diestrus was discarded because of degradation. We generated 11 RNA-seq libraries of follicle tissue from six Large White gilts and five Mi gilts. Utilizing paired-end Illumina sequencing technology, about 630 million raw reads were obtained, and 610 million high-quality reads were obtained after removing low quality reads. The number of clean reads was more than 54 million in each group (Table 1). The ratio of Q30 (with a base quality > 30 and error rate < 0.001) bases comprised more than 87% clean data. In four groups, 77.33% to 80.55% clean reads were mapped onto the Sus scrofa reference genome (sscrofa10.2), and 4.33% to 5.00% clean reads were multi map reads (Table 1). The genic distribution of clean reads was comprised of 66.95% exonic, 8.02% intron, and 25.03% intergenic regions (Supplementary Table S3). To ensure the reliability of the analysis results, the unique mapped reads were used for subsequent statistical analysis. Reads per kilobase millon mapped reads (RPKM), a normalized transcription level, can eliminate the influence of gene length and sequence difference on gene expression27. Therefore, here RPKM was used to compare the gene expression between different samples.

Table 1 Quality analyses of the digital gene expression profiling library of follicles tissue in Large white and Mi gilts at estrus and diestrus1.

SNP and InDel analysis

We detected approximately 122,804 to 335,295 SNPs and 6,140 to 14,947 InDel in the 11 samples (Table 2). The most frequent SNPs were A>G, C>T, G>A, and T>C (Supplementary Fig. S1). The most frequent length of InDel was one nucleotide (Supplementary Fig. S2).

Table 2 Statistics of SNP and InDel of the genes expressed in follicles tissue of Large white and Mi gilts at estrus and diestrus.

Alternative splicing analysis

We detected 12 types of alternative splicing (AS) events: Skipped exon (SKIP), Multi-exon SKIP (MSKIP), Intron retention (IR), Multi-IR (MIR), Alternative exon ends (5’or/and 3’) (AE), Transcription Start Site (TSS), Transcription Terminal Site (TTS), Approximate SKIP (XSKIP), Approximate MSKIP (XMSKIP), Approximate IR (XIR), Approximate MIR (XMIR), and Approximate AE (XAE). The top two AS events were TSS and TTS AS events, which accounted for more than 70% of AS events in each library (Fig. 1).

Figure 1
figure 1

Statistics of alternative splicing events of the mRNA expressed in follicles tissue of Large white and Mi gilts at estrus and diestrus. LD group: NO_1, NO_2, NO_3; LE group: NO_4, NO_5, NO_6; MD group: NO_7, NO_8; ME group: NO_9, NO_10, NO_11. SKIP: Skipped exon; MSKIP: Multi-exon SKIP; IR: Intron retention; MIR: Multi-IR; AE: Alternative exon ends (5’or/and 3’) ; TSS: Transcription Start Site; TTS: Transcription Terminal Site; XSKIP: Approximate SKIP; XMSKIP: Approximate MSKIP; XIR: Approximate IR; XMIR: Approximate MIR; XAE: Approximate AE.

Identification and analysis of DEGs

A total of 20,910 genes were expressed in the 11 samples (Supplementary Table S4). The number of expressed genes was not significantly different between libraries. As shown in Table 3, reference genes expressed at RPKM < 10 and RPKM > 500 were 68.03% to 72.81% and 0.75% to 0.78%, respectively. In addition, the gene expression levels between groups were not significantly different (P > 0.05, Fig. 2A). A DEG heatmap showed that the DEGs from LD vs MD comparison group gather together, while the DEGs from LE vs ME comparison group gather in another class (Fig. 2B). Subsequently, 2,838 DEGs were identified (Supplementary Table S5) using DESeq (v1.14.0). The distribution of DEGs is revealed in a Venn diagram (Fig. 2C), where tartrate-resistant acid phosphatase type 5 (ACP5) and GPI transamidase component PIG-S-like (PIGS) were the common DEGs among the four comparison groups. The number of DEGs was much greater (One-Way ANOVA, P = 0.018) at estrus than diestrus. The number of DEGs was not different (One-Way ANOVA, P = 0.543) between Large White and Mi gilts at estrus or diestrus (Fig. 2D). Twenty-six common DEGs were found in the LD vs MD and LE vs ME comparison groups (Table 4). To screen potential candidate genes associated with estrus expression, the fold change was further analyzed and the P-values of up-regulated and down-regulated DEGs estimated (Supplementary Fig. S3). A DEG whose reads count was zero in one of the four groups was defined as a specific expressed gene (Supplementary Table S6). A new gene (ENSSSCG00000028235) was exclusively expressed in Mi gilts, while another new gene (ENSSSCG00000021903) was exclusively expressed in Large White gilts. In addition, the new gene ENSSSCG00000003073 was only expressed at estrus.

Table 3 Number of genes within different reads per kilobase transcriptome per million mapped reads (RPKM) intervals.
Figure 2
figure 2

Identification and analysis of DEGs of the genes expressed in follicles tissue of Large white and Mi gilts at estrus and diestrus. LD, Large white gilts at diestrus; LE, Large white gilts at estrus; MD, Mi gilts at diestrus; ME, Mi gilts at estrus. (A) Comparison of expression levels of genes detected in the LD, MD, LE, and ME groups. (B) Heatmap analysis of DEGs in the LD, MD, LE, and ME groups. (C) Venn diagram of the 2838 differentially expressed genes detected among LD vs MD, LE vs ME, LE vs LD, ME vs MD comparison groups. Different color represents different combination, the number in the overlap region represents the overlapped DEGs number. (D) Number of enriched DEGs. The orange and cyan pillars represents up-regulated and down-regulated gene number, respectively.

Table 4 List of 26 overlaped DEGs in LD vs MD and LE vs ME comparison groups1.

GO and KEGG pathway enrichment analysis

The enrichment of DEGs in the GO terms was tested to gain the insights into the biological implications. These DEGs fell into three major GO categories: biological process, cellular component, and molecular function. In the GO category biological process, the most abundant GO terms were single-organism process, cellular process, metabolic process, and biological regulation. In the LD vs MD, LE vs ME, LE vs LD, and ME vs MD comparison groups, the number of biological processes with significant different (q < 0.05) DEGs were 10, 108, 178, and 495, respectively (Fig. 3; Supplementary Table S7). In the GO category cellular component, the most abundant GO terms were cell part and organelle. In the GO category molecular function, the most abundant GO terms were binding and catalytic activity.

Figure 3
figure 3

GO annotation of DEGs of the genes expressed in follicles tissue of Large white and Mi gilts at estrus and diestrus. LD, Large white gilts at diestrus; LE, Large white gilts at estrus; MD, Mi gilts at diestrus; ME, Mi gilts at estrus. DEGs number of the most enriched 20 GO terms derived from biological processes from LD vs MD (A), LE vs ME (B), LE vs LD (C), ME vs MD (D) comparison groups, respectively. Red shows higher expression and blue shows lower expression. The figures in parentheses refer to the number of DGEs in this term.

To identify the potential biological pathways involved in estrous behavior, a KEGG analysis was performed. The DEGs were enriched in 315 pathways in the LD vs MD, LE vs ME, LE vs LD, and ME vs MD comparison groups. There were 3, 10, 24, and 23 pathways with significant differences (q < 0.05) of DEGs in the LD vs MD, LE vs ME, LE vs LD, and ME vs MD comparison groups, respectively (Fig. 4; Supplementary Table S8). The representative KEGG pathways, such as ECM−receptor interaction, olfactory transduction, ovarian steroidogenesis, cell cycle, steroid biosynthesis, cell cycle-yeast, phagosome, and ribosome, were enriched with these DEGs. Most DEGs were upregulated in the MD and LE groups. In the ECM−receptor interaction pathway (Table 5), five novel genes were upregulated in the MD group, but none in the LD group. The CD36 molecule (CD36) and a novel gene were upregulated in the ME group. Twelve genes, such as collagen type XI alpha 1 chain (COL11A1), thrombospondin 1 (THBS1), etc., were upregulated in the LE group. Forty genes were upregulated in the MD group, such as laminin subunit beta 2 (LAMB2), collagen type VI alpha 5 chain (COL6A5), hyaluronan mediated motility receptor (HMMR), etc. Six genes, syndecan 1 (SDC1), secreted phosphoprotein 1 (SPP1), integrin subunit beta 3 (ITGB3), etc., were upregulated in the ME group. In the steroid biosynthesis pathway (Table 5), nine genes, such as hydroxysteroid (17-beta) dehydrogenase 7 (HSD17B7), 24-dehydrocholesterol reductase (DHCR24), methylsterol monooxygenase 1 (MSMO1), etc., were upregulated in the LD group. However, no gene was upregulated in the LE group. In addition, six genes, such as DHCR24, MSMO1, transmembrane 7 superfamily member 2 (TM7SF2), etc., were upregulated in the MD group. Lipase A, lysosomal acid type (LIPA) was upregulated in the ME group. Fifteen of 23 genes involved in the ovarian steroidogenesis pathway (Table 5), such as low-density lipoprotein receptor (LDLR), CYP19A1, HSD3B1, etc., were upregulated in the LD group. Eight genes, such as arachidonate 5-lipoxygenase (ALOX5), phospholipase A2 group IVA (PLA2G4A), adenylate cyclase 3 (ADCY3), etc., were upregulated in the LE group. In constrast, 14 genes (LDLR, HSD3B1, CYP17A1, etc.) were upregulated in the MD group, while one novel gene was upregulated in the ME group.

Figure 4
figure 4

KEGG annotation for DEGs functions of the genes expressed in follicles tissue of Large white and Mi gilts at estrus and diestrus. LD, Large white gilts at diestrus; LE, Large white gilts at estrus; MD, Mi gilts at diestrus; ME, Mi gilts at estrus. DEGs number of the most enriched 20 pathways from LD vs MD (A), LE vs ME (B), LE vs LD (C), ME vs MD (D) comparison groups, respectively. Red shows higher expression and blue shows lower expression. The figures in parentheses refer to the number of DGEs in this pathway.

Table 5 Several representative heat-related pathways in LD vs MD, LE vs ME, LE vs LD, and ME vs MD comparison groups for DEGs.

qRT-PCR confirmation

To validate the results of RNA-seq, SULT1C3, apolipoprotein E (APOE), complement component C7 precursor (C7), CYP17A1, complement component 4 binding protein, alpha (C4BPA), gap junction protein alpha 1 (GJA1), and ACP5 were selected for quantitative PCR (qPCR). Based on qRT-PCR, these 7 DEGs were differently expressed in at least one of four comparison groups. Some DEGs have been reported to be involved in follicular development processes. For example, ACP5 was differentially expressed in the four groups; APOE, C4BPA, C7, and ACP5 were highly expressed at estrus; SULT1C3, CYP17A1, and GJA1 were highly expressed at diestrus. In addition, it has been reported that SULT1C3 28 and CYP17A1 29 are related to steroid metabolism pathway, APOE was related to lipid metabolism30, and C7 31 and C4BPA 32 were related to the immune system. The expression patterns of these 7 DEGs were in agreement with our RNA-seq findings. As shown in Table 6, the correlation coefficients between qRT-PCR and RNA-seq results are greater than 0.9 6 of 7 DEGs; the P value are less than 0.05 in 4 DEGs and less than 0.10 in 6 of 7 DEGs. For C7 gene, the correlation coefficient between results of qRT-PCR and RNA-seq is 0.86 (P = 0.14). The qRT-PCR results were consistent with the sequencing data (shown in Supplementary Fig. S4). Although the relative expression of C7 in LE vs LD and GJA1 LE vs ME comparison groups differed using two approaches, the difference was not significant, which was probably caused by biological differences between samples as well as the sensitivity and capability of the different methods.

Table 6 Validation of RNA-seq results using quantitative RT-PCR.

Discussion

In recent years, high-throughput sequencing have been performed using animal reproductive tissues, such as endometrium20, ovaries21, placenta33, follicles34 and granulosa cells35, to identify the genes and genetic loci that affect litter size. Compared to litter size, the studies on the molecular genetic mechanism of estrus expression have seldom been reported. In commercial pig production, estrus characteristics influence sow fecundity36. Previous studies have investigated the expression profiles in ovaries of sheep at estrous and anestrous37, in porcine endometrial tissue on 938, 1239, 1422, and 1538 d of pregnancy, and in porcine endometrial tissue on 1239 d after onset of estrus. In the present study, we investigated the expression profiles in follicle tissue of Large White and Mi gilts at diestrus and estrus using RNA-seq, to mine the DEGs between two breeds and two stages of estrus cycle.

In our present study, 610 million clean reads were generated. In each group, the clean reads were more than 54 million, which is similar to the previous studies on RNA-seq in placenta33 and longissimus muscle40. A total of 122,804–335,295 SNPs and 6,140-14,947 InDel were detected in the 11 samples. The number of SNPs and InDel were much greater for Mi gilts than Large White gilts at both estrus and diestrus stages, which indicates the difference on estrus expression between two breeds was due to the difference of gene expression. In addition, the most SNPs (A>G, C>T, G>A, and T>C) and the most length of InDel (one nucleotide) could play important roles in the regulation of gene expression41. AS generates multiple transcripts from the same gene, and widely exists in eukaryotes42. AS is an important way to regulate gene expression and protein diversity43. The most common AS events (TSS and TTS) can produce different transcripts or different proteins which affect the biological process43.

A total of 2,838 DEGs produced in the LD vs MD, LE vs ME, LE vs LD, and ME vs MD comparison groups were annotated in the GO and KEGG databases in the present study. Some DEGs had been identified in follicle development and hormone metabolism by RNA-seq37, 44. Further validations had been made using quantitative PCR for a part of DEGs, such as insulin-like 3 precursor (INSL3), SULT1C3, and hypoxia-inducible factor 1-alpha isoform 3 (HIF1ß)20, 21, 37. Previous studies suggested that INSL3 might be an important intrafollicular modulator of theca interna cell function/steroidogenesis45 and theca cell-derived growth factor for pre antral follicle46. It has been reported that SULT1C3, a member of sulfotransferase family, catalyzes the sulfate conjugation of several hormones47, 48, and regulates estrogen removal in estrogen metabolic pathway19. In addition, HIF1ß is a critical mediator of follicle development and ovulation and a regulator of gene expression in ovary49, 50. Several genes, such as INSL3, STAR, and HIF1ß, were associated with follicular development, ovulation, and steroid secretion in the ovary of Qira black sheep at estrous and anestrous37. VEGF and STAR play important roles in follicle development and steroid synthesis in Qira Black Sheep and Hetian Sheep21. Angiogenesis is critical for female ovulatory cycle, including follicular development, ovulation, and corpora lutea formation51. The VEGF system is the most important signaling pathway in angiogenesis52. STAR is essential for steroid hormone synthesis and involved in the regulation of follicular development in mammalian ovary53,54,55. But we did not detect VEGF and STAR in the DEGs, which could be due to the difference between different species. In our present study, there were 26 DEGs between breeds at both diestrus and estrus, which may be caused by the differences between breeds8. These 26 DEGs were mainly involved in the single-organism process, and mainly related to binding and catalytic activity. The predicted novel genes in our present study also provided clues for the research on estrus expression between different breeds and different stages of estrus cycle. The function of these new genes has not been identified. Further studies are needed to evaluate if whether they are new candidate genes related to estrus expression.

Follicles, as important tissues of animal reproductive organs, fulfill many pivotal functions, including oocyte and granulosa cells production and hormone secretion. They have obvious differences in shape and biological activity during estrous cycle21, 37, 56. In the ovarian follicles, sex steroids are synthesized from granulosa and thecal cells in the follicular wall34. Previous study found that two signaling networks (Notch, SLIT/ROBO and PI3K signaling, and ITGB5 and extracellular matrix signaling) were associated with follicular development through extracellular signal related kinases (ERKs)57. In our present study, the GO annotation and KEGG pathway analysis clearly revealed that some hormone related genes were involved in steroid biosynthesis and ovarian steroidogenesis pathways, which indicates that the two pathways were activated in the ovaries of Large White and Mi gilts at estrus. In the steroid biosynthesis and ovarian steroidogenesis pathway, most of the DEGs were upregulated in the LD and MD groups, which suggests that these genes were activated and overexpressed during the diestrus. Most of these genes belong to cytochrome P450 family and oxidoreductases family, and participate in the redox reaction19. SCARB1 and LDLR have been shown to be involved in biosynthesis of steroid hormones through affecting on the absorption of cholesterol substrates from circulating lipoproteins58,59,60, which suggests that lipid metabolism also play a role at estrus. Other genes related to steroid hormones could also be important to ovarian function in regulation of female reproduction.

In addition, follicular development also integrates the proliferation and differentiation of cells in the ovarian follicular wall with the transportation of nutrients, and the supplementation of energy61, 62. Results of our present study provided abundant information for some signaling pathways, such as ECM–receptor interaction, olfactory transduction, cell cycle, cell cycle-yeast, phagosome, ribosome, PI3K-Akt signaling, metabolic pathways, DNA replication, cell adhesion molecules (CAMs), and biosynthesis of secondary metabolites. Results of our present study indicates that these pathways potentially regulate estrus expression. Among these signaling pathways, ECM-receptor interaction and olfactory transduction had significant difference in three of four comparison groups. The extracellular matrix (ECM) consists of a complex mixture of structural and functional macromolecules, and plays an important role in tissue and organ morphogenesis63. Specific interaction between cells and the ECM leads to the direct or indirect control of cellular activities, such as adhesion, migration, differentiation, proliferation, and apoptosis64. In addition, a previous study performed a genome-wide analysis of domestic pigs and Tibetan wild boars, and found that 35 genes were enriched in olfactory transduction pathway65. It is a better response to a wider range of food types because of resource-poor plateau65. Moreover, a better sense of smell may also be favored by artificial selection because it enhances disease aversion and facilitates the estrus expression and ovulation of females, which is stimulated by the pheromones in the saliva of boars and then enhances sexual receptive behavior of females66, 67. In the present study, most DEGs in the pathways affected by different stages of estrus within breeds were activated and overexpressed at estrus. These pathways also have important roles in modulating the balance of integrating cell proliferation, energy supplementation, nutrient delivery, and steroid hormone synthesis, which are important processes for sustaining normal folliculogenesis34 and estrus.

In addition, 16 genes involved in cell adhesion molecules (CAMs) were also downregulated at estrus. CAMs, as cell surface proteins, were involved in many biological processes, such as cell adhesion, proliferation, differentiation, developmental processes of fertilization, embryogenesis, and embryonic development68, 69. During the rapid growth and development of follicles, a large amount of energy is required to supply ATP for DNA and protein synthesis in the follicular membrane70. Cell cycle, cell cycle-yeast, ribosome, DNA replication, biosynthesis of secondary metabolites and metabolic pathways are related to the growth and development of the follicle34, as well as expression of estrus. In our present study, most of the DEGs involved in metabolic pathways (91/135) and cell cycle (26/27) were downregulated at estrus, which indicates that these DGEs involved in metabolic pathways and cell cycle pathways could inhibit the follicular development, and ovulation. However, the exact relationships between these signaling networks are not fully understood, and no evidences has been reported to support the effects of these pathways at estrus. In our present study, some DEGs repeatedly appeared in various signaling pathways, and their roles at estrus should be studied in the future.

In conclusion, we generated the expression profile of mRNA in pig follicles of Large White gilts at diestrus (LD) and estrus (LE), Mi gilts at diestrus (MD) and estrus (ME) using RNA-seq. A total of 122,804-335,295 SNPs, 6,140-14,947 InDel and 12 types of AS events (39.57% TSS, 34.90% TTS) were detected in the 11 samples. A total of 2,838 genes were differentially expressed between the groups. These DEGs are involved in important biological processes, such as single-organism process, binding, catalytic activity, cell adhesion and enriched in ECM-receptor interaction, olfactory transduction, ovarian steroidogenesis, cell cycle, steroid biosynthesis, CAMs, phagosome, and ribosome signaling pathways, which depicts a complex mechanism underlying the modulation of estrus characteristics. Thus, these newly identified DEGs should be considered as new candidate genes with functions in regulating estrus. These results provide useful information for future in-depth studies on estrus expression.

Materials and Methods

Ethics Statement

All experimental procedures were conducted according to the “Guidelines for Experimental Animals” of the Ministry of Science and Technology (Beijing, China). This study was reviewed and approved by the ethics committee of Nanjing Agricultural University. Animals were humanely sacrificed as necessary to ameliorate suffering.

Animals

Thirty Large White gilts and 30 Mi gilts were observed for expression of estrus at Yong Kang Agricultural Science and Technology Co., Ltd in Changzhou city, Jiangsu Province, China. The gilts were selected before their second estrus cycle. Their breed, group, body weight, and backfat thickness information are shown in Supplementary Table S1. The Large White and Mi gilts were raised in under a standardized feeding regimen with free access to water.

Estrus detection and sample collection

Estrus detection was carefully performed twice daily (at 7:00 and 15:00) at pro-estrus. Expression of estrus were defined as standing reflex, and reddening and swelling of the vulva71. Inspections to evaluate standing reflex, and reddening and swelling of the vulva were made on gilts individually by an experienced technician using a standardized routine3. A sexual mature boar was introduced into each pen for at least 15 min daily to ensure adequate stimulation, during which the gilts had directly head-to-head contact with the boar. Standing reflex was assessed by the back-pressure test. Reddening and swelling of the vulva was measured visually. When a standing reflex occurred, the expression of estrus were scored (0: no; 1: weak; 2: strong)13, 36, 72 according to a standard scoring system (Table 7).

Table 7 Scoring systems for estrus expression of Large white and Mi gilts.

After the first scoring, Large White and Mi gilts with scores of 2 points were selected (sampling flow chart is shown in Fig. 5). At the onset of standing reflex in the second estrus cycle, amount of 5 ml blood was collected from the anterior vena cava of gilts at the 10th day of estrus cycle. Blood samples were centrifuged at 3,000 rpm for 10 min, immediately after collection. The plasma was stored at −20 °C until analyses. Before the third estrus cycle, 6 Large White gilts and 6 Mi gilts were selected. The estrus cycles were about 21 d in length and the scores of the estrus expression were two points. At the first day of the third estrous, the day the gilts were at onset of exhibiting standing reflex, three Large White and three Mi gilts were slaughtered humanely by anesthesia. At the 10th day of estrous cycle, 3 Large White and 3 Mi gilts were slaughtered in the same way. Ovaries were dissected and all samples were collected with better ovulation points or preovulatory follicles on the surfaces of the ovaries. All samples were immediately frozen in liquid nitrogen and stored at −80 °C until RNA was isolated.

Figure 5
figure 5

The sampling flow chart.

Total RNA isolation

Total RNA was extracted from the follicles using TRIzol reagent (Invitrogen) and treated with DNase I following the manufacturer’s instructions. Degradation and contamination of total RNA were monitored on 1% agarose gels. RNA concentration was measured using Qubit RNA Assay Kit in Qubit 2.0 Flurometer (Life Technologies, CA, USA), its purity was assessed using the NanoPhotometer® Spectrophotometer (IMPLEN, CA, USA), the concentration and integrity were assessed using a RNA Nano 6000 Assay Kit of the Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA).

Library preparation for RNA-seq

According to the manufacturer’s manual, sequencing libraries were generated using NEBNext Ultra™ RNA Library Prep Kit for Illumina (NEB, USA). Amount of 3 μg RNA per sample was used to purify mRNA using poly-T oligo attached magnetic beads and then the purified mRNA was interrupted into short fragments (about 200 bp) by the fragmentation buffer. First strand cDNA was synthesized using random hexamer primer and M-MuLV Reverse Transcriptase (RNase H). After adding buffer, dNTPs, DNA polymerase I (New England Biolabs) and RNase H (Invitrogen), the second strand cDNA synthesis was subsequently performed using DNA polymerase I and RNase H. After adenylation of 3’ ends of DNA fragments, NEBNext Adapter was ligated to prepare for hybridization. The cDNA fragments with 150–200 bp in length were selected for PCR amplification to create cDNA libraries. The library preparations were sequenced on an Illumina Hiseq 2500 platform and generated 125 bp paired-end reads.

Data analysis

Raw data sequenced from the last step were saved as a format of fastq. All clean reads were obtained by rejecting low quality sequences or reads (it has more than 5% unknown nucleotides) and sequencing adapters. In addition, the Q20, Q30, and GC content were calculated. The reference genomes and the annotation file were downloaded from ENSEMBL database (http://www.ensembl.org/index.html). The software Bowtie2 (v2.2.3) was used for building the genome index. Clean data were mapped to the reference genome using TopHat (v2.0.12). More than 2 bp of mismatches were not considered. Unmapped or multi-position matched reads were excluded from further analyses. In addition, sequence saturation analyses of the four libraries were executed to provide an overview of the project. Reads count of each gene was counted using HTSeq (v0.6.0). RPKMs were then calculated to estimate the expression level of genes, which can eliminate the effect of sequencing depth and gene length on gene expression levels and the data could be compared directly27.

Identification of SNP, InDel, and Alternative splicing (AS)

SNP and InDel analyses were performed by mapping RNA-seq reads into the genomic of Sus scrofa (sscrofa10.2) using Samtools (v1.3.1)73. The AS events for each library using ASprofile.b (v1.0.4).

Identification of differentially expressed genes (DEGs)

The software DESeq (v1.16) was used for differential gene expression analysis between two samples with biological replicates using a model based on the negative binomial distribution. The P-value had been assigned to each gene and adjusted by the Benjamini and Hochberg’s approach for controlling the false discovery rate74. Genes with q ≤ 0.05 and |log2_ratio| ≥ 1 were identified as differentially expressed genes (DEGs).

GO and KEGG enrichment analysis

The GO (Gene Ontology, http://geneontology.org/) enrichment of DEGs was implemented by the hypergeometric test75, in which P-value was calculated and adjusted as q-value, and data background was genes in the whole genome. GO terms with q < 0.05 were considered to be significantly enriched. GO enrichment analysis can exhibit the biological functions of the DEGs. KEGG (Kyoto Encyclopedia of Genes and Genomes, http://www.kegg.jp/) is a database resource containing a collection of manually drawn pathway maps representing our knowledge on the molecular interaction and reaction networks76. The KEGG enrichment of DEGs was also implemented by the hypergeometric test, in which p-value was adjusted by multiple comparisons as q-value. KEGG terms with q < 0.05 were considered to be significantly enriched.

Real-time RT-PCR

RNA samples from the 11 gilts in the RNA-seq experiment were used to validate the results by quantitative real-time RT-PCR (qPCR), which was performed as previously described77. Total cDNA was synthesized using reverse transcriptase Kit (TaKaRa, Dalian). To amplify specific fragments referring to selected regulated genes, specific primers were designed with NCBI Primer-BLAST (Supplementary Table S2). Real-time PCR reactions were performed as described by the manufacturer in triplicate with MaximaTM SYBR Green/ROX qPCR Master Mix (Fermentas) on an Applied Biosystems Step One Plus system using the following program: 10 min at 95 °C; followed by 40 cycles of 15 s at 95 °C and 60 s at 60 °C; then a 55–95 °C melting curve detection. The GAPDH gene was used as a control in the experiments. All amplifications were followed by dissociation curve analysis of the amplified products. Relative expression levels of the selected target genes were calculated with the 2(-Delta Delta C[T]) method78. The statistical difference in gene expression between groups was analyzed by IBM SPSS Statistics (v20.0). The results are presented as mean ± standard deviation. The correlation between the results of RNA-seq and qPCR was calculated using correlation test (pearson correlation) by IBM SPSS Statistics (v20.0)38.