Article | Open | Published:

# Polymorphisms of the cryptochrome 2 and mitoguardin 2 genes are associated with the variation of lipid-related traits in Duroc pigs

## Abstract

The genetic factors determining the phenotypic variation of porcine fatness phenotypes are still largely unknown. We investigated whether the polymorphism of eight genes (MIGA2, CRY2, NPAS2, CIART, ARNTL2, PER1, PER2 and PCK1), which display differential expression in the skeletal muscle of fasted and fed sows, is associated with the variation of lipid and mRNA expression phenotypes in Duroc pigs. The performance of an association analysis with the GEMMA software demonstrated that the rs330779504 SNP in the MIGA2 gene is associated with LDL concentration at 190 days (LDL2, corrected P-value = 0.057). Moreover, the rs320439526 SNP of the CRY2 gene displayed a significant association with stearic acid content in the longissimus dorsi muscle (LD C18:0, corrected P-value = 0.015). Both SNPs were also associated with the mRNA levels of the corresponding genes in the gluteus medius skeletal muscle. From a biological perspective these results are meaningful because MIGA2 protein plays an essential role in mitochondrial fusion, a process tightly connected with the energy status of the cell, while CRY2 is a fundamental component of the circadian clock. However, inclusion of these two SNPs in chromosome-wide association analyses demonstrated that they are not located at the peaks of significance for the two traits under study (LDL2 for rs330779504 and LD C18:0 for rs320439526), thus implying that these two SNPs do not have causal effects.

## Introduction

The genome-wide analysis of gene expression data obtained from RNA-seq experiments can provide valuable information in order to understand the biology of production phenotypes and how they are genetically regulated. Cardoso et al.1 compared the muscle transcriptomic profiles of Duroc sows before and after feeding and, in doing so, they demonstrated that the ingestion of food is associated with changes in the mRNA levels of several circadian genes including the cryptochrome circadian regulator 2 (CRY2), neuronal PAS domain protein 2 (NPAS2), circadian associated repressor of transcription (CIART), aryl hydrocarbon receptor nuclear translocator like 2 (ARNTL2), period circadian regulator 1 (PER1) and period circadian regulator 2 (PER2). The identification of circadian clock regulator genes is particularly relevant because they have been broadly reported as major contributors to lipid metabolism and energy homeostasis2,3,4,5,6,7, driving changes in the expression of multiple transcripts and modulating cell response to different stimuli such as food intake5,8,9. Two other interesting genes identified by Cardoso et al.1 as differentially expressed before and after eating were mitoguardin 2 (MIGA2), which regulates mitochondrial fusion10, a process tightly connected with energy homeostasis11, and phosphoenolpyruvate carboxykinase 1 (PCK1), an enzyme fundamental for the maintenance of glucose and lipid levels12.

The expression of the eight genes mentioned above (ARNTL2, CIART, CRY2, NPAS2, PER1, PER2, PCK1 and MIGA2) is affected by food intake and there is ample evidence that they have a key role in carbohydrate and lipid metabolism8,10,13,14,15. The main hypothesis that we aim to test in the current work is whether the variability of these eight genes is associated with lipid phenotypes recorded in a Duroc pig population denominated as Lipgen (Supplementary Table 1). To achieve this goal, we have first identified a total of 20 polymorphisms (Table 1) in these eight genes by using a previously published RNA-Seq data set corresponding to 52 pigs from the Lipgen population16. These 20 SNPs have been genotyped in 345 pigs from the Lipgen population with available records for a broad array of lipid traits listed in Supplementary Table 1, i.e. serum lipid concentrations17,18, longissimus dorsi (LD) and gluteus medius (GM) muscle fatty acid composition19 and backfat thickness. Subsequently, those SNPs showing significant associations (after correction for multiple testing) with a given lipid trait, have been further studied by investigating if they are associated with gene expression as well as by performing chromosome-wide association analyses based on Porcine SNP60 BeadChip data. The liver and GM muscle mRNA expression data sets18,20 and the Porcine SNP60 BeadChip genotypes18,21 used for this purpose were generated in previous studies (Supplementary Table 2).

## Results

### Association analyses for lipid traits

Previous data sets employed for making the association analyses with a wide variety of lipid-related traits are listed in Supplementary Table 2. Performance of association analyses between the 20 selected SNPs and the phenotypes listed in Supplementary Table 1 allowed us to identify several associations that were significant at the nominal level (Table 2). Three SNPs in the PER1 gene were associated with LD and GM C18:3, and there was also an association between the CIART genotype and backfat thickness. Two SNPs in the PCK1 gene were associated with LD C17:0, and CRY2 and MIGA2 genotypes showed associations with several serum lipid and fatty acid composition traits. These results were consistent with the relevant role of the genes under study on metabolism and energy homeostasis. However, only two associations remained significant after correction for multiple testing. The serum concentration of low-density-lipoproteins (LDL) measured at ~190 days was significantly associated with the rs330779504 SNP (Table 2), a splice region variant located in the beginning of intron 14 (1:269.360 Mb) of the mitoguardin 2 gene (MIGA2). Pigs inheriting the A-allele showed an increased LDL cholesterol concentration (Fig. 1A), with homozygous AA animals having a higher median blood LDL concentration (69.35 mg/dL) than GA (61.75 mg/dL) and GG (58.40 mg/dL) individuals. Kruskal-Wallis ranking test for differences in median LDL concentrations yielded a P-value of 5.14E-03 (Supplementary Table 3), thus supporting the existence of significant differences among the three rs330779504 genotypes. Besides, this MIGA2 polymorphism also displayed an additive effect on palmitic acid content in LD muscle, total serum cholesterol concentration at ~190 days of age and the ratio between omega-6 and omega-3 desaturation in LD, but only at the nominal P-value level of significance (Table 2). The proportion of variance in LDL cholesterol concentration explained by rs330779504 genotype was 2.16% (SE = 0.03%).

The other association that remained significant after correction for multiple testing was that between rs320439526 genotype and stearic acid content (C18:0) of the LD muscle (Table 2). This polymorphism is located in the 5′ end of the CRY2 gene, and it was annotated as having a putative stop gain effect in the former Sus scrofa assembly record (Sscrofa10.2). This led us to select it due to the high impact effect that the inactivation of this gene could have on the regulation of circadian clock rhythms and many other relevant metabolic processes. However, when interrogated in the last available assembly release for the porcine genome (Sscrofa11.1), this variant appeared to be located in the 5′-UTR of the CRY2 gene. The Kruskal-Wallis ranking test for differences in median C18:0 content in the LD muscle yielded a P-value of 5.71E-03 (Supplementary Table 3), with homozygous TT pigs having a higher median stearic acid content (12.52%) than their CT (11.54%) and CC (11.30%) counterparts (Fig. 1C). The proportion of variance in stearic acid content in LD muscle explained by the rs320439526 genotype was 8.87% (SE = 0.04%).

### Polymorphisms in the MIGA2 and CRY2 genes are associated with mRNA expression levels

To gain new insights into the molecular basis of the two significant associations found (Table 2), we investigated whether the rs330779504 and the rs320439526 SNPs are associated with the mRNA expression of the MIGA2 and CRY2 genes, respectively. Previously reported hepatic and GM muscle microarray data sets18,20 were employed for this purpose (Supplementary Table 2). Analysis with the GEMMA software revealed a significant association between the rs330779504 polymorphism and MIGA2 mRNA expression levels in the GM muscle (Table 3). Pigs inheriting the A-allele of the rs330779504 polymorphism showed a reduced MIGA2 mRNA expression (Fig. 1B). Performance of a test based on the analysis of variance (ANOVA) confirmed the existence of statistically significant differences amongst genotypes (Supplementary Table 4). Moreover, a weak but significant association between the SNP rs330779504 and one of the probes defining liver MIGA2 mRNA expression was also found (Table 3). With regard to the CRY2 gene, when we performed an association analysis with the GEMMA software, the rs320439526 5′-UTR variant happened to be significantly associated with the expression of the corresponding gene in the GM muscle (Table 3). When we compared the CRY2 mRNA levels corresponding to each one of the three rs320439526 genotypes (Fig. 1D) by using an ANOVA test, we found differences that almost reached significance (Supplementary Table 4).

### Inclusion of significant SNPs in a chromosome-wide association analysis

After demonstrating that in the Lipgen population the rs330779504 (MIGA2) and rs320439526 (CRY2) SNPs are associated with serum LDL concentration at 190 days and LD C18:0, respectively, we aimed to investigate whether other SNP markers located in the vicinity of rs330779504 and rs320439526 display associations with these two traits with a higher level of significance than those observed for rs330779504 and rs320439526. To achieve this goal, we merged the rs330779504 SNP with 7,188 SNPs mapping to pig chromosome 1 (SSC1) and the rs320439526 SNP with 3,684 SNPs mapping to SSC2. The SSC1 and SSC2 SNP data were extracted from Porcine SNP60 BeadChip genotyping data reported by Manunza et al.18 and González-Prendes et al.21 in the Lipgen population (Supplementary Table 2). The associations between the markers rs330779504 (MIGA2) and rs320439526 (CRY2) with LDL serum concentration at ~190 days of age and with stearic acid content in LD, respectively, were only detected at the nominal level (Fig. 2). Indeed, we did not find any significant association at the chromosome-wide level when correcting for multiple testing with the false discovery rate (FDR) approach22 (Fig. 2).

## Discussion

Another association that remained significant after correction for multiple testing was that between the MIGA2 rs330779504 SNP and serum LDL concentrations at ~190 days (Table 2, Supplementary Table 3). Moreover, this SNP was also associated with MIGA2 mRNA expression in the GM muscle and liver tissues (Table 3, Supplementary Table 4). The MIGA2 gene, also known as FAM73B, and its homolog MIGA1 (FAM73A) encode proteins localized to the outer membrane of mitochondria as membrane-integrated proteins and they have been previously associated with reduced body weight in mice29 and variations in backfat thickness in pigs30. In a study performed by Zhang et al.10, it was reported that MIGA1/2 proteins stabilize the dimeric complex formed by active MitoPLD, thus facilitating mitochondrial fusion31. Interestingly, the dynamics of mitochondrial fusion and fission is tightly related with the energy demand of cells. Indeed, nutrient abundance and starvation are associated with an increased frequency of fission and fusion events, respectively27,32. Besides, the capacity to produce ATP in response to changes in energy demand and supply is modulated by mitochondrial morphology33. A recent study reported that mitochondrial fusion induced by leptin could have important effects on the hepatic lipid accumulation34, but to the best of our knowledge it is currently unknown whether mitochondrial fusion/fission has any effect on cholesterol and lipoprotein metabolism. Noteworthy, the chromosome-wide analysis pictured in Fig. 2 evidenced that the association observed between the MIGA2 rs330779504 marker and serum LDL levels at ~190 days is probably not causal, as there are some other neighboring SNPs that show more significant associations with this trait.

## Conclusions

In this work, we wanted to test whether the variability of six circadian genes (ARNTL2, CIART, CRY2, NPAS2, PER1 and PER2) and two additional genes (MIGA2 and PCK1) with key roles in energy homeostasis is associated with a set of lipid phenotypes recorded in Duroc pigs (Lipgen population). We have observed multiple associations between the variation of circadian genes and muscle fatty acid composition, but only that between the rs320439526 SNP of the CRY2 gene and LD C18:0 content remained significant after correction for multiple testing. We have also detected a significant association between the rs330779504 SNP of the MIGA2 gene and LDL concentration at 190 days. In the light of the results of the chromosome-wide analyses, we conclude that none of these two associations are causal.

## Methods

### Ethics approval

Animal care and management procedures were performed following Spanish national guidelines for the Good Experimental Practices and they were approved by the Ethical Committee of the Institut de Recerca i Tecnologia Agroalimentàries (IRTA).

### Animal material and phenotype recording

As previously reported by Gallardo et al.35,36, a total of 345 Duroc barrows belonging to 5 half-sib families and distributed in 4 fattening batches were selected from a commercial pig line, devoted to high quality meat production. This line is characterized by its high content of intramuscular fat, a feature that results in the improvement of meat juiciness and taste, hence conferring a better consumer acceptance37. Pigs were bred under intensive conditions of feeding and handling, and slaughtered when they reached 122 kg of live weight (~190 days of age). Phenotypic measures for different traits (Supplementary Table 1) were recorded during the productive cycle or after slaughtering: Triglycerides (TG), total cholesterol (TotalCholest), high-density lipoprotein (HDL) and low-density lipoprotein (LDL) serum concentrations at ~45 and ~190 days of age as reported by Gallardo et al.17, whereas intramuscular fat content in the LD and GM muscles and fatty acid composition for LD and GM were determined as described by Quintanilla et al.19.

### Association analyses between twenty selected SNPs and porcine lipid-related traits

The PLINK software39 was used for processing genotyped data. Association analyses between genotyped polymorphisms and phenotypes were performed with the Genome wide efficient mixed-model association (GEMMA) software40. This package uses a mixed model approach to account for population stratification and relatedness by calculating a genomic kinship matrix with SNPs genotypes as random effects and provides an exact test of significance. We implemented a univariate mixed model as follows:

$$y=W\alpha +x\delta +u+\varepsilon$$

where y is the vector of phenotypic observations for every individual; α corresponds to a vector including the intercept plus the fixed effects, i.e. batch effect with 4 categories (all traits), farm origin effect with 3 categories (all traits), data of extraction with 2 categories within batch (only for TotalCholest, TG, HDL and LDL serum concentration, that were measured at approximately 45 and 190 days). The α vector also contains the regression coefficients of the following covariates: live weight at slaughterhouse for TotalCholest, TG, HDL and LDL serum concentrations, and IMF content in LD and GM for LD and GM fatty acid composition respectively; W is the incidence matrix relating phenotypes with the corresponding effects; x is the vector of the genotypes corresponding to the set of selected polymorphisms; δ is the allele substitution effect for each polymorphism; u is a vector of random individual effects with a n-dimensional multivariate normal distribution MVNn (0, λ τ−1 K), where τ−1 is the variance of the residual errors, λ is the ratio between the two variance components and K is a known relatedness matrix derived from the SNPs; and ε is the vector of residual errors.

The association between lipid-related traits and the twenty analysed polymorphisms was assessed on the basis of the estimated allele substitution effects. The significance of these effects was established by implementing a correction for multiple testing using the FDR method reported by Benjamini and Hochberg22. Moreover, we compared the phenotypic medians corresponding to each one of the three possible genotypes by applying the non-parametric Kruskal-Wallis test, due to the non-normal data distribution of lipid phenotypes under study.

### Association analyses between the rs330779504 and rs320439526 polymorphisms and the expression of the genes that contain them

Gluteus medius skeletal muscle and liver samples were collected from 103 Duroc pigs belonging to the Lipgen population. Samples were retrieved after slaughtering, and immediately frozen at −80 °C in liquid nitrogen. Total RNA was isolated from GM samples by using the TRIzol method41 and the RiboPure kit (Ambion, Austin, TX) following manufacturer’s recommendations. Transcriptomic mRNA expression profiles were then assessed by hybridization to the GeneChip Porcine arrays (Affymetrix Inc., Santa Clara, CA), as previously reported by Cánovas et al.20. Expression data corresponding to GM muscle and liver samples are deposited in NCBI’s Gene Expression Omnibus42 and can be accessed through GEO Series accession number GSE115484 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE115484). Data pre-processing, background correction, normalization and log-transformation of expression values between samples were carried out by computing a Robust Multi-array Average (RMA) as described by Irizarry et al.43.

The correspondence between genes and microarray expressed probes was assessed with the Biomart database available at Ensembl repositories (https://www.ensembl.org/biomart/martview/). Expression levels for selected genes were then extracted from microarray samples for both GM muscle and liver tissues and used as continuous variables in association analyses, following the same statistical model previously described for phenotype records and correcting for batch (4 categories), farm of origin (3 categories) and laboratory (2 categories) as fixed effects. Moreover, we compared the phenotypic means corresponding to each one of the three possible genotypes by applying an ANOVA test.

### Inclusion of the MIGA2 rs330779504 and CRY2 rs320439526 SNPs in a chromosome-wide association analysis

As previously described by Manunza et al.18 and González-Prendes et al.21, the population employed in the current experiment was typed with the Porcine SNP60 BeadChip (Illumina, San Diego, CA) which contains probes for 62,163 SNPs (Supplementary Table 2). The GenomeStudio software (Illumina) was used for quality control analyses, as reported by Manunza et al.18. The PLINK software39 was used for removing SNPs that (a) did not map to autosomal chromosomes, (b) had minor allele frequency (MAF) <0.05, (c) with rate of missing genotypes >0.05, (d) major departures from the Hardy-Weinberg equilibrium (P-value = 0.001), (e) had a GenCall score <0.15, (f) had a call rate <0.95, or (g) that could not be mapped to the pig reference genome. A total of 36,710 SNPs were finally retrieved after filtering and merged with genotyping data corresponding to the rs330779504 and the rs320439526 SNPs. Association analyses were performed with the GEMMA software40 as described before, and multiple testing correction was implemented with the FDR method22 by establishing a chromosome-wide threshold of significance.

## Data Availability

Expression data corresponding to GM muscle and liver samples are deposited at NCBI’s Gene Expression Omnibus and are accessible through GEO Series accession number GSE115484 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE115484). Genotypes and phenotypes for the 345 Duroc pigs (Lipgen population) have been deposited in the Figshare public repository (https://figshare.com/s/2e636697009360986794).

## References

1. 1.

Cardoso, T. F. et al. Nutrient supply affects the mRNA expression profile of the porcine skeletal muscle. BMC Genomics 18, 603 (2017).

2. 2.

Froy, O. The relationship between nutrition and circadian rhythms in mammals. Front. Neuroendocrinol. 28, 61–71 (2007).

3. 3.

Green, C. B., Takahashi, J. S. & Bass, J. The meter of metabolism. Cell 134, 728–742 (2008).

4. 4.

Laposky, A. D., Bass, J., Kohsaka, A. & Turek, F. W. Sleep and circadian rhythms: Key components in the regulation of energy metabolism. FEBS Lett. 582, 142–151 (2008).

5. 5.

Froy, O. & Miskin, R. Effect of feeding regimens on circadian rhythms: implications for aging and longevity. Aging 2, 7–27 (2010).

6. 6.

Paschos, G. K. Circadian clocks, feeding time, and metabolic homeostasis. Front. Pharmacol. 6, 112 (2015).

7. 7.

McGinnis, G. R. & Young, M. E. Circadian regulation of metabolic homeostasis: causes and consequences. Nat. Sci. Sleep 8, 163–80 (2016).

8. 8.

Patel, S. A., Velingkaar, N., Makwana, K., Chaudhari, A. & Kondratov, R. Calorie restriction regulates circadian clock gene expression through BMAL1 dependent and independent mechanisms. Sci. Rep. 6, 25970 (2016).

9. 9.

Chaudhari, A., Gupta, R., Makwana, K. & Kondratov, R. Circadian clocks, diets and aging. Nutr. Healthy Aging 4, 101–112 (2017).

10. 10.

Zhang, Y. et al. Mitoguardin regulates mitochondrial fusion through MitoPLD and is required for neuronal homeostasis. Mol. Cell 61, 111–24 (2016).

11. 11.

Westermann, B. Bioenergetic role of mitochondrial fusion and fission. Biochim. Biophys. Acta - Bioenerg. 1817, 1833–1838 (2012).

12. 12.

Millward, C. A. et al. Phosphoenolpyruvate carboxykinase (Pck1) helps regulate the triglyceride/fatty acid cycle and development of insulin resistance in mice. J. Lipid Res. 51, 1452–1463 (2010).

13. 13.

Grimaldi, B. et al. PER2 controls lipid metabolism by direct regulation of PPARγ. Cell Metab. 12, 509–20 (2010).

14. 14.

Machicao, F. et al. Glucose-raising polymorphisms in the human clock gene cryptochrome 2 (CRY2) affect hepatic lipid content. PLoS One 11, e0145563 (2016).

15. 15.

Jordan, S. D. et al. CRY1/2 selectively repress PPARδ and limit exercise capacity. Cell Metab. 26, 243–255 (2017).

16. 16.

Cardoso, T. F. et al. RNA-seq based detection of differentially expressed genes in the skeletal muscle of Duroc pigs with distinct lipid profiles. Sci. Rep. 7, 40005 (2017).

17. 17.

Gallardo, D. et al. Mapping of quantitative trait loci for cholesterol, LDL, HDL, and triglyceride serum concentrations in pigs. Physiol. Genomics 35, 199–209 (2008).

18. 18.

Manunza, A. et al. A genome-wide association analysis for porcine serum lipid traits reveals the existence of age-specific genetic determinants. BMC Genomics 15, 758 (2014).

19. 19.

Quintanilla, R. et al. Porcine intramuscular fat content and composition are regulated by quantitative trait loci with muscle-specific effects. J. Anim. Sci. 89, 2963–71 (2011).

20. 20.

Cánovas, A. et al. Segregation of regulatory polymorphisms with effects on the gluteus medius transcriptome in a purebred pig population. PLoS One 7, e35583 (2012).

21. 21.

González-Prendes, R. et al. Joint QTL mapping and gene expression analysis identify positional candidate genes influencing pork quality traits. Sci. Rep. 7, 39830 (2017).

22. 22.

Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological) 57, 289–300 (1995).

23. 23.

Ruiter, M. et al. The daily rhythm in plasma glucagon concentrations in the rat is modulated by the biological clock and by feeding behavior. Diabetes 52, 1709–15 (2003).

24. 24.

Kumar Jha, P., Challet, E. & Kalsbeek, A. Circadian rhythms in glucose and lipid metabolism in nocturnal and diurnal mammals. Mol. Cell. Endocrinol. 418, 74–88 (2015).

25. 25.

Sahar, S. et al. Circadian control of fatty acid elongation by SIRT1 protein-mediated deacetylation of acetyl-coenzyme A synthetase 1. J. Biol. Chem. 289, 6091–6097 (2014).

26. 26.

Green, C. B. et al. Loss of Nocturnin, a circadian deadenylase, confers resistance to hepatic steatosis and diet-induced obesity. Proc. Natl. Acad. Sci. USA 104, 9888–9893 (2007).

27. 27.

Putti, R., Sica, R., Migliaccio, V. & Lionetti, L. Diet impact on mitochondrial bioenergetics and dynamics. Front. Physiol. 6, 109 (2015).

28. 28.

Mignone, F., Gissi, C., Liuni, S. & Pesole, G. Untranslated regions of mRNAs. Genome Biol. 3, REVIEWS0004 (2002).

29. 29.

Bassett, J. H. D. et al. Rapid-throughput skeletal phenotyping of 100 knockout mice identifies 9 new genes that determine bone strength. PLoS Genet. 8, e1002858 (2012).

30. 30.

Lee, K.-T. et al. Neuronal genes for subcutaneous fat thickness in human and pig are identified by local genomic sequencing and combined SNP association study. PLoS One 6, e16356 (2011).

31. 31.

Choi, S.-Y. et al. A common lipid links Mfn-mediated mitochondrial fusion and SNARE-regulated exocytosis. Nat. Cell Biol. 8, 1255–1262 (2006).

32. 32.

Vamecq, J. et al. Mitochondrial dysfunction and lipid homeostasis. Curr. Drug Metab. 13, 1388–1400 (2012).

33. 33.

Schrepfer, E. & Scorrano, L. Mitofusins, from mitochondria to metabolism. Mol. Cell 61, 683–694 (2016).

34. 34.

Hsu, W.-H., Lee, B.-H. & Pan, T.-M. Leptin-induced mitochondrial fusion mediates hepatic lipid accumulation. Int. J. Obes. 39, 1750–6 (2015).

35. 35.

Gallardo, D. et al. Alternative splicing at exon 28 of the acetyl-coenzyme A carboxylase α gene in adult pigs and embryos. Anim. Genet. 39, 205–206 (2008).

36. 36.

Gallardo, D. et al. Polymorphism of the pig acetyl-coenzyme A carboxylase α gene is associated with fatty acid composition in a Duroc commercial line. Anim. Genet. 40, 410–417 (2009).

37. 37.

Wood, J. D. et al. Fat deposition, fatty acid composition and meat quality: A review. Meat Sci. 78, 343–358 (2008).

38. 38.

Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly 6, 80–92 (2012).

39. 39.

Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).

40. 40.

Zhou, X. & Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821–824 (2012).

41. 41.

Chomczynski, P. & Sacchi, N. Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal. Biochem. 162, 156–9 (1987).

42. 42.

Edgar, R., Domrachev, M. & Lash, A. E. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 30, 207–10 (2002).

43. 43.

Irizarry, R. A. et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4, 249–264 (2003).

## Acknowledgements

The authors are indebted to Selección Batallé S.A. for providing the animal material. We gratefully acknowledge to J. Reixach (Selecció Batallé), J. Soler (IRTA), C. Millan (IRTA), A. Quintana (IRTA) and A. Rossell (IRTA) for their collaboration in the experimental protocols and pig management. We also want to thank MINECO for the Center of Excellence Severo Ochoa 2016–2019 (SEV-2015-0533) grant awarded to the Centre for Research in Agricultural Genomics (CRAG), and the CERCA Programme of the Generalitat de Catalunya for their support. Part of the research presented in this publication was funded by grants AGL2013–48742-C2–1-R and AGL2013–48742-C2–2-R awarded by the Spanish Ministry of Economy and Competitivity, and grant 2014 SGR 1528 from the Agency for Management of University and Research Grants of the Generalitat de Catalunya. Tainã Figueiredo Cardoso was funded with a fellowship from the CAPES Foundation-Coordination of Improvement of Higher Education, Ministry of Education of the Federal Government of Brazil. Emilio Mármol-Sánchez was funded with a FPU pre-doctoral fellowship from the Spanish Ministry of Education (FPU15/01733).

## Author information

M.A., R.Q. and J.J. designed the experiment. R.Q. generated the animal material and collected the phenotypic and microarray data. E.M.S. and M.A. selected the SNPs to be genotyped. E.M.S. did all bioinformatic and statistical analyses. T.F.C. contributed to the analysis of gene expression data. E.M.S. and M.A. wrote the paper. All authors read and approved the content of the manuscript.

Correspondence to Marcel Amills.

## Ethics declarations

### Competing Interests

The authors declare no competing interests.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.