The climatic and genetic heritage of Italian goat breeds with genomic SNP data

Cortellari, Matteo; Barbato, Mario; Talenti, Andrea; Bionda, Arianna; Carta, Antonello; Ciampolini, Roberta; Ciani, Elena; Crisà, Alessandra; Frattini, Stefano; Lasagna, Emiliano; Marletta, Donata; Mastrangelo, Salvatore; Negro, Alessio; Randi, Ettore; Sarti, Francesca M.; Sartore, Stefano; Soglia, Dominga; Liotta, Luigi; Stella, Alessandra; Ajmone-Marsan, Paolo; Pilla, Fabio; Colli, Licia; Crepaldi, Paola

doi:10.1038/s41598-021-89900-2

Download PDF

Article
Open access
Published: 26 May 2021

The climatic and genetic heritage of Italian goat breeds with genomic SNP data

Matteo Cortellari ORCID: orcid.org/0000-0002-5161-0648¹^na1,
Mario Barbato²^na1,
Andrea Talenti^1,3,
Arianna Bionda¹,
Antonello Carta⁴,
Roberta Ciampolini⁵,
Elena Ciani⁶,
Alessandra Crisà⁷,
Stefano Frattini¹,
Emiliano Lasagna⁸,
Donata Marletta⁹,
Salvatore Mastrangelo¹⁰,
Alessio Negro¹,
Ettore Randi¹¹,
Francesca M. Sarti⁸,
Stefano Sartore¹²,
Dominga Soglia¹²,
Luigi Liotta¹³,
Alessandra Stella¹⁴,
Paolo Ajmone-Marsan²,
Fabio Pilla¹⁵,
Licia Colli² &
…
Paola Crepaldi¹

Scientific Reports volume 11, Article number: 10986 (2021) Cite this article

3337 Accesses
23 Citations
4 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 20 September 2021

This article has been updated

Abstract

Local adaptation of animals to the environment can abruptly become a burden when faced with rapid climatic changes such as those foreseen for the Italian peninsula over the next 70 years. Our study investigates the genetic structure of the Italian goat populations and links it with the environment and how genetics might evolve over the next 50 years. We used one of the largest national datasets including > 1000 goats from 33 populations across the Italian peninsula collected by the Italian Goat Consortium and genotyped with over 50 k markers. Our results showed that Italian goats can be discriminated in three groups reflective of the Italian geography and its geo-political situation preceding the country unification around two centuries ago. We leveraged the remarkable genetic and geographical diversity of the Italian goat populations and performed landscape genomics analysis to disentangle the relationship between genotype and environment, finding 64 SNPs intercepting genomic regions linked to growth, circadian rhythm, fertility, and inflammatory response. Lastly, we calculated the hypothetical future genotypic frequencies of the most relevant SNPs identified through landscape genomics to evaluate their long-term effect on the genetic structure of the Italian goat populations. Our results provide an insight into the past and the future of the Italian local goat populations, helping the institutions in defining new conservation strategy plans that could preserve their diversity and their link to local realities challenged by climate change.

Discovering novel clues of natural selection on four worldwide goat breeds

Article Open access 06 February 2023

Local adaptations of Mediterranean sheep and goats through an integrative approach

Article Open access 01 November 2021

Landscape genetics and the genetic legacy of Upper Paleolithic and Mesolithic hunter-gatherers in the modern Caucasus

Article Open access 09 September 2021

Introduction

The preservation of animal genetic diversity is fundamental to ensure food security and the development of farming communities¹. Among the key factors shaping genetic and phenotypic diversity there is climatic and environmental heterogeneity², with indigenous domestic breeds showing better adaptation to local environments than highly productive breeds kept in controlled farming systems in which the effects of climatic challenges are minimized^3,4,5. In this rapidly evolving situation, species that are mostly reared in marginal rural areas of the world such as goats are also more likely to be among those most affected by environmental changes⁶.

The Italian territory is characterized by a rich environmental diversity, spanning from the polar climate of the Alps to the Mediterranean climate of the south and isles⁷. This environmental richness is paired with a reservoir of genetic resources for the caprine species counting over 36 goat breeds registered by the National Goat and Sheep breeds Association (http://www.assonapa.it). Italian goat genetic resources are managed throughout diverse farming systems ranging from intensive and semi-intensive to traditional grazing and transhumance. Importantly, most of them pertain to marginal areas where they play a crucial socio-economic role, contributing to the management of landscapes, biodiversity preservation, and the production of niche traditional products⁸. The Italian caprine diversity and adaptation to climate has been previously investigated through genotype data analyses^6,9. However, no previous work included breeds from the central areas of the country, which hosts several minor, local, and niche populations¹⁰. Importantly, the latter are specifically adapted to the broad range of Italian eco-climatic scenarios, making them an ideal model to investigate genetic adaptation to climate, which only few studies tried to address in the goat species^11,12,13.To investigate the link between territory and genetics, we analyse the second release of the goat genotype dataset assembled by the Italian Goat Consortium, a collaboration across Italian universities that aims to enhance the understanding of the Italian goat genetic variability. This new release of the dataset, called IGC2, expands the previous version, improving the sampling coverage of the central-southern goat populations and completes the whole Italian panorama and its internal connections (Supplementary Fig. 1). To the best of our knowledge, IGC2 currently represents the largest national-level genotyping effort on goat biodiversity¹⁰.

The most recent climate predictions by the Koppen-Geiger climate classification foresee hotter and drier climate across the Italian peninsula over the next 70 years¹⁴. Such changes will likely affect locally adapted populations by reducing food availability (e.g., pasture and forage crop availability and quality¹⁵), increase temperature-related health problems (illness, death rates, increased diffusion of vector-borne diseases and parasites), and cause metabolic problems (decreasing productive and reproductive performance or depressing feed intake^8,16). Locally adapted breeds will be the hardest hit, mostly relying on grazing^15,17.

We performed genome-wide analyses on the IGC2 dataset to investigate the genetic structure of Italian goat breeds with particular attention to the newly sampled local populations, and link the heritage and genomic structure of current populations with the present and future climatic condition of the rearing areas. Our results will help to understand their environment-driven adaptation and draw effective management plans to face climate change^18,19.

Results and discussion

Genotyping control and datasets creations

After filtering the initial raw dataset of 1,071 goats for poor quality genotype and related animals we obtained the dataset used for the haplotype sharing analysis, which included 980 animals and 48,396 SNPs. This dataset was further pruned for linkage disequilibrium (LD; r² < 0.2).

We first balanced the number of animals in the different populations by reducing the size of the nine largest groups, leaving 42,088 SNPs and 802 individuals. We used this dataset to perform population structure analyses (Multi-Dimensional Scaling (MDS), Admixture, and Reynolds distances).

Upon the removal of 2nd-degree related individuals and animals without geographical coordinates, we retained 41,898 SNPs and 489 individuals for Landscape Genomics analyses. See Table 1 for the detailing of the different datasets.

Table 1 Composition of the datasets used for the different analysis, with names, codes and number of samples processed and filtered for each population and grouped by the type of analysis.

Full size table

Population structure

The MDS plot showed a north–south geographic gradient comparable with previous findings on Italian goat population structure⁶. The first MDS component identified three main groups corresponding to northern Italian, central-southern Italian, and Maltese populations. The second MDS component discriminated the insular Montecristo goat (MNT_I; Fig. 1a) from the other mainland breeds, likely due to the high inbreeding and prolonged geographical isolation (Somenzi et al., in preparation). For this reason, we excluded the two Montecristo populations (MNT_M and MNT_I) from the subsequent population structure and haplotype sharing analysis and to repeat the analyses without them. The new MDS plot without the two MNT populations (Fig. 1b) still separated the three main groups on the first component, a structure further supported by the bootstrapped Reynolds’ distances phylogenetic tree (Supplementary Fig. 2).

Although overlapping a previous investigation of the Italian caprine population structure⁶, our improved dataset identified a closer relationship between the central and southern Italian population, more in accordance with the recent known history and geography of the Country. Until 1860 Italy was divided in many states with tight connections to other European kingdoms (https://www.150anni.it). The north-western part of the country and Sardinia were part of the Sardinian kingdom, tightly connected with the French empire, whereas the north-east part (the Kingdom of Lombardy—Venetia) was under the political influence of the Austrian Empire. Central Italy was ruled by the Papal state, and southern Italy and Sicily were under the Kingdom of the two Sicilies ruled by the Borbone (Fig. 2)²⁰.

The ADMIXTURE analysis (Supplementary Fig. 3) at K = 2 separates the Maltese populations (purple component) and the Northern Italy breeds (yellow component), and improves the representation of the North–South gradient over previous studies on Italian goat populations⁶. At K = 3 it resembles the MDS plot distinguishing the central-southern Italian breeds led by the Girgentana (GIR; blue component) and the mean proportion for each breed overlap nicely with the political borders of Italy prior 1860 (Fig. 2). Each K above 3 distinguishes single or groups of breeds, such as Teramana (TER; K = 4) and Valdostana (VAL; K = 5). The lowest cross-validation error was recorded at K = 20 (Supplementary Fig. 4) and showed the similar genetic background of those breeds originated from the same geographical regions (north, central, south and Maltese), and some breeds identified by private clusters, once again confirming the uniqueness of GIR, ORO, VAL, TER and SAM, among others (Supplementary Fig. 3).

The haplotype sharing analysis across populations (Fig. 3) also highlights the three genetic groups corresponding to admixture K = 3 and consistently with the administrative and temporal history of the Italian Peninsula until 1860²⁰.

We observe that the Northern-Italian populations (yellow cluster) show no haplotype exchange with the other clusters, with the exception of SAA and TER probably due to a recent introgression event. Within the Northern-Italian cluster there is a more pronounced haplotype sharing among the Lombardy breeds (ORO, NVE, LIV and BIO) than among those from the rest of the Alps. The Val Passiria (VPS) together with the Garfagnana (GAR) are the only two populations that do not exchange haplotypes at all, perhaps suggesting a geographical and/or political isolation. Populations from Central-Southern Italy (blue cluster) show large haplotype sharing within and among different clusters, possibly due to breeding and management practices as well as local geographical conditions, such as breeds from the Lazio region (BIA, GCI, CAP, and FUL) have high haplotype sharing among themselves. Lastly, the populations from the isles and in particular the Maltese (MAL and SAM, purple cluster) and Sarda (SAR and MXS) are those that mostly shared haplotypes with all other southern breeds, probably as a consequence of their high productivity and diffusion over the territory. The green colour represents the outgroup Capra aegagrus that does not exchange haplotypes with any of the other breeds. Importantly, future investigations with dedicated experimental designs aimed to dissect the different effects of selection might aid unfolding the undergoing evolutionary dynamics.

The political subdivision of Italy preceding the unification of the country has probably contributed to maintain the ancient genetic flows from central-north Europe in the north of the country and from Africa and Spain in the south¹³, with only a minor impact on the population structure of the following 150 years of history of the country.

Landscape genomics

The landscape genomics analyses (LGA) were performed using the climatic variables representing the current climate applying two different approaches: Samβada²³ and LFMM²⁴. We observed no direct overlap between the two methods. However, this is not surprising as simulation studies showed that LFMM is overall more conservative than Samβada, and the two methods tend to have marginal overlap on co-selecting the same signals, with the most significant loci detected by Samβada ignored by LFMM²³. Samβada identified 252 genotypes belonging to 216 different SNPs significantly associated (FDR < 0.05) with at least one climatic variable (Supplementary Table 1). Among them, 75 SNPs mapped within a gene region annotated in the goat genome (ARS1.2), identifying a total of 62 different genes associated with at least one of the following four representative environmental variables: “Isothermality” (47 genes), “Mean diurnal range” (four genes), “Mean temperature wettest quarter” (three genes) and “Precipitation coldest Quarter” (11 genes) (Supplementary Table 2). Some of these genes had already been identified in other landscape genomics works in relation with different environmental variables, for example ANK3 and BTRC in relation to longitude, and RYR3 with Mean Temperature of the wettest quarter (BIO3)¹⁹. The DCLK1 gene, in particular, was found in association with the continental goat group compared to the rest of the world⁹. Details on correlations among representative and excluded variables are shown in Supplementary Table 3.

Initially, we investigated the role in biological pathways of the 62 genes identified by Samβada (Supplementary Table 2), splitting them according to the associated environmental variable. We identified only one significant pathway (“Circadian rhythm related genes WP3594”; adjusted p-value < 0.0045) for those genes associated with “Mean diurnal range” with two genes linked to the circadian clock regulation (MAPK9²⁵) and to hair follicle formation and hair growth in Cashmere goat (NTKR3²⁶).

We also analysed the 62 genes individually to better understand their function. Using the information found, we can clump the most interesting genes into four groups based on the phenotype they affect the most: (1) meat- and growth-related genes, (2) circadian rhythm-related genes, (3) fertility-related genes, and (4) inflammatory response genes.

The first group (meat and growth) is the largest and counts 24 genes, including HADC9, which has a role in the feedback inhibition of myogenic differentiation in sheep muscle²⁷, DLG1, that is related to adipogenesis and residual feed intake in cows²⁸, and KLF12, which is related to the formation of preadipocytes in goats²⁹. The second group (circadian rhythm) includes 12 genes, such as MAPK9 and EYA3, both related to melanin production and photoperiod regulation³⁰, and KCNJ1, associated with the production of polyunsaturated fatty acid (PUFA) and feed efficiency in cattle^5,31,32. The third group (fertility-related) includes 15 genes such as BTRC, whose mutations can affect spermatogenesis and mammary gland development in mice³³, PRKD1, associated to age at puberty in pigs³⁴, and DENND1A, related to anti-Mullerian hormone and superovulation in dairy cows and to polycystic ovarian syndrome in human³⁵. Finally, the fourth group (inflammatory response) includes eight genes such as BTLA, strongly related to rheumatoid arthritis³⁶. This last gene in particular is relevant as a candidate for one of the most relevant infective diseases of goats worldwide, the Caprine Arthritis Encephalitis Virus (CAEV). This virus belongs to the Retroviriade virus family, like the human immunodeficiency virus (HIV), and has rheumatoid arthritis among its principal symptoms^37,38. Due to the CAEV importance and the relevance of climatic factors and their change play into pathogens diffusion³⁹, this group of genes becomes a potential candidate for studies on livestock resilience to incoming climate challenges.

LFMM identified four SNPs significantly associated (FDR < 0.05) with three different climatic variables (Mean Diurnal Range, Mean Temperature Wettest Quarter, and SlopeP), two of which intercepting NBEA, a gene located within a region involved with wool production in sheep⁴⁰ and previously associated with continental goat group in the work of Bertolini et all 2018⁹, and the RHOBTB1 gene that is known to be associated to meat quality in cattle⁴¹ (Supplementary Table 4).

Future genotypes prediction

The data collected on the current Koppen-Geiger climate classification showed that 21 Italian breeds live in “Temperate” regions, eight in “Cold” regions (BIO, SAA, VLS, TER, MNT_M, LIV, ORO, VPS), two in “Arid” regions (GAR, MNT_I), and one in a “Polar” region (VAL; Fig. 4a). BEZ and MXS were not considered for the analysis due to lack of georeferenced information. If we compare the current Koppen-Geiger classification of their breeding areas with the future predictions (Fig. 4b), we observe that, in 70 years from now, only 11 breeds will live in regions that will not change their classification. Such a scenario will likely pose new threats to those populations living in colder climates, whereas those breeds coming from the warmer parts of the country might have a chance to expand their range, with direct repercussions on the genetic diversity and survival of these breeds.

Among them, nine (ASP, BIA, CAP, GRF, MES, NIC, RCC, RME, SAR) populate “Temperate dry hot summer (Csa)” areas, one (GAR) is present in an “Arid step cold (Bsk)” area, and one (NVE) in a “Temperate without dry season hot summer (Cfa)” region. The remaining 21 breeds populate regions with a warmer and drier climate in the future (Table 2).

Table 2 Present and future predicted Koppen-climate class and Anova classification for breed divided in the two groups: HOT/NOTHOT and DRY/NOTDRY (see Materials and Methods).

Full size table

A one-way ANOVA analysis applied on the groups based on the Koppen-Geiger classification identified 27 SNPs that significantly differentiate the groups DRY/NOTDRY (seven within a gene region) and 11 that differentiate the groups HOT/NOTHOT (two within a gene region) (Supplementary Table 5). The linear regression model, applied to verify the variation of the genotype frequencies over time based on the value of their related variables, allowed us to identify five significant SNPs out of nine, intercepting the genes CHD2, ARL13B, KLF12, and PAK5 for the DRY/NOTDRY group and RACGAP1 for the HOT/NOTHOT group (Supplementary Table 6). Then, we calculated the expected future variation of allelic and genotypic frequencies of the significant SNPs in these groups. For instance, the SNP “snp32991-scaffold385-133908” intercepts the ARL13B gene and is associated to “Isothermality” with the genotype GG. At present, the frequency of the G allele of this SNP is 0.4296 in the DRY group and 0.6109 in the NOTDRY group and the delta of the variable “Isothermality” for the two groups is respectively − 0.1253 for the DRY group and − 0.0935 for the NOTDRY group. Using the regressor of the linear regression model (b = 0.3278), we predicted the future G allele frequency for this SNP in both groups (0.3885 and 0.5802 for the DRY and NOTDRY group, respectively) and consequently the expected GG genotype frequency (respectively 0.1509 for the DRY group and 0.3366 in the NOTDRY group). This simplified model suggests a future reduction of the genotype currently associated with the reference variable (“Isothermality”) in both groups. Interestingly, the gene intercepted by ARL13B interacts with RABGEF1, related to the reduction of the circadian cycle in humans according to the GenomeRNAi human phenotypes database (http://www.genomernai.org). In general, the prediction analysis identified SNPs that might go to stabilization of the frequencies or fixation (see “snp44855-scaffold611-263638” and “snp40739-scaffold521-1667886”, respectively; Table 3).

Table 3 Predicted genotypic frequencies for the five polymorphisms recorded within genes identified to be significantly different between the groups HOT/NOTHOT or DRY/NOTDRY.

Full size table

Conclusions

This new release of the Italian goat consortium dataset (IGC2)—almost three times the size of the previous iteration—fills in the gaps in terms of completeness and representativeness of the Italian caprine diversity. Our analyses overlap and expand on previous studies providing insight into the past, present, and future evolution of the populations considered. We confirm the geographic gradient of goat diversity ranging from north to south⁶, provide fine scale population structure, and highlight the overlap with the geo-political situation in which the breeds evolved. Previous studies have shown how past migrations from Africa and Spain on the one hand, and central Europe and the Alps on the other hand, contributed to shaping the backbone of biodiversity along the peninsula. Nevertheless, the overlap among the three diversity clusters and the political subdivision of Italy up to 160 years ago²⁰ is an intriguing finding that suggests a role for the past socio-political scenario of the country in the current diversity of Italian goats breeds. By investigating the relationship between genotype and environment, we identified several genes which might play a role in the adaptation to temperature and humidity. Interestingly, we identified a gene that can be a fitting candidate for future studies on the caprine arthritis encephalitis virus (CAEV). Lastly, we predicted the future genotypic frequencies under the light of climate changes and foresee the directionality of changes in genotypes frequencies, an important starting point for future studies aiming at improving these analytical approaches. We infer that improved modelling approaches could deepen and perfect such results and shed light on today’s favorable genotypes for specific environmental conditions. These results will likely be instrumental in breeding schemes and genomic selection, assisting locally adapted breeds to cope with the expected climate change toward warmer and drier climates¹⁴.

Material and methods

Biological samples

Management and handling of the animals involved in this study were performed following the Italian and European legislation on animal welfare (D.lgs n. 146/2001, Council Directive 98/58/CE) and adhering to the ARRIVE ESSENTIAL 10 guidelines, where applicable. Blood samples were taken by official veterinary surgeons following the recommendations of the European directive 2010/63, without performing any actual experimental research on animals. Experimental protocol was approved by the Ethical committee of the Department of Veterinary Science of the University of Messina (code 046/2020).

Blood sampling collection of the new individuals was performed using Vacutainer tubes with the K-EDTA anticoagulant, then all the samples were stored at − 20 °C until genomic DNA was extracted using a commercial kit (NucleoSpin Blood, Macherey–Nagel, GmbH & Co KG, Germany) according to the manufacturer’s instructions⁶. DNA samples were genotyped using the GoatSNP50 BeadChip (Illumina Inc., San Diego, CA) developed by the International Goat Genome Consortium (IGGC) at the Agrotis srl (http://www.lgscr.it, Cremona, Italy), Porto Conte Ricerche s.r.l. (Alghero, Sassari, Italy), and University of Palermo facilities (Italy).

Genotyping control and datasets creations

The IGC2 successfully fills the gaps of the previous dataset⁶, intercepting the local diversity of several under-represented areas of the country (i.e., the central regions of Italy) and identifying small, indigenous breeds never characterized before. For this work, 19 new Italian goat populations, for a total amount of 586 individuals, were sampled and added to SNP genotyping data taken from previously studies^6,43,44, including seven Iranian Bezoar (Capra aegagrus) genotyped by the NEXTGEN project as outgroup for the analyses (“NEXTGEN Project” n.d.). From that, we obtained a final raw dataset consisting of 1071 goats from 33 Italian breeds and populations and one wild species, Capra aegagrus (Table 1). Geographical coordinates of the samples at the time of sampling were available for 998 samples (93% of all samples).

The raw dataset was updated to the latest goat genome map (ARS1.2) and pruned to retain SNPs having SNP call rate > 98%, individual call rate > 95% and minor allele frequency (MAF) > 0.05. Then, we pruned loci in linkage disequilibrium (LD), removing one of each couple of SNPs having LD > 0.2 using PLINK v1.90b6⁴⁵. Duplicated individuals (identity by state > 99%) were removed and for each pair of highly related animals (Mendelian Errors count < 100) we excluded the animal occurring in multiple pairs or having the highest missingness. Phasing and imputation of missing genotypes was performed using BEAGLE v4.1⁴⁶, using sliding windows of ~ 5 Mb with an overlap of ~ 2 Mb and allowing two SNPs trimming (~ 0.15 Mb). The resulting dataset was used for the haplotype sharing analysis. In order to investigate the population structure using comparable population sizes, we created a specific dataset reducing the number of individuals for each population to ≤ 30 while maintaining the overall within-population diversity, using the ‘representative.sample’ function implemented in the R package BITE v1.2.0007⁴⁷.

Lastly, individuals with more than second-degree relatedness were identified using the—genome flag in PLINK and removed to perform the Landscape genomics analisis.

Population structure analysis

Population structure analysis was conducted through MDS of the identity by state (IBS) distances obtained with the flag—cluster in PLINK. Maximum likelihood analysis of population structure was conducted using ADMIXTURE v1.3.0 Software⁴⁸. Unsupervised clustering was calculated for K values from 2 to 35. We used fivefold cross-validation (CV) errors for each K to evaluate the optimal partitioning, and plots for each K were generated using an in-house R script. A phylogeny tree based on Reynolds Genetic distances, with 100 bootstrap replicates, was computed using a custom script. A Neighbour‐joining consensus tree was generated using PHYLIP v3.697⁴⁹ and using Bezoar as an outgroup.

The proportion of haplotypes shared among breeds was determined as Identity By Descent (IBD) estimation among individuals, and calculated using RefinedIBD v4.1⁵⁰ on all the individuals that passed the initial quality check. Sliding windows size was set to of 1 Mb, reporting windows of at least 0.2 Mb and allowing 0.05 Mb overlap. We considered the shared haplotypes between two breeds as the median length of shared haplotypes among all the possible pairs of individuals belonging to the breeds considered (individual pairs with no haplotype sharing were assigned length = 0⁵¹).

Landscape genomics

For each georeferenced sample, we used the ‘extract’ function from the R package raster⁵² to retrieve the values of 19 bioclimatic and elevation variables available from the WorldClim database⁵³ as those referring to the time span between 1960 and 1990 as proxy for the current climate, and the estimated future values for 2070 (average for 2061–2080) (Supplementary Table 7). Altitude was used to compute terrain slope through the function raster::terrain. Each variable was retrieved as a raster layer with a spatial resolution of 2.5 arcminutes (~ 4 km). Pairwise correlations were calculated among the climatic variables using JMP⁵⁴.

LGA was performed to assess the genotype/environmental variable association using Samβada v.0.5.3²³ and LFMM v3.1.2²⁴. Analyses were performed using the ‘current’ bioclimatic variables and a more stringent subset of animals. To spare computation time, the number of environmental variables was reduced iteratively by randomly removing one of the two most correlated variables until the maximum correlation across all variables was lower than |r²|< 0.7 as implemented in the R function ‘caret::findCorrelation’⁵⁵. To reduce the risk of false positive detections, we evaluated the genetic structure of our dataset through principal component analysis, used the scree plot to identify the number of principal components to keep to adequately describe the dataset, and included the selected PCs as population structure predictors for the association analysis^56,57. A likelihood-ratio test comparing a null and an alternative model was carried out for each genotype. Specifically, null models included the population structure predictors alone, whereas alternative models included population structure predictors and the focal environmental variable. A genotype was considered significantly associated with the environmental variable if the resulting p-value associated with the likelihood-ratio test statistic was lower than the nominal significance threshold of 0.05 after Benjamini-Hochberg (BH) correction for multiple testing. The R function ‘p.adjust’ was used to perform corrections for multiple testing²¹.

Gene-level analysis

To further investigate the biology underlying the signals identified, we screened all the SNPs that resulted significantly associated with a WorldClim variable for annotated genes of interest at the exact location of each marker in the ARS1.2 goat genome version (https://www.ensembl.org/biomart). These genes were investigated individually (https://www.genecards.org) and used as input for an enrichment analysis for pathways and ontologies using the online tool Enricher (https://amp.pharm.mssm.edu/Enrichr/).

Future genotypes prediction

Comparing the LGA results with the Koppen-Geiger classification relative to the breeding areas of the different Italian breeds, we tried to predict the future frequencies of the genotypes significantly associated with one or more of the environmental variables considered. First, we estimated the extent of climate change that Italian breeds will face comparing the present and future Koppen-Geiger classification¹⁴ (Supplementary Table 8). Then, we grouped the breeds based on temperature according to the current Koppen-Geiger classification, creating the HOT and NOTHOT groups, and humidity, creating the DRY and NOTDRY groups. The HOT group included those breeds that live in Csa and Dfa regions and the NOTHOT group included breeds that live in Csb, Dfb, Dfc, Bsk, and EF regions. The DRY group included those breeds that live in Csa, Csb, and Bsk regions and the NOTDRY group included those breeds that live in Cfa, Dfb, Dfc, and EF regions (see Table 2 and Supplementary Table 8 for the detail of the environmental codes).

We calculated the MAF of all the significant SNPs from LGA in each Italian breed using a custom script. We summarized the MAF in each of the four groups and performed a one-way ANOVA analysis using the R base package²¹ considering the HOT/NOTHOT groups or the DRY/NOTDRY groups as source of variation, identifying the SNPs significantly different in the two couples of groups. We applied a linear regression model (R base Package) only on those SNPs that fall within genomic regions including an annotated gene considering the mean allelic frequencies of the SNP and the mean value of the environmental variable resulted significantly associated with that SNP for each breed in each group. Finally, we calculated the future hypothetical change in allelic and genotypic frequencies only for those SNPs with a statistically significant linear regression model. For each group, we multiplied the delta between the current and the projected future value of the environmental variable associated with the SNP with the regressor of the linear model.

Ethical approval

All authors declare that animal samples were obtained in compliance with local/national laws in force at the time of sampling. Genotyping data exchange was in accordance with national and international regulations.

Data availability

The genotyping data for the Italian goat considered in this study are deposited and publicly available on Mendeley Data (DOI: 10.17632/hnd59x6gmg.1; URL: https://data.mendeley.com/datasets/hnd59x6gmg/1).

Change history

20 September 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41598-021-98758-3

References

Hoffmann, I. Climate change and the characterization, breeding and conservation of animal genetic resources. Anim. Genet. 41(1), 32–46 (2010).
Article PubMed Google Scholar
Lv, F.-H. et al. Adaptations to climate-mediated selective pressures in sheep. Mol. Biol. Evol. 31, 3324–3343 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bruford, M. W. et al. Prospects and challenges for the conservation of farm animal genomic resources, 2015–2025. Front. Genet. 6, 314–314 (2015).
Article PubMed PubMed Central CAS Google Scholar
Pariset, L., Joost, S., Marsan, P. A. & Valentini, A. Landscape genomics and biased FST approaches reveal single nucleotide polymorphisms under selection in goat breeds of North-East Mediterranean. BMC Genet. 10, 7–7 (2009).
Article PubMed PubMed Central CAS Google Scholar
Barbato, M. et al. Adaptive introgression from indicine cattle into white cattle breeds from Central Italy. Sci. Rep. 10, 1279–1279 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Nicoloso, L. et al. Genetic diversity of Italian goat breeds assessed with a medium-density SNP chip. Genet. Sel. Evol. 47, 1–10 (2015).
Article CAS Google Scholar
Fratianni, S. & Acquaotta, F. The Climate of Italy 29–38 (Springer, 2017). https://doi.org/10.1007/978-3-319-26194-2_4.
Book Google Scholar
Marino, R. et al. Climate change: production performance, health issues, greenhouse gas emissions and mitigation strategies in sheep and goat farming. Small Rumin. Res. 135, 50–59 (2016).
Article Google Scholar
Bertolini, F. et al. Signatures of selection and environmental adaptation across the goat genome post-domestication 06 biological sciences 0604 genetics. Genet. Sel. Evol. 50, 1–24 (2018).
Google Scholar
Talenti, A. et al. Italian Goat Consortium: a collaborative project to study the Italian caprine biodiversity. In ASPA 22nd congress 75–76 (Italian Journal of Animal Science, 2017).
Kim, E. S. et al. Multiple genomic signatures of selection in goats and sheep indigenous to a hot arid environment. Heredity 116, 255–264 (2016).
Article CAS PubMed Google Scholar
Mdladla, K., Dzomba, E. F. & Muchadeyi, F. C. The potential of landscape genomics approach in the characterization of adaptive genetic diversity in indigenous goat genetic resources: a South African perspective. Small Rumin. Res. 150, 87–92 (2017).
Article Google Scholar
Stella, A. et al. AdaptMap: exploring goat diversity and adaptation. Genet. Sel. Evol. 50, 61–61 (2018).
Article PubMed PubMed Central Google Scholar
Beck, H. E. et al. Present and future Köppen-Geiger climate classification maps at 1-km resolution. Sci. Data 5, 1–12 (2018).
Article ADS Google Scholar
Nardone, A., Ronchi, B., Lacetera, N., Ranieri, M. S. & Bernabucci, U. Effects of climate changes on animal production and sustainability of livestock systems. Livest. Sci. 130, 57–69 (2010).
Article Google Scholar
Rojas-Downing, M. M., Nejadhashemi, A. P., Harrigan, T. & Woznicki, S. A. Climate change and livestock: impacts, adaptation, and mitigation. Clim. Risk Manag. 16, 145–163 (2017).
Article Google Scholar
Rischkowsky, B. & Pilling, D. The State of the World’s Animal Genetic Resources for Food and Agriculture (Commission on Genetic Resources for Food and Agriculture, Food and Agriculture Organization of the United Nations, 2007).
Google Scholar
Rochat, E. & Joost, S. Spatial areas of genotype probability (SPAG): predicting the spatial distribution of adaptive genetic variants under future climatic conditions. bioRxiv https://doi.org/10.1101/2019.12.20.884114 (2019).
Article Google Scholar
Mdladla, K., Dzomba, E. F. & Muchadeyi, F. C. Landscape genomics and pathway analysis to understand genetic adaptation of South African indigenous goat populations. Heredity 120, 369–378 (2018).
Article CAS PubMed PubMed Central Google Scholar
Riall, L. The Italian Risorgimento (Routledge, 2002). https://doi.org/10.4324/9780203412343.
Book Google Scholar
R Development Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2020).
Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genom. Res. 19, 1639–1645 (2009).
Article CAS Google Scholar
Stucki, S. et al. High performance computation of landscape genomic models including local indicators of spatial association. Mol. Ecol. Resour. 17, 1072–1089 (2017).
Article CAS PubMed Google Scholar
Caye, K., Jumentier, B., Lepeule, J. & François, O. LFMM 2: fast and accurate inference of gene-environment associations in genome-wide studies. Mol. Biol. Evol. 36, 852–860 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yoshitane, H. et al. JNK regulates the photic response of the mammalian circadian clock. EMBO Rep. 13, 455–461 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liu, H., Wang, T., Wang, J., Quan, F. & Zhang, Y. Characterization of Liaoning Cashmere goat transcriptome: sequencing, De Novo assembly, functional annotation and comparative analysis. PLoS ONE 8, 1–11 (2013).
Google Scholar
Fleming-Waddell, J. N. et al. Effect of DLK1 and RTL1 but not MEG3 or MEG8 on muscle gene expression in callipyge lambs. PLoS ONE 4, e7399 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Seabury, C. M. et al. Genome-wide association study for feed efficiency and growth traits in U.S. beef cattle. BMC Genom. 18, 1–25 (2017).
Article CAS Google Scholar
Xu, Q., Lin, Y., Wang, Y., Bai, W. & Zhu, J. Knockdown of KLF9 promotes the differentiation of both intramuscular and subcutaneous preadipocytes in goat. Biosci. Biotechnol. Biochem. 84, 1594–1602 (2020).
Article CAS PubMed Google Scholar
Dupré, S. M. et al. Identification of melatonin-regulated genes in the ovine pituitary pars tuberalis, a target site for seasonal hormone control. Endocrinology 149, 5527–5539 (2008).
Article PubMed CAS Google Scholar
Ibeagha-Awemu, E. M., Peters, S. O., Akwanji, K. A., Imumorin, I. G. & Zhao, X. High density genome wide genotyping-by-sequencing and association identifies common and low frequency SNPs, and novel candidate genes influencing cow milk traits. Sci. Rep. 6, 1–18 (2016).
Article CAS Google Scholar
Crislip, G. R. et al. Differences in renal BMAL1 contribution to Na+homeostasis and blood pressure control in male and female mice. Am. J. Physiol. Ren. Physiol. 318, F1463–F1477 (2020).
Article CAS Google Scholar
Van Den Berg, I. et al. Concordance analysis for QTL detection in dairy cattle: a case study of leg morphology. Genet. Sel. Evol. 46, 1–14 (2014).
Google Scholar
Nonneman, D. J. et al. Genome-wide association and identification of candidate genes for age at puberty in swine. BMC Genet. 17, 1–9 (2016).
Article CAS Google Scholar
Nawaz, M. Y. et al. Genomic heritability and genome-wide association analysis of anti-Müllerian hormone in Holstein dairy heifers. J. Dairy Sci. 101, 8063–8075 (2018).
Article CAS PubMed Google Scholar
Lin, S. C., Kuo, C. C. & Chan, C. H. Association of a BTLA gene polymorphism with the risk of rheumatoid arthritis. J. Biomed. Sci. 13, 853–860 (2006).
Article CAS PubMed Google Scholar
Colussi, S. et al. A single nucleotide variant in the promoter region of the CCR5 gene increases susceptibility to arthritis encephalitis virus in goats. BMC Vet. Res. 15, 1–6 (2019).
Article CAS Google Scholar
Schultz, E. B. et al. Short communication: genetic parameter estimates for caprine arthritis encephalitis in dairy goats. J. Dairy Sci. 10, 6407–6411 (2020).
Article CAS Google Scholar
Epstein, P. R. Climate change and emerging infectious diseases. Microb. Infect. 3, 747–754 (2001).
Article CAS Google Scholar
Wang, Z. et al. Genome-wide association study for wool production traits in a Chinese Merino sheep population. PLoS ONE 9, e107101 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Silva, D. B. S. et al. Spliced genes in muscle from Nelore Cattle and their association with carcass and meat quality. Sci. Rep. 10, 14701 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Rubel, F., Brugger, K., Haslinger, K. & Auer, I. The climate of the European Alps: Shift of very high resolution Köppen-Geiger climate zones 1800–2100. Meteorol. Z. 26, 115–125 (2017).
Article Google Scholar
Talenti, A. et al. A method for single nucleotide polymorphism selection for parentage assessment in goats. J. Dairy Sci. 99, 3646–3653 (2016).
Article CAS PubMed Google Scholar
Talenti, A. et al. The Valdostana goat: a genome-wide investigation of the distinctiveness of its selective sweep regions. Mamm. Genom. 28, 114–128 (2017).
Article CAS Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4, 7–7 (2015).
Article PubMed PubMed Central CAS Google Scholar
Browning, S. R. & Browning, B. L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084–1097 (2007).
Article CAS PubMed PubMed Central Google Scholar
Milanesi, M. et al. BITE: an R package for biodiversity analyses. bioRxiv https://doi.org/10.1101/181610 (2017).
Article Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1655 (2009).
Article CAS PubMed PubMed Central Google Scholar
Felsenstein., J. PHYLIP—Phylogeny Inference Package (Version 3.2) (1989).
Browning, B. L. & Browning, S. R. Improving the accuracy and efficiency of identity-by-descent detection in population data. Genetics 194, 459–471 (2013).
Article PubMed PubMed Central Google Scholar
Talenti, A. et al. Studies of modern Italian dog populations reveal multiple patterns for domestic breed evolution. Ecol. Evol. 8, 2911–2925 (2018).
Article PubMed PubMed Central Google Scholar
Hijmans, R. J. Geographic Data Analysis and Modeling [R Package Raster Version 3.3-13]. (2020).
Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G. & Jarvis, A. Very high resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25, 1965–1978 (2005).
Article Google Scholar
SAS Institute Inc. JMP.
Kuhn, M. et al. Package ‘ caret ’ (2020).
Rellstab, C., Gugerli, F., Eckert, A. J., Hancock, A. M. & Holderegger, R. A practical guide to environmental association analysis in landscape genomics. Mol. Ecol. 24, 4348–4370 (2015).
Article PubMed Google Scholar
Vajana, E. et al. Combining landscape genomics and ecological modelling to investigate local adaptation of indigenous Ugandan cattle to east coast fever. Front. Genet. 9, 385–385 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank all the researchers involved in the Italian Goat Consortium, the Italian National Association of Sheep and Goat breeders (ASSONAPA) and all the breeders for their availability in data, samples and information provision. The authors acknowledge support from the University of Milan through the APC initiative.

Author information

These authors contributed equally: Matteo Cortellari and Mario Barbato.

Authors and Affiliations

Dipartimento di Scienze Agrarie e Ambientali – Produzione, Territorio, Agroenergia, Università degli Studi di Milano, Via Celoria 2, 20133, Milan, Italy
Matteo Cortellari, Andrea Talenti, Arianna Bionda, Stefano Frattini, Alessio Negro & Paola Crepaldi
Dipartimento di Scienze Animali, della Nutrizione e degli Alimenti and BioDNA Centro di ricerca sulla Biodiversità e sul DNA Antico, Università Cattolica del Sacro Cuore, Via Emilia Parmense 84, 29122, Piacenza, Italy
Mario Barbato, Paolo Ajmone-Marsan & Licia Colli
The Roslin Institute, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
Andrea Talenti
Unità di Ricerca di Genetica e Biotecnologie, Agris Sardegna, 07100, Sassari, Italy
Antonello Carta
Dipartimento di Scienze Veterinarie, Università di Pisa, Viale delle Piagge 2, 56124, Pisa, Italy
Roberta Ciampolini
Dipartimento di Bioscienze Biotecnologie e Biofarmaceutica, Università degli Studi di Bari, Via Orabona 4, 70126, Bari, Italy
Elena Ciani
Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CREA) - Research Centre for Animal Production and Acquaculture, 00015, Monterotondo, Rome, Italy
Alessandra Crisà
Department of Agricultural, Food and Environmental Sciences, University of Perugia, 06121, Perugia, Italy
Emiliano Lasagna & Francesca M. Sarti
Department of Agriculture, Food and Environment, University of Catania, Via Valdisavoia 5, 95123, Catania, Italy
Donata Marletta
Dipartimento Scienze Agrarie, Alimentari e Forestali, University of Palermo, 90128, Palermo, Italy
Salvatore Mastrangelo
Department of Chemistry and Bioscience, Faculty of Engineering and Science, University of Aalborg, Aalborg, Denmark
Ettore Randi
Dipartimento di Scienze Veterinarie, Università degli Studi di Torino, largo Braccini 2, 10095, Grugliasco, Italy
Stefano Sartore & Dominga Soglia
Dipartimento di Scienze Veterinarie, University of Messina, Messina, Italy
Luigi Liotta
Institute of Biology and Biotechnology in Agriculture, National Research Council (CNR), Milan, Italy
Alessandra Stella
Dipartimento Agricoltura, Ambiente e Alimenti Universitá degli Studi del Molise, 86100, Campobasso, Italy
Fabio Pilla

Authors

Matteo Cortellari
View author publications
You can also search for this author in PubMed Google Scholar
Mario Barbato
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Talenti
View author publications
You can also search for this author in PubMed Google Scholar
Arianna Bionda
View author publications
You can also search for this author in PubMed Google Scholar
Antonello Carta
View author publications
You can also search for this author in PubMed Google Scholar
Roberta Ciampolini
View author publications
You can also search for this author in PubMed Google Scholar
Elena Ciani
View author publications
You can also search for this author in PubMed Google Scholar
Alessandra Crisà
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Frattini
View author publications
You can also search for this author in PubMed Google Scholar
Emiliano Lasagna
View author publications
You can also search for this author in PubMed Google Scholar
Donata Marletta
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Mastrangelo
View author publications
You can also search for this author in PubMed Google Scholar
Alessio Negro
View author publications
You can also search for this author in PubMed Google Scholar
Ettore Randi
View author publications
You can also search for this author in PubMed Google Scholar
Francesca M. Sarti
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Sartore
View author publications
You can also search for this author in PubMed Google Scholar
Dominga Soglia
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Liotta
View author publications
You can also search for this author in PubMed Google Scholar
Alessandra Stella
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Ajmone-Marsan
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Pilla
View author publications
You can also search for this author in PubMed Google Scholar
Licia Colli
View author publications
You can also search for this author in PubMed Google Scholar
Paola Crepaldi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.C., M.C., L.C., M.B., and A.T. conceived the study. M.C., P.C., M.B. and A.T. performed the analyses. AB, AN, and SF performed lab work. A.C., A.S., D.M., D.S., E.L., E.R., F.M.S., F.P., L.C., P.A.M., R.C., S.M., S.S., L.L. and P.C. provided samples. M.C., M.B., A.T., L.C. and P.C. contributed to data interpretation and drafted the manuscript. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Andrea Talenti.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The Data Availability section in the original version of this Article was omitted. It now reads: “The genotyping data for the Italian goat considered in this study are deposited and publicly available on Mendeley Data (DOI: 10.17632/hnd59x6gmg.1; URL: https://data.mendeley.com/datasets/hnd59x6gmg/1).”

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Information 6.

Supplementary Information 7.

Supplementary Information 8.

Supplementary Information 9.

Supplementary Information 10.

Supplementary Information 11.

Supplementary Information 12.

Supplementary Information 13.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cortellari, M., Barbato, M., Talenti, A. et al. The climatic and genetic heritage of Italian goat breeds with genomic SNP data. Sci Rep 11, 10986 (2021). https://doi.org/10.1038/s41598-021-89900-2

Download citation

Received: 18 November 2020
Accepted: 29 April 2021
Published: 26 May 2021
DOI: https://doi.org/10.1038/s41598-021-89900-2

This article is cited by

The demographic history and adaptation of Canarian goat breeds to environmental conditions through the use of genome-wide SNP data
- Gabriele Senczuk
- Martina Macrì
- Amparo Martínez
Genetics Selection Evolution (2024)
Characterization of heterozygosity-rich regions in Italian and worldwide goat breeds
- Giorgio Chessari
- Andrea Criscione
- Salvatore Mastrangelo
Scientific Reports (2024)
Genome-wide mapping of signatures of selection using a high-density array identified candidate genes for growth traits and local adaptation in chickens
- Salvatore Mastrangelo
- Slim Ben-Jemaa
- Martino Cassandro
Genetics Selection Evolution (2023)
Runs of homozygosity in the Italian goat breeds: impact of management practices in low-input systems
- Matteo Cortellari
- Arianna Bionda
- Paola Crepaldi
Genetics Selection Evolution (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and discussion

Genotyping control and datasets creations

Population structure

Landscape genomics

Future genotypes prediction

Conclusions

Material and methods

Biological samples

Genotyping control and datasets creations

Population structure analysis

Landscape genomics

Gene-level analysis

Future genotypes prediction

Ethical approval

Data availability

Change history

20 September 2021

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links