Adaptive fitness of Sapindus emarginatus Vahl populations towards future climatic regimes and the limiting factors of its distribution

Kumar Pal, Ashish; Vaishnav, Vivek; Meena, Baleshwar; Pandey, Nalini; Singh Rana, Tikam

doi:10.1038/s41598-020-60219-8

Download PDF

Article
Open access
Published: 02 March 2020

Adaptive fitness of Sapindus emarginatus Vahl populations towards future climatic regimes and the limiting factors of its distribution

Ashish Kumar Pal^1,2,
Vivek Vaishnav¹,
Baleshwar Meena¹,
Nalini Pandey² &
…
Tikam Singh Rana¹

Scientific Reports volume 10, Article number: 3803 (2020) Cite this article

1685 Accesses
4 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Sapindus emarginatus Vahl (Sapindaceae) also known as ‘Indian Soap nut’ is significantly important for saponin content in its fruits. However, its current population in India is heavily fragmented due to a lack of sustainable harvesting practices. Moreover, changing climatic regimes may further limit its distribution and possibly compromise the survival of the species in nature. The aim of the present study was to: predict the future distribution range of S. emarginatus; identify the bioclimatic variables limiting this distribution and to evaluate its adaptive fitness and genomic resilience towards these variables. To determine future species distribution range and identify limiting bioclimatic variables, we applied two different ecological niche models (ENMs; BioClim and MaxEnt) on real occurrence data (n = 88 locations). The adaptive fitness of the species was evaluated by quantifying the genetic variability with AFLP markers and marker-environmental associations, using AFLP-associated Bayesian statistics. We found 77% overlap between the baseline (2030) and predicted (2100) species distribution ranges, which were primarily determined by maximum temperature (T_MAX) and mean annual precipitation (MAP). The T_MAX and MAP contributed 43.1% and 27.1%, respectively to ENM model prediction. Furthermore, AFLP loci significantly associated with bioclimatic variables, and T_MAX and MAP represent the lowest proportion (6.15%), confirming to the severe response of the species genome towards these variables. Nevertheless, the very low Linkage disequilibrium (LD) in these loci (4.54%) suggests that the current sensitivity to T_MAX and MAP is subject to change during recombination. Moreover, a combination of high heterozygosity (0.40–0.43) and high within-population variability (91.63 ± 0.31%) confirmed high adaptive fitness to maintain reproductive success. Therefore, the current populations of S. emarginatus have substantial genomic resilience towards future climate change, albeit significant conservation efforts (including mass multiplication) are warranted to avoid future deleterious impacts of inbreeding depression on the fragmented populations.

Seed germination schedule and environmental context shaped the population genetic structure of subtropical evergreen oaks on the Yun-Gui Plateau, Southwest China

Article 26 November 2019

Assessing signals of selection and historical demography to develop conservation strategies in the Chilean emblematic Araucaria araucana

Article Open access 15 October 2021

Genetic diversity and population structure in Nothofagus pumilio, a foundation species of Patagonian forests: defining priority conservation areas and management

Article Open access 06 November 2020

Introduction

Sapindus emarginatus Vahl (Sapindaceae) also known as ‘Indian ‘Soap nut’ is a tree species native to Indian sub-continent. In India, it is considered to be originated in the Western-Ghats and later extended up to the West- Central- North Indian biogeographical regions¹. The species is of substantial importance for global trade in soap and perfumery industries due to the oil and saponin content obtained from its leaves, fruits and seeds. The global trade of saponin from all plant sources is expected to grow up to 970 million US$ in 2023, from 950 million US$ in 2017². India exports approximately 63,368 kg/year of saponin extract obtained from all plant sources to different countries³. The exact contribution of S. emarginatus to the global saponin market is unclear, but probably substantial as the species is widely used traditionally at Indian homes as detergent to wash precious clothes and ornaments and as shampoo to wash hair. Due to high commercial demand, the species has been enlisted under ‘priority species of economic importance’ in the country report of state of forest genetic resource in India⁴ and the available natural patches of the species in some places have been restored through reforestation⁴. Lack of sustainable harvesting practices has been causing problems in the natural regeneration, as the fruits are harvested without leaving seeds for its natural regeneration^5,6, which led to fragmentation of the populations from wild and now the plants are found scattered in dry and moist deciduous forests and its periphery. Further, the distribution of the plants is subjected to significant impact due to changing climatic regimes. Therefore, there is a need to evaluate the available genetic resource of S. emarginatus and its adaptive fitness in projected bioclimatic regimes to ensure its sustainability.

Species distribution modeling (SDM) based on fundamental ecological niche theory has become an integral tool to provide the biogeographic extent of a species distribution⁷. With the current occurrence data of a species, the ecological niche modeling (ENM) tools such as BioClim (bioclimatic analysis and prediction system) and MaxEnt (maximum entropy), have been employed to predict the suitable niches for species population and their response toward different bioclimatic variables integrating the projected climatic regimes^8,9. However, the actual climate tolerance of long-lived species is wider than the climatic envelope they currently occupy¹⁰. Specifically, the forest species are practiced to plant outside of its natural distribution for commercial, breeding and improvement purposes in monoculture. It has also been suggested for covering the commercial forestry trials, botanical gardens, and biodiversity database along with the natural distribution of a tree species to measure its climatic or environmental requirements through SDM/ENM¹¹. Nevertheless, these model-based predictions are not practically successful until the species population itself is not adaptively fit¹². A species needs to be resilient enough for local adaptation to maintain its survivability facing bottleneck selection pressures. In particular, the response of a species may vary toward dynamic extrinsic factors like changing climatic conditions. On the other hand, the existence of a high level of genetic heterogeneity within a population can lead to the increased adaptive fitness and evolutionary potential in longer-term^13,14,15,16. An adaptive fit population is tend to transfer the same genetic characteristic to the future generations¹⁷. Ultimately, the response of species to the predicted changing climatic condition with high genetic heterogeneity leading to the adaptive fitness is of significant importance for the sustainability of population in limited distribution¹⁸.

The DNA-based marker systems such as dominant markers i.e. amplified fragment length polymorphism (AFLP), inter-simple sequence repeats (ISSR), DAMD (directed amplification of minisatellite DNA) and co-dominant markers i.e. simple sequence repeats (SSR) have become an important tool to evaluate the distribution of heterogeneity and structure of the plant genetic resources. Among the dominant markers, AFLP exhibits higher reproducibility¹⁹, robust informativeness, higher multiplex ratio, wider genome coverage, higher heritability²⁰ and fewer artifacts compared to ISSR and other markers²¹. In a previous report on S. emarginatus, the AFLP markers revealed comparatively higher polymorphism (P%) and polymorphic information content (PIC) values than the ISSR and DAMD markers²². Using ISSR markers the genetic diversity and differentiation among S. emarginatus populations from three biogeographical regions in India has been evaluated²². Assuming the population in Hardy-Weinberg Equilibrium (HWE) and without consideration of spatial and geographical factors, the study resulted into three distinct groups showing affinities towards their biogeographical regions²². Unlike the classical genetic algorithm, the Bayesian algorithm developed for AFLP markers^23,24,25 is advantageous to estimate the genetic diversity and differentiation of populations without considering of HWE and inbreeding coefficient²⁶. Moreover, it is also able to handle the biased estimates of genetic diversity measures due to unequal sample sizes.

The present investigation was performed to address the major questions: How the future climatic regime is going to affect the species population distribution? What is the state of the genomic resilience of different populations of S. emarginatus to respond to the projected climatic regime? What are the bioclimatic variables contributing as limiting factor in population distribution? In order to resolve these questions, ENM was performed with real occurrence data of the populations of S. emarginatus from three biogeographical regions in India (Fig. 1) with multi-modeling approach with two different models viz. BioClim and MaxEnt, comparing the predicted species distribution pattern between baseline (the year 2030) and projected (the year 2100) climatic regimes. The fitness of the populations was evaluated assessing the genetic variability using AFLP markers with Bayesian statistics. The signature of adaptation in genome of the species was evaluated using genome-wide association (GWAS) between marker and bioclimatic variables.

Results

A significant correlation (p < 0.05, r = 0.85) was found between the predictions by both the models (BioClim and MaxEnt) for the species distribution on its range of occurrence in the baseline (the year 2030) and future (the year 2100) climatic regimes. Therefore, the species distribution prediction (Fig. 2) and output obtained from the MaxEnt was applied for all further estimations. The comparison between the species distribution patterns predicted for the baseline and future climatic regimes resulted into 77% proportion of overlapping range. The niches from Deccan peninsula biogeographic region were predicted to be the most suitable for survival of the species. The niches of species occurrence at Semi-arid biogeographical region showed comparatively less suitability for the survival of species for the year 2100 (Fig. 2). During prediction, MaxEnt resulted no change in area under curve (AUC = 0.99 ± 0.0) for both climatic regimes. The prediction of limiting bioclimatic variables (by BioClim) and their responses to the probability of presence (by MaxEnt) resulted in the relative proportion of maximum temperature (T_MAX = 43.10%), and annual precipitation (MAP = 27.10%) with the highest contribution in the model predictions (Fig. 3, Table 1). The species niches were found with sensitive response towards T_MAX (<10 °C), temperature range (T_RANGE; > 10 °C) and increased MAP (<1000 mm).

Table 1 Estimates of relative contribution (in %) of the climatic variables to the MaxEnt model and of AFLP loci in significant association with the climatic variables.

Full size table

The AFLP primers amplified 1957 loci (103 ± 39.37 loci/primer) with 94.79 ± 6.43% polymorphism (1859 polymorphic loci), 0.65 ± 0.02 major allele frequency (MAF) and 0.33 ± 0.01 PIC (Table 2). The 49 loci amplified by the AFLP primers ACG/CTA and ACA/CAG were excluded from the analysis due to low numbers of amplified loci (<50) and further genetic analysis was performed with the 1908 loci amplified by the 17 AFLP markers (Table 2). The Bayesian model-based analysis performed with Hickory resulted into the lowest deviance information criteria (DIC) value (35357.20) supporting the suitability of full-model (inbreeding) among the four models for the data. The average heterozygosity (Hs value) generated for the locations was 0.40 ± 0.0. For the samples as a whole, panmictic heterozygosity (Ht) was 0.43, whereas θ-I and f (F_IS) values were 0.18 and 0.98, respectively. The observed heterozygosity (Ho) ranged from 0.42 ± 0.0 to 0.44 ± 0.0 with an average of 0.43 ± 0.01 (Table 3) while F_ST value was 0.048. Allahabad (AL) population revealed the highest number of common alleles < =50%, and no rare and common (<=25%) allele was found in the populations analyzed.

Table 2 Genetic information revealed by the 19 AFLP primer combinations on 41 genotypes of S. emarginatus.

Full size table

Table 3 Genetic diversity of S. emarginatus based on AFLP markers.

Full size table

The Jaccard’s genetic similarity coefficient among the genotypes ranged from 0.24 to 0.61 with an average of 0.38 ± 0.05. The principal coordinate analysis (PCoA) differentiated the genotypes into three clusters (Fig. 4). Few genotypes from AL, KA, and VS populations (Table 4) were found clustering in distinct clusters but most of them are highly admixed with the genotypes of other locations. Analyzing the results in program STRUCTURE, based on the highest Delta-K value, the admixture model with independent allelic frequencies was found the most appropriate for our dataset through the program STRUCTURE HARVESTER. The most suitable cryptic population number was four (K = 4, Supplementary Fig. S1). The bar plot showed admixture in populations, and KA population distinctly (Supplementary Fig. S2). The populations VS, GJ, AL, and RJ were admixed, whereas GJ population was found in admixing with KA population. Populations like AL and VS were also found admixed distinctly. The F_ST values on these clusters ranged from 0.02 to 0.08. The pair-wise F_ST values among the locations ranged between 0.015 to 0.128 and the Nm exchanged among the locations ranged between 3.39 (AL-KA) to 32.39 (GJ-VS, Supplementary Fig. S2). The Mantel’s test revealed significant correlation (p < 0.01) between genetic distances and geographical distances matrix (Supplementary Fig. S3) and non-significant correlation (p > 0.05) was found between the number of migrants (Nm) exchanged among locations to geographic distance matrix (Supplementary Fig. S4) and altitudinal gradient (r = 0.59). The variation among locations was 6.46 ± 1.34% and within location among genotypes was 91.63 ± 0.31%.

Table 4 Geo-climatic variables at locations of S. emarginatus populations in different biogeographical regions of India.

Full size table

In order to detect the outlier loci with a signature of adaptation, the only locus E-AGC/M-CTG-71 (F_ST = 0.23) was found with the posterior odd (log₁₀PO) > 0.5 (Fig. 5). For the rest of the loci, their corresponding F_ST values were found real positive. Linkage disequilibrium (LD) was confirmed to avoid false-discovery in genome-wide association study (GWAS). Among all loci pairs, only 1.99% were found in significant LD (p < 0.01). Major LD decay was observed within a distance of 100 base pair (bp) and the length of LD block extended up to 1450 bp (Supplementary Fig. S5). The mixed linear model (MLM) based analysis detected a significant (p < 0.001) association of 65 loci (3.59% out of 1810 polymorphic loci) from 15 AFLP markers with bioclimatic variables and altitude (Supplementary Table S1). Among these 65 loci, the highest proportion (32.30%) was found associated with T_RANGE and the lowest proportion (6.15%) was with MAP and T_MAX (Table 1). Among 1651 possible combinations of these 65 loci, only 4.54% were found in significant (p < 0.001) LD.

Discussion

The occurrence of S. emarginatus was found at the ecological niches exhibiting varied altitudinal ranges (CV = 74.40%), and MAP distribution (CV = 22.51%) but with a narrow temperature gradient (CV = 3.90%; Table 4). Sun et al. studied the natural distribution of another species of the same genus (S. mukorossi) which showed comparatively wider range of MAP (CV = 26.84%) and mean annual temperature (CV = 17.84%)²⁷. The comparison of the CV and MAP data revealed that S. emarginatus might have been facing intrinsic incompetency to shift beyond the specified range of temperature gradient due to bionomic characteristics. Since the species distribution thrives on the fragmented populations in varied biogeographical regions, it may be significantly influenced by the extrinsic barriers like local conditions and selection pressures as well. Hence the adaptive fitness of the species led by the population genetic characteristics was determined to ensure the sustainability and genomic resilience of the population through dominant AFLP markers. The dominant markers have already been applied to confirm the adaptive fitness of trees^28,29 and other plant species^30,31,32. In the present study also, a high MAF (0.65 ± 0.02) of the AFLP markers and a high LD decay (1.99%) in loci-combinations supported their suitability for high-resolution GWAS³³ confirming their frequent coverage of the species genome.

Influence of future climatic regimes on the species population distribution

In both; BioClim and MaxEnt models, no change in AUC on projected climatic regimes confirmed a stable distribution pattern of the species population. However, our projection has confirmed the increased suitability of the niches of the species in Deccan peninsula and none of the locations of species occurrence is going to be vulnerable by the future changes, except one (JP) from the Semi-arid biogeographic region (Fig. 1), which has been found to be comparatively less suitable. This might be possible because the northern regions of India have been predicted to be more influenced by the warmer climate than other regions³⁴. It can be inferred also from the results of ENM establishing the T_MAX as one of the limiting factors to the species distribution with the highest (43.10%) relative contribution to the model prediction (Table 1).

The adaptive fitness of the species population

In the present investigation, a higher value (0.40–0.43) for the measures of heterozygosity (Table 3) and higher within-population variability (91.63 ± 0.31%) have been resulted. This could be possible that the species prefers xenogamy over geitonogamy due to asynchrony in flowering³⁵. Moreover, the sweet incense of its flowers attracts the pollinators, like Indian honeybees³⁶ that may help to admix the population through pollination between the trees scattered away from each other³⁵ maintaining within population variability. The Bayesian algorithm-based analysis of genetic structure has found the admixture model with independent allele frequency as the fittest and assumed four cryptic populations for the samples. Nevertheless, a weak or negligible genetic structure (Delta K < 300, Fig. S1) confirmed high admixture of the S. emarginatus populations exchanging genes to limited distance. The same can be inferred from the Mantel’s test resulting no significant effect of geographical distance on a number of migrants (Nm) exchanged among the sampled genotypes (Fig. S4) leading to higher within-population variability.

The high heterozygosity of S. emarginatus populations in light of high admixing within-population strongly supports its adaptive fitness to maintain reproductive success. Only one locus from the AFLP markers has been found with signature of substantial selection, which confirms no possibilities of local adaptation by the populations. Nevertheless, the Bayesian model-based program HICKORY suggested the ‘full-model’ as the fittest for the data with a high value for the inbreeding coefficient (f = 0.98) supporting the sign of inbreeding depression on the species population. High heterozygosity with inbreeding depression in sub-divided populations represents the state of overdominance caused by heterozygote superiority³⁷. The current status of the S. emarginatus populations in severe fragmentation along with the absence of natural regeneration through seeds, act as barrier for inter-population or long-distance out-crossing and seems to be the reason for inbreeding³⁸. Inbreeding depression, causing loss of genetic variability in small or fragmented populations, may result in the reduced fitness of the offspring^39,40. Despite inbreeding, the higher heterozygosity of the populations indicated that the population might not have crossed many generations with the inbreeding depression⁴¹. Therefore, being a long-lived forest tree, the reproductive success of the highly heterozygous population of S. emarginatus through mass multiplication leading to restoration may mask the deleterious impact of inbreeding⁴².

Limiting factors of the species population distribution

A multi-model climate change projection for the Intergovernmental Panel on Climate Change (IPCC, Assessment Report-V) has predicted a 4% increase in warming and 14% increase in precipitation by the year 2080⁴³. This increment will have some influence on the climatic envelope of fundamental ecological niches leading their shifting to suitable conditions. Temperature and rainfall have been referred to as prime contributing factors in most of the projections for future climatic regimes⁴⁴. In our investigation also, we found that T_MAX and MAP with maximum contribution to the model prediction. The ENM establishes T_MAX and MAP as extrinsic limiting factors of the species population distribution. Since the population distribution of the species has been observed within a narrow range of temperature gradient, it may have an influence of the predicted warm climate⁴⁵. The GWAS approach found the most of the genomic proportion covered by the AFLP markers significantly associated with T_RANGE and T_MIN. It revealed the genomic resilience of the species to survive within varied T_RANGE and T_MIN. It is obvious, as the species is known to be frost hardy with occurrence in tropical deciduous forests in India⁴⁶. The least proportion of the AFLP markers was found associated with T_MAX and MAP establishing them as limiting factors for the genomic resilience of the species. In MLM, 3.45% of the species genome was found in significant association with the bioclimatic variables. The least proportion (6.15%) of the loci linked with T_MAX and MAP has confirmed the severe response of the species genome toward these variables. Nevertheless, these associations are highly dependent on high LD to transfer to future generations. We observed only 4.54% LD among the loci-combinations found in significant association with the bioclimatic variables with high LD decay (within 100 bp). Since S. emarginatus maintains high heterozygosity preferring xenogamy, the current status of susceptibility of the species genome responding to T_MAX and MAP is subject to change during recombination. In addition, the mating system preferred by S. emarginatus may help to avoid the deleterious effect of inbreeding on the species population. Therefore, it can be assumed that the S. emarginatus populations may have substantial genomic resilience toward the bioclimatic variables.

Conclusions

To the best of our knowledge, the present study seems to be a maiden attempt to study the adaptive fitness of S. emarginatus populations using molecular marker technique with SDM/ENM approach. The ENM-based predictions have supported a stable distribution pattern of the species population towards the projected future climatic regime and confirmed maximum temperature and annual precipitation as the major limiting factors. However, the AFLP markers have detected the signature of adaptation on the genome of S. emarginatus, which supports that the species is resilient enough to survive on the threat of rising temperature and precipitation. The Deccan peninsular biogeographic region was found more suitable for the species, therefore the niches in the region can be preferred to conserve as strict nature reserves to maintain the existing genetic variability of the species populations. In light of the threat and vulnerability of ecological niches of Semi-arid region, sincere efforts can be made for screening of resilient genotypes of the species. Further, initiatives for the conservation and mass multiplication of the species are recommended to avoid the future deleterious impact of inbreeding depression on the fragmented populations. There is a scarcity of the genomic information related to S. emarginatus. Therefore, the 15 markers found significantly associated with the bioclimatic variables and altitudinal gradients, which can be further employed to find out their association with the functional adaptive traits.

Methods

Population occurrence data and sampling

The information on occurrence of the species was obtained from the global plant database like GBIF (https://www.gbif.org/occurrence/search?taxon_key=8086757), European herbaria; KEW (https://www.kew.org/search?textsearch=sapindus) and the herbarium records available at CSIR- National Botanical Research Institute, Lucknow (LWG), Central National Herbarium, Kolkata (CAL), Botanical Survey of India (Central Circle), Allahabad (BSA), Botanical Survey of India, Hyderabad (BSID) and Forest Research Institute, Dehradun (DD). The 157 GPS coordinates obtained were subjected to spatial filtration to avoid the sampling biases and equal richness based grid sampling applied through the program DIVA-GIS v7.5⁴⁷. Remaining GPS coordinates were selected for ground-truthing to obtain the real occurrence data of S. emarginatus populations. The locations were visited and finally 88 coordinates were recorded at the locations with verified abundance of the species in eight Indian states representing three biogeographical regions viz. Gangetic plain, Semi-arid and Deccan peninsula (Table 4, Fig. 1) for ENM. For DNA profiling leaf samples were collected from 41 genotypes of five localities (Table 4, Fig. 1) representing the above biogeographical regions.

Bioclimatic data

The data of 35 bioclimatic variables⁴⁸ for baseline (year 2000–2030) and projected (the year 2070–2100) regimes was obtained from the website http://www.climond.org (Climond model CSIRO- MK 3.0) and extracted for the GPS coordinates of the locations using DIVA-GIS v7.5⁴⁷ (available at http://www.diva-gis.org/) in the resolution of 2.5 m.

Prediction of the distribution pattern

The GPS coordinates were employed for ENM to predict the potential distribution of populations determining suitable niches for the species. For confirmation of the result both, BioClim and MaxEnt based models were applied as the assembling and comparing the results from different models can give robust projection^49,50. The BioClim algorithm was applied using the envelope method as implemented in program DIVA-GIS v7.5⁴⁷ for baseline and predicted bioclimatic variables. MaxEnt model was applied through program MaxEnt v3.4.1^50,51,52 to evaluate the response of niches to the bioclimatic variables for baseline and predicted climatic regimes. For training with threshold-depended sensitivity model (threshold = 0.5), default parameters in program setting were preferred with 50 replications and 500 iterations doing jackknife and to measure variable importance. The results obtained for two different models for two climatic regimes were correlated and the proportion of overlapping niches was determined through program ENMtools v1.8⁵³ applying relative rank statistics⁵⁴. Based on the significant correlation among the bioclimatic variables obtained through program ENMtools v1.8, finally climatic variables viz. minimum temperature (T_MIN), maximum temperature (T_MAX), temperature range (T_RANGE), isothermality (I), and mean annual precipitation (MAP) were tested to find out the limiting factor for the species distribution.

DNA isolation and AFLP amplification

Genomic DNA was isolated from the collected leaf samples following a modified CTAB method²². The conventional AFLP protocol¹⁹ was used with minor modifications⁵⁵ to develop an amplification profile for the 41 genotypes on 19 AFLP primer combinations (Table 2).

Assessment of genetic diversity

DNA profiling of 41 genotypes with 19 combinations of AFLP primers was developed based on scoring the presence (1) and absence (0) of the bands with help of program Genemapper^® v4.0 (Applied Biosystems, Foster City, CA, USA). The program eliminates the band profiles with short sizer peaks (<50) to avoid the error on scoring. Further, the AFLP markers with <50 amplified loci were excluded for further analysis. The profile was analyzed applying both band-based and allele frequency-based approach²⁶. The mean allele frequency (MAF), polymorphism percentage (P %) and polymorphic information content (PIC) of the markers were calculated by program POWERMARKER v3.25⁵⁶. The genetic diversity of the sampled populations was estimated using Bayesian models. We preferred the θ-statistics (viz. θ-I, θ-II, θ-III and G_ST-B) implemented in program HICKORY v1.1²⁴ that allows direct estimates of genetic differentiation measure (F_ST) from dominant markers without prior assumption of inbreeding and Hardy -Weinberg Equilibrium (HWE) within population even with small sample size. Location-wise panmictic heterozygosity (Hs) and total panmictic heterozygosity (Ht) were also estimated. All these estimations were performed with 50,000 steps of burn-in, 500,000 replicates, ‘thinning’ = 20 following the best-suited model for the data amongst all the four models (‘full’, ‘f = 0’, ‘θ = 0’ and ‘f-free’ models) implemented in the program based on their corresponding DIC (deviance information criterion) value²⁵. The inbreeding coefficient (f = F_IS) resulted from the best-fitted model in HICKORY was incorporated in another Bayesian program AFLP-Surv v1.0⁵⁷ with the non-uniform prior distribution of allele frequencies at 10000 permutations to estimate the observed heterozygosity (Ho). Rare (private) and common alleles (<25%) among the populations were calculated through program GenAlEx v6.5⁵⁸. Jaccard’s genetic similarity coefficient among the genotypes was calculated by program DARWIN v5.0⁵⁹. A principal coordinate analysis (PCoA) was performed to cluster the genotypes in different axes applying program GenAlEx v6.5. To determine the genetic structure of the sampled population with an appropriate number of cryptic population (K) assigned by the genotypes, Bayesian statistics-based program STRUCTURE v2.3.1⁶⁰ was used. The project was run applying four prior-assumption with combinations of admixture/no-admixture models with correlated/independent allele frequencies on 100000 burn-in and 1000000 MCMC repeats with three-run of K from 1 to 9 for 2 ≤ K ≤ 8. The most suitable model and the best-suited ‘K’ for the data set were determined based on the highest Delta-K value resulted from an online program STRUCTURE HARVESTER⁶¹. The ancestry coefficient (Q) values of the genotypes were further used for mixed linear modeling (MLM) based association mapping. Pair-wise F_ST among populations were also calculated by program AFLP-Surv. The gene flow among populations was determined by calculating the number of migrants (Nm) among the populations⁶². To determine the effect of geographical distance and altitude on gene flow among locations, the Mantel’s tests were performed among pair-wise Nm, pair-wise geographical distance and pair-wise Euclidean distance based on altitude through program GenAlEx. Analysis of molecular variance (AMOVA) was also calculated.

Detection of signature of local adaptation

To detect the signature of local adaptation on AFLP loci, program BAYESCAN v2.1⁶³ was applied that identifies outlier loci with a Bayesian test based on a logistic regression model that decomposes F_ST values into a locus-specific component (α, selection effect) and a population-specific component (β, demographic effect). Twenty pilot runs of 5,000 iterations were used to estimate the distribution of α-parameters, followed by 50,000 iterations for sampling. Outlier loci with substantial signature of adaptation were identified using a 5% false discovery rate with posterior odds of >10 (log10 (PO) > 0.5). Linkage disequilibrium among loci-combinations was performed by program TASSEL v3.0⁶⁴ to confirm the influence of demographic variables on the population genetic structure inferring LD decay. A genome-wide association study (GWAS) was performed between climatic variables (i.e. T_MIN, T_MAX, T_RANGE, I, and MAP), altitude and AFLP marker profile by program TASSEL v3.0 applying MLM to control the false-discovery due to the structured population^65,66. The MLM considers the markers applied to the study as a fixed–effect factor and the population structure information of the sampled genotypes are considered as random effect factors. The required kinship (K) information was obtained from program TASSEL and the ancestry coefficient (Q) value was obtained from the program STRUCTURE output to implement the MLM + K + Q method. The loci significantly (p ≤ 0.01) associated with one or more climatic variables were taken into consideration. To find out the limiting factor, the proportions of AFLP markers linked with these climatic variables and altitude were calculated based on their number of loci in significant association with the climatic variables.

Data availability

The datasets generated and analyzed during the present study will be available from the corresponding author on reasonable request.

References

Champion, H. G. & Seth, S. K. A Revised Survey of Forest Types of India (Government of India Press New Delhi, 1968).
Anonymous. Global saponin market by manufacturers, regions, type and application, forecast to 2023, https://www.planetmarketreports.com/reports/global-saponin-market-5797. (2019).
Anonymous. Saponin export data of India, https://www.exportgenius.in/export-data/india/saponin.php. (2019).
Anonymous. State of Forest Genetic Resources in India: A Country Report, http://www.fao.org/3/i3825e/i3825e32.pdf (Institute of Forest Genetics and Tree Breeding Coimbatotre, Indian Council of Forestry Research and Education, 2012).
Murali, K. S., Shankar, U., Shaanker, R. U., Ganeshaiah, K. N. & Bawa, K. S. Extraction of non‐timber forest products of Biligiri Rangan Hills, India. 2. Impact of NFTP extraction on regeneration, population structure, and species composition. Econ. Bot. 50, 252–269 (1996).
Article Google Scholar
Murthy, K., Bhat, P. R., Ravindranath, N. H. & Sukumar, R. Financial valuation of non-timber forest product flows in Uttara Kannada district, Western Ghats, Karnataka. Curr. Sci. 88, 1573–1579 (2005).
Google Scholar
Pereira, I. M. & Groppo, M. Ecological Niche Modeling: Using Satellite Imagery and New Field Data to Support Ecological Theory and its Applicability in the Brazilian Cerrado. J. Ecosyst. Ecography. 2, https://doi.org/10.4172/2157-7625.1000111 (2012).
Nagaraju, S. K. et al. Do ecological niche model predictions reflect the adaptive landscape of species?: a test using Myristica malabarica Lam., an endemic tree in the Western Ghats, India. PLoS One 8, https://doi.org/10.1371/journal.pone.0082066 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Guisan, A. et al. Predicting species distributions for conservation decisions. Ecol. Lett. 16, 1424–35 (2013).
Article PubMed PubMed Central Google Scholar
Hughes, L., Cawsey, E. M. & Westoby, M. Climatic Range Sizes of Eucalyptus Species in Relation to Future Climate Change. Glob. Ecol. Biogeogr. 5(1), 23–29 (1996).
Article Google Scholar
Booth, T. H. Assessing species climatic requirements beyond the realized niche: some lessons mainly from tree species distribution modelling. Clim. Change. 145, 259–271 (2017).
Article ADS Google Scholar
Kremer, A., Potts, B. M., Delzon, S. & Bailey, J. Genetic divergence in forest trees: understanding the consequences of climate change. Funct. Ecol. 28, 22–36 (2014).
Article Google Scholar
Fischer, M., Hock, M. & Paschke, M. Low genetic variation reduces cross-compatibility and offspring fitness in populations of a narrow endemic plant with a self-incompatibility system. Conserv. Genet. 4, 325–336 (2003).
Article CAS Google Scholar
Szulkin, M., Bierne, N. & David, P. Heterozygosity-fitness correlations: a time for reappraisal. Evolution 64, 1202–1217 (2010).
PubMed Google Scholar
Reed, D. H. & Frankham, R. Correlation between fitness and genetic diversity. Conserv. Biol. 17, 230–237 (2003).
Article Google Scholar
Spielman, D., Brook, B. W. & Frankham, R. Most species are not driven to extinction before genetic factors impact them. Proc. Natl. Acad. Sci. USA 101(42), 15261–15264 (2004).
Article ADS CAS PubMed Google Scholar
West, S. A. & Gardner, A. Adaptation and Inclusive Fitness. Curr. Biol. 23, 577–584 (2013).
Article CAS Google Scholar
Thode, V. A. et al. Genetic diversity and ecological niche modelling of the restricted Recordia reitzii (Verbenaceae) from southern Brazilian Atlantic forest. Bot. J. Linn. Soc. 176, 332–348 (2014).
Article Google Scholar
Vos, P. et al. AFLP- a new technique for DNA fingerprinting. Nucleic. Acids. Res. 23, 4407–4414 (1995).
Article CAS PubMed PubMed Central Google Scholar
Lerceteau, E. & Szmidt, A. E. Properties of AFLP markers in inheritance and genetic diversity studies of Pinus sylvestris L. Heredity 82, 252–260 (1999).
Article CAS PubMed Google Scholar
Meudt, H. M. & Clarke, A. C. Almost forgotten or latest practice? AFLP applications, analyses, and advances. Trends. Plant. Sci. 12(3), 106–117 (2007).
Article CAS PubMed Google Scholar
Mahar, K. S., Rana, T. S., Ranade, S. A. & Meena, B. Genetic variability and population structure in Sapindus emarginatus Vahl from India. Gene 485, 32–39 (2011).
Article CAS PubMed Google Scholar
Holsinger, K. E., Lewis, P. O. & Dey, D. K. A Bayesian approach to inferring population structure from dominant markers. Mol. Ecol. 11, 1157–1164 (2002).
Article CAS PubMed Google Scholar
Holsinger, K. E. & Lewis, P. O. Hickory: A package for analysis of population genetic data, version 1.1. Computer program and documentation, Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut, USA (2007)
Spiegelhalter, D. J., Best, N. G., Carlin, B. P. & Vanderlinde, A. Bayesian measures of model complexity and fit. J. R. Stat. Soc. Series B Stat. Methodol. 64, 583–689 (2002).
Article MathSciNet MATH Google Scholar
Bonin, A., Ehrich, D. & Manel, S. Statistical analysis of amplified fragment length polymorphism data: a toolbox for molecular ecologists and economists. Mol. Ecol. 16, 3737–3758 (2007).
Article CAS PubMed Google Scholar
Sun, C. et al. Genetic structure and biogeographic divergence among Sapindus species: An inter-simple sequence repeat-based study of germplasms in China. Ind. Crops. Prod. 118, 1–10 (2018).
Article CAS Google Scholar
Wang, T., Wang, Z., Xia, F. & Su, Y. Local adaptation to temperature and precipitation in naturally fragmented populations of Cephalotaxus oliveri, an endangered conifer endemic to China. Sci. Rep. 6, 25–31 (2016).
Article ADS CAS Google Scholar
Vaishnav, V. & Ansari, S. A. Genetic differentiation and adaptation in teak (Tectona grandis L.f.) metapopulations. Plant Mol. Biol. Report. https://doi.org/10.1007/s11105-018-1101-3 (2018).
Article CAS Google Scholar
Wang, T., Chen, G., Zan, Q., Wang, C. & Su, Y. J. AFLP genome scan to detect genetic structure and candidate loci under selection for local adaptation of the invasive weed Mikania micrantha. PLoS One. 7, e41310 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Krauss, S. L., Sinclair, E. A., Bussell, J. D. & Hobbs, R. J. An ecological genetic delineation of local seed-source provenance for ecological restoration. Ecol. Evol. 321, 38–49 (2013).
Google Scholar
Leempoel, K., Parisod, C., Geiser, C. & Joost, S. Multiscale landscape genomic models to detect signatures of selection in the alpine plant Biscutella laevigata. Ecol. Evol. 8, 1794–1806 (2018).
Article PubMed PubMed Central Google Scholar
Kim, S. Y. et al. Estimation of allele frequency and association mapping using next-generation sequencing data. BMC Bioinformatics 12, 231 (2011).
Article PubMed PubMed Central Google Scholar
Revadekar, J. V., Kothawale, D. R., Patwardhan, S. K., Pant, G. B. & Rupa, K. K. About the observed and future changes in temperature extremes over India. Natural Hazards 60, 1133–1155 (2012).
Article Google Scholar
Reddi, C. S., Reddi, E. U. B., Reddi, N. S. & Reddi, P. S. Reproductive ecology of Sapindus emarginatus Vahl (Sapindaceae). Proc. Ind. Nat. Sci. Acad. 49B(1), 57–72 (1983).
MATH Google Scholar
Rao, A. S. Preliminary studies on the seasonal occurrence of insect pests a soap nut (Sapindus sp). Ind. Forester. 118(6), 432–437 (1992).
ADS Google Scholar
Charlesworth, D. & Charlesworth, B. Inbreeding depression and its evolutionary consequences. Annu. Rev. Ecol. Syst. 18, 237–268 (1987).
Article Google Scholar
Severns, P. M., Liston, A. & Wilson, M. V. Habitat fragmentation, genetic diversity, and inbreeding depression in a threatened grassland legume: is genetic rescue necessary? Conserv. Genet. 12, 881–893 (2011).
Article Google Scholar
Saccheri, I. et al. Inbreeding and extinction in a butterfly metapopulation. Nature 392, 491–494 (1998).
Article ADS CAS Google Scholar
Walisch, T. J., Matthies, D., Hermant, S. & Colling, G. Genetic structure of Saxifraga rosacea subsp. sponhemica, a rare endemic rock plant of Central Europe. Plant. Syst. Evol. 301, 251–263 (2015).
Article Google Scholar
Charlesworth, B. & Charlesworth, D. The genetic basis of inbreeding depression. Genet. Res. 74, 329–340 (1999).
Article CAS PubMed MATH Google Scholar
Taylor, H. R. et al. Cryptic inbreeding depression in a growing population of a long-lived species. Mol. Ecol. 26, 799–813 (2017).
Article PubMed Google Scholar
Chaturvedi, R. K., Joshi, J., Jayaraman, M., Bala, G. & Ravindranath, N. H. Multi-model climate change projections for India under representative concentration pathways. Curr. Sci. 103(7), 791–802 (2012).
Google Scholar
Song, X. & Zeng, X. Evaluating the responses of forest ecosystems to climate change and CO₂ using dynamic global vegetation models. Ecol. Evol. 7, 997–1008 (2017).
Article PubMed PubMed Central Google Scholar
Gao, Y., Gao, X. & Zhang, X. The 2 °C Global Temperature Target and the Evolution of the Long-Term Goal of Addressing Climate Change-From the United Nations Framework Convention on Climate Change to the Paris Agreement. Engineering 3, 272–278 (2017).
Article Google Scholar
Troup, R. S. The Silviculture of Indian Trees: Volume I. 239–240 (Oxford University Press 1921).
Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G. & Jarvis, A. Very high resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25, 1965–1978 (2005).
Article Google Scholar
Kriticos, D. J. et al. CliMond: global high-resolution historical and future scenario climate surfaces for bioclimatic modelling. Methods Ecol. Evol. 3(1), 53–64 (2012).
Article Google Scholar
Shabani, F., Kumar, L. & Ahmadi, M. A comparison of absolute performance of different correlative and mechanistic species distribution models in an independent area. Ecol. Evol. 6(16), 5973–5986 (2016).
Article PubMed PubMed Central Google Scholar
Phillips, S. J., Anderson, R. P. & Schapire, R. E. Maximum entropy modeling of species geographic distributions. Ecol. Modell. 190(3–4), 231–259 (2006).
Article Google Scholar
Elith, J. et al. A statistical explanation of MaxEnt for ecologists. Divers. Distrib. 17, 43–57 (2011).
Article Google Scholar
Phillips, S. J., Dudik, M. & Schapire, R. E. [Internet] Maxent software for modeling species niche and distribution (Version 3.4.1). Available from url: http://biodiversityinformatics.amnh.org/open_source/maxent/.
Warren, D. L., Glor, R. E. & Turelli, M. Environmental niche equivalency versus conservatism: quantitative approaches to niche evolution. Evol. 62, 2868–2883 (2008).
Article Google Scholar
Warren, D. L. & Seifert, N. Ecological niche modeling in MaxEnt: the importance of model complexity and the performance of model selection criteria. Ecol. Application. 21(2), 335–342 (2011).
Article Google Scholar
Tripathi, P. K., Jena, S. N., Rana, T. S. & Sathyanarayana, N. High levels of gene flow constraints population structure in Mucuna pruriens L. (DC.) of northeast India. Plant. Gene. 15, 6–14 (2018).
Article Google Scholar
Liu, K. & Muse, S. V. Powermarker: Integrated analysis environment for genetic marker data. Bioinformatics. 21(9), 2128–2129 (2005).
Article CAS PubMed Google Scholar
Zhivotovsky, L. A. Estimating population structure in diploids with multilocus dominant DNA markers. Mol. Ecol. 8, 907–913 (1999).
Article CAS PubMed Google Scholar
Peakall, R. & Smouse, P. E. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research-an update. Bioinformatics. 28, 2537–2539 (2012).
Article CAS PubMed PubMed Central Google Scholar
Perrier, X. & Jacquemoud-Collet, J. P. DARwin software: Dissimilarity Analysis and Representation for Windows http://darwin.cirad.fr/darwin (2006).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genet. 155, 945–959 (2000).
CAS Google Scholar
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 14, 2611–2621 (2005).
Article CAS Google Scholar
McDermott, J. M. & McDonald, B. A. Gene flow in plant pathosystems. Annu. Rev. Phytopathol. 31, 353–373 (1993).
Article Google Scholar
Foll, M. & Gaggiotti, O. E. A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: A Bayesian perspective. Genet. 180(2), 977–993 (2008).
Article Google Scholar
Bradbury, P. J. et al. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics. 23(19), 2633–2635 (2007).
Article CAS PubMed Google Scholar
Pritchard, J. K., Stephens, M., Rosenberg, N. A. & Donnelly, P. Association mapping in structured populations. Am. J. Hum. Genet. 67, 170–181 (2000).
Article CAS PubMed PubMed Central Google Scholar
Yu, J. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38, 203–208 (2006).
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge the Department of Biotechnology, New Delhi (GAP-3398) for partial financial support. The Curators of the different herbaria (LWG, CAL, BSA, BSID and DD) and authorities of Forest Departments of Andhra Pradesh, Gujarat, Karnataka, Madhya Pradesh, Rajasthan, Telangana and Uttar Pradesh are gratefully acknowledged for providing access to collect information and plant samples for the study. The fellowship grant to VV from CSIR, New Delhi is also acknowledged.

Author information

Authors and Affiliations

Plant Diversity, Systematics and Herbarium Division, CSIR-National Botanical Research Institute, Rana Pratap Marg, Lucknow, 226001, India
Ashish Kumar Pal, Vivek Vaishnav, Baleshwar Meena & Tikam Singh Rana
Plant Nutrition and Stress Physiology Laboratory, Department of Botany, University of Lucknow, Lucknow, 226007, India
Ashish Kumar Pal & Nalini Pandey

Authors

Ashish Kumar Pal
View author publications
You can also search for this author in PubMed Google Scholar
Vivek Vaishnav
View author publications
You can also search for this author in PubMed Google Scholar
Baleshwar Meena
View author publications
You can also search for this author in PubMed Google Scholar
Nalini Pandey
View author publications
You can also search for this author in PubMed Google Scholar
Tikam Singh Rana
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K.P., N.P. and T.S.R. conceived the research hypothesis and experiment design; A.K.P., B.M., sampled the species from different localities and recorded the data. A.K.P. performed the laboratory works; V.V. & A.K.P. analyzed the data and drafted the manuscript; all authors contributed equally in the final version after critical reviews.

Corresponding author

Correspondence to Tikam Singh Rana.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Appendix S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kumar Pal, A., Vaishnav, V., Meena, B. et al. Adaptive fitness of Sapindus emarginatus Vahl populations towards future climatic regimes and the limiting factors of its distribution. Sci Rep 10, 3803 (2020). https://doi.org/10.1038/s41598-020-60219-8

Download citation

Received: 20 May 2019
Accepted: 30 January 2020
Published: 02 March 2020
DOI: https://doi.org/10.1038/s41598-020-60219-8

This article is cited by

Predicting the potential global distribution of Sapindus mukorossi under climate change based on MaxEnt modelling
- Yongxiang Li
- Wenhao Shao
- Jingmin Jiang
Environmental Science and Pollution Research (2022)
Modeling habitat suitability of bats to identify high priority areas for field monitoring and conservation
- Anooshe Kafash
- Sohrab Ashrafi
- Masoud Yousefi
Environmental Science and Pollution Research (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Seed germination schedule and environmental context shaped the population genetic structure of subtropical evergreen oaks on the Yun-Gui Plateau, Southwest China

Assessing signals of selection and historical demography to develop conservation strategies in the Chilean emblematic Araucaria araucana

Genetic diversity and population structure in Nothofagus pumilio, a foundation species of Patagonian forests: defining priority conservation areas and management

Introduction

Results

Discussion

Influence of future climatic regimes on the species population distribution

The adaptive fitness of the species population

Limiting factors of the species population distribution

Conclusions

Methods

Population occurrence data and sampling

Bioclimatic data

Prediction of the distribution pattern

DNA isolation and AFLP amplification

Assessment of genetic diversity

Detection of signature of local adaptation

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Appendix S1.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Predicting the potential global distribution of Sapindus mukorossi under climate change based on MaxEnt modelling

Modeling habitat suitability of bats to identify high priority areas for field monitoring and conservation

Comments

Search

Quick links