Introduction

The previous studies of ecological-genetic interface using allozyme (protein-coding region) (Nevo et al, 1982, 1988; Nevo and Beiles, 1989) and randomly amplified polymorphic DNAs (Li et al, 1999) suggested that ecological stresses might direct genetic diversity at macro- and microgeographic scales in Triticum dicoccoides (reviewed in Nevo, 1988; 1998; Nevo and Beiles, 1989). However, in contrast to the nuclear coding allozymes, microsatellites (SSRs) are primarily non-coding, regarded by many as neutral markers. Therefore, their ecological and genetic macro- and microgeographic correlates need critical testing to establish their nature and relevance. Here we show effects of microclimatic stresses on adaptive dinucleotide SSR divergence in natural subpopulations of wild emmer wheat at a microscale.

Materials and methods

Plant materials

Triticum dicoccoides is the tetraploid and predominantly self-pollinated progenitor of cultivated wheats (Zohary, 1970). Its genome constitution is AABB, with 28 chromosomes. Annual wild emmer grows over a wide range of altitudes and ecological amplitudes in several steppe-like herbaceous formations in the Tabor oak, Quercus ithaburensis, or in the Q. brantii open park forest belts (Zohary, 1973) across the Fertile Crescent (Harlan and Zohary, 1966). The Yehudiyya site is located in an open oak park forest of Q. ithaburensis at the lower western foothills of the Golan Heights, northeast of the Sea of Galilee, Israel. Sampling T. dicoccoides was conducted in 1985 in an area smaller than 1000 m2 and involved 12 repeated sampling plots (trees and their immediate circumference). Trees were mosaically distributed and separated on average 10 m from each other, and each experimental tree involved two microclimatic niches separated by only a few meters: (1) shade, mostly shaded during daytime under the canopies of oak trees (trees 10–20 m in height, with canopy diameters up to 20 m); and (2) sun, exposed in daytime to continuous sun radiation and drying in the circumference around each tree and between trees (Nevo et al, 1988). The soil temperature in the sun niche was almost 10°C higher than in the shade niche. The relative humidity in soil was similar between the shade and sun niches. During the growing season (Oct–May) of T. dicoccoides the shade niche is under stresses of lower temperature and lower intensity of radiation in contrast to the sun niche. The two microniches vary significantly in plant formations (Nevo et al, 1988). Wild emmer is sparse in the shade under the oak canopy and abundant in the sun. In each pair of niches for each tree, several plants (4–5) were sampled from the shade under the tree canopy, and nearby from the sun niche immediately surrounding the tree circumference. Thus, the 12 trees sampled represent 12 repeated plots, each involving sun-shade paired comparison, separated by 2 to 4 meters and extracted from bush and extensive stands of wild emmer involving thousands of individual plants. In total, we examined SSR diversity of 105 individual plants (50 from the shade and 55 from the sun microniches, 4–5 individuals on the average in the shade and 4–5 plants from the sun around each tree).

Analysis of SSR diversity

Genomic DNA was extracted from seedlings using the method of Junghans and Metzlaff (1990). Plaschke et al (1995) and Röder et al (1995, 1998) described the SSR primers used in this study. Twenty-eight dinucleotide SSR DNA markers (one for each chromosomal arm, see the accompanying paper Li et al, 2002) were chosen for the analysis. The procedure used to detect SSR polymorphism followed Plaschke et al (1995) and Fahima et al (1998). Fragment sizes were calculated using a Fragment Manager (Pharmacia) computer program by comparing with internal size standards, which were added to each lane in the loading buffer. The repeat number of alleles was calculated according to the fragment sizes and number of repeat units at the corresponding locus in Chinese Spring (see the accompanying paper Li et al, 2002).

Data analysis

An estimate of gene diversity was calculated for each locus and sub-population according to Nei (1973). The χ2-test was used to test significance of differences in allele frequency and linkage disequilibrium. Niche-specific alleles are defined as those that significantly predominate in one of the two niches (either shade or sun). Niche-unique alleles are defined as those that are present in only one (either shade or sun) of the two niches. The sign test for each pair of shade and sun niches over all 12 trees was used to analyze the differences in repeat number between the two sub-populations; the Hartley F-ratio tested the significance of differences between the two sub-populations for the variance in repeat number at each locus. The POPGENE (Yeh et al, 1997) and STATISTICA (Statsoft, 1996) programs were used to perform the statistical analyses. A permutation test was also conducted (by 5000 times of random shuffling of the samples) to estimate the significance of niche-specificity of alleles and linkage disequilibria. Deviation or skewness from symmetry (skewness = 0) was tested using t-test.

Genetic distance between the shade and sun sub- populations was estimated according to Nei (1972). The original sample set was randomly shuffled 1000 times, maintaining sample sizes for each niche. The genetic distances were also estimated according to Nei (1972) for the 1000 randomly shuffled sample sets.

Results

Twenty-seven of the tested 28 SSR DNA markers (96.4%) produced fragments; locus GWM637 did not amplify any product in all individuals. The GWM332 produced two fragments in all individuals, which were considered two loci named GWM332a and GWM332b. The two loci were completely linked and mapped at the same position in the long arm of chromosome 7A in the mapping population derived from ‘T. dicoccoides × T. durum’ (Peng et al, 2000). Each of the remaining 26 primer pairs produced one fragment in each individual regarded as an allele of one locus. Out of the total 28 loci, two loci (GWM601 and GWM415) were monomorphic in both microclimatic niches, and the remaining 26 loci (92.9%) were polymorphic.

The distribution of alleles at SSR loci

Alleles were defined by number of repeats. Table 1 presents the summary of statistical analyses for allele distribution at 28 SSR loci. One to 13 alleles were amplified per locus, but the number of alleles (NA) per locus was different at the 28 SSR loci between the two niches. The total NA was 172 and 179 in the shade and sun sub- populations, respectively. The shared NA was 159 in the two sub-populations, and allele distributions were different between the two sub-populations. The χ2-test was used to test for deviation of each allele distribution between the two sub-populations from the null hypothesis Ho: Psun(i) = Pshade(i). After excluding rare alleles (defined as ‘observed in ≤7 individuals’ in the entire sample of 105 plants), 20 and six niche-specific alleles were found at P < 0.05 and P < 0.01 levels, respectively; more shade-specific (n = 18) than the sun-specific alleles (n = 8) were observed (Table 2). Among these niche- specific alleles, three shade-unique alleles were found: Two shade-unique alleles at loci GWM218 and GWM459 were found under trees no. 1 and 2. Another shade-unique allele at locus GWM368 was found under trees no. 6 and 7. Two sun-unique alleles were also found at loci GWM136 under trees no. 1, 2 and 3, and GWM251 under trees no. 11 and 12. Genotypes within each pair of niches (microniche and tree) were randomly shuffled maintaining the initial sample sizes of each niche. A permutation test based on 5000 randomly shuffled sets revealed that the significance levels of shade and sun specificity were high (P < 0.005, 0.003, respectively, Table 2). This result suggests with high confidence the non-randomness of the revealed sub-population differentiation. Moreover, niche-unique alleles under neighbouring trees reflect migration among close trees, but, without strong microclimatic selection, such niche-uniqueness of some alleles cannot be maintained in a mosaic manner under the tree canopy and separated by sun niche between trees.

Table 1 Statistics of allele distributions at 27 microsatellite loci in the shade and sun sub-populations of Triticum dicoccoidies at Yehudiyya
Table 2 Summary of permutation test for niche-specificity of alleles (excluding rare alleles observed in ≤7 individuals) at microsatellite loci of Triticum dicoccoides in the sunny and shady microclimatic niches in Yehudiyya, Israel

Gene diversity, He (Nei, 1973), based on allele frequencies, also showed obvious differences between the two climatic divergent sub-populations at some loci. The differences in He were as high as 0.10–0.19 at some loci, such as GWM99, GWM219, GWM251, GWM332a, GWM361, GWM408, and GWM429 (Table 1). These results indicated that allele distributions appeared to be nonrandom in the two subpopulations separated by only 2–4 meters but associated with sun vs shade climatic microniches.

To test how much the distributions of repeat number (allele size) deviate from symmetry in the two microclimatic niches, skewness was calculated for each locus (Table 1). Allele size distributions significantly deviated from symmetry (skewness = zero) at 12 loci (P < 0.05–0.00001). In the shade and sun sub-populations, allele distributions were significantly skewed to the right (skewness > 0) at seven and nine loci, and to the left (skewness < 0) at one and two loci, respectively (P < 0.05–0.00001, Table 1). Such patterns indicated that frequencies of alleles with relatively short repeats were significantly higher than those with long repeats. Such distribution appears not to meet the null hypothesis of pure random drift. Out of 100 cases, the skewness may be significant by chance in 5% of cases. According to the binomial distribution, out of our 54 (2 × 27, two-tailed test) cases, the probability of obtaining significance in 12 cases by chance was very low (0.00006). Opposite direction of skewness was also observed at three to four loci between the shade and sun sub-populations (Table 1).

Sign test over each pair of shade and sun niches around all 12 trees showed that average repeat numbers (ARN) were significantly larger at nine loci in the shade (P < 0.05–0.01, Table 1). Significantly, larger variances in repeat number were found in the sun than in the shade sub-population at six loci (GWM18, GWM99, GWM294, GWM124, GWM332a, and GWM408), and an opposite pattern was observed at four other loci (GWM162, GWM219, GWM361, GWM537). These results also indicated significant differences in allele (repeat number) distributions between the two subpopulations.

Genetic distance

The estimate of genetic distance (Nei, 1972) was 0.1057 between the shade and sun subpopulations. To test the significance of the genetic divergence represented by this estimate, the original sample set was randomly shuffled 1000 times maintaining sample size for each tree and niche. Genetic distance between the two subpopulations was calculated for each randomized set. Among the 1000 sets, no case showed larger genetic distance than the observed value, and most of the cases showed about half of the observed genetic distance. Thus, the significance of the observed genetic distance (0.1057) was at least P < 0.001. This permutation test indicates that genetic divergence at SSR loci is substantial between the neighbouring shade and sun subpopulations, only a few meters apart.

Discriminant analysis

Forward stepwise discriminant analysis was performed to distinguish individuals from the two microclimatic niches according to multilocus analysis including the 25 polymorphic loci (excluding GWM136 because of missing data). The result indicated that five (GWM18, GWM251, GWM332a, GWM368, GWM459) out of 25 SSR loci were sufficient to correctly classify 82% of 105 individuals into their original microclimatic sun and shade niches. The difference between the distribution center of the two subpopulations was highly significant (F(5,99) = 15.30, P < 0.00005). Even with only two (GWM169, GWM332a) of the 25 loci, 70% of 105 individuals could be correctly assigned to their original microniches. Here too, the difference between the centroids of the two subpopulations was also highly significant (F(2,102) = 14.28, P < 0.00005). These results clearly suggest that the observed genetic differentiation at SSR loci is substantial, niche-associated, and therefore, ecologically determined.

Linkage disequilibria (LD) between SSR loci

Triticum dicoccoides is predominantly a self-pollinating plant (about 99%), hence its effective population size is expected to be smaller than those of outcrossing plants and animals. Even without any selection, a small effective size would result in linkage disequilibria, LD (Ohta and Kimura, 1969). However, this is a minor effect. The major effect is that in an outcrosser LDs are reduced in each generation by crossing over between linked loci. The effect of recombination is very low in an inbreeder. It is of interest from the viewpoint of population genetics to investigate how LDs exist between SSR loci in this largely (but not completely) selfing species across the two microclimatic niches. In the present study, 16847, 13530, and 14541 possible pairwise associations were found in the total population, the shade and sun subpopulations, respectively. Of these, 18.0, 21.6, and 14.2 percent were significant (χ2-test, P < 0.05) LDs in the total population, shade and sun subpopulations, respectively. Because rare alleles may cause biased estimation of LD (Lewontin, 1995), further analyses were performed excluding the rare alleles (observed in ≤7 individuals in the whole population). In the shade niche, 86%, 90%, and 94% of 1089, 594, and 267 LDs at P < 0.05, 0.01, and 0.001 significant levels were niche-specific (Table 3). Permutation test, based on 5000 randomly shuffled sets of the initial data, showed high significance of the LD shade- specificity (P < 0.003–0.009, Table 3). In the sun subpopulation, after excluding rare alleles, 608, 296, and 131 LDs were also found to be significant at the levels of 0.05, 0.01, and 0.001, respectively. Among these significant LDs, 74–88% were sun-specific. However, the permutation test did not show significance of this LD sun-specificity (Table 3). These results suggest that the possibility of observing shade-specific LDs by chance was very low, the shade-specific environment may be responsible for the LD interniche specificity.

Table 3 Permutation test for niche-specificitya of linkage disequilibria (LD) at microsatellite loci of Triticum dicoccoides in the sunny and shady niches in the Yehudiyya, Israel

The log-linear model test showed that niche effect was very significant (χ2df=1 = 34.37, P < 0.000005), whereas, genome effect was negligible (χ2df=1 = 0.25, P > 0.50). Linkage itself between loci within chromosomes also seemed not to affect percentages of significant LDs in both subpopulations, reflecting the fact of the higher importance of selfing as compared to linkage.

Discussion

Microclimatic allelic divergence at SSRs in the shade and sun

This study demonstrated that interniche allele distributions at several SSR loci were significantly associated with microclimatic niches of T. dicoccoides. Microsatellite diversity in ARN, variance in repeat number (σ2) and numbers of alleles per locus (A), were significantly affected by microclimatic niche. Excluding rare alleles, niche-specific and niche-unique alleles were still identified in both sub-populations. In particular, more niche-specific and niche-unique alleles were observed in the shade sub-population. The observed niche-unique alleles clustered in neighbouring trees. This pattern most probably reflects migration. In such a small area, as suggested by Golenberg (1986) and Li et al (1999), seed dispersal by ants, rodents and human activities cannot be ignored. However, if there were not strong microclimatic selection in such a small area of only a few meters between the sun and the shade, the observed SSR differentiation might not have been maintained. Indeed, one would expect that without microclimatic selection, these niche-unique alleles should be spread over the neighbouring areas, because a small amount of migration or gene flow is sufficient to swamp the differentiation that arises from the random drift of non-selective genes (Lewontin, 1974). Therefore, we can assume that the revealed patterns may reflect an adaptation to microclimatic specificity of the two very close microclimatic niches. The lower temperature and intensity of radiation in the shade niche, and low moisture stress in the sun niche, may select specific or unique alleles of the subpopulations of T. dicoccoides as earlier demonstrated for nuclear allozyme alleles (Nevo et al, 1988). The importance of the present study is the demonstration of possible selection on the non-coding genomic regions.

Microclimatic differentiation of SSR linkage disequilibria in the shade and sun

LD can arise from natural selection, or by chance from population sub-division, hitchhiking (selfing and/or linkage), or founder effects. This study showed that the majority of the observed significant LDs were niche- specific. Permutation test demonstrated that the niche-specificity was highly significant in the shade, but not in the sun, subpopulations. These results suggest that the shade-specific linkage disequilibria do not arise from stochastic events, but from shade unique microclimate presumably owing to lower temperature and solar radiation (Table 1 in Nevo et al, 1988).

Niche-specificity was found at single- and two-locus levels in the shade subpopulation, suggesting that natural selection may act upon SSR variation, exactly as it does on allozymic diversity where we earlier demonstrated overall (sun + shade), sun and shade linkage disequilibria, suggesting niche-specific selection (Nevo et al, 1988). Nevertheless, even the small amount of outcrossing in wild cereals, especially in humid regions (Brown et al, 1978), can provide raw material for recombination and clonal selection, particularly in a widespread population such as that of Yehudiyya involving thousands of wild emmer individual plants organized in a sun/shade mosaic pattern.

It is important to avoid the pitfall of neighbouring effect in the attempt to separate random (founder effect) from non-random (niche structure) factors. Our present data at least partly overcome this problem, since it contains a series of repetitive plots, each with an oak tree canopy (shade) and its surrounding intertree space (sun). The mosaic oak tree structure of our experimental design involving 12 repetitions safeguard, at least partly, from genetic neighborhood effects. The genetic divergence of sun vs shade found here is, therefore, unlikely to be the result of stochastic processes. Edaphic and topographic microgeographic SSR divergences were also found in our other two populations of T. dicoccoides from two microsites, Tabigha (Li et al, 2000a) and Ammiad (Li et al, 2000b).

SSR variation and adaptation

The observed microclimatic SSR diversity may imply an adaptation to the shade or sun niche. There is evidence that SSR sequences can serve functional roles as regulatory elements (Karlin et al, 1998; Kashi and Soller 1999; King and Soller, 1999) and this is discussed in the accompanying paper (Li et al, 2002).

Viewed as functional elements of the genome, the special characteristics of SSRs as mutational hotspots have led to the proposal that SSRs may be a major source of genetic diversity and evolutionary adaptation (Kashi et al, 1997; Kashi and Soller, 1999; Trifonov, 2000). Therefore, the niche-unique alleles or linkage disequilibria may result from an adaptation of wild emmer plant to the unique sun or shade niche.

If so, this study complements the one on allozyme diversity (Nevo et al, 1988) in demonstrating that both genomic coding and non-coding regions are subjected to natural selection. This hypothesis is testable by transplant experiment between the sun and shade genotypes and by genetic mapping.