Introduction

Ecological specialization with adaptation, which is a genetic and evolutionary process increasing the fitness of a population to its environment, is commonly found in nature (Futuyma and Moreno 1988; Brown 1995). Studies on the evolutionary process and genome-wide differentiation associated with ecological differences contribute to understanding processes and mechanisms of ecological specialization (Hoekstra et al. 2006). From dwarf shrubs to canopy trees, the tree form is an important ecological specialization, such as competing with other species or individuals, reproducing effectively, and using limited resources (King 1990; Petit and Hampe 2006). Although this morphology is also influenced by the local environment, such as drought and wind stresses, the tree form is determined by hormone concentration of auxin and genetically predetermined for each species (Hollender and Dardick 2015); therefore, understanding tree form differences and its evolution contributes to forestry and other agriculture. Tree form variation is found in various taxa and within genera such as Echium (Böhle et al. 1996), Lupinus (Hughes and Eastwood 2006), and Salix (Wagner et al. 2018). Phylogenetic analyses revealed that tree form evolution occurred multiple times within a genus (Wagner et al. 2018). However, it is difficult to detect the genetic mechanism of tree form evolution from inter-specific relationships because of excessive genetic divergence. The variation within species is limited, such as Eucalyptus globulus (Foster et al. 2007), and the evolutionary process within species can provide important knowledge for the ecological specialization of trees. In the case of E. globulus, shrub populations evolved in parallel in disjunct areas, suggesting that these populations were formed by environmental pressure or genetic mechanisms.

Generally, the growth of evergreen broad-leaved trees is restricted in cold areas because of the less cold hardiness of evergreen leaves (Sakai and Larcher 1987; Kume and Ino 1993). However, evergreen broad-leaved shrubs are characteristically distributed in a heavy snowfall region along the Sea of Japan of the Japanese archipelago (Maekawa 1949; Hara 2023), and it is considered that the distribution of the shrubs is enabled by protection in snow cover (Kume and Ino 1993). For example, evergreen broad-leaved shrubs such as Camellia rusticana (Theaceae), Aucuba japonica var. borealis (Garryaceae), Ilex crenata var. paludosa, I. leucoclada (Aquifoliaceae), and Daphniphyllum macropodum var. humile (Daphniphyllaceae) are distributed in this heavy snowfall region. Interestingly, their tree or tall shrub congeners are distributed in a region along the Pacific Ocean with light snowfall, such as C. japonica, A. japonica var. japonica, I. crenata var. crenata, I. integra, and D. macropodum var. macropodum (Maekawa 1949; Hara 2023). The maximum snow depth in the Japanese archipelago along the Sea of Japan is sometimes over than 4 m (Fig. 1). It is caused by three factors: the cold winter monsoon from the northwest, the warm Tsushima current in the Sea of Japan, and continuous mountain ranges along the Sea of Japan (Kawase et al. 2018). Clouds are formed by contact between the cold air and warm current and move to the Japanese mountains during strong winter monsoons, resulting in heavy snowfall in the mountains. Global cooling started in the mid-Miocene and climatic oscillations started in the Pleistocene (Zachos et al. 2001); the eolian deposits in north China suggest that the winter monsoon became incrementally strong during the Pleistocene (Sun et al. 2010), and the uplift of mountains in Honshu, Japan, started in the late Pliocene (Yonekura et al. 2001).

Fig. 1: Distributions of analyzed populations and chloroplast DNA haplotypes, genetic clusters elucidated by sNMF, and approximate maximum tree height of the population.
figure 1

a Small letters indicate population codes corresponding to those in Table 1. Pie charts indicate frequencies of chloroplast DNA haplotypes, and the green and blue outer circles of the pie charts indicate Daphniphyllum macropodum var. macropodum and var. humile, respectively. A haplotype network is superimposed on the map. b Geographical distributions of ancestry for the two varieties when K = 4, the optimal number of clusters supported by the cross-entropy criterion. Snow depth is shown only for Japan, which is modified data from Japan Meteorological Agency (https://www.data.jma.go.jp/obd/stats/etrn/view/atlas.html). c Approximate maximum tree height of the population. d Distributions of ancestry when the number of clusters ranged from K = 2–5.

Daphniphyllum, the sole genus of Daphniphyllaceae, is the evergreen broad-leaved woody plant, and most of the 16 species are distributed in East and Southeast Asia, the Indian Subcontinent, and Australasia (Huang 1996; Tang et al. 2012). Daphniphyllum macropodum Miq. is distributed in southeast China, Taiwan, South Korea, and Japan. In Japan, the tree variety, D. macropodum var. macropodum, exceeding 10 m in height, is distributed in a warm temperate climate with light snowfall along the Pacific Ocean of Honshu, Shikoku, Kyushu, and the Ryukyu Islands. In addition, a shrub variety, D. macropodum var. humile (Maxim. ex Franch. et Sav.) K. Rosenthal (syn. D. humile Maxim. ex Franch. et Sav.) is distributed in cool temperate climate with heavy snowfall in Hokkaido and along the Sea of Japan of Honshu (Fig. 1). In contrast to the wide distribution of var. macropodum across East Asia, var. humile is endemic in Japan. The distribution of var. humile is concordant with the range of heavy snowfall over 50 cm (Fig. 1) and the duration of snow cover of around 30–120 days (Roessler and Dietz 2022). In contrast to the tree form of var. macropodum with a single straight trunk owing to apical dominance, var. humile grows horizontally owing to lateral dominance and clonal reproduction by layering due to being pressed to the ground by snow weight. In addition, short individuals of var. humile with a diameter below 1 cm at ground height can bloom (Yoichi, field observations), indicating that the reproductive size of var. humile is notably shorter than that of var. macropodum. Because individuals of the two varieties experimentally planted at the same place in the Pacific Ocean side of Honshu (35.7753N, 139.9023E) exhibited tree and shrub, respectively (Yoichi, field observations), it suggests that this tree form variation of the two varieties can be determined genetically. Such wide tree height variation is notable between shrubs and these congeners in Japan. The two varieties do not show differences in flower morphology; therefore, we adopt taxonomic treatment as two varieties. Pollination of D. macropodum occurs via wind (Kato 2000), and seed dispersal occurs via birds (Kominami et al. 1997).

In the present study, we performed a population genetic analysis of the two D. macropodum varieties, with D. teijsmannii as an outgroup, using chloroplast DNA (cpDNA) sequences and double digest restriction-site associated DNA (ddRAD) sequencing. To elucidate the evolutionary process of the shrub variety, which is considered related to the development in the heavy snowfall climate, we aimed to: (i) reveal genome-wide divergence between the two varieties, (ii) elucidate relationships between genetic and climatic differentiations between the two varieties, and (iii) discuss whether the evolution of var. humile is associated with the development of snowfall along the Sea of Japan during the Quaternary. Our results will provide new knowledge regarding the evolutionary process with ecological specialization and help us to understand tree form evolution and its genetic mechanisms.

Materials and methods

Plant sampling

We collected leaf materials from mature individuals from 16 populations of D. macropodum var. macropodum from Japan and South Korea and 23 of D. macropodum var. humile from Japan for covering the entire range (Table 1 and Fig. 1). The varieties of populations were identified based on measurements of the maximum tree height of each population because there were mature and immature individuals with various ages. In addition, we sampled three populations of D. teijsmannii, which is the other species of the genus distributed in Japan and South Korea, as an outgroup. The collected leaf tissues were immediately dried using silica gel.

Table 1 Sampling locality and genetic diversity estimates within populations collected from Japan and South Korea.

DNA experiment

Genomic DNA was extracted from dried leaves using a modified cetyltrimethylammonium bromide method (Murray and Thompson 1980) after treatment with sorbitol buffer (Wagner et al. 1987). On the basis of a screening of polymorphic loci using previously reported universal primers (Kress et al. 2005; Shaw et al. 2005), three noncoding cpDNA regions [trnK-matK, trnD-trnE, and psbH-psbB (Shaw et al. 2005)], which were polymorphic within and between D. macropodum and D. teijsmannii, were used in this study. Four individuals from each population of D. macropodum and two individuals from each population of D. teijsmannii were amplified and sequenced using the protocols described in Yoichi et al. (2017). Genome-wide single-nucleotide polymorphisms (SNPs) were detected using the ddRAD sequencing method, as described in Yoichi et al. (2019). Briefly, the genomic DNA of each sample was digested with BglII and EcoRI, and digested fragments of 350–400 bp were sequenced with 150 bp paired-end reads using a HiSeqX platform (Illumina, San Diego, California, USA). Raw reads were processed using fastp v.0.20.1 (Chen et al. 2018) to cut adaptor sequences and filter out sequences with a Phred quality score, Q < 30.

SNP detection

Before SNP calling, samples with less than 150 K single-end reads were removed; therefore, 165 samples (an average of 3.9 samples per population) were used for SNP detection. SNPs from single-end reads were called using Stacks v. 2.3 (Catchen et al. 2013). To create stacks, we used the following settings with the outgroup species: minimum depth option for creating a stack, -m 3; maximum distance between stacks within an individual, -M 3; number of mismatches between loci, -n 2; gapped option for creating stacks, –max_gaps 5; and default settings for other options. To maximize the number of used SNPs in each data analysis, we defined and made the following three types of datasets that were tailored to the objective of each analysis: 42 populations of D. macropodum and D. teijsmannii for analyses with the outgroup (Set 1), 39 populations of D. macropodum for population genetic analyses of the species (Set 2), and 3 groups of D. macropodum for demographic analysis of the species (Set 3). The definitions of the three groups, var. macropodum (M) and southern and northern var. humile (HS and HN), were populations M1–M16, H11–H23, and H1–H10, respectively, which were identified by population genetic analyses (see Results). For Set 1 and Set 2, we extracted SNPs under Hardy–Weinberg equilibrium, with independent loci, removing minor alleles and allowing missing sites, by applying the following restrictions using the Populations program in Stacks: minimum minor allele frequency was 1%, –min_maf 0.01; observed heterozygosity less than 0.5, –max_obs_het 0.5; genotyping rate more than 20% of individuals within populations, -r 0.2; and present in all of the populations, -p 42 or 39. Pairwise R2 values for each SNP pair were calculated using PLINK 1.90b (Purcell et al. 2007), and loci with values higher than 0.8 were removed using the whitelist option in the populations to exclude SNPs with high linkage equilibrium. SNP genotypes for the three groups in D. macropodum (Set 3) were called with the following restrictions: include all SNPs without minor allele filter, –min_maf 0.00; observed heterozygosity less than 0.5, –max_obs_het 0.5; genotyping rate more than 80% of individuals within regions, -r 0.8; and present in all of the regions, -p 3.

Population genetic analysis

cpDNA sequences were edited and assembled using DNA Baser 4 (Heracle BioSoft, PiteÅŸti, Romania) and aligned using MUSCLE implemented in MEGA 5 (Edgar 2004; Tamura et al. 2011). A haplotype network was constructed by the median-joining method using NETWORK v. 10 (Fluxus Technology Ltd, Colchester, UK).

Genetic relationships among populations, with and without the outgroup species, were constructed independently by the neighbor-joining method based on DA distances (Nei et al. 1983) using POPTREE2 (Takezaki et al. 2010). The significance of nodes was evaluated from bootstrap values based on 1000 replicates. The individual-based genetic structure with and without the outgroup species was inferred independently by the sparse non-negative matrix factorization method using sNMF (Frichot et al. 2014). Different numbers of ancestries (K) in the 1–10 range were tested and ranked using the cross-entropy criterion. Principal component analysis (PCA) of individuals was performed using the R package adegenet of R v. 4.1.1 (Dray and Dufour 2007).

Demographic analysis

To estimate past population demography, such as population size change, population divergence, or migration between populations, approximate Bayesian computation (ABC) was performed. On the basis of the results of population genetic analyses (see Results), we classified populations into three different groups: var. macropodum (M, populations M1–M16), southern var. humile (HS, populations H11–23), and northern var. humile (HN, populations H1–H10). We then created four three-population divergence scenarios: no migration (NM), isolation with migration (IM), ancient migration (AM), and secondary contact (SC) (Fig. 2). NM assumes no migration between the populations. IM assumes that migration between populations always occurs from its divergence to the current. AM assumes that migration occurred only in the early period of its divergence. SC assumes that migration occurs after a certain period of isolation. First, we tested two migration patterns, the island and stepping-stone models, for IM from T0 to T2 and SC from T0 to T1. The island model assumes that migration occurs between all pairs of groups. The stepping-stone model assumes that migration occurs only between adjacent pairs of groups. In our preliminary analyses, as the stepping-stone model was more strongly supported than the island model in both IM and SC (details not shown), we used the stepping-stone model for IM and SC in the scenario comparison. Prior distributions of the effective population size (N), time (T), and migration (Nmij) parameters were log-uniform (103, 106), log-uniform (1, 106), and log-uniform (0.1, 10), respectively (Supplementary Table 1). The units were the number of diploid individuals generations ago and the number of migrants per generation from j to i, respectively. To aid the reader’s understanding, the direction of migration is shown as the movement of individuals from j to i (time in forward).

Fig. 2: Comparison of the three group divergence scenarios.
figure 2

The four scenarios are no migration (NM), isolation with migration (IM), ancient migration (AM), and secondary contact (SC). M, Daphniphyllum macropodum var. macropodum; HS and HN, southern and northern D. macropodum var. humile, respectively. N, effective population size; T, event time; Nmij, number of migrants per generation from j to i (1, M; 2, HS; 3 HN). Direction of migration is forward in time. The period shown in light gray indicates that a migration is occurring.

We calculated two-dimensional minor allele site frequency spectra (2D-minor allele SFSs) between two group pairs using our own R scripts using the dataset for ABC analyses (Set 3). Missing data were compensated for by bootstrapping within a group. We then calculated the number of polymorphic sites (S), nucleotide diversity (PI), and Tajima’s D (D) for each group, and FST between group pairs (Wright 1951) from 2D-minor allele SFSs.

The four scenarios were simulated 104 times each with fastsimcoal2 v. 2705, which enables us to analyze geographically structured populations (Excoffier et al. 2021). The 12 summary statistics were calculated from the output 2D-minor allele SFSs. To simulate the data as realistically as possible, the same number of short leads (2279 loci) and average length (151.21 bp) as the observed data were used. The mutation rate estimated in the woody species Populus tremula (1.74 × 10−8 per site per generation) was used (Gossmann et al. 2012). Because we simulated very short DNA sequences, the recombination rate was set to 0. Simulated scenarios were compared using ABC-random forest (RF) implemented in the abcrf package of R v. 1.8.1 with 1000 decision trees (Pudlo et al. 2016). The confusion matrix of the scenarios, classification error rates, and posterior probability were calculated. Using the best scenario selected by ABC-RF, an additional 4 × 106 simulations were performed for parameter estimation. With the 1000 simulations closest to the observed data, posterior distributions of the parameters were estimated by a neural net regression method implemented in the abc package of R v. 2.1 (Blum and François 2010; Csilléry et al. 2012). The number of neural networks was set to 20. The logit transformation of the parameters was used to maintain the estimated value within the prior range. The posterior mode was estimated by the density function of R. The 95% highest posterior density (HPD) was estimated using the coda package of R v. 0.19.4 (Plummer et al. 2006). Event times were converted from generations ago to years ago using the value of 20 years per generation, which was determined by considering the species life history, although the two varieties have different life forms associated with different maturation times (Petit and Hampe 2006).

To confirm the fit of the scenario to the observed data, posterior predictive simulation with 1000 randomly drawn posterior samples was conducted, and summary statistics were compared to the observed data (Gelman et al. 2014).

Climatic differentiation

To evaluate the differentiation of habitats, the maximum snow depth at 30 arc-sec resolution from the Japan Meteorological Agency (https://www.data.jma.go.jp/obd/stats/etrn/view/atlas.html) was obtained, and the values of the analyzed populations were retrieved. Because it is difficult to identify the difference in tree height between the two varieties from herbarium specimens, we used only the locality of the analyzed populations for evaluating climatic differentiation. We applied analysis of variance with Tukey’s multiple comparisons of means, using R v.3.5.2, to determine whether snow depth differed among the three groups. Because of a lack of a value of the maximum snow depth from Jeju Island in South Korea, this population was removed from this analysis.

Results

Distribution of chloroplast DNA haplotypes for the two species

The sequences of three cpDNA regions were obtained from 64 individuals of Daphniphyllum macropodum var. macropodum, 92 of D. macropodum var. humile, and 11 of D. teijsmannii. The lengths of the aligned sequences for the three regions were 739 bp for trnK-matK, 545 bp for trnD-trnE, and 560 bp for psbH-psbB. These sequences identified four haplotypes from four segregating sites across the two species. A haplotype network revealed that the haplotypes could be distinguished with one or two substitution steps. Daphniphyllum macropodum var. macropodum had two haplotypes (1 and 2) and var. humile had two haplotypes (2 and 3); thus, haplotype 2 was shared between the two varieties in western Japan (Fig. 1a).

Phylogeographic patterns of SNPs for the two species and two varieties

The RAD-seq library yielded 368,840,797 single-end reads with an average of 3,226,768 per sample. We detected 2194 SNPs with a genotyping rate of 85.15% with the outgroup (Set 1) and 3053 SNPs with a genotyping rate of 82.44% without the outgroup (Set 2). The number of polymorphic SNPs and expected heterozygosity within each population of D. macropodum (Set 2) ranged from 644 to 1716 and 0.085 to 0.187, respectively (Table 1). Genetic relationships among populations by the neighbor-joining method revealed that D. teijsmannii and the two varieties of D. macropodum were distinguished with the highest probability (100%), and the two varieties were distinguished in both the SNP sets with and without the outgroup (Fig. 3). Furthermore, genetic relationships among populations identified two groups, populations H1–H10 and H11–H23, within var. humile with the highest probability (Fig. 3b). The individual-based genetic structure with the outgroup, inferred by sNMF, showed that the optimal number of clusters (K) was K = 4, based on the cross-entropy criterion. For all K = 2–5, D. macropodum and D. teijsmannii were clearly distinguished by different clusters (Supplementary Fig. 1). The individual-based genetic structure without the outgroup showed that the optimal number of clusters was K = 4. For all K = 2–5, clusters were admixed between the two varieties (Fig. 1d), and this pattern was the same when including the outgroup. PCA showed that the first component (85.1%) had a high contribution to the variance explained. Three groups, that is, var. macropodum (M, populations M1–M16), southern var. humile (HS, populations H11–23), and northern var. humile (HN, populations H1–H10), could be clearly identified by the first two PCA axes (Fig. 4).

Fig. 3: Genetic relationships among populations, constructed based on DA distance using the neighbor-joining method.
figure 3

Relationships a with and b without the outgroup species, Daphniphyllum teijsmannii. Bootstrap probabilities that were a 100% and b exceeded 80% based on 1000 replicates are shown above nodes. Populations codes are shown in Table 1. Red, green and blue are Daphniphyllum macropodum var. macropodum (M), southern (HS) and northern (HN) D. macropodum var. humile, respectively.

Fig. 4: Genetic relationships among individuals detected by principal component analysis (PCA).
figure 4

M, Daphniphyllum macropodum var. macropodum; HS and HN, southern and northern D. macropodum var. humile, respectively. Proportions of variance explained for the two principal components were evaluated from the top 50 components.

Demographic analysis

We detected 5858 SNPs with a genotyping rate of 91.58% for the demographic analysis dataset (Set 3). Pairwise genetic differentiation (FST) between groups was highest between M and HN (0.071) and lowest between M and HS (0.022; Supplementary Fig. 2). The isolation with migration (IM) was selected as the best scenario by ABC-RF with a high posterior probability of 0.808 (Table 2). Although the classification error rate was relatively high (0.501) as votes by an RF were concentrated into the IM (0.844), the posterior probability of the best scenario was high (0.808) and the error rate when the IM was the true scenario was moderate (0.328; Supplementary Table 2). Therefore, we confirmed that our ABC-RF scenario choice was reliable. All posterior distributions of parameters in the IM showed a different shape from their prior distributions and showed a clear peak (Supplementary Fig. 3). Posterior predictive simulation of the IM showed that all simulated parameters were located around the observed value (Supplementary Fig. 2). This indicates that the goodness-of-fit of the IM is high.

Table 2 Summary table of ABC-RF scenario choices.

The posterior modes (95% HPDs) of the effective population sizes at present of M, HS and HN were 62,538 (2399–976,341), 2649 (1014–594,108) and 28,699 (2223–873,117), respectively (Fig. 5 and Supplementary Table 3). Posterior distributions of the ratio of effective population size at present to that at T1 (N0/N1) in M and HN were biased to larger than 1.0 (0.0 on the log-scale) with posterior probabilities (N0 > N1) 0.665 and 0.744, respectively (Supplementary Fig. 4c). However, that in HS was distributed around 1.0. This indicated that M and HN expanded in size from T1 to the present, whereas HS did not. Although we also examined population size changes from T1 to T2 and T2 to T3, there were no clear expansion or reduction signals in population size (Supplementary Fig. 4a, b). The posterior modes (95% HPDs) of T1, T2 and T3 were 20 (0–393), 198 (14–4464), and 1316 (282–19,984) Kya, respectively. Migration was more frequent from the present to T2 than from T2 to T3 (Fig. 5 and Supplementary Table 3). Especially in the direction toward HS, the values of the number of migrants per generation (Nm) were larger than those of the others [from M, 4.24 (0.68–9.66); from HN, 3.29 (0.36–8.18)].

Fig. 5: The effective population size and number of migrants per generation in the three island groups are denoted by circles and arrows, respectively, in the best scenario (isolation with migration, IM).
figure 5

The sizes of the circles and arrows are proportional to the value of the posterior mode of the parameter. The effective population sizes were at T2 and T0. Numbers indicate posterior mode with 95% highest posterior density. Direction of migration is forward in time. Photographs were obtained for each representative population. M, Daphniphyllum macropodum var. macropodum; HS and HN, southern and northern D. macropodum var. humile, respectively.

Tree height variation and habitat differentiation

The approximate maximum tree height of the population for M, HS and HN ranged from 5 to 15 m, 0.5 to 1 m, and 0.5 to 5 m, respectively (Table 1 and Fig. 1c). The maximum snow depth of the analyzed population of var. humile ranged from 0.30 to 2.95 m (Table 1). The approximate maximum tree height and maximum snow depth of the population significantly differed between groups of M and HN, and M and HS (P < 0.001), but not between HN and HS, respectively.

Discussion

Genetic relationships of Daphniphyllum species in Japan and South Korea

When considering evolution and population genetic structure, introgression can have non-negligible effects on evolution and species relationships (Barton 2001; Abbott et al. 2013). Daphniphyllum species, distributed in Japan and South Korea, consist of only D. macropodum and D. teijsmannii. The genetic divergence of the two species, which are rarely found in the same area, is clear, suggesting that the influence of introgression on the two species, especially in D. macropodum, is negligible in this study (Fig. 3a).

Genetic differentiation between the two varieties of D. macropodum

Daphniphyllum macropodum var. macropodum and var. humile have also diverged with the highest support (Fig. 3). It suggests that the morphological difference between the two varieties, which are characterized by tree height, may be environmentally controlled and also genetically predetermined. In Japan, plant distributions have changed during climatic oscillations of the late Quaternary and such history is reflected in genetic structure. However, widespread trees without morphological differences between heavy and light snowfall climates exhibit weak genetic structures corresponding to snowfall in western Honshu (Sakaguchi et al. 2011; Iwasaki et al. 2012), while the genetic structure of the two varieties seems to maintain divergence between the varieties (Fig. 2b). Difference in habitats between the two varieties based on climate, including snowfall, suggests that heavy snowfall is considered to be a significant selective pressure for var. humile to maintain the shrub form as same as other shrubs (Kume et al. 1998).

The distribution of plants with morphological differences, that is, distribution in a light snowfall region along the Pacific Ocean and the heavy snowfall region along the Sea of Japan in the Japanese archipelago, is known for many species. Ecological and morphological differences, such as leaf size, leaf shape, snow weight tolerance, and a loss of drought tolerance, between regions are considered to be a result of adaptations to heavy snowfall climate (Hagiwara 1977; Kume and Ino 2000; Tamaki et al. 2018). As a result, a deciduous broad-leaved tree, Magnolia salicifolia exhibited that morphological and genetic differentiations were associated (Tamaki et al. 2018). The evergreen broad-leaved shrub, Aucuba japonica var. borealis, which has morphological features similar to those of D. macropodum var. humile in the heavy snowfall region (Kume and Ino 2000), also exhibited that morphological and genetic differentiations were not associated with geographic distribution in diploid and tetraploid individuals (Ohi et al. 2003). Furthermore, Fagus crenata (Hiraoka and Tomaru 2009) and an evergreen conifer, Cryptomeria japonica (Kimura et al. 2014), which have morphological differences between heavy and light snowfall climates, exhibited genetic divergences between populations in the Pacific Ocean and the Sea of Japan sides but admixed population genetic structures, which may suggest the presence of migration between the regions. These evidence indicate that the evolution in the heavy snowfall region and the maintenance of genetic differentiation vary within the same landscape.

In contrast to the genetic divergence of the two varieties, genetic differentiation between var. macropodum (M) and the southern group of var. humile (HS; FST = 0.022) was lower than that between the two groups of var. humile (FST = 0.053; Supplementary Fig. 2). These relationships between groups can be explained by the strength of migration between groups, which was larger in M-HS pair than in HS-HN pair (Fig. 5). This result seems to be reliable for considering a tendency that tree height is tall in the northern and short in the southern groups of var. humile (Table 1). Especially the existence of taller populations in southern var. humile (HS) suggests migration between the two varieties. The migration is also supported by the shared cpDNA haplotype (haplotype 2) of the two varieties in western Honshu (Fig. 1a), and it indicates that migration occurs not only by pollen flow via wind but also by seed dispersal via birds (Kominami et al. 1997; Kato 2000). In addition, migration seems to be predominant from var. macropodum into southern var. humile, as shown in the demographic analysis (Fig. 5). In contrast, the unshared haplotype (haplotype 1) of the two varieties in eastern Honshu suggests that the distribution of southern var. humile may be formed by range expansion from the south. Demographic analyses using user-friendly programs such as DIY-ABC enable the modeling of admixture at a timepoint but cannot consider continuous migrations in scenarios (Cornuet et al. 2014). Our demographic analysis using isolation with migrations is considered to reflect more actual demography of migration. It may be difficult to evaluate whether the definition of groups, based on the among-population relationships and individual relationships (Figs. 3 and 4), represents the actual population dynamics because of the gradual population genetic structure in var. humile (Fig. 1d).

The short tree height of var. humile is considered an adaptive trait against cold climate by being buried in snow cover (Kume and Ino 1993); therefore, short tree height would be essential for survival in a heavy snowfall climate. Another shrub species Camellia rusticana, which is endemic in the heavy snowfall region, and C. japonica in the light snowfall region have different physiological tolerance to heavy snowfall climate (Kume et al. 1998); therefore, the shrub variety of Daphniphyllum may also evolve such physiological tolerance (Katahata et al. 2014). In var. humile, some populations in western Honshu tend to be taller than northern populations (Table 1); thus, tall populations in this region may indicate that the snow cover in these populations is sufficient to bury individuals or temperatures not low enough to damage leaves (Kume and Ino 1993). Furthermore, in southern var. humile, mixing proportions of the genetic clusters, which dominated in the two varieties (Fig. 1b), in addition, tree heights tended to be variable among populations, especially in western Honshu (H18–H23; Table 1). These lines of evidence suggest that the tree height variation with migration is considered to be maintained by both the geographic proximity of the two varieties and climatic selection in habitats. However, the approximate maximum tree height described in this study is insufficient and should be improved by measuring the maximum and minimum range of tree height of mature individuals with sex in populations.

The timing of adaptation of D. macropodum var. humile to heavy snowfall environment

On the basis of a phylogenetic analysis of Daphniphyllum (Tang et al. 2022), vars. macropodum and humile were closely related within the genus, and other closely related species to var. humile are also taller than the shrub variety. In addition, the distribution of var. macropodum is wide across southeast China, Taiwan, South Korea, and Japan (Tang et al. 2012). These pieces of evidence support the hypothesis that the shrub variety evolved from the tree variety. However, the shrub variety of evergreen conifer Torreya nucifera var. radicans, in the heavy snowfall region was not the most closely related to the tree variety Torreya nucifera var. nucifera in the light snowfall region of Japan (Aizawa and Worth 2021). Therefore, more reliable phylogenetic trees of Daphniphyllum will help us to understand the evolutionary history of var. humile. The IM scenario in this study was robust, whereas the divergence time of the two varieties was controversial because it is highly dependent on the applied mutation rate and generation time. In this simulation, the divergence time of the two varieties was estimated to be 1316 Kya (282–19,984, 95% HPD). Global cooling since the mid-Miocene and climatic oscillations since the Pleistocene (Zachos et al. 2001) influenced snowfall in Japan, the eolian deposits in north China suggest that the winter monsoon became incrementally stronger during the Pleistocene (Sun et al. 2010), and the uplift of mountains in Honshu has been activated since the late Pliocene (Yonekura et al. 2001). In addition, discoveries of Stewartia monadelpha fossils, which is currently growing in light snowfall region of the Pacific Ocean side of Japan, a living fossil conifer, Metasequoia glyptostroboides, from layers during 1.5–1.8 Mya in central Niigata (37.40–37.80N, 148.46–148.50S) along the Sea of Japan, which is currently a heavy snowfall area (Momohara et al. 2017), and an increase of Cryptomeria japonica pollens from sediments from Sea of Japan off at 2.5 and 1.7 Mya (Igarashi et al. 2018) suggests that this area had become cold and started to experience heavy snow since the period. Therefore, the divergence of the two varieties and range expansion of var. humile might have occurred since the Middle Quaternary.

Conclusion

The short tree height of var. humile is considered to be an adaptive trait to the heavy snowfall climate. The migration between the shrub and tree varieties since the divergence suggests that selective pressure by snow cover and cold hardiness is geographically shifted. This provides important knowledge for studies on ecological speciation with migration and on evolution associated with adaptation to climate, which is sometimes gradually changing. The evolution of the shrub variety could be associated with a lack of apical dominance, and thinness and plasticity of stems, resource allocation, and reproductive size.