Introduction

Species divergence and local adaptation typically involve changes in suites of functionally interrelated traits such flowers and leaves. When floral structure diverges, as can occur when species or populations adapt to different sets of pollinators, many component traits such as corolla length and width, anther and stigma positions, and amount and concentration of nectar produced evolve simultaneously to produce an altered structure in which all of these component traits have the appropriate spatial relationships for facilitating efficient pollination by a specific set of pollinators (Berg 1960; Armbruster et al. 2009; Goodwillie et al. 2010). Similarly, adaptive divergence in leaves can involve modification of a suite of traits including leaf size, shape, age-span, cuticular properties, and physiological characteristics, such as chlorophyll, water, and nutrient content (Arntz and Delph 2001; Reich et al. 2003; Milla and Reich 2011). However, it is still poorly understood how the individual components of complex trait suites such as flowers and leaves evolve in a coordinated fashion to retain functionality.

Three scenarios may explain how coordinated evolution can occur. One extreme possibility is that genetic variation in each component trait is independent, and direct selection on each component trait, possibly coupled with correlational selection on groups of component traits, molds the final phenotype (Pigliucci 2003; Klingenberg 2008; Pélabon et al. 2014). A second possibility, at the other extreme, is that proper functional relationships among component traits may be assured by the genetic correlation structure, allowing direct selection on one or a small number of the component traits to result in indirect selection on the other component traits, essentially dragging them along in a manner that maintains their functional relationship (Armbruster et al. 2014). Studies of floral integration often report strong genetic correlations among floral component structure (Juenger et al. 2005; Bouck et al. 2007; Wessinger et al. 2014; Smith 2016), suggesting the possibility of this mechanism. However, it is not clear why it should be expected that the directionality of the correlation structure will be aligned to the direction in which selection attempts to move the population, especially if the structure of genetic correlations among component traits is largely determined by the mutational correlation structure (Lande 1980). A third possibility is that strong selection favoring change in one character leads to slightly maladaptive correlated changes in other characters, which are subsequently compensated for only slowly as appropriate mutations arise. Under this scenario, in recently diverged taxa, change in complex traits is the result of direct selection on a small number of component traits with the others diverging due to indirect selection. One way of distinguishing among these possibilities is to determine the extent to which mutations responsible for the evolution of component traits are pleiotropic.

In particular, if mutations generally affect only one or a small number of component traits, each trait or small group of traits will evolve independently; and if the changes are adaptive, they will require independent direct selection. By contrast, if mutations are highly pleiotropic with respect to the component traits, they will evolve in a coordinated fashion that could be driven by direct selection on one or a small number of the component traits. Previous studies on natural plant populations have revealed that phenotypic divergence can result from either a few quantitative trait loci (QTL) with large effect (Bradshaw et al. 1998; Bouck et al. 2007; Klahre et al. 2011) or many QTL of small effect (Goodwillie et al. 2006; Fishman et al. 2015; Wessinger et al. 2014). However, only a few studies have examined formally the extent to which the QTL for different characters overlap (Juenger et al. 2005; Hall et al. 2006; Lowry et al. 2015).

Others have used quantitative genetic breeding designs to examine the extent to which floral traits are genetically correlated (Edwards and Weinig 2011). These have generally found that floral traits tend to be correlated inter se, as do vegetative characters, but with little correlation between floral and vegetative characters. These results suggest that plant phenotypes may consist of more or less separated developmental modules, within which characters are highly correlated genetically, but between which correlations are weaker. However, genetic correlations arising due to segregating variation within individual species do not necessarily reflect pleiotropy of mutations that contribute to fixed differences between species, for two reasons: (1) within-species genetic correlations may be caused by linkage disequilibrium (Berg 1960; Armbruster and Schwaegerle 1996), and (2) fixed substitutions may reflect novel mutations that may have different pleiotropy characteristics than standing variation. Consequently, examining whether the same between-species QTL affects different traits is a more appropriate approach than examining within-species QTL for understanding the extent to which different traits have evolved independently.

When many correlated traits are examined, principal component analysis (PCA) can be used to reduce the dimensionality of the data, while retaining as much as possible of the variation in the original data (Jolliffe 1986; Upadyayula et al. 2006). This is achieved by identifying uncorrelated linear combinations of traits, the principal components (PCs), which are the eigenvectors of the phenotypic covariance matrix. Typically, a few leading PCs explain a large proportion of variance and can be considered new uncorrelated traits amenable to QTL detection (Jiang and Zeng 1995; Upadyayula et al. 2006). Reducing highly correlated variables into smaller sets of independent PCs through PCA may increase power of detecting QTL by reducing genome-wide false-positive detection rates.

In this study, we identify QTL contributing to differences in floral morphology and in leaf morphology and physiology between two closely related species of Primulina (Gesneriaceae) in southern China. This system is ideal for examining the genetic underpinnings of adaptive phenotypic divergence because of their sympatric distribution in different microhabitats. We quantify the degree to which QTL for different component traits overlap and test whether this overlap deviates from random expectations. In addition, we address the issue of the extent to which phenotypic divergence between these two species was actually caused by selection by applying a QTL sign test (QTLST; Orr 1998) to QTL for different traits. We use an F2 population generated by a cross between one individual of each of these species from the sympatric site to identify QTL for both individual traits and PCs for a number of floral and leaf traits. With this information, we ask two questions: (1) to what extent do QTL for different traits overlap in genomic location? And (2) is there evidence indicating that divergent selection was responsible for differentiation of any of the traits?

Materials and methods

Study system

Primulina is a speciose genus of about 180 described species (Xu et al. 2017). This genus displays remarkable interspecific variation in flower and leaf traits and species occur in diverse light regimes, from steep outcrops exposed to full sunlight to dark caves with low light levels. Here we focus on a pair of sister species, Primulina danxiaensis and Primulina depressa, with marked differences in flower and leaf traits. P. danxiaensis was initially described as a member of the genus Chiritopsis and has small, light-yellow flowers and round leaves, while P. depressa has large and blue–purple flowers and narrow-long leaves (Fig. S1). Both species are endemic to northern Guangdong, South China. The distinct habitats in which they grow differ in a number of abiotic and biotic characteristics that could serve as selective agents to cause divergence, including differences in light availability, soil moisture and nutrient content, and the community of pollinators available (Zhou et al. 2010; Hao et al. 2015). The two species occur in sympatry (<100 m between populations) at a location on Danxia Mountain, a National Park and UNESCO World Heritage site, Guangdong, South China. At this location, the two species grow in different nearby habitats: P. danxiaensis tends to occupy caves with dim light conditions, whereas P. depressa tends to grow on outcrops with more sun (Fig. S1). Despite the contrasting morphologies, multiple nuclear DNA sequence data revealed that these two species are phylogenetically sister species (Kong et al. 2017). Individuals of the P. danxiaensis population sympatric with P. depressa are genetically more similar to P. depressa than they are to other conspecific populations (Kong et al. 2017), suggesting that there may be gene flow between the two species. Nevertheless, phenotypic differentiation in sympatry is maintained, presumably by selection favoring local adaptation to different microhabitat niches.

Crossing and cultivation

In 2011, we collected several adult individuals of P. depressa and P. danxiaensis from the sympatric populations in the Danxia Mountain National Park. The parent lines were maintained in a greenhouse at South China Botanical Garden (SCBG). In 2012, an F1 family including 21 progeny was generated from a cross between an individual of P. depressa (♂, DXS02-17) and an individual of P. danxiaensis (♀, DXS04-5). In 2013, we obtained an F2 mapping population consisting of 201 individuals by selfing a single F1 plant. After germination, all plants were transplanted into 8-cm pots filled with about 200 ml Klasmann substrate (Klasmann Deilmann, Geeste, Germany) and were grown on benches in a fully randomized design at the SCBG greenhouse under natural light. All F2 seeds were collected from September to October and sowed in next February. Plants were watered as needed and fertilized with Miracle-Gro Tep6 compound fertilizer (Scotts Miracle-Gro Company, OH, USA) weekly.

Trait measurement

Twelve traits were measured, including seven flower morphological traits (upper petal width, lower petal width, corolla width, corolla height, corolla length, pistil length, and stamen length; Fig. S2), two leaf size traits (leaf length and leaf width), and three leaf physiology traits (concentrations of chlorophyll a, chlorophyll b, and carotenoids). These traits were measured on 10 P. depressa individuals, 9 P. danxiaensis individuals, 3 F1 hybrids, and 201 F2 individuals.

We measured flower morphological traits on the first three to five flowers per individual on the day the flower opened. First, we measured all the corolla traits using digital calipers. We then carefully dissected the corolla and measured the pistil and stamen length.

We scanned three mature and fully expanded leaves collected from each individual. ImageJ v.1.49 software (Abràmoff et al. 2004) was used to quantify leaf length and leaf width. The largest mature leaf of each individual was sampled to measure the maximum leaf length and width.

To measure the chlorophyll and carotenoid content, we extracted leaf pigments from the mid portion of fully expanded leaves with acetone and quantified content photometrically, using a multiscan spectrophotometer (Epoch Take3, Biotek Instruments, Winooski, VT, USA) at 440, 645, and 662 nm. Concentrations per leaf area were calculated with the formula of Fadeel (1962): Ca = 9.78 × E662–0.99 × E645; Cb = 21.4 × E645–4.65 × E662; and Cc = 4.69 × E440–0.268 × C(a+b), where Ca, Cb, and Cc indicate concentrations of chlorophyll a, chlorophyll b, and carotenoids, respectively.

Assessing trait independence

To assess trait independence, we used PCA to identify major axes of trait variation and covariation (Jaing and Zeng 1995). In the F2 population, seven individuals did not flower, probably due to hybrid breakdown, and their floral traits were thus missing data. We performed PCA with Bayesian PCA (the “bpca” function of the R package pcaMethods; Stacklies et al. 2007), which can deal with missing values via imputation (i.e. mean substitution). The phenotypic data were standardized with the “scale” algorithm to eliminate the noise from magnitude of different phenotypic composition. We calculated correlations among PC scores and phenotypic traits to determine the relative contributions of different traits to each PC variable. PC scores were regarded as quantitative traits and used to identify QTL associated with these PCs.

Restriction-site associated DNA sequencing method (RAD-seq) genotyping by sequencing

Genomic DNA was isolated from the fresh leaves of male and female parents, the F1 hybrid that generated the F2 population, as well as 201 F2 individuals using a modified CTAB-chloroform extraction protocol (Doyle and Doyle 1987) and was later quantified using Qubit fluorometer (Invitrogen, Carlsbad, CA, USA). To obtain markers for mapping, we prepared our DNA for a RAD-seq (Baird et al. 2008). In brief, DNA was digested with the restriction endonuclease EcoRI, which recognizes a 6-bp sequence (G’AATTC). Fragments with lengths about 350 bp were separated on an agrose gel, selected for PCR amplification as sequencing templates, and sequenced on Illumina XTenTM platform under the PE 150 protocol. The library construction and next-generation sequencing were performed by the staff of Novogene Bioinformatics Institute (Beijing, China).

Reads were quality-filtered by removing adapter sequences and reads containing >10% low-quality bases with Q20 value ≤20%. The quality filtration pipeline was provided by Feng et al. (2017). The quality-filtered RAD reads of each individual were mapped onto the Primulina huaijiensis genome (C. Feng, unpublished data) with the software BWA version 0.7.5 (Li and Durbin 2009). Based on the alignment result, Stacks software system version 1.44 (Catchen et al. 2011) was used to group homologous Illumina short reads into loci and determine individual genotypes. Putative haplotypes were built for each individual using a minimum stack depth parameter of five identical reads (m = 5). Identical loci from the two parents were then merged to build a catalog, with a maximum number of mismatches of three nucleotides (n = 3). Markers that were heterozygous in F1 but homozygous in both parents were suitable for linkage mapping.

Genetic map construction

The Primulina RAD linkage map was constructed with the JoinMap v.4.0 program (van Ooijen 2006), using markers with a maximum of 20 missing genotypes out of 201 progeny. The segregation of each marker was analyzed by a chi-square (χ2) test for goodness-of-fit to the expected Mendelian ratio 1:2:1 for biallelic codominant markers in an F2 population (P < 0.005). Markers were assigned to linkage groups (LGs) with an independence logarithm of odds (LOD) score of 7.0. The marker order within a LG was evaluated using default JoinMap v.4.0 parameters. Recombination rate was converted into genetic map distance (cM) by Kosambi’s mapping function (Kosambi 1944).

QTL mapping and analysis

We adopted two approaches to assessing the extent to which QTL for different traits overlap. The first approach was to detect QTL for individual traits, while the second was to first conduct a PC analysis for all traits and then identify QTL for PC scores (PCA QTL). The latter approach may have the potential to be more powerful for detecting QTL’s affecting multiple traits (Jiang and Zeng 1995). For both approaches, we used the interval mapping algorithm implemented in the MapQTL v.6.0 software using 1 cM increments (van Ooijen 2004) to map both individual trait QTL and PCA QTL. A significance LOD threshold was determined for each LG using 10000 permutations to test for segregating QTL with a 5% significance level. Markers with LOD scores exceeding the empirical threshold were first selected as cofactors before automatic cofactor selection was performed for each LG with a QTL. A multiple QTL-mapping analysis was finally performed using selected cofactors, and QTL were identified when LOD peaks were higher than the 5% chromosome-wide LOD threshold determined by permutations. For each significant QTL, we obtained 1.5-LOD confidence intervals (CIs), as well as estimates of additive allelic effects. QTL effect sizes were reported as the proportion of phenotypic variation explained (PVE) and the proportion of the parental difference explained by the additive effects. Final QTL maps were drawn with the help of graphical package MapChart v.2.2 (Voorrips 2002).

In order to evaluate the significance of correspondence between QTL for different traits, we calculated the probability of obtaining the observed number of matching QTL by chance alone (Lin et al. 1995). This evaluation was determined by the given total length of genetic map and average interval length of all QTL, and the probability of overlap was estimated for different QTL intervals. To correct for multiple testing, we applied the Benjamini and Hochberg false discovery rate correction (Benjamini and Hochberg 1995) to determine which individual tests were significant at the overall P = 0.05 level.

QTL sign test

We performed a QTLST (Orr 1998) to test for directional natural selection on both the individual trait and PCA QTL. This test examines the deviation from the null hypothesis that traits diverging via neutral processes (i.e. genetic drift) will accumulate alleles of both positive and negative effects, whereas traits evolving under directional selection will accumulate alleles with consistent effects. We preferred to implement QTLST rather than the more commonly used QTLEE (QTL equal effects) method because QTLST is less susceptible to ascertainment bias than QTLEE (Anderson and Slatkin 2003; Muir et al. 2014). For comparison, we also reported QTLEE test results, since many studies use this method. The log-normal distribution fitted the observed distribution of effect sizes in our study (data not shown), indicating that QTLST method was suitable to our QTL data (Orr 1998; Muir et al. 2014). We numerically integrated probability densities using the distr package (Ruckdeschel et al. 2006) and integrate function in R version 3.3.1 (R Development Core Team 2012), according to the method described by Muir et al. (2014). The code is available at Dryad.

Results

Trait variation and correlations

All 12 examined traits differed significantly between the two species (Table S1). PCA extracted four major PCs that accounted for a total of 89.4% of the variation among F2 individuals (Table 1). All measured traits and the four PCs exhibited continuous variation with unimodal distributions (Fig. 1 and Fig. S3). The first PC (PC1) explained 48.4% of the total trait variation with substantial positive loadings for all 12 variables, though loadings for flower morphological traits were somewhat stronger than for leaf traits (Table 1). We interpret this PC to represent variation in overall size. Because the other PCs are orthogonal to PC1, they can be interpreted as explaining variance and covariance in subsets of traits independent of variation in overall size.

Table 1 Correlations between the first four principal component (PC) scores and each trait
Fig. 1
figure 1

Histograms of principal component scores for the four PCs in the F2 population. Mean values of Primulina depressa, P. danxiaensis, and F1 hybrids are indicated with blue, red, and orange arrows, respectively

PC2 explained 20.5% of total variation with primary loadings (0.78 to 0.83) for leaf physiological traits being positive, plus minor negative loadings (−0.10 to −0.40) for flower size (Table 1). We interpret this PC as representing primarily variation in leaf physiological traits. PC3 explained 13.3% of total variation with substantial positive loadings (0.83 to 0.86) for leaf size, and thus represents leaf-size variation: large PC scores correspond to individuals that have large leaves relative to overall plant and flower size. Finally, PC4 explained 7.2% of total variation and displayed substantial loadings (0.08 to 0.54) for floral morphology, with petal width being positively and corolla length, pistil length, and stamen length being negatively correlated with PC score (Table 1). We consider this PC to represent primarily variation in floral morphology, indicating that individuals with high PC scores have shorter but wider flowers relative to overall plant size.

Linkage mapping

After quality filtering, a total of 14,313,717 clean reads were obtained for the male parent (DXS02-17), 10,400,725 for the female parent (DXS04-5), and 21,566,255 for the F1 hybrid. The number of reads for the 201 F2 progeny ranged from 3,053,584 to 9,145,815 with an average of 4,680,666 (Supplementary Table S2). STACKS reconstructed 41,425 loci. Six individuals with genotypes for less than two thirds of loci were removed. We chose markers identified in at least 178 F2 individuals (90% of 195 mappable individuals) and removed markers displaying significant segregation distortion at P < 0.005 with JoinMap v.4.0. This led to a final 2534 mappable markers. Initial linkage mapping revealed that the inclusion of distorted markers (at P < 0.01) did not affect the linkage arrangement compared with the map without distorted markers, and they were randomly distributed along the LGs with non-skewed markers. Therefore, these 831 segregation-distorted markers were included in the final linkage map, which led to a total of 2484 markers mapped on 195 F2 individuals. The final linkage map was assembled into 18 different LGs (Table S3 and Fig. 2), corresponding to the haploid number of chromosomes in P. depressa and P. danxiaensis (n = 18, Kang et al. 2014).

Fig. 2
figure 2

Genome location of significant floral QTL. Confidence intervals of QTL are illustrated on the right side of the linkage groups. Arrows indicate co-localized PCA QTL

The length of LGs ranged from 88.943 to 209.951 cM, for a total map length of 2601.510 cM and an average distance between adjacent markers of 0.955 cM (Supplementary Table S4). The number of markers on LGs ranged from 24 to 277 with an average of 138 markers. The minimum distance between adjacent markers was 0.017 cM (LG08), and the maximum distance was 13.909 cM (LG15; Supplementary Table S3).

Analyses of individual trait QTL

We identified a total of 114 QTL for the 12 individual phenotypic traits examined in this study. The number of QTL detected per trait ranged from 7 for leaf length to 11 for upper petal width, corolla height, and concentration of carotenoids (Table S5). PVE for the identified QTL ranged from 3.4% to 13.0%, with the majority (89.5%) of the QTL having small effects (PVE < 10%; Table S5). These QTL accounted for 42.5–76.2% of the trait variation.

A total of 67 significant QTL was identified for flower morphological traits. The proportion of the parental difference explained by all QTL for these traits ranged from −0.18 to 0.84, with a mean of 0.35 (Table S5). Generally, individual QTL for floral traits had small effects: none explained >0.15 of the parental difference, indicating that species differences in floral traits are highly polygenic.

In the 47 QTL for leaf traits, 30 were responsible for leaf physiological traits and 17 for leaf morphology. The proportion of the parental difference for leaf physiological and morphological traits explained by all QTL were 0.59 (range 0.49 to 0.74) and 0.005 (range −0.12 to 0.13), respectively (Table S5). Differences in leaf physiological traits were due primarily to QTL with moderate-to-major effect: 70% of the identified QTL explained >15% of the parental difference, while 53% explained >30% (Table S5). QTL effect sizes for leaf size traits were intermediate between those for floral and leaf physiological traits: 41% explained >15% of the parental difference, while 18% explained >30%.

Of the 114 identified QTL, 37 (32%) had overlapping 1.5-LOD confidence intervals with one to two other QTL from the same trait group while only nine QTL (8%) overlapped with one other QTL from a different trait group (Table S5 and Figs. 2 and 3). Of the 21 trait pairs that exhibited QTL overlap, 11 pairs showed significantly more overlap than expected by chance (8 remain significant at the overall P< 0.05 level after correction for false discovery rate, Table 2). Most of these (9 out of 11, including the 8 that remain significant after false discovery rate correction) trait pairs were within groups, suggesting strong genetic integration due to pleiotropy within trait modules. In contrast, most trait pairs between categories (45 out of 53) showed no QTL overlap, and only 2 exhibited significantly more overlap than expected by chance. Neither of these remained significant after correction for false discovery rate.

Fig. 3
figure 3

Genome location of significant leaf QTL. Confidence intervals of QTL are illustrated on the right side of the linkage groups. Arrows indicate co-localized PCA QTL

Table 2 Test for significance of QTL overlap between traits

Analysis of PCA QTL

We identified a total of 27 PCA QTL (Fig. 4, Table S5). Of these, eight corresponded PC1, six to PC2 (leaf physiology), six to PC3 (leaf size), and seven to PC4 (floral morphology). Nearly half (12) of the PCA QTL overlapped with individual trait QTL (Figs. 2, 3, and 4 and Table S5). Of the PC1 (overall size variation) QTL, three co-localized only with floral QTL (PC1_01, _06 and _07), while two co-localized with both floral traits and leaf physiology traits (PC1_02 and _04). Of the PC2 (leaf physiology) QTL, two co-localized with only leaf physiology QTL (PC2_01 and _02) and one with leaf physiology QTL and the stamen length QTL (PC2_06). For PC3 (leaf size) QTL, one overlapped with a leaf morphology QTL (PC3_01), while the other overlapped with floral morphology traits (PC3_03). Finally, two PC4 (floral morphology) QTL overlapped with floral QTL (PC4_03 and _06). Of the seven QTL that can be assigned to a specific module (e.g. PC2, PC3, PC4), six overlapped with trait QTL from the same module, while two overlapped with traits from a different module. Although the number of these PCA QTL is small, this pattern is similar to that seen in the individual trait QTL, in which substantially more co-localization occurs within modules than between modules.

Fig. 4
figure 4

Genome location of significant QTL for the first four PCs of phenotypic variation. Confidence intervals of QTL are illustrated on the right side of the linkage groups. Black arrows indicate regions that individual trait QTL co-localized

Such a type of comparison is not possible for PC1, the overall size component, because it does not represent a single module. However, five PC1 QTL co-localize with individual trait QTL representing floral and leaf physiology traits, which suggests that these individual trait QTL reflect overall size variation rather than module-specific size variation. In addition, 15 PCA QTL did not overlap with individual trait QTL, and 97 individual trait QTL did not overlap with any PCA QTL (Table S5). These results suggest that two QTL approaches are complementary to each other.

In contrast with individual trait QTL, none of the PCA QTL overlap with each other (Fig. 4). This is perhaps not surprising given that the PCA components are orthogonal to each other, which would tend to prevent overlap among QTL from different PCs.

The effect sizes and total explained variation of the PCA QTL mirror those of individual trait QTL (Table S5). As for individual floral trait QTL, those associated with PC4 (floral morphology component) all explain <10% of phenotypic variance in the trait, though 43% explain >30% of the parental difference, suggesting that there are some QTL with moderate-to-large effect on these traits. In addition, all the PCA QTL together completely explain the parental difference for PC4. No QTL for PC2 (leaf physiology component) explain >12% of variation, but many explain substantially larger portions of the parental difference (17% <0.15, 50% >0.30), paralleling the genetic architecture of individual leaf physiological traits. In addition, together all these QTL completely explain the parental difference. Finally, the PC associated with leaf size (PC3) appears intermediate, with all QTL explaining <10% of the phenotypic variance, but 83% explaining >0.15 of the parental difference and 33% explaining >0.3. As with the individual leaf size traits, these PCA QTL together explained little of the parental difference in PC scores, presumably because alleles increasing the PC score are just about as common as alleles decreasing it.

All eight QTL associated with PC1 (overall size variation) individually were small, explaining <10% of the parental difference. In addition, in sum they explained little of the parental difference because they differed in sign: five were individually negative, while three were individually positive. We note that, because general size does not represent one of the modules, we cannot compare these results with QTL from the modules. Nevertheless, this analysis indicates that the architecture of QTL affecting general size corresponds to many small-effect QTL. Given that floral traits load most strongly on PC1, this pattern may reflect our finding that individual floral QTL generally have small effect sizes.

QTL sign test

We found that most QTL (56 out of 67) of the individual flower traits were in the direction of divergence between the parent species (Supplementary Table S5), suggesting that directional selection for flower size might have played a role in trait divergence. This hypothesis was supported by Orr’s QTLST implemented with QTLST and QTLEE, both of which showed that five flower traits (upper petal width, corolla width, corolla length, pistil length, stamen length) were likely to have diverged owing to natural selection (Table 3). In contrast, a substantial proportion of antagonistic QTL were detected for leaf traits, with average number of antagonistic QTL of 3.5 per trait (Supplementary Table S5). In addition, Orr’s QTLST revealed no significant directional selection for these traits (Table 3).

Table 3 QTLST and QTLEE results calculated from the number of QTL (n), the number of QTL that altered a trait in the direction of Primulina depressa (i), the phenotypic difference between species (R), and the average absolute additive effect size of QTL (\(\left| {\bar a} \right|\))

We did not find significant directional selection for PCA QTL by either QTLST or QTLEE (Table 3). Results for PC2 (leaf physiology) and PC3 (leaf size) were consistent in this manner with tests performed on individual traits of the corresponding modules. However, in contrast with the individual trait analysis, for PC1 (overall size) and PC4 (floral morphology), QTL were inconsistent in sign and did not exhibit a signal of directional selection, even though they were highly or moderately correlated with the flower traits subjected to natural selection. We can offer no reason for this inconsistency, and thus this result should be interpreted with caution.

Discussion

Trait modularity and divergence

Our PCA of F2 phenotypes suggested that traits can be grouped into three modules, within which correlations between traits are high but between which correlations between traits are low. This pattern suggests that, during divergent evolution, these three modules evolve more or less independently, while traits within modules may evolve in a constrained, coordinated fashion. These inferences were largely confirmed by our QTL analyses. Analyses of both individual trait QTL and PCA QTL indicate that QTL overlap is substantially higher within modules than between modules.

Our tests for selection indicated only floral traits exhibited signatures of selection. Because of the low QTL overlap between floral traits and leaf traits, it is likely that selection on floral traits caused at most a small correlated response in leaf traits. The observed divergence in leaf traits was thus likely due either to independent selection on those traits or to genetic drift. Current data does not permit distinguishing between these possibilities. However, the apparent genetic independence of the three modules would suggest that, if divergent selection acted on either leaf physiology or leaf size, it would have done so largely independently on these two modules.

Genetic architecture of flower and leaf trait divergence

Our study suggests a highly polygenic genetic architecture involved in flower trait divergence between P. depressa and P. danxiaensis. QTL for individual traits generally explained small-to-moderate proportions (<13%) of the phenotypic variance and a small proportion of the difference between parent means (<0.15). A similar pattern was revealed by QTL for PC4 (floral morphology), although some of these had moderate-to-large effect (>0.3 of the parental difference explained). By contrast, for leaf physiological traits, individual QTL again explained small proportions of the phenotypic variance. However, in terms of proportion of parental difference explained, there were many individual QTL with moderate-to-large effect. QTL for PC2 (leaf physiology) exhibited a similar pattern. It would thus seem that species differences in these traits may be due primarily to QTL with large effect. Finally, the architecture of leaf size QTL fall somewhat between those of floral and leaf physiological traits: a mixture of QTL with small, intermediate, and large effects.

Our PCA allowed us to identify subsets of traits that corresponded to modules (PC2—leaf physiology, PC3—leaf size, and PC4—flower morphology) and one axis that represented overall size (PC1). We detected eight QTL associated with PC1 that were generally of small effect size and had inconsistent directions. This suggests that divergence in overall size may have been largely due to genetic drift.

The effect sizes estimated and the number of QTL detected are likely to be influenced by a number of methodological factors, such as sample size and marker density (Beavis 1998). Despite the modest size of the mapping population (N = 195), identified QTL generally fell into narrow confidence intervals (Supplementary Table S5). Nevertheless, owing to modest sample sizes, the number of QTL resolved in this study is likely to be an underestimate of the actual number of genetic factors affecting trait divergence (Beavis 1998; Doerge 2002), thus resulting in an overestimation of the effect size of individual QTL. In addition, the identified QTL for each trait only accounted for ca. 43–76% of variance (Supplementary Table S5), suggesting that a large number of QTL could have remained undetected. Despite these caveats, the possibility that trait divergence between the two species was affected by a few QTL of large effects can be ruled out definitively for floral traits. For leaf traits, QTL of large effect appear to be involved in addition to numerous QTL of small effect.

The highly polygenic genetic architecture of flower morphology in Primulina is comparable to that seen in many other plants. Similar results have been described for floral morphology in Mimulus (Fishman et al. 2002, 2015), Leptosiphon (Goodwillie et al. 2006), and Penstemon (Wessinger et al. 2014). However, our finding that many QTL of small effect underlie the flower trait differences between two related species that occur in sympatry contrasts with that observed by Ferris et al. (2017), who found that a few QTL of large effect underlie interspecific divergence in flower traits between two sympatric species of the Mimulus guttatus complex. They argue that their result reflects divergence in sympatry in the face of substantial gene flow, a situation in which divergence is expected theoretically to involve few loci with large effects, while mutations with small effect would be homogenized by gene flow (Yeaman and Whitlock 2011). In our case, gene flow between P. depressa and P. danxiaensis is likely minimal, because no morphologically intermediate individuals can be found in the field. This suggests that, given strong reproductive isolation, species can diverge in sympatry without large-effect mutations.

Aspects of leaf morphology such as leaf size are often reported to be complex traits. Numerous studies have identified multiple QTL for leaf morphological traits for crop plants (Jiang et al. 2000; Frary et al. 2004; Welter et al. 2007; Ku et al. 2012); however, QTL underlying leaf morphological traits in natural populations have been less studied. In accordance with our results, a recent study in Mimulus found multiple small-to-moderate effect QTL for leaf morphology (Ferris et al. 2015). Leaf pigments play important roles in response to biotic and abiotic stresses, such as drought, wounding, cold temperatures, and pathogen infection (Mur et al. 2010; Guanter et al. 2014). Although the ecological significance of leaf pigments has been widely studied in diverse crops, considerably less is known about their genetic architecture in natural systems. Our results indicate that difference in pigment contents between P. danxiaensis and P. depressa were dominated by substantial QTL with large effect, consistent with the observation in crops (Takai et al. 2010; Yin et al. 2015).

Trait vs. PCA QTL

Our study shows that standard QTL analysis on individual traits and analysis on PC scores provide complementary information. Each reveals QTL that the other does not, although some PCA QTL co-localize with some trait QTL. If one is interested in maximizing the number of QTL discovered, these results suggest that both types of QTL analysis should be conducted. In our case, we identified more trait QTL that did not co-localize with PCA QTL than PCA QTL that did not localize with trait QTL, but it seems likely that this pattern may be system specific and that in other species the opposite might be true. In any case, the patterns of variation revealed by the two approaches are very similar. In particular, both approaches indicate that QTL overlap is more common within modules than between modules. And both yield similar estimates of effect sizes and proportion of parental difference explained for the different modules.

Selection, drift, and divergence

QTLSTs showed that five flower traits (upper petal width, corolla width, corolla length, pistil length, and stamen length) were likely to have diverged owing to natural selection, suggesting that divergence in floral morphology was at least in part driven by selection. However, this test cannot distinguish between directional selection on individual traits and indirect selection due to genetic correlations with directly selected traits. The relatively high QTL overlap among these traits makes such correlated responses possible but does not rule out the possibility that selection acted on all five significant traits. However, for each floral trait, there is a substantial fraction of QTL that do not overlap with QTL of any other trait. If these diverged owing to selection, it follows that selection should have acted directly on those loci. Overall divergence in flower size thus may have involved a combination of response to direct selection and correlated responses to selection on other traits.

It is perhaps not surprising that our results indicate that floral morphology diverged owing to selection. Floral morphology influences mating system evolution and reproductive success, and different pollinators or different mating systems are likely to favor different morphologies. The evolution of selfing from outcrossing ancestors is often associated with changes that reduce flower size (Sicard and Lenhard 2011). In small isolated habitats where pollinators are rare, small-flowered genotypes capable of selfing can have a fitness advantage over larger outcrossing genotypes. Alternatively, divergent selection driven by adaptation to different pollinators may explain floral divergence. We observed that P. depressa is mainly visited by bees (Amegilla spp., Bombus spp., and Nomia spp.), while P. danxiaensis was visited by both bees (Nomia spp.) and hoverflies (Syrphidae) (L Yang, personal observations). It would therefore be of interest for future studies to assess selfing rates in the two sympatric Primulina species and to test the effectiveness of flower visitors as pollinators.

In contrast with floral trait QTL, we found no evidence for selection on QTL associated with PC4, which reflects floral trait variation. This result suggests that some of the QTL causing floral differences between the species have diverged owing to genetic drift rather than selection. Similarly, the prevalence of antagonistic effects in QTL for leaf traits, together with the nonsignificant QTLST for directional selection, suggest that these traits may really have diverged under weak selection or neutrality.

Conclusions

In this study, we have conducted QTL mapping of flower and leaf traits that differ between a pair of sympatric sister species of P. depressa and P. danxiaensis. We identified multiple QTL for each trait examined, suggesting a polygenic basis for trait divergence in this system. While individual QTL for floral traits have small effects, pigments and leaf size QTL tend to have large effect. The observed floral integration, manifested by a high degree overlap in floral trait QTL, may have been critical for evolutionary divergence of floral morphology in relation to their pollinators. This overlap suggests that direct selection on only one or a few of the component traits could have caused substantial divergence in other floral traits due to genetic correlations, while the lack of QTL overlap between floral and vegetative traits suggests that these trait suites are genetically unlinked and can evolve independently in response to different selective pressures corresponding to their distinct functions.

Data archiving

Raw Illumina sequences of RAD sequences have been deposited in NCBI sequence read archive (SRP148984). Genotypes and phenotype data for QTL analysis are available at the DRYAD repository: https://doi.org/10.5061/dryad.49t7030.