Introduction

Knowledge of the genetic architecture of complex traits can offer insight into several facets of evolution and adaptation, including adaptive differentiation, signatures of selection, speciation genetics, epistasis, mating system evolution and sexual selection (Erickson et al., 2004). In both natural and domesticated populations, this is classically obtained by inferring quantitative trait loci (QTLs) in segregating progenies of a cross between two differentiated populations (Slate, 2005). Although the biases and inaccuracies of QTL mapping are well known, QTL analysis can provide fundamental information about the size, location and effects of individual QTL that differ between the two parents of the cross (Broman, 2001; Price, 2006).

QTL mapping techniques have been used to dissect the genetic architecture of complex traits in many applications, such as for human diseases (Cardon and Bell, 2001), adaptation in natural populations (Slate, 2005) and breeding of animals (Mott et al., 2000). In plants, a classic example of QTL mapping for adaptive traits has involved the comparison between bumblebee pollinated M. lewsii and hummingbird pollinated M. cardinalis (Bradshaw et al., 1995; Schemske and Bradshaw, 1999). However, all these studies are based upon pairwise comparisons, which allow estimates of those differences only along a single lineage.

In this paper, we propose a phylogenetic approach for QTL mapping, in which the genetic effect of QTL along phylogenetic lineages is inferred, using several pairwise crosses among the taxa examined. At the simplest, by bringing in a third taxon, one can infer the QTL changes that have occurred along the two lineages that lead to the two most closely related taxa. The third lineage traces from the common ancestor of these two taxa, back to the common ancestor of all three taxa and forwards again to the third taxon. After mapping QTL onto lineages, we can determine if changes of QTL effects at the same apparent map position are orthologous (arising singly in an ancestral lineage leading to derived taxa) or paralogous (arising twice independently in derived lineages, due to different mutations). Also, akin to Orr’s test (Orr, 1998), the distribution of QTL in a species network can test for drift-mutation balance versus directional selection for quantitative traits.

To illustrate our approach, we use three species from the yellow monkey-flower species complex (Vickery, 1978). In the yellow monkey-flower (M. guttatus) species complex, several daughter species with varying degrees of inbreeding and narrow distributions occur, derived from the outcrossing and widespread M. guttatus (Vickery, 1978). We expect QTL changes in the two inbreeding lineages should mostly be promoting inbreeding, and that in the outbreeding lineage should have fewer or no QTL changes, as outbreeding is ancestral. In exploring this method, we find several challenges that should be addressed in future studies.

Materials and methods

Model species and traits

M. guttatus is extensively distributed in western North America, with a relatively low inbreeding coefficient of 0.38 (Ritland and Ritland, 1989). M. platycalyx occurs in the coast ranges north of San Francisco (Dole, 1992) and is a moderately inbred species with an inbreeding coefficient of 0.54 (Ritland and Ritland, 1989). M. micranthus is endemic to the Coast Range foothills of California, and is a highly selfing species with an inbreeding coefficient of 0.73 (Ritland and Ritland, 1989). The shape and size of the flowers of these species is illustrated by Figure 1. The evolution of inbreeding is accompanied by changes of an entire syndrome of floral traits, including male allocation, reduced size of floral characters, reduced attraction to pollinators, and reduction of the spatial and temporal separation of male and female reproductive organs within the flower (Jain, 1976; Ritland and Ritland, 1989). Figure 2 gives the five metric floral characters we measured as representative of inbreeding for Mimulus.

Figure 1
figure 1

The three intercrossable Mimulus taxa used in this study. M. guttatus is predominately outcrossing, whereas M. platycalyx and M. micranthus are inbreeders. Although M. platycalyx has a relatively large flower, it is highly autofertile.

Figure 2
figure 2

The four quantitative traits measured in this study; the fifth, stigma-anther separation, is the difference between pistil length and stamen length.

Lineage-specific QTL mapping for three taxa: models

QTL mapping has traditionally involved a single cross between differentiated taxa. In a two taxa QTL map, an inferred QTL has a lineage-specific effect ‘U1’ relative to the common ancestor, and another lineage-specific effect ‘U2’ relative to the common ancestor. The lineage-specific values are U1 and U2; only the difference U1U2 can be estimated.

Between three taxa, there are three QTL effects. With regard to the three Mimulus species, these are denoted UG for the M. guttatus lineage, UP for the M. platycalyx lineage and UM for the M. micranthus lineage. As in the single-cross data, the three possible pairwise crosses yield estimates of (UGUP), (UGUM) and (UPUM). The solution for the individual branch effects UG, UP and UM involves the insolvable relation , as the matrix is singular. Individual branch effects are not estimable without some kind of constraint. If we introduce the constraint UG+UP+UM=0, for example, the sum of the QTL effects across lineages is zero, then there are effectively two unknowns instead of three. This introduced constraint makes the lineage-specific estimates relative, in that changes are relative to an arbitrary mean of zero. The resulting relation can now be solved for lineage-specific effects UG, UP and UM, as the matrix is non-singular (determinant=3).

An alternative approach is likelihood, as commonly used for phylogenies of DNA sequences. As in the three-sequence model, the likelihood would be L=Pr(U0)Pr(U1|U0) Pr(U2|U0) Pr(U3|U0), where Pr(U0) is the probability of ancestor state U0 and Pr(Ui|U0) are the probabilities of mean Ui given ancestral state U0, for i=1, 2, 3 (Ritland and Clegg, 1987). For DNA sequences, there are base substitution models that provide explicit Pr() functions; for quantitative traits, Pr() would involve a Gaussian function. However, the major problem is that informative DNA phylogenies require data across many sites, while a QTL is equivalent to one nucleotide site (for example, one observation with one degree of freedom). Informative likelihood lineage-specific QTL mapping would thus require assuming QTL sites all evolve identically, preventing inferences about locus-specific patterns of evolution.

Lineage-specific QTL mapping for three taxa: parameter space

For this three taxon comparison, Figure 3 depicts the ‘space’ of QTL effects. If we constrain the joint values of UG, UP and UM to sum to zero, we can plot it in a manner similar to a ternary plot (de Finetti diagram), except that each axis ranges from −1 to +1 (QTL effects conceivably might lie outside this interval, but in our study we normalized traits by dividing by the variance, and indeed our results found no QTL of effect larger than |0.3|). From a given point within the triangle, the intersection of a line perpendicular to an axis gives the QTL effect observed for that axis. Figure 3a shows how a given triplet of estimates plots in this space; in this example, the M. guttatus lineage has a QTL of +0.23, the M. platycalyx lineage has a QTL of +0.08 and M. micranthus has a QTL of −0.31.

Figure 3
figure 3

The space of lineage-specific QTL effects in a three taxa comparison. (a) Lines emanating from a given point, representing a triplet of estimates, perpendicularly intercept axes at their respective lineage-specific QTL estimates (see text). (b) There are six fundamental divisions of this space, and for each, the most parsimonious phylogenies of QTL evolution is shown (hashes denote changes of QTL; −denotes a negative QTL effect, +denotes a positive QTL effect; m=M. micranthus, g=M. guttatus, p=M. platycalyx).

Figure 3b shows six fundamental divisions of this space, and for each, the most parsimonious phylogenies of QTL evolution, when the ancestral QTL effect is positive. Ancestry is most likely positive as a large flowered outbreeding species is most likely ancestral (Vickery, 1978). These partitions are indicated by (a), (b), (c), (d), (e) and (f), corresponding joint QTL effects of ++−, +−−, +−+, −−+, −++ and −+− for the M. micranthus, M. guttatus and M. platycalyx lineages, respectively. Within each of the six partitions is a three-taxon phylogeny. The slashes indicate the branch that most parsimoniously explains the lineage-specific QTL change observed among the current taxa.

Homology is the common ancestry of a trait or gene, as opposed to functional similarity. It is mainly applied to gene sequences and gene products (Fitch, 2000; Petsko, 2001). Homology of QTL has been classically defined as QTL that map to the same position in different populations or species (for example, see Freeman et al., 2009). In lineage-specific QTL mapping, we can subdivided homology into ‘orthology of QTL’, as QTLs that share common ancestry (share the same mutational event of an ancestor), and ‘paralogy of QTL’, as QTLs that have different mutations in the same region of the genetic map, but which arise independently. Note that paralogous QTL are not true homologues but ‘apparent’ homologues in the classical QTL definition as mapping to the same position. In Figure 3b, for positive ancestral QTL, cases (b) and (f) illustrate paralogous QTL, whereas the other four cases illustrate orthologous QTL. We expect orthologous QTL to be more common than paralogous QTL, as two changes are needed for paralogy but just one for orthology. If the ancestral state is negative (for example, inbreeding state), the space of paralogy is rotated 180°, with cases (c) and (e) indicating paralogy.

Lineage-specific QTL mapping for three taxa: estimation

To statistically identify QTL from genetic data in segregating crosses, we initially assume that species are fixed for QTL, but that QTL effects differ between species (this is a common assumption for species comparisons). Hence, the F1 can be heterozygous for a QTL, whereas the parent species is homozygous. Under this assumption, the expected means of progeny, conditioned upon the marker genotype, are given in Table 1 where the F1 derived from taxon i and j is backcrossed to taxon i. In this table, there are two crossing configurations, depending upon whether markers are in a backcross configuration (Aa × aa) or an intercross configuration (Aa × A a); this includes coupling versus repulsion linkage phases. The third column gives the expected means of progeny genotypes, conditioned upon parent genotypes and linkage phase.

Table 1 Expected mean phenotype for a quantitative trait, conditioned upon progeny genotype and crossing configuration of parents

The joint likelihood of quantitative traits and markers is

where Qck is the observed quantitative trait value (normalized to zero mean and unit variance), and is the expected mean (given in rightmost column of Table 1) of a progeny of cross c involving an F1 between taxa i and j, backcrossed to taxa i, with parent cross type t(c) at the marker locus (1=backcross, 2=intercross), linkage phase p(c) (1=coupling, 2=repulsion) and progeny genotype g(c,k) (1=dominant, 2=recessive). This assumes a normal distribution of quantitative traits. The most likely phase is chosen (‘max’). If a given F1 is a parent of more than one family, then the expression is more complicated; the most likely phase is chosen over all involved crosses. In principle, just six crosses, three to generate F1’s between all three pairs of taxa, and three backcrosses to just one of the two parents, provide enough information to jointly estimate uG, uP and uM.. (the u’s in Table 1, across the tree lineages). Below we used a more intensive crossing design that involved all six possible backcrosses.

Mimulus crosses and genotyping

All three inter-taxon crosses were performed (M. guttatus × M. platycalyx, M. guttatus × M. micranthus and M. platycalyx × M. micranthus). The F1 progeny were then backcrossed to each parent species (six reciprocal backcrosses). For the purpose of data analysis, this design can be written as a loop of six crosses: PP × PG × GG × GM × MM × MP × PP (the last ‘PP’ is the same as the first), where PP, GG and MM are the parent species, and PG, GM and MP are the F1s. Note that to map a QTL, at least two of the three F1 (PG, GM, MP) must be heterozygous for a marker (this is where more polymorphic markers, such as microsatellites, provide more power for lineage-specific mapping). One individual plant was used for each species, to ensure that there is just one possible parental genotypic configuration in these crosses. These species are quite distinct such that one individual captures most QTL differences between species.

All plants, including parental, F1 and backcrosses, were grown in Pro-Mix soil in growth chambers at 14 °C/8 °C day/night and 18 h of daylight. To avoid effects caused by the heterogeneous environment among growth chambers, trays of seedlings were periodically rotated among growth chambers. To avoid pollen contamination, flowers were bagged immediately after crossing. Five floral traits were measured (Figure 3): corolla width, corolla length, pistil length, stamen length (there are two sets of anthers that differ in length, the average length was measured) and stigma-anther separation (the difference between the previous two traits). In a combined-cross analysis, crosses with greater variability in the phenotype will have a greater influence (Li et al., 2005). Hence, before QTL analysis, we standardized traits by division by standard deviations across all crosses; this was done regardless of cross as such variance stabilizing within crosses can introduce bias into our lineage-specific analysis.

Genomic DNA was isolated from leaf tissue with the CTAB method (Doyle and Doyle, 1990). Assays for amplified fragment length polymorphisms (AFLP) were performed following Vos et al. (1995) and Remington et al. (1999) with modifications for the LiCor 4200 DNA sequencer (LI-COR, Inc., Lincoln, NE, USA). Five hundred nanogram of DNA was digested with EcoRI and MseI. Preamplification was carried out using standard AFLP EcoRI and MseI primers containing the selective nucleotides Eco+C and Mse+CC. Selective final amplifications were conducted by eight combinations of Eco primers with three nucleotides and Mse primers with three nucleotides (Table 2). After running the LiCor gels, RFLPscan Version 3.0 (Scanalytics Inc., Fairfax, VA, USA) scored segregating loci. Markers showing significant segregation distortion (P<0.05) were excluded. A total of 368 polymorphic loci were obtained, and 675 individuals from the six backcrosses were genotyped (PP × PG=112, PG × GG=135, GG × GM=124, GM × MM=121, MM × MP=135 and MP × PP=68).

Table 2 AFLP primers and numbers of polymorphic fragments (N)

Inference of parent genotypes and linkage

As the dominance of AFLP markers prevents detection of heterozygous parents, we inferred parent genotypes using the progeny of all six backcrosses, as follows. Table 3 lists the segregation probabilities for bandless versus banded progeny, conditioned on parent genotypes. Let these probabilities be Pu(i,j) for the bandless phenotype and Pb(i,j) for the banded phenotype. Across all six parents, there 36=729 possible configurations of parent genotypes. For any particular genotypic configuration, the likelihood of the observed data across the six crosses is

Table 3 Probabilities of bandless (first number) versus banded progeny (second number), conditioned on the genotypes of the two parents of a cross (bandless is the recessive condition)

where Nu,k is the number of bandless progeny and Nb,k is the number of banded progeny in cross k; and k1kand k2k are the putative male and female parent genotypes of cross k. A computer program was written that enumerated all 36=729 possible parental genotype configurations (values of k1kand k2k) and chose the most likely configuration of the six parents for each AFLP locus. The likelihood of the second most likely parental genotypes for each locus is also given in the program, which incorporates uncertainty of inferred parental genotypes. To account for genotyping error, at each cross, the less frequent phenotype was truncated to zero and the likelihood of the data evaluated against the original data. If the increase in likelihood was greater than expected by a 5% genotyping error, the less frequent phenotype was truncated to zero. Parentage inference was quite reliable via Equation (2), as relative to the most likely set of six parent genotypes, we found the next most likely set of six-parent genotypes were 1000 times less likely 90% of the time, and 10 times less likely 97% of the time.

Using the most likely parental genotypes, Equation (1) was applied to each of the 368 markers. To avoid problems with numerical estimation, we simply evaluated the likelihood surface across all possible values of UG, UP and UM, each ranging from −1 to +1 in increments of 0.02. The joint estimate was chosen as that combination of the three U’s that gave the highest likelihood. Variance explained was calculated as where and . This formula assumes that the magnitude of QTL effect between any two species is the sum of the lineage-specific QTLs, for example, QTLs evolve additively. It also assumes that QTL effects are fixed between taxa, and none are segregating within taxa.

Statistical significance of QTL was ascertained in two ways. First, we permuted quantitative traits and markers 1000 times (traits randomized among genotypes). The likelihood of these permutated data were compared with the original un-permuted data; if 50 or less of the permuted data were more likely than the original data, the estimates are deemed significant. In the second way, we use the bootstrap to estimate standard errors of individual branches, wherein progeny were re-sampled with replacement within crosses 1000 times.

Also, we estimated recombination rates using a joint likelihood function (Hu et al., 2004), which combines information across the six crosses. For those pairs of markers, which were linked closer than 10 map units, one marker was omitted. This and the above calculations were implemented in a FORTRAN computer program written for this data (code available upon request from KR).

Results

Among the eight primer pairs (Table 2), 368 polymorphic AFLP loci were genotyped and scored (markers with distorted segregation ratios were excluded). Of these, 190 were informative about lineage-specific QTL (where all three pairwise crosses were heterozygous F1s). Nine markers were excluded from further analysis due to close linkage to another marker. The estimated genetic distances between M. guttatus and M. micranthus was 0.08 (s.e.=0.01), between M. guttatus and M. platycalyx was 0.19 (s.e.=0.02) and between M. platycalyx and M. micranthus was 0.20 (s.e.=0.02). A dendrogram based upon these distances is shown in Figure 4, with the ‘A’ indicating the approximate location of the common ancestor of these three taxa (roughly the point within the tree that is equidistant to all three taxa). This dendrogram shows that M. micranthus is the more recently derived inbreeder. Although M. platycalyx has the largest branch length, it is likely that M. guttatus was the common ancestor. A band-sharing index (Nei and Li, 1979) gave the same result as this dendrogram.

Figure 4
figure 4

Dendrogram of genetic distances among M. guttatus, M. platycalyx and M. micranthus based upon 368 AFLP loci (standard errors in parenthesis); the true location of the most common in on the branch leading out to M. platycalyx.

Lineage-specific QTL effects

Table 4 lists the markers that gave significant lineage-specific QTL effects for at least one of three lineages. Among the 190 informative markers, 40 QTLs were detected across the five traits, for an average of eight QTLs per trait. The same markers often had QTLs for several of these five traits, indicating high pleiotropy. For example, for corolla width and length in M. guttatus, QTL b2_109 is pleiotropic, generally controlling floral size. Asterisks denote estimates that are either above zero, or below zero, in 95% or more of the bootstraps. Note that estimates deemed significant by this measure may not have permutation probabilities of less than 5%. This is due to the complex structure of data, wherein one lineage may not have significant QTL but another lineage will. In addition, the percentage variance explained by markers was very low, on the order of 1%, despite the estimates showing statistical significance. This is extremely low for classical pairwise QTL experiments.

Table 4 Estimates of lineage-specific QTL genetic effects (standard errors in parenthesis)

QTL of positive effect increases the size of the trait (promoting outcrossing), whereas negative QTL effect decreases trait size (promoting selfing). We expect the M. guttatus lineage to show positive QTL effects, and the M. platycalyx and M. micranthus lineages to show negative effects. By and large this was true; for corolla width 9 of 12 estimates were positive for the M. guttatus lineage, but only one significantly so. The M. platycalyx also had largely positive corolla width/length QTL effect, except some were significantly negative. M. micranthus had the most negative and most significantly negative QTL effects for corolla width and length, in accord with its smallest flower size. Stamen length showed the least number of QTL (n=4) followed by stigma-anther separation (n=6). M. platycalyx gave the greatest number of significant QTLs (n=21), followed by M. micranthus (n=12) then M. guttatus (n=5), which accords with the greater length of the M. platycalyx lineage (Figure 4).

To obtain a broader picture of the pattern of lineage-specific QTLs, Figures 5 and 6 show the ‘space’ of QTL effects. To increase the sample size at the expense of adding some false positives, we included all markers with QTL permutation probabilities of 0.10 or less (thus the numbers of markers is greater than in Table 4), with those of probability of 0.05–0.10 indicated by the smaller dots.

Figure 5
figure 5

The space of lineage-specific QTL evolution among three monkey-flower species, for the floral size measures of corella width and correla length (larger dots are QTLs with less than 5% permutation probabilities and smaller dots are QTLs with 5–10% permutation probabilities).

Figure 6
figure 6

The space of lineage-specific QTL evolution among three monkey-flower species, for the floral outcrossing measures of anther length, stamen length and stigma-anther separation (larger dots are QTLs with less than 5% permutation probabilities and smaller dots are QTLs with 5–10% permutation probabilities).

Comparing Figure 5 against the hypothetical evolutionary scenarios given in Figure 3b, corolla width showed predominately scenarios (a), (d) and (e). Scenario (a) and (e) corresponds to QTL change in an inbreeder lineage. A notable exception are four QTL for corolla width in scenario (d), in which M. platycalyx lineage has a QTL effect for larger corollas. Also these three modes are all orthologous. Corolla length showed a similar pattern with the exception that two QTL showed significant paralogy (scenario f). Figure 6 shows the scenarios for floral organs. All six quadrants were occupied by pistil length QTL with roughly equal numbers of QTL in each quadrant, except for scenario (c). However, the more significant QTL mapped mainly to (a), (d) and (e), as in the floral size traits. Stamen length also mapped to these three quadrants. Finally, stigma-anther separation exclusively occupied quadrant (a), indicating that reduced stigma-anther separation is unique to the derived M. platycalyx lineage. Overall, very few QTL mapped to the paralogous modes (b, f).

Discussion

Patterns of lineage-specific QTL effects

‘Lineage-specific QTL mapping’ goes beyond the traditional inferences about number of QTLs and magnitudes of QTL effects. With pairwise crosses between several related taxa, one can now infer phylogenetic lineages along which QTL changes occur, but with some difficulties. These are discussed below in the prospects.

Assuming outbreeding being ancestral, the M. guttatus lineage showed the fewest (6) significant QTL across the five traits, as compared with 29 for M. platycalyx lineage and 17 for the M. micranthus lineage. The evolutionary distance of these two lineages were strongly associated with QTL number: that for M. platycalyx is considerably longer than that for M. micranthus (Figure 4). The average QTL effect was also greater in the M. platycalyx lineage (0.197) than in either the M. micranthus lineage (0.157) or the M. guttatus lineage (0.173).

Although we expected that the two inbreeding lineages would have QTL effects that cause smaller flowers and less stigma-anther separation, not all QTLs involving with the evolution of inbreeding were of such effect. Although almost all significant QTLs for the M. micranthus lineage were negative (the exception being a positive QTL for stigma-anther separation), the M. platycalyx lineage had a mixture of approximate equal proportions of positive and negative QTL effects for all traits except stigma-anther separation. These accords with the flower size of M. platycalyx, which is not much smaller than M. guttatus. It also may indicate a complex evolutionary history in the lineage of M. platycalyx, where QTL alleles for smaller flowers and QTL allele for larger flowers were either selectively favored at different times, or underwent considerable genetic drift. Interestingly, all QTL effects for stigma-anther separation were negative for the M. platycalyx lineage. This accords with the observation that M. platycalyx is highly autofertile in the greenhouse, and that the major feature distinguishing the evolution of this species is the fixation of several QTL for reduced stigma-anther separation.

Mapping QTL to the ‘space’ of QTL effects (Figure 3) allows one to distinguish alternative phylogenetic patterns of QTL evolution, including distinguishing orthology from paralogy, when QTL are deemed ‘homologous’. The most parsimonious explanations of where QTL changes occur along with lineages are shown in Figure 3b, for the cases in positive ancestral QTL. When we compare this figure to the observed QTL space in Mimulus (Figures 5 and 6), we see some dramatic differences between traits for their pattern of evolution. Corolla width showed predominately scenario (a) and (e) in Figure 3b; these both correspond to QTL changes for smaller corollas appearing in an inbreeder lineages, as would be expected. A notable exception is four QTL for corolla width in scenario (d), in which M. platycalyx lineage has a QTL effect for larger corollas. Corolla length showed predominant scenario (a) in Figure 3b (smaller in the M. platycalyx lineage) but also some (e) and (d) scenarios, involving changes toward shorter corollas and paralogy in the M. guttatus lineage, respectively. However, (f) is a rare case, as it requires two mutational steps. Pistil length showed almost all six alternative modes of evolution in roughly equal proportion, with M. guttatus lineage showing a rare case of negative QTL (Table 4, Figure 6).

Holsinger (1991) and Uyenoyama and Waller (1991) worked with models for the evolution of selfing, where inbreeding depression must be purged before genes favoring self-fertilization can spread. They found that initial evolution toward selfing is more likely to occur with few loci of major effect, because associations easily develop between loci affecting inbreeding depression and loci controlling selfing. In a study of the genetic architecture of floral differences between M. guttatus and M. micranthus, Lin and Ritland (1997) suggested that genes with small to intermediate effects were considered responsible to the evolution of mating system. They speculated that the evolution of self-fertilization in Mimulus involves the initiation of selfing by a few genes with relatively larger effects and followed by subsequent minor changes of minor modifier loci. However, we cannot verify these two hypotheses with our current study, as we cannot infer where in our three lineages the QTL actually arise.

The large evolutionary distance for the M. platycalyx lineage occurs because the common ancestor of all three taxa lies a significant distance along this lineage (indicated by the ‘A’ in Figure 4). Unlike Figure 4, the isozyme phylogeny of yellow monkey-flowers presented in Ritland and Ritland (1989) is approximately star-like for these taxa (all branch lengths the same). This is likely due to the low resolution of isozyme data (the standard errors of branch lengths were up to half the total branch length).

The variance explained by QTL is very low, on the order of 1%. In normal pairwise comparisons, QTL explain at least 5% of the variance, if not up to 20%. Although low, our lineage-specific QTL were still statistically significant in most cases. Because we simultaneously map QTL in three lineages, it seems likely that the non-significant QTL in one lineage probably dilute the variance explained by a significant QTL in another lineage. In no case were QTL significant in all three lineages, supporting this dilution effect. Computer simulation of QTL evolution and lineage-specific QTL mapping are needed to verify this assertion.

Our model and analyses assumed that QTLs are fixed between taxa, and none are segregating within taxa. This is justified because we are examining QTL differences that distinguish phylogenetic lineages; therefore QTLs are effectively homozygous compared with QTLs segregating within populations. However, if there is heterozygosity for QTL, various problems arise (Slate, 2005). F1s may not necessarily be heterozygous for QTL, thus QTL are not detected. Linkage phase between marker and QTL will also not necessarily consistent across families. These problems arise when outbred populations are surveyed for QTL.

Natural selection versus neutrality in the context of QTL lineage mapping

The sign of QTL changes can be used to indicate whether traits have been under selection, as opposed to neutrality. Under random genetic drift, there should be roughly equal numbers of ‘+’ and ‘–’ QTL between taxa (Orr, 1998). This was observed for the M. platycalyx lineage (roughly equal + and − changes for each trait). It was not observed in the M. micranthus lineage, as nearly all QTL in this lineage were negative, indicating that natural selection drove the evolution of floral traits in this lineage.

Lineage-specific QTL mapping allows for a second type of Orr-type neutrality test. If the genetic basis on the evolution of mating system was solely based on drift-mutation balance, the lineages with larger branch length (for example, M. platycalyx, Figure 4) should have more QTL. Indeed, this lineage does have more significant QTL. In contrast, natural selection would introduce a fewer, more major QTL, taking into account the distance of lineages. This further supports the role of drift in the M. platycalyx lineage.

Orthology versus paralogy of QTL

The term homology was introduced by Richard Owen as the similarity of characters due to shared ancestry (Owen, 1843). With regard to sequences, Walter Fitch first developed the distinction between orthologs and paralogs (Fitch, 1970), although Fitch was referring to gene duplication and not mutation. Homology of QTL is normally defined as QTL that map to the same position in different populations or species (for example, see Freeman et al. (2009)). The two QTL may or may not be the same actual coding genes, but reside within a distance of several map units. In some cases, there may be other evidence of true orthology. For example, in the Poaceae, the orthology between rice (Oryza longistaminata) and sorghum (Sorghum propinguum) may be due to common key genetic regulators of morphological development in the Poaceae (Pereira and Lee, 1995; Hu et al., 2003).

In lineage-specific QTL mapping, we can more strictly classify homologous QTL as orthologous (QTLs that share common ancestry due to same mutational event of an ancestor) versus paralogous (QTLs that have different mutations in the same region of DNA, which arise independently). Lineage-specific QTL mapping approach more directly infers homology, not by comparing separate maps, but by conducting a joint analysis of three taxa simultaneously. QTL orthology is identified by the presence of a QTL in a shared lineage between two taxa. Figures 3a represent true orthology of QTL. With our Mimulus data, we found most QTL to be orthologous, as opposed to paralogous (Figures 5 and 6).

Prospects

We regard our model and analyses as mainly conceptual in nature, showing that inferences about QTL can be made on specific lineages in phylogeny, which provides novel insights into the evolution of QTL. Several improvements or extensions should be possible.

Our inferences were limited the use of AFLP markers, which was the marker available at the time of this study. The major limitation is that AFLPs are dominant and diallelic, and parents often homozygous. Microsatellite markers would overcome these problems, and would make the analysis more straightforward and powerful. Specifically, parents can be genotyped and need not be inferred by their progeny, and loci will be more informative as parents are normally heterozygous. Investigations are needed into alternative crossing designs for lineage-specific mapping of QTL. As well, more elaborate models involving dominance and epistasis would be worthwhile to pursue, and it would be useful to develop interval mapping techniques.

We have assumed the sum of the three lineage-specific QTL effects to be zero. Thus, if one lineage is significantly positive, the other two lineages should normally be negative. But because the other two lineages divide this negative effect, there is a lack of power for statistical significance for the latter two lineages. It should also be noted that Figure 3b depicts single QTL changes along a lineage; other lineages may have incurred no QTL changes but because the sum of all three lineages is zero, a ‘positive’ mutation in one lineage would cause ‘negative’ changes to occur in the other lineages even though no actual mutations occurred. This emphasizes the relativity of our three taxon comparison.

In a related vein, we have assumed the ancestral state was positive (for example, large flowers, greater stigma-anther separation) as outcrossing is generally regarded as an ancestral state, whereas inbreeding is derived. To be more generally applicable, inference of ancestral state should be incorporated. Ancestral states might be inferred by including more species into the phylogeny. However, a graphical representation of the space of QTL effects (Figures 3, 5 and 6) would be difficult to devise with more than three taxa lineage-specific analysis.

Data archiving

Data deposited in the Dryad repository: doi:10.5061/dryad.23r11.