Introduction

A central tenet of evolutionary ecology is the expectation that environmental conditions affect evolutionary processes. Evolutionary responses to selection on a trait require the trait to have a genetic basis, so an understanding of the genetic components of phenotypic variation is required to predict evolutionary dynamics (McAdam et al. 2002; Mousseau and Fox 1998; Noble et al. 2014). Quantitative traits are likely to be determined by a large number of genes each with a small effect (Falconer and Mackay 1996; Lynch and Walsh 1998), and the genetic basis of phenotypic variation—or heritability—of a trait can be quantified indirectly from similarities in trait values between relatives (Falconer and Mackay 1996; Lynch and Walsh 1998). However, variation in the environmental conditions individuals experience can play an important role in the process of identifying the genetic components of trait variation. First, similarities between relatives might be due to shared environmental effects, such as maternal effects, which therefore need to be accounted for to generate accurate estimates of heritability (Kruuk and Hadfield 2007; Wolf and Wade 2016). Second, variation in environmental conditions can affect the expression of genetic variance (Rowiński and Rogell 2017; Sgrò and Hoffmann 2004). Third, variation in environmental conditions is also likely to affect the expression of other components of variance, including maternal effects (Mousseau and Fox 1998; Uller et al. 2013).

Changes in the observed genetic variance underlying phenotypic traits in different environmental conditions are known as genotype-by-environment interactions (G × E; Charmantier and Garant 2005; Hoffmann and Merila 1999; Rowiński and Rogell 2017; Wood and Brodie 2015). There is abundant evidence from laboratory studies on animals of G × E for a range of traits, based on phenotypic responses to manipulation of environmental conditions such as food availability, temperature, and pathogen levels (e.g. Evans et al. 2015; Ferguson and Read 2002; Vieira et al. 2000). Plant studies also frequently report evidence for G × E (Des Marais et al. 2013): plants of different genotypes or from different populations show marked variation in their phenotypic responses to key environmental variables (e.g. de Leon et al. 2016; Donohue et al. 2000). Understanding whether the performance of genotypes is correlated across environments is critical to determine the extent to which environmental variation might maintain genetic variance (Barton and Turelli 1989; Johnson and Barton 2005): do genotypes that are successful in one environment also do well in another, or are there ‘trade-offs’ across environments (Kruuk et al. 2008)? Here, we consider how substantial these aspects of G × E might be relative to other determinants of phenotypic variation in key life history and related traits.

An individual’s phenotype is shaped by multiple factors in addition to its genotype, one of which is the effect its mother has on it (Pick et al. 2016; Wolf and Wade 2009). ‘Maternal effects’ occur when a mother’s phenotype affects that of her offspring over and above that attributable to the genes it inherits from her (Mousseau and Fox 1998; Räsänen and Kruuk 2007). This might involve pre-natal and/or post-natal effects (Lock et al. 2007; Pick et al. 2016; Wolf et al. 2011). The influence of maternal effects on offspring phenotype is often highly dependent on the mother’s own environment. For example, mothers experiencing good environmental conditions may produce larger offspring, breed sooner, or provide more food and greater parental care (Marshall and Uller 2007; Mousseau and Fox 1998; Reznick and Yang 1993). In general, the most obvious mechanisms driving variation in maternal effects point towards environmental factors that alter the mother’s phenotype (e.g. mothers in poor condition provide less milk; Trivers 1974). If mothers differ in how they respond to environmental variation, this plasticity can be thought of as “maternal-by-environment interactions” or M × E, akin to genotype-by-environment interactions. Maternal-by-environment (M × E) interactions have been less thoroughly investigated than G × E in the context of variance component analyses (though see for example Chirgwin et al. 2017; Laugen et al. 2005), so in general we have little idea whether ‘good’ environments generally amplify or depress any differences among mothers, nor of the potential for trade-offs across environments whereby mothers that are superior in one environment are inferior in another (i.e. negative cross-environment maternal effect correlations).

The timing of environmental variation may also drive biologically important effects on trait expression. Maternal effects plasticity is typically investigated in the context of variation in the mother’s environment. M × E could, however, equally plausibly be generated by maternal effects differing in interactions with the offspring’s environment. In many cases, the two sources of environmental variation impacting on maternal effects are indistinguishable. For example, more stressful environmental conditions reduce variation among mothers in maternal effects for offspring birth weight in wild Soay sheep (Wilson et al. 2005), but it is not possible to determine to what extent this is due to effects of the environment on mothers or on their lambs. We therefore know less about how maternal effects vary ‘downstream’ due to variation in the offspring’s environment. More specifically, are there predictable differences between mothers in how their offspring respond to changes in environmental conditions? In particular, do mothers vary in how much their offspring are able to withstand environmental stress? Addressing this aspect of M × E requires focusing on changes in the offspring’s environment once maternal care has ceased.

Analysis of maternal-by-environment interactions may also be important for methodological reasons. Maternal effects typically make relatives (i.e. siblings) look more similar than they would otherwise. They can therefore be difficult to disentangle from additive genetic effects, which are typically estimated from the degree of similarity between relatives. Maternal effects have the potential to inflate estimates of genetic variance unless properly modelled, and there is general acceptance of the need to control for maternal effects when estimating the heritability of a trait (Kruuk and Hadfield 2007; McAdam et al. 2014). However, less attention has been paid to the fact that the same issue applies to tests for G × E interactions: just as the occurrence of maternal effects can inflate estimates of additive genetic variance and heritability, the occurrence of M × E should presumably inflate estimates of G × E. To our knowledge, this possibility remains untested. It implies that, in addition to the fundamental biological question of whether offspring of different mothers are differentially affected by environmental stress due to maternal effects, quantifying M × E may be a critical component of analysis of G × E.

General conclusions about the prevalence of G × E and M × E in a system also require assessment of different traits. Different types of traits typically show different patterns of heritability, of maternal effects and, presumably, of respective interactions with the environment. Traits that are closely associated with fitness often show lower heritability than more weakly-selected traits (Houle 1992; Postma 2014; Roff and Mousseau 1987). For instance, life history traits, such as fecundity and viability, are under strong directional selection and show lower heritability than morphological and physiological traits (Kruuk et al. 2000; Roff and Mousseau 1987). Sexually selected traits may also show different patterns of variation, alongside differences between the sexes in their genetic architecture (Jia et al. 2000; Parker and Garant 2004). However, comparatively little is known about the relative magnitude of G × E, let alone M × E, for different types of traits.

Here, we experimentally manipulated a critical aspect of the environment experienced by offspring during their early development (food availability), to assess the relative contribution of additive genetic versus maternal effects on phenotypic variance, as well as the extent to which each contribution was influenced by the environmental stress of food restriction. We used a multigenerational breeding design of a laboratory population of mosquitofish (Gambusia holbrooki) to test for G × E and M × E interactions in seven adult phenotypic traits: size and age at maturity for both males and females, and three sexually selected male traits, namely, relative genital size, sperm number, and sperm velocity. We considered this range of phenotypic traits in order to investigate the importance of different sources of variance for multiple aspects of adult phenotypes. In many taxa, size and age at maturity are key life history traits often linked to fitness (Roff 1992). Likewise, some sperm traits are strongly positively associated with fitness (Parker and Pizzari 2010), although this is not always the case (e.g. Simmons et al. 2003). We already have clear evidence for maternal effects on growth and development rates that persist until sexual maturity in G. holbrooki (Kruuk et al. 2015). Here, we assessed the potential for environmental stress (food restriction) during offspring development to generate both G × E and M × E. Gambusia holbrooki is a live-bearing fish lacking post-natal parental care, so all maternal effects must be mediated by events prior to birth. Our experiment therefore constitutes a test for interactions between pre-natal maternal effects (M) and post-natal (i.e. offspring alone) environmental conditions (E). We asked (1) What is the relative importance of additive genetic vs maternal effects in contributing to phenotypic trait variance in each sex? (2) Do these effects interact with the environmental conditions experienced by the offspring? And if so, are there (i) consistent differences in the levels of additive genetic and maternal effects variance between good and poor environments and (ii) trade-offs across environments in either genetic or maternal effects?

Methods

Study species

Gambusia holbrooki, a species of viviparous poeciliid fish, is endemic to North America but now introduced worldwide (Pyke 2005). G. holbrooki have internal fertilisation, are sexually dimorphic, and males transfer sperm by a modified anal fin (’gonopodium’) that acts as an intromittent organ (Pyke 2005). There is substantial variation in female adult size, which is strongly positively correlated with fecundity (Bisazza et al. 1989; Callander et al. 2012). Male size also varies considerably, despite their growth ceasing at maturation. Small males have greater manoeuvrability, which seems to increase their propensity to sneak copulations (Pilastro et al. 1997), while large males are socially dominant and might transfer more sperm per copulation because they have greater sperm reserves (Bisazza and Marin 1991; O’Dea et al. 2014). Selection on male size in this species is, in summary, unclear. Recent studies in our lab of free-swimming fish have found negative selection (favouring smaller males, Head et al. 2017), no selection (Vega-Trejo et al. 2017), and positive selection (favouring larger males, Booksmythe et al. 2016). Age at maturation in both sexes is highly variable (see Livingston et al. 2014; Pyke 2005; Vega-Trejo et al. 2016a). Greater relative gonopodium size is also linked to increased reproductive success (Head et al. 2017; Vega-Trejo et al. 2017; but see Booksmythe et al. 2016). Finally, sperm velocity declines with age (Vega-Trejo et al. 2016b), and sperm number has been shown to be condition-dependent and positively related to body length (O’Dea et al. 2014).

Experimental design

Our analyses are based on measurements of seven phenotypic traits from laboratory reared G. holbrooki in which we varied the level of food an individual received during development. Our multi-generation breeding design also involved the comparison of fish with different levels of inbreeding, as part of a separate investigation into the effects of inbreeding (Vega-Trejo et al. 2016a; Vega-Trejo et al. 2017). The base stock (F0) population consisted of offspring from 151 gravid wild-caught females collected in Canberra, Australia. F 0 fish were kept in single-sex tanks under a 14:10 photoperiod at 28 °C, and fed ad libitum with Artemia nauplii and commercial flakes (Fig. 1). Once they were mature, we randomly paired fish from this base stock to create full-sib families (F1). Fish from these full-sib families were then used to create an F2 generation consisting of 58 outbred families (with unrelated parents) and 58 inbred families (with full-sib parents; f = 0.25; Fig. 1). To do this, we used 29 pairs of full-sib families (e.g. A and B), which we refer to as “blocks”. Within each block, one male from family A and one from family B respectively were paired to females (between 1 and 4) from the other family, to create outbred full-sibs/half-sibs (AB and BA); and one male from each family was paired to his full-sib sisters (again, 1–4 females per male) to create inbred full-sibs/half-sibs (AA and BB). The same number of females contributed to each of the four cross-types within a block to generate a mix of inbred and outbred half-sib and full-sib families. We then reared a maximum of 10 offspring per female (i.e. full-sib family). See Vega-Trejo et al. (2015) and Vega-Trejo et al. (2016a) for a fuller methodological description.

Fig. 1
figure 1

Schematic of the experimental design. S = stock fish. F0 stock males and females were paired to create F1 full-sib families (e.g. A and B). We set up 1–4 females per cross-type to create F2 outbred (AB, BA—Out) and inbred (AA, BB—Inb) fish. These fish were reared on either a control or a low food diet early in life. F2 females from each treatment were paired with a stock (outbred) male to create F3 offspring. F2 males from each treatment artificially inseminated stock (outbred) females to create F3 offspring. F3 offspring were classified as outbred and control diet

The food manipulation experiment was conducted on the F2 generation (described below). In the F2 generation, half of the offspring in each family were raised on a ‘control’ diet, whereas the other half experienced a ‘low food’ diet early in life (Fig. 1). Fish on the control diet were fed ad libitum with Artemia nauplii twice a day from birth until the end of the experiment. Fish on the low diet were fed the control diet until they were one week old, and were then fed 3 mg of Artemia nauplii once every other day (i.e. <25% of the control food intake) for 21 days, after which they were returned to the control diet. Fish almost totally suppressed growth while on the low food diet (Livingston et al. 2014; Vega-Trejo et al. 2016a). For completeness, we also included in our analyses measurements on the F3 generation. The F3 generation was created by pairing each F2 female with a stock male, and by using sperm from F2 males to artificially inseminate stock females (Fig. 1). Thus, all F1 and F3 fish were outbred and raised on a control diet, whereas F2 fish were both outbred or inbred, and raised on either a control or restricted diet. Offspring of all generations were transferred to individual tanks at birth to eliminate the potential for post-natal shared environment effects.

Measurements of phenotypic traits

For individuals of generations F2 and F3, we measured seven adult traits. Table 1 lists the traits measured, with sample sizes and summary statistics. To determine the timing of sexual maturity, we inspected all tanks three times a week. Females were considered to be mature when yolked eggs were evident in the abdomen (Stearns 1983). Males were considered to be mature when their gonopodium was translucent, with a spine visible at the tip (Stearns 1983; Zulian et al. 1993).

Table 1 Means ± SD (N) for the traits measured, and parameter estimates ± SE for the fixed effects are from analyses of standardised values from our basic model V A + V M + V B + V R

To measure morphological traits, we anaesthetized fish by submersion in ice-cold water for a few seconds to reduce movement. The fish were then photographed alongside a microscopic ruler (0.1 mm gradation). We used Image J software (Abramoff et al. 2004) to measure: body length at maturity (snout tip to base of caudal fin, in mm) for both males and females, and male gonopodium length (apical tip to base, in mm). We then calculated relative gonopodium size for males as the residuals from a linear regression of (log) gonopodium length on (log) standard length (Horth et al. 2010; Vega-Trejo et al. 2017).

We measured two sperm traits in the F2 generation: sperm number and sperm velocity. Details on the extraction of ejaculates and the samples are given in the Supplementary Information. In brief, males were anaesthetized in ice-cold water, placed on a glass slide and their gonopodium was swung forward. We then applied gentle pressure to the abdomen to eject all of the available sperm. We counted the sperm and measured sperm velocity. Afterwards, each male was returned to his individual tank. All inspections for maturity and measurements of traits were made blind to food treatment, inbreeding status, and family identity.

Statistical analyses

We quantified components of variance in the phenotypic traits using an ‘animal model’, a form of mixed model that uses pedigree information to assess covariance between relatives (Wilson et al. 2010), fitted using ASReml-R (Butler et al. 2009). All models contained random effects of an additive genetic effect (with covariance structure defined by relatedness between individuals, as determined by the breeding design’s pedigree, and associated additive genetic variance component V A), a maternal effect (grouping individuals by mother, with associated maternal effects variance component V M; Kruuk and Hadfield 2007), a block component (with variance component V B, defined above), and residual effect (with associated variance component V R).

The animal model estimates the additive genetic variance underlying a trait’s variance based on the similarity between multiple types of relatives; here relatedness, was defined by our four-generation pedigree (F0–F3). The estimate of the maternal effects variance V M was determined by the increased similarity between offspring of the same mother, beyond that due to additive (direct) genetic effects (Kruuk and Hadfield 2007). This value will necessarily encompass both maternal environment effects and maternal genetic effects (i.e. effects of the mother’s genotype on her offspring, over and above the direct effects of the genes they inherit from her); we do not attempt to separate them. Offspring were reared separately from birth onward (see above), so there is little potential for post-natal common environment effects to inflate estimates of maternal effects. Offspring of the same mother were, however, always full-siblings, so there is the potential for the estimate of V M to be inflated by dominance genetic variance (Falconer and Mackay 1996). We make the standard assumption that dominance variance is small compared to additive genetic variance (e.g. Hill et al. 2008; Zhu et al. 2015), and hence that any impact is also small, but we return to this point in the Discussion.

All traits were standardised to unit variance and zero-centred prior to analysis, and we analysed the data separately for males and females. In all models, we fitted food treatment (control vs low food diet), inbreeding (inbred vs outbred), and generation (two levels, as phenotypic data were only available for F2 and F3) as fixed factors. For sperm number and velocity, generation was not fitted because we only measured F2 males. However, for the sperm traits we included fixed effects of male age (range 19–125 days post maturity) as age influences sperm number and velocity (Vega-Trejo et al. 2016b). The parameter estimates for the fixed effects in a model with only V A, V M, V B, and V R (see details below) are in Table 1. Parameter estimates for the effects of food treatment are shown in Fig. S1.

Univariate models of interactions with environmental conditions

We first fitted univariate models for each of the seven traits to explore the extent of interactions of both the additive genetic variance and maternal effects variance with environmental conditions (i.e. food treatment). We started with Model 1, a ‘null’ model with only fixed effects (as described above: food, inbreeding, and generation), block as a random effect, and a residual random effect. We then fitted: Model 2, which included variance due to additive genetic effects (V A); Model 3, a model containing just the maternal effects variance (V M); and Model 4, a model that included both components of variance (V A + V M). We call this the ‘basic model’. We report the trait’s heritability (V A / (V A + V M + V B + V R)) and maternal effects proportion (V M /(V A + V M + V B + V R)) from the basic model in Table 1.

We then investigated whether the variance components changed between the two food treatments (i.e. Genotype × Environment or Maternal effect × Environment) by testing for either a V A × Food or V M × Food interaction. To do so, we first included the interactions between each variance component and the food treatment in models with only one main variance component (i.e. Model 5: V A + V A × Food; and Model 6: V M + V M × Food), and then in models which included both main terms (i.e. Model 7: V A + V M + V A × Food; and Model 8: V A + V M + V M × Food). Finally, we ran a model with all main terms and interactions (Model 9: V A + V M + V A × Food + V M × Food).

Model comparison

Because we wished to compare several models, many of which were not nested, we used a model comparison approach based on the Akaike Information Criterion AIC (Burnham and Anderson 2002; see Saastamoinen et al. 2013 for a similar model comparison of different animal models). We calculated AIC as -2log(L) + 2 K, where log(L) was the model’s log likelihood and K the number of parameters estimated. We only considered the number of parameters associated with estimating variance in the different random effects (other than the residual and block variance estimates), given that the fixed effects were the same in all models for a given trait. Thus, K ranged from 0 (null model: model with only fixed effects and one random effect of block) to 4 (final model with V A, V M, V A × Food and V M × Food). Akaike weights w for each model i were calculated as w i  = exp(∆AIC i )/∑exp(∆AIC), where ∆AIC is the difference in AIC between model i and the model with the lowest AIC (the top model for that trait). Models ranked within two AIC units of the top model were considered to be ‘reasonable candidate’ models providing indistinguishable levels of support (Burnham and Anderson 2002). The number of reasonable models ranged from one to four across the different traits.

The informational approach outlined above allowed us to compare differences between models for each trait. We then determined the significance of the interaction parameter estimates in the top model. To do this, we ran likelihood ratio tests (LRT) comparing the top model to one without that parameter. The test statistics and P-values are given in the text of the Results. We also tested for the significance of V A and V M by LRTs by respectively dropping those terms from the basic model (V A + V M + V B + V R). The P-values from these LRTs are shown in Table 1 adjacent to the heritability (i.e. for V A) and to the proportion of maternal effects variance (i.e. for V M).

For each trait, we report the variance components for the reasonable candidate models in the main Results, and for all nine models in the Supplementary Information. However, given that there might be some level of support for multiple models, we also provide model-averaged parameter estimates (Burnham and Anderson 2002). For each trait, we model averaged the estimates of the different variance components across our 9 models, where the model-averaged variance component V* was calculated by weighting V i , the variance estimate of model i with the weights w i as calculated above, such that V* =  (V i  × w i ). We also estimated model-averaged standard error as SE* =  [w i  × √(SE i 2 + (V i V*)2)] (Burnham and Anderson 2002).

Bivariate models across environments

We found evidence for interactions between food treatment and variance components for both female age at maturity and male size at maturity (see Results). We therefore fitted bivariate models to estimate the variance due to additive genetic effects and maternal effects within each food treatment, by splitting each trait by treatment to define two new “sub-traits”. These models contained inbreeding level and generation as fixed effects as above, and maternal and additive genetic effects as random effects in order to generate environment-specific estimates of the variance components in each trait. Block was also included as a random effect for female age at maturity, but was not fitted for male size at maturity given a lack of convergence (note: for other traits, fitting block made little difference to the parameter estimates). Each model also contained estimates of the covariance across environments in additive genetic, maternal effect, and (for female age) block effects. We also ran a model with additive genetic effects only to compare our estimates of V A when maternal effects were ignored.

Results

Almost all traits were affected by the food treatment during development, but there was no effect of inbreeding on trait means (Table 1). However, there were marked differences among traits as to which of the different combinations of variance components produced the best model. Table 2 shows the reasonable candidate models (<2 AIC units from the top model) for each trait with their corresponding variance estimates, and also the model-averaged variance estimates averaged across all models. For the corresponding details of all 9 models for each trait, see Table S1.

Table 2 The reasonable candidate models for each trait

We found evidence of additive genetic and maternal effects variance for almost all traits. However, the interaction with food treatment varied substantially between traits. Below we present results for each trait in turn.

Female size at maturity

Females on the low food treatment were slightly smaller at maturity than those on control diets (Table 1; Fig. S1). For the ‘basic’ model with just V A + V M, we found substantial heritability and maternal effects variance (Table 1). Comparing all models, both of the ‘reasonable candidate’ models contained variance due to additive genetic effects and maternal effects, with the top model containing only V A + V M (i.e. the ‘basic’ model as defined above; Table 1). Although V M × Food appeared in the second-ranked model, overall there was little indication of support for this interaction (LRT: X 2 (1) = 0.546, P = 0.761; Table 2); this model fits the characteristics of a ‘hitch-hiking’ model, where an additional term generates a very slight improvement in model fit and so falls within 2 AIC units of the top model (Arnold 2010; Symonds and Moussalli 2011). Similarly, there was no support for a V A × Food interaction (LRT: X 2 (1) < 0.001, P = 1; Table S1).

Male size at maturity

Males on the low food treatment were also smaller at maturity (Table 1; Fig. S1). The estimates of heritability and maternal effects proportions from the basic V A + V M model showed little heritability but substantial maternal effects variance in male size (Table 1). Model comparison indicated strong support for a V M × Food interaction as the term appeared in both of the two reasonable candidate models (Table 2), and the top model contained both V M and a significant V M × Food interaction (LRT: X 2 (1) = 18.205, P < 0.001). We found little support for a V A × Food interaction (Table S1). We fitted a bivariate model of environment-specific traits to estimate the variance components for each food treatment. There was significant variance in maternal effects in the control environment but not in the low food environment (Fig. 2a). There was also a negligible covariance (negative) of the maternal effects across environments, most likely because of the lack of variation among mothers in the low food environment (Fig. 2a, Table 3).

Fig. 2
figure 2

Effect of food treatment on variance components for a male size at maturity; b female age at maturity: additive genetic effects (V A) and maternal effects variance (V M) ± SE for a bivariate model with V A + V M + V B + V R, and V A only (+V B + V R; see Methods for details). Dark symbols represent values for fish in the control food treatment, light symbols represent values for fish in the low food treatment, and white symbols represent the covariance between the traits in the two treatments

Table 3 Variance–covariance matrices from the bivariate models of traits expressed in each environment for female age at maturity and male size at maturity

Female age at maturity

Females in the low food treatment took on average 21% longer to mature (Table 1; Fig. S1). Similar to female size at maturity, the basic V A + V M model indicated substantial heritability and maternal effects for female age at maturity. In the model comparison, the only reasonable candidate model was that containing V A and V M and a V M × Food interaction (LRT for V M × Food: X 2 (1) = 6.449, P = 0.039; Table 2). When we considered female age at maturity in the two environments separately, we found much higher variance in maternal effects in the low food environment than in the control environment, and also a positive covariance of maternal effects across the two environments (Fig. 2b, Table 3).

Male age at maturity

Males in the low food treatment took on average 38% longer to mature than those on the control diet (Table 1, Fig. S1). As with male size at maturity, the basic model with only V A + V M showed little heritability but substantial maternal effects variance (Table 1). The reasonable candidate model set contained a top model with V M only and another with V M and a V M × Food interaction. However, similar to female size, the second model was consistent with a hitch-hiking model because there was little support for the V M × Food interaction (LRT: X 2 (1) = 0.987, P = 0.611; Table 2).

Relative gonopodium size

Males in the low food environment had relatively longer gonopodia on average (Table 1, Fig. S1). In the basic model with just V A + V M, the heritability and the strength of maternal effects were both relatively low, though the latter accounted for nearly 10% of the phenotypic variation and was statistically significant (Table 1). The top model contained only V M. Although models containing both V A × Food and V M × Food interactions were included in the reasonable candidate model set, neither parameter estimates differed significantly from zero (LRT: V A × Food: X 2 (1) = 2.019, P = 0.364; V M × Food: X 2 (1) = 0.097, P = 0.953; Table 2).

Sperm traits

Males in the control treatment had sperm with a faster mean velocity, but there was no difference between food treatments in mean sperm number (Table 1; Fig. S1). For sperm number, the top model contained only maternal effects variance V M, and heritability was negligible (Table 2). However, the estimate of V M in the top model had a large SE and was not significant (LRT: X 2 (1) = 2.736, P = 0.098) and the null model with only fixed and block effects was also a ‘reasonable candidate’ model (Table 2). For sperm velocity, the top model contained only V A, with a corresponding significant heritability of 0.215 (0.092 SE), (LRT: X 2 (1) = 4.715, P = 0.029), but if we considered the ‘basic’ model with V A + V M, the estimate of heritability was reduced to 0.197 (0.125 SE; Table 1) and was non-significant (LRT: X 2 (1) = 2.073, P = 0.149).

Discussion

Understanding the causes of phenotypic variation in traits is fundamental to predicting evolutionary responses. Our study of life history and sexual traits in the mosquitofish Gambusia holbrooki revealed several patterns. First, we found sex differences in heritabilities: there were significant heritability estimates for female age and size at maturity, but not for the male traits examined. Second, in both sexes there were maternal effects that persisted until sexual maturity for all traits except the sperm traits (i.e. for five of seven traits). Third, there were interactions between maternal effects variance and food treatment for female age at maturity and male size at maturity. Failure to account for maternal by environmental interactions (M × E) led to overestimates of genotype-by-environment interactions (G × E). We discuss each of these points below.

The relative importance of heritable genetic effects in our study differed between males and females: in general, female traits had higher heritability. Because of the lack of genetic variance in males, we did not attempt to estimate cross-sex genetic correlations, but our results indicate a very different underlying genetic architecture shaping female and male phenotypes. This is in contrast to estimates in many other taxa of strong cross-sex correlations in morphology (Kruuk et al. 2008; Poissant et al. 2010), and maturation time (e.g. Guntrip et al. et al. 1997). The differences between the sexes we observed here could potentially be due to the biology of mosquitofish. Females have indeterminate growth and their fecundity increases with body size (Bisazza et al. 1989; Callander et al. 2012), while males stop growing upon maturation. Selection pressures on growth and maturation rates are thus likely to be highly sex-specific. Sex differences in heritability of traits might be due to sexual selection acting more strongly on males, thereby depleting the amount of additive genetic variation expressed in males (Van Homrigh et al. 2007). Lower heritability of female traits has also been found in other species, for example for morphological traits in house sparrows (Jensen et al. 2003). However, the implications for evolutionary dynamics of differences in heritability between the sexes still remain relatively underexplored.

Within males, the relative importance of heritable genetic effects also differed among sexual traits. There was a low, non-significant heritability for relative gonopodium size (0.039 ± 0.083 SE; Table 1). Interestingly, this is similar to an estimate of realised heritability based on artificial selection on relative gonopodium length in the same study population (0.028 ± 0.006 SE, Booksmythe et al. 2016)—a reminder that a response to selection is possible even when heritability is low. Sperm number showed no evidence of additive genetic variance, but sperm velocity did. The low heritability of sperm number may indicate that sperm quantity is highly condition-dependent (e.g. influenced by diet; see O’Dea et al. 2014). Sperm velocity showed significant heritability in the ‘top’ model (Table 2), which could potentially fit with Y-linked effects as suggested in other poecilids (e.g. Evans 2011). We note however that the estimate was lower and not significant in the basic model with maternal effects (Table 1), so there was no strong support for significant heritability of sperm velocity across all models.

Maternal effects contributed to trait variance for both males and females. Female mosquitofish invest in their offspring prior to fertilisation by provisioning eggs (lecithotrophy; Fernández-Delgado and Rossomanno 1997; Pollux et al. 2014) and also possibly via subsequent nutrient transfer to embryos (see Marsh-Matthews et al. 2005; Marsh-Matthews et al. 2010). Our results suggest that differences between mothers in their prenatal allocation of resources to either eggs or embryos have important implications for their offspring’s subsequent development: maternal effects were still apparent in traits measured at sexual maturity (see also Kruuk et al. 2015). Further, in addition to the overall presence of maternal effects, we observed significant maternal-by-environmental variance (M × E) interactions for female age at maturity and male size at maturity. These interactions were apparent even though the food treatment was applied after maternal provisioning ended. They indicate that studies of maternal effects need to consider the potential impact of environmental heterogeneity: here, we would have reached a very different conclusion as to the importance of maternal effects had we only considered offspring reared under ‘control’ rather than ‘low food’ conditions. We found a significant positive covariance of the maternal effects across environments for female age at maturity. That is, mothers with maternal effects that caused their daughters to take longer to mature in the control environment also had daughters that took longer to mature in the low food environment (Fig. 2, Table 3). For males, however, there was no support for covariance across the food treatments for maternal effects on male size – probably due to the very low maternal effects variance expressed in the low food environment. We therefore found no evidence for trade-offs in maternal effects across environments. Similarly, Charmantier and Garant (2005) also find little evidence of genetic trade-offs via negative cross-environment genetic correlations, also suggesting that the role of environmental heterogeneity in generating life-history trade-offs remains unclear (see also discussion in Kruuk et al. 2008). As a final point, as noted in the Methods, our estimates of maternal effects variance could be inflated by dominance genetic variance and the separation of the two is a challenging issue for many studies (Wolak and Keller 2014). Data from different systems indicate that dominance genetic variance itself is typically small relative to additive genetic variance (Hill et al. 2008; Wolak and Keller 2014; Zhu et al. 2015). There is nevertheless the possibility that our estimates of M × E are inflated by dominance genetic-by-environment (D × E) interactions. There are examples of changes in non-additive genetic variance across environments in some taxa (e.g. Blows and Sokolowski 1995; Chirgwin et al. 2017; Kumar et al. 2015), but in general evidence for D × E appears to be markedly less prevalent than for G × E. We note also that, in turn, estimates of dominance variance in other systems may be inflated by maternal or shared environment effects if these are not properly modelled.

We found striking differences in our results from models incorporating maternal effects and maternal-by-environmental variances (M × E) compared to those without. Estimates of the variance due to additive genetic effects (V A) were always higher in models where V M was not estimated, indicating that ignoring maternal effects inflated estimates of V A (Table S1). Similarly, when we compared estimates of V A × Food (i.e. G × E) from models without and with V M × Food (i.e. M × E), we found larger estimates if V M × Food was not accounted for. For instance, for female age at maturity, V A × Food was estimated at 0.054 ± 0.065 in the model with V A + V M + V A × Food, but as 4 × 10−8 ± 6 × 10−9 SE in the model with V A + V M + V A × Food + V M × Food. Similarly, for male size at maturity, the model of V A + V M + V A × Food returned a G × E interaction estimate of 0.172 ± 0.078 SE, but the model V A + V M + V A × Food + V M × Food provided an estimate of 4 × 10−7 ± 4 × 10−8 SE. The right-hand panels in Fig. 2 also show the much greater change in estimates of V A across environments in models fitted without maternal effects. Although this is unsurprising, studies of genotype-by-environment interactions rarely also account for maternal-by-environment interactions. It is well established that the presence of maternal effects (or other non-additive causes of covariance between relatives, such as dominance variance) can inflate heritability estimates if not accounted for properly (Falconer and Mackay 1996; Kruuk and Hadfield 2007). In the same vein, our results indicate that estimates of G × E can be inflated by the existence of unaccounted-for M × E interactions. It is thus possible that previous studies of other populations have overestimated the role of G × E in driving phenotypic variation in systems where maternal effects (or other causes of increased similarity between relatives such as non-additive genetic effects) are important.

In sum, our study found sex differences in the genetic architecture underlying important phenotypic traits. Additionally, it suggests that maternal effects can shape phenotypic traits even when there is no postnatal investment and, furthermore, that differences between offspring of different mothers can interact with the offspring’s rearing environment to influence their adult phenotypes. Our findings also illustrate the need to consider maternal-by-environmental interactions in quantitative genetic studies. G × Es have been well studied and there is increasing appreciation of their potential importance in evolutionary ecology (e.g. reviews by Des Marais et al. 2013; Hunt and Hosken 2014, respectively). Our analysis here illustrates the need to also consider other potential contributors to environmental interactions when assessing G × E interactions, as failure to do so could result in an overestimation of the importance of G × E. Including M × E interactions may thus improve our understanding of the factors that contribute to phenotypic variance in different components of individuals’ life histories.

Data archiving

Data available from Dryad: https://doi.org/10.5061/dryad.9s1gm