Introduction

A condition for adaptive evolution in a natural population is the presence of additive genetic variation for the trait under consideration. In the case of Drosophila, regular latitudinal clines have been described for various traits, and they are generally assumed to be a consequence of climatic selection and adaptation (David et al., 1983; Parsons, 1983; David & Capy, 1988; Hoffman & Parsons, 1991; Capy et al., 1993; Partridge et al., 1994). A recurring observation in several species is a genetic increase in body size with latitude. Wild-living flies, however, exhibit a huge phenotypic variance in body size (David et al., 1980; Coyne & Beecham, 1987; Imasheva et al., 1994; Moreteau et al., 1995), which is caused by environmental heterogeneity during development. For that reason, the genetic characteristics of natural populations, including geographical clines and heritability, are generally investigated in the laboratory under controlled conditions, in order to minimize the environmental variance and provide better genetic information. This procedure raises several problems that are generally not addressed, such as the choice of laboratory conditions and the generation to be investigated. For keeping a natural population in the laboratory, two procedures can be used: either a large population is maintained in a single cage (e.g. Partridge et al., 1994) or isofemale lines are established and kept separately (Parsons, 1983; Capy et al., 1993).

Two opposite processes are, however, likely to occur in successive laboratory generations. Genetic drift in a neutral environment should progressively increase the divergence between lines and thus the genetic heterogeneity. If a mass population is kept, drift will be slower but will occur anyway after a few years. In contrast, if the laboratory conditions are not neutral for the trait investigated, some kind of laboratory adaptation will be observed. This possible adaptation will shift the overall mean from the initial value to some new equilibrium. In the case of parallel lines, adaptation should induce a convergence towards the selected optimum and thus reduce their genetic heterogeneity.

Numerous examples of both drift and adaptation have been described for discrete genetic markers, including phenotypic mutants, chromosome rearrangements or allozymes (Dobzhansky, 1970; Lewontin, 1974; Merrel, 1981; Van Delden, 1982; Delpuech et al., 1993). Data on quantitative traits are, on the other hand, less well documented. An increased divergence of ovariole number among laboratory strains was interpreted as a consequence of genetic drift (Bocquet et al., 1973). A classical example of laboratory temperature adaptation refers to strains kept at various temperatures, which, after a few years, exhibited a systematic divergence, i.e. a smaller size at higher temperature and a larger size at lower temperature (David et al., 1983; Cavicchi et al., 1985; Partridge et al., 1994).

In the present study, the genetic variability of wild-living flies and their laboratory progeny was investigated over several generations. Two kinds of quantitative traits were measured: wing length (related to body size) and body pigmentation (thorax and abdomen) in females. In nature, wing length exhibited a huge phenotypic variance, as expected from previous results. Pigmentation traits were less variable and highly heritable. Genetic variability among isofemale lines in the laboratory was high for all traits and stable over generations. The genetic stability of the lines was also demonstrated by a high positive correlation (genetic repeatability) between family means over successive generations. The techniques implemented here should be a model for analysing experimental adaptations to different environments.

Materials and methods

A wild-living D. melanogaster population was collected with banana traps in Cotonou (Republic of Benin, Africa). Adults were put in sucrose–agar vials and sent by air mail to France. On arrival, females were analysed for morphometrical traits and then isolated at 25°C in culture vials containing a killed yeast medium to establish isofemale lines. Among 60 females that were isolated, only 37 produced sufficient progeny (at least 10 females) to be analysed further.

From each isofemale line, 10 laboratory-grown females were taken randomly and measured. Total wing length from thorax articulation to the tip of the third longitudinal vein was measured with a micrometer under a binocular microscope. The dark pigmentation on the last three abdomen segments (5, 6 and 7) was estimated visually, and their sum was also calculated (see David et al., 1990). For each segment, 11 classes ranging from 0 (no dark pigment) to 10 (segment completely dark) were used. The intensity of pigmentation of a trident pattern on the mesonotum was also estimated. In that case, four phenotypic classes from 0 (no visible trident) to 3 (dark trident) were used (see David et al., 1985). The 10 measured females were then associated with 10 randomly taken males and used for producing the next generation. Egg laying of the 10 pairs was limited to a few hours, and crowding effects were avoided by using a killed yeast high-nutrient medium.

Ten females of the next generation were again taken at random, measured and used as parents of the following generation. The first three laboratory generations (G1, G2 and G3) were investigated in this way. Females of the fourth generation were not investigated but the method used for propagating each line (i.e. 10 randomly chosen parental pairs) was kept the same. The G5 generation was again investigated and, finally, the G9 generation.

Results

Mean values of parental flies and laboratory progeny

Basic data concerning the six traits investigated are given in Table 1, and the variability among lines is illustrated (Fig. 1) for two traits, the abdomen pigmentation (sum) and wing length.

Table 1 Mean values (±standard error) of morphometrical traits in wild-collected female Drosophila melanogaster (P1 and P2) and laboratory generations of 37 isofemale lines
Fig. 1
figure 1

Mean values of parental Drosophila melanogaster (P) and laboratory generations (G1–G9) for the sum of abdomen pigmentation and wing length (37 lines).

The two parental samples (P1, the whole sample, and P2, the mothers of isofemale lines) were similar for all traits and not statistically different. The P2 females, which were chosen because of their higher progeny number, did not differ from the whole natural sample. Compared with laboratory-grown flies, these wild-living females were different for all traits. The major difference concerned size, i.e. wing length. Wild-living females were on average smaller and also much more variable (see Fig. 1). The phenotypic variance in wild-living females was 273.8 against 31.3 in laboratory-grown flies, i.e. 8.8 times higher. The thoracic trident, poorly marked in this tropical population, was darker in wild-living flies than in the laboratory progeny anyway. The reverse was true for abdomen pigmentation, which was darker in laboratory flies than in wild-living ones, especially for segments 6 and 7. Interestingly, the phenotypic variability of females in nature was close to the variability observed in the laboratory (see Fig. 1). For the sum of the three segments, the phenotypic variance in nature was 12.8 against 14.2 in the laboratory.

The wild population of Cotonou investigated here was living at a temperature of more than 25°C (range 27–29°C). The average wing length, which is shorter than in laboratory generations, may be explained by both poor feeding conditions and higher developmental temperature. This higher temperature also explains the overall lighter pigmentation of the abdomen segments (David et al., 1990).

Variability among lines and generations in the laboratory

Morphometrical data of generations G1–G9 were submitted to a nested ANOVA (not shown). A highly significant line effect was observed for all traits, suggesting their genetic heterogeneity. For pigmentation characters, no significant variation was found between generations except a slight effect for segment 5. A significant heterogeneity between generations was, however, shown for wing length. As visualized in Fig. 1, this effect is accounted for by a general decrease in wing length of all lines in G2. Excluding the G2 data led to homogeneity among the four other generations (G1, G3, G5 and G9). The reasons why G2 values were consistently smaller probably correspond to some uncontrolled rearing accident.

Genetic variation between lines was investigated further by calculating, for each generation and each trait, the coefficient of intraclass correlation, t (Table 2). These values were then submitted to ANOVA (not shown), which showed a highly significant trait effect (F5,20=12.09; P<0.001) and a slight generation effect (F4,20=3.02; P<0.05). The trait effect is obvious from Table 2: the coefficient of intraclass correlation is minimum for the trident (0.22±0.02), intermediate for wing length (0.34±0.03) and maximum for abdomen segments 6 and 7 and for the sum (range 0.41–0.48).

Table 2 Values of intraclass correlations calculated for six different morphometrical traits in five laboratory generations (G1–G9) of Drosophila melanogaster

The significant generation effect suggests a slight increase of t with time: the mean value in G9 is 0.44, whereas it is 0.34 for the previous generations. Such a phenomenon might result from genetic drift occurring among lines in the laboratory. The problem was further investigated by calculating the between-line variance, which estimates the genetic variance, and the within-line variance, which is mainly attributable to environmental effects (Table 3). For each trait, variances were submitted to a Bartlett's homogeneity test, and no significant differences among generations were observed. The slight increase in genetic variability over time, which is evident in the whole set of intraclass correlations, is too small to be demonstrated when considering the variances of each trait separately.

Table 3 Variances among lines and within lines (laboratory generations) of Drosophila melanogaster

Correlation and regression between wild-living females and laboratory progeny

Regression coefficients between wild parents and laboratory offspring are a classical means of estimating heritability (Falconer, 1989). In the present case, because one sex only (mother–daughters regression) was considered, heritability should be twice the regression coefficient. To be applied correctly, this formula implies that parents and offspring developed in the same environment and exhibit similar phenotypic variances, a condition which obviously does not apply for wing length. In this case, it may be suspected that additive genetic variance in nature is different from the same parameter in the laboratory, and various techniques can be used to approach this problem (Riska et al., 1989). The classical mother–daughter regression, b (Table 4), was calculated, as well as two other quantities suggested by Riska et al. (1989) — γ2h2 and h2v — both of which could provide estimates of heritability in nature. As several laboratory generations were available, it was possible to correlate wild-living females with their F1 progeny and also with successive generations (Table 4). The parent–F1 regression was very low (0.02) and not significantly different from zero. The same coefficient, calculated with more distant generations, tended to increase, becoming significantly greater than zero. Such variations are difficult to explain, except by random fluctuations between samplings. The best estimate of the regression is thus the mean value, 0.07±0.01, significantly greater than zero and suggesting a heritability of 0.14±0.02.

Table 4 Relationship between wild-living parental female Drosophila melanogaster and laboratory progeny in successive generations

The quantity γ2h2=4b2(VPN/VAL) (where VPN is the phenotypic variance in nature and VAL is the additive genetic variance in the laboratory) proved to be useless, with very high values, one of them greater to 1 (Table 4). The quantity h2v=VAL/VPN was always very low, ranging between 0.025 and 0.042. As indicated by Riska et al. (1989), it is a lower bound of heritability in nature.

The difficulty of heteroscedasticity did not exist for pigmentation traits, and the mother–daughter regression appears as a convenient estimate of heritability. In several cases (segments 6, 7 and the sum), a tendency existed for an increased regression coefficient when more distant progeny generations were considered. We suggest again that the average value should be considered as the best estimate of heritability. Surprisingly, heritability for segment 5 seems to be very low and not different from zero. For other traits, values range between 0.20 and 0.31, corresponding to heritabilities from 0.40 to 0.62.

Correlation between laboratory generations: genetic repeatability

A highly significant line effect in ANOVA (not shown) as well as the graphs in Fig. 1 show that the mean value of each line remained quite stable, or repeatable, over successive generations. A way to estimate this genetic repeatability is to calculate the correlation between the mean values of the lines in different generations (Table 5). All coefficients were significant, demonstrating a good repeatability of all traits. Significant differences were, however, found between traits (ANOVA, not shown). The least repeatable, or least stable, are wing length (r=0.55), thoracic trident and abdomen segment 5 (r=0.67), whereas segments 6, 7 and the sum exhibit correlations greater than 0.8 (Fig. 2).

Table 5 Coefficients of correlation (genetic repeatability) between mean values of the 37 isofemale lines of Drosophila melanogaster in different generations
Fig. 2
figure 2

Relationship between mean values of isofemale lines of Drosophila melanogaster over different laboratory generations. Sum, sum of last abdominal segments (correlation G1–G2). Wing, wing length in mm×100 (correlation G1–G3). For each graph, the continuous line is the major axis. r=correlation between generations.

It was expected that, because of drift, the repeatability or predictability of the isofemale line characteristics would decrease with the generation interval. A significant but slight effect was detected with ANOVA: the average correlation for all traits was 0.76 when the generation interval was one or two and decreased to 0.69 for longer intervals of seven or eight generations. But the decrease was quite slow: values observed in the various lines at G1 are still good predictors of values found in G9 (r2=0.48).

It might be assumed that the parent–offspring regression technique could also be used with isofemale mean values for estimating heritabilities. In fact, this method does not apply, because we are dealing with a group of parents and a group of offspring. If two successive generations are considered, the expectation of the mean in the second generation is identical to that of the first, and variations will be caused by sampling and by uncontrolled environmental effects. This was verified by calculating mean square regressions for the first three generations, i.e. G2=f(G1) and G3=f(G2). Regression coefficients for the various traits varied between 0.47 and 1.01, with an average value of 0.80±0.05 (n=12). To give equal weight to each generation, independently of their order, the bivariate distributions were characterized by their major axes. Values were calculated for G1–G2 and G2–G3: they ranged between 0.85 and 1.34 with a mean of 1.027±0.044 (n=12). Examples of bivariate distributions are given in Fig. 2.

If the repeatability between generations depends mainly on sampling effects among flies of the same line, it should be influenced by the number of flies measured in each line. In the present case, 10 females were taken for calculating average line values. How the correlation varied according to the number of flies taken from each line by a bootstrap technique was analysed. Results for two characters (sum and wing length) and four pairs of generations are shown in Fig. 3. In all cases, the correlation between family means increased with the sample size. For a highly heritable trait (sum of abdomen pigmentation), a fairly high correlation (r=0.45± 0.03) is already observed by taking only one female from each line. Increasing the sample size increases the correlation. With five females per line, the average r becomes 0.79±0.02, not much less than the final value obtained with 10 flies (r=0.88±0.01). The curves in Fig. 3 suggest an asymptotic value of r for bigger sample sizes that would remain lower than 1. This is because the mean values of successive generations are not expected to be identical, owing to the sampling effect in the choice of parents.

Fig. 3
figure 3

Variation of the correlation coefficient between family means of laboratory generations of Drosophila melanogaster when increasing the number of flies taken from each family. Four sets of generation pairs were used, as indicated. For each set, 30 repeats were made and values averaged.

For a less heritable trait (wing length), correlations between family means were much lower: average values with one, five and 10 flies per line were 0.22, 0.53 and 0.63, respectively.

In practice, two or three flies measured from each line would be sufficient to demonstrate a convenient genetic repeatability for pigmentation. At least five flies would be necessary to achieve the same goal for wing length.

Discussion

In agreement with all previous investigations (David et al., 1980; Coyne & Beecham, 1987; Imasheva et al., 1994; Moreteau et al., 1995), our data confirmed that size variations (i.e. wing length) are much greater in nature than in laboratory flies. Because the mean values of the investigated populations are not the same and, more importantly, because the way wing length is measured varies according to investigations, comparisons of variabilities can be made only with a relative estimate, the coefficient of variation (CV). Average values of wing length CVs from three papers in which several populations of D. melanogaster were investigated both in nature and in the laboratory are given in Table 6. In the present study, CV was found to be 7.36 per cent in nature and 2.22 per cent in the laboratory, i.e. a variance about 11 times less in the laboratory. In Coyne & Beecham's (1987) work, the average ratio was only 3.2; and in Imasheva et al.'s (1994) study it was 5. From Table 6, it appears that average phenotypic variability in natural D. melanogaster populations from various parts of the world is quite stable, ranging between 6 per cent and 7 per cent. Laboratory variability, on the other hand, seems to be more heterogeneous among investigators and is more likely to reflect rearing conditions than genetic differences. In several works using a killed yeast, a highly nutritive food, mean CVs ranged from 1.72 (Moreteau et al., 1995) to 2.46 (Capy et al., 1994). In contrast, values obtained by Imasheva et al. (1994) and especially by Coyne & Beecham (1987) appear significantly higher. This may result, in the latter case, from the use of an instant Drosophila food medium, which must be seeded with live yeast and in which crowding effects are important.

Table 6 Average values of wing length CVs from three papers in which several populations of Drosophila melanogaster were investigated both in nature and in the laboratory

Abdomen pigmentation in wild flies is less documented (Moreteau et al., 1995; present study). Because variability is related to mean value (David et al., 1990), it is better to compare the variances. In the present study, almost identical values were found in nature and in the laboratory for the sum (14.2 vs. 12.8). In Moreteau et al. (1995), variance in nature was twice that found in the laboratory (20.8 against 9.9). These observations suggest that pigmentation is far less influenced by the environmental heterogeneity of natural populations than body size. This is also the case for abdominal bristle number (Coyne & Beecham, 1987).

A major aim of this paper was to analyse the heritability of quantitative traits, and two methods were used: comparison of wild-living flies with their laboratory progeny and analysis of isofemale lines. For each trait, it was possible to compare successive laboratory generations and to calculate either correlations or regressions.

When individual parents are compared with their progeny, heritability can be estimated by the parent–offspring coefficient of regression. For wing length, a very low and non-significant coefficient (0.02±0.03) between P and G1 was found. Prout (1958) found a negative correlation between wild fathers and laboratory offspring. Coyne & Beecham (1987) obtained a low positive value for the mother–daughter regression (0.09±0.07). This reflects the fact that, in nature, the major part of the phenotypic variability is attributable to an environmental, non-heritable component, whereas the additive genetic variance is not significantly increased. In our case, the mother–daughter regression could be extended to the mother–granddaughter, mother–greatgranddaughter, etc. up to the ninth generation. Surprisingly, the regression, which was not significant with G1, became significant with later laboratory generations, providing a highly significant mean value over generations (0.07±0.01). This value, corresponding to an average heritability of 0.14, shows that size variations in nature are indeed slightly heritable. Also, Coyne & Beecham (1987), by investigating both sexes, came to a mean value of 0.22.

Under laboratory conditions, heritability can be estimated by the coefficient of intraclass correlation (Hoffmann & Parsons, 1988; Falconer, 1989). For wing length, the average t-value was found to be 0.34±0.03, close to the average value of 0.40 calculated from many geographical populations (Capy et al., 1994). Under classical theory, t is assumed to correspond to 0.5h2 (Falconer, 1989). In practice, t is usually too high, so that h2<2t (see Capy et al., 1994 for detailed discussion). This may occur from the possible role of dominance and epistatic components in the genetic variance. The founder effect that occurs in each line may initiate some new genetic variation, which will persist in further generations (Capy et al., 1994; Ritchie & Kyriacou, 1994).

For pigmentation traits, phenotypic variances in nature and in the laboratory were similar, so that no bias was expected in calculating parent–offspring regression. Heritability estimates were higher than for wing length, generally between 0.40 and 0.62. High values were also found for the intraclass correlations, ranging between 0.22 and 0.47, in agreement with previous observations (David et al., 1990; Moreteau et al., 1995; Gibert et al., 1996).

Pigmentation is certainly a highly heritable trait, both in nature and in the laboratory. A similar conclusion seems also to be valid for abdominal bristle number (Coyne & Beecham, 1987). In a recent literature survey, Weigensberg & Roff (1996) argued that, in most organisms, there were no significant differences between laboratory and field estimates of heritability. Our data agree with that conclusion for pigmentation but not for size. Traits that are very plastic, i.e. very sensitive to environmental fluctuations, are likely to show a lower heritability in nature.

Once isofemale lines are established, it may be interesting to demonstrate that variation observed among them is really heritable and not caused by experimental accidents, such as common environment effects. The demonstration is provided by considering the correlation between line mean values of different generations. This correlation, which may be called genetic repeatability, does not provide an estimate of heritability. The regression method between two generations cannot be applied, and it was verified that the major axes of the ellipses of different traits had, on average, a slope close to unity. It was found that the average repeatability (r=0.55±0.04) for wing length, which has a lower heritability, was significantly less than the repeatability of highly heritable traits, such as abdomen segments 6, 7 and the sum (repeatabilities above 0.80, see Table 3). Also, the optimum family size for demonstrating a genetic repeatability among generations is higher for less heritable traits.

A last and practical conclusion of the present study is that the genetic properties of the isofemale lines remained quite stable over nine generations for all the traits investigated. This stability is best shown by the stability of the overall mean in successive generations. Some evidence exists that the variance between lines and the intraclass correlations increased over time, presumably because of genetic drift, but the effect remained small. Of course, this conclusion applies only to the traits investigated here and to a given method of propagating the lines (killed yeast food and 25°C). Isofemale lines appear to be a convenient way of keeping the quantitative characteristics of a natural population in the laboratory, at least over several months. The possible effects of different environments (e.g. different temperatures) could thus be analysed with excellent precision by keeping subsets of the same lines in different conditions and measuring their characteristics in the same environment periodically.