Introduction

Daily, seasonal, and spatial variation in environmental temperatures can have profound effects on the patterns of daily activity for ectotherms (Huey & Pianka, 1977; Clarke, 1996). Large ectotherms have well-developed capacities for behavioural temperature regulation and can thus minimize the impact of these environmental effects (Stevenson, 1985). However, small insects such as Drosophila melanogaster are essentially isothermal with their microenvironment (Stevenson, 1985) and can control body temperatures only by moving to favourable habitats (Casey & Hegel, 1981). Selection on these animals may favour evolutionary shifts in thermal sensitivity of physiology along environmental thermal gradients. The common observation that temperature sensitivity in Drosophila varies geographically (Levins, 1969; Coyne et al., 1983; David, 1988; Hoffmann & Parsons, 1991) suggests that selection on thermal sensitivity is important in nature and that genetic variation is available for at least some components of thermal physiology.

One way to examine the standing genetic potential for rapid evolution of temperature sensitivity is through selection experiments. Previous studies with Drosophila and other ectotherms have used either laboratory natural selection in different thermal environments (Stephanou & Alahiotis, 1983; Cavicchi et al., 1985; Bennett et al., 1992; Partridge et al., 1995) or artificial selection on either heat or cold tolerance (Kindred, 1965; White et al., 1970; Morrison & Milkman, 1978; Pennycuik, 1979; Stephanou & Alahiotis, 1983; Quintana & Prevosti, 1990; McColl et al., 1996; Hoffmann et al., 1997). Artificial selection for resistance to high temperatures in Drosophila is usually carried out through family selection because high temperature exposure often sterilizes the surviving flies (Morrison & Milkman, 1978; Quintana & Prevosti, 1990).

Here we present the results of artificial selection on knockdown temperature (Tkd), a convenient index of high-temperature sensitivity (Huey et al., 1992). Tkd is that temperature at which the fly loses its ability to grip a textured, inclined surface inside a Weber column (Weber, 1988; Weber & Diggins, 1990). The measurement is similar to the ‘critical thermal maximum’ (loss of righting response) commonly used to study heat sensitivity in diverse ectotherms. Knockdown temperature is correlated with the ability to locomote at high temperatures (Gilchrist, unpubl. data). It can be quickly measured on hundreds of individuals with no apparent increase in mortality or infertility, so that family selection is unnecessary. A preliminary experiment (Huey et al., 1992) suggested that Tkd would exhibit a rapid response to selection.

We selected on a recently collected population of Drosophila melanogaster to explore the magnitude of genetic variation in Tkd. We initially set up six lines selected for high Tkd and six control lines; we selected the top 25% of the flies in the Up-selected lines, and a random 25% from the Control lines. We initially chose not to select for low Tkd (e.g. the bottom 25%) because we were concerned that down selection could select for inferior flies or for flies that simply ‘let go’ when dropped into the column. For example, D. melanogaster with low fitness because of mutation accumulation often fell through the Weber column without ever getting a grip (Huey, unpublished observation).

Within the first few generations of selection, it became apparent that Tkd was bimodally distributed. Although the small selection of Tkd histograms in Fig. 1 show some lines that might be interpreted as simply skewed to the left, in generation after generation most of the control lines showed two peaks separated by a valley. Furthermore, the estimates of the means and standard deviations of the peaks were remarkably consistent (Fig. 2; see Results for details). Beginning at generation five, we selected flies from within the lower mode (‘Low-selected’) as a surrogate for down-selecting by truncation in a similar manner to the Up-selected lines. We contrast our results with a recent study of selection on knockdown time (McColl et al., 1996; Hoffmann et al., 1997).

Figure 1
figure 1

Histograms (proportional) of Tkd for generations 0 and 46 for three Up-selected and Control lines, and the three Low-selected lines of Drosophila melanogaster. The lines show the maximum likelihood fit for a bimodal distribution (fitted parameters in Apppendix 1 and 2). Female distributions are shown; the distributions of males are similar.

Figure 2
figure 2

Distribution of parameter estimates for the bimodal model (Controls only). (a) The estimated means Mlow (black) and Mhigh (grey) over all generations. (b) The estimated standard deviations Slow (black) and Shigh (grey) over all generations.

Materials and methods

Establishment and maintenance of lines

Drosophila melanogaster used in this experiment originated from a large population (≈1000 isofemale lines, courtesy of L. Harshmann & M. Turelli) collected from Escalon, CA in May 1991. This stock was maintained at the University of California, Davis for almost 1 year (≈22°C, 13 L:11D cycle). In April 1992, we received a sample (≈1000 flies) and began rearing the flies at a controlled density (≈50 eggs/vial; cornmeal, molasses, yeast, agar, tegosept) before transfer to population cages of 2000–3000 adults with discrete two-week generations at 25°C (12 L:12D).

The six Control lines and the six Up-selected lines were each established with six half-pint bottles, with each bottle stocked with ≈100 flies haphazardly selected from the population cages on 15 November 1993. The eggs for the initial generation of selection were collected on the following day. For each line, ≈50 eggs were placed in each of 28 vials and reared to adulthood at 25°C (12 L:12D). Eleven days later, the eclosed flies were transferred to fresh vials with a pinch of dry yeast.

Measuring and selecting on knockdown temperature

The first generation of selection occurred on 29 and 30 November 1993. Six haphazardly chosen lines (≈1000 flies, estimated by volume) were run on each day. Basic procedures for measuring knockdown temperature are detailed in Huey et al. (1992). In brief, adult flies are tossed into the top of a Weber column (120 × 7.5 cm) with internal baffles (Weber, 1988), that is surrounded by a temperature-controlled water jacket. Water is continuously circulated through the jacket from a five-gallon water bath with a Haake E30 heater/pump and a Haake W40 immersion cooler. The temperature of the water is heated from 30 to 50°C over about 20 min. The air temperature within the column lags about 2–3°C behind the water temperature, but is relatively uniform (±0.2°C) between any two points within the column. As the column heats, the flies eventually lose their grip on the walls or baffles and fall through an opening in the bottom. Column temperatures are monitored, and the flies are fractionated in 0.5°C intervals, and later sexed and counted.

We selected for high knockdown temperature (Up-selected lines) by retaining the top 25% of the flies (≈250 individuals) from a particular run. Males and females were selected separately. In each Control line, 25% of all males and females were chosen haphazardly. The ‘selected’ flies were then divided among six half-pint bottles containing 30 mL of medium and a sprinkle of yeast; they then were incubated at 25°C.

Selection on the Low lines (selected from the lower mode of the Tkd) began in generation 5 using a targeted selection protocol rather than the truncation selection used for the Up-selected lines. In generation 3, we set up a base stock by pooling ≈300 eggs from each of the six Control lines. At generation 4 of selection, we set up three lines each with ≈125 males and 125 females from this base stock divided among six bottles per line. These animals were reared and eggs collected as for the Up-selected and Control lines. Beginning at generation 5, the three Low-selected lines were run through the column; we selected the 25% of the population centred around 37°C (in early generations, the selected flies were collected at 36.0–38.0°C; in later generations, this window was narrowed to 36.5–37.5°C). For generations 6 and 7, we selected the Low-selected flies but did not count and record every fraction because of time constraints. By convention, we will use the same numbering scheme to refer to the generations in all three selection treatments.

In all lines, the selected males and females were held together for one week before collecting eggs for the next generation. This lag for remating meant that we could run fewer generations per unit time, but had the advantage of increasing the strength of selection. Drosophila melanogaster shows a high degree of last-male sperm precedence, so that at least 85% of the offspring produced by a doubly mated female are sired by the second male (Gromko et al., 1984). Test crosses between our stocks and flies homozygous for a dominant eye colour mutant (brown) showed that over 90% of females remated within one week, with high levels of last-male sperm precedence. [Note: the alternative of collecting and selecting virgins was not practicable.]

Eggs were collected and transferred to vials (≈50 eggs per vial) that were then held at 25°C (12 L:12D) until hatching. After eclosion, adults were transferred to fresh media with yeast 2–3 days prior to the next round of selection.

Here we report on the response to selection to generation 46. Selection was skipped in generations 2, 12, 17, 28, 30, 32, 34, 35, 37, 38, 40, 41, 43, 44, and 45. In these generations, ≈125 adults of each sex (haphazardly selected) were used to found the next generation.

Statistics

The parameters of the unimodal and bimodal (mixture of two normal) distributions were estimated through maximum likelihood (Lynch & Walsh, 1998). We programmed S-PLUS (StatSci, 1993) to estimate the two parameters of a unimodal normal distribution (N(Mu, Su), where N(x¯,σ) indicates the normal distribution probability function, Mu=mean x¯, and Su = standard deviation σ) and the five parameters of a bimodal normal mixture distribution (Plow× N(Mlow, Slow) + (1 – Plow) × N(Mhigh, Shigh), where Mlow and Mhigh = means of the low and high modes, Slow and Shigh = standard deviations of low and high modes, and Plow = proportion of the population falling under the low mode). The hypothesis that the bimodal distribution provided a better description of the data was tested using a log-likelihood ratio test for each line in each generation (see Results). The G-statistic follows a chi-squared distribution, with the appropriate degrees of freedom being the difference in the number of parameters in each model (d.f.=3 in all tests). Similar results are obtained with other tests, such as the AIC; however, the log-likelihood test is more conservative.

Occasionally several flies fell out of the Weber column at an unusually low temperature, creating a spurious peak in the maximum likelihood estimation of bimodal distribution parameters. In these cases, the parameter estimates for Mhigh and Shigh were actually the estimates of Mlow and Slow. In the Low-selected lines where this was most commonly observed, the erroneous parameter estimates were identified by inspection and re-estimated prior to final analysis. A second potential problem is that a five-parameter model will give a better fit than a two-parameter model to almost any data set that departs from normality. One could argue that the distribution of Tkd is not really bimodal, but simply skewed or platykurtic Visual inspection of the data (e.g. Fig. 1) does not support this contention; we consistently find two peaks with predictable means and variances in generation after generation. Transformation fails to remove the bimodality. Additional statistical procedures will be described in the Results.

Heritability estimates

The realized heritability of knockdown temperature was estimated by regressing the response to selection against the selection differential over the first 10 generations of selection (Falconer, 1989). The replicated lines were treated as fixed factors in the analysis, with heterogeneity within a selection treatment determined by a comparison of slopes. Because some small unknown portion of the Up- and Low-selected lines in each generation are sired by unselected males (see above), the selection differentials represent maximum rather than realized values, resulting in an underestimate of the true heritability (Falconer, 1989). For the heritabilities of the bimodal parameters, the selection differential for the low mean was computed as the difference between the mean Tkd of the selected flies and Mlow of the total population. The selection differential for the high mean was computed as the difference between the mean Tkd of the selected flies and Mhigh of the total population.

To see how the heritability changed over the course of the experiment, we regressed the response to selection against the selection differential for each line over a sliding window of 10 selection episodes (regressions ending with generations 11–46 for the Up-selected lines and generations 16–46 for the Low-selected lines).

Relaxed selection lines

Relaxation of selection is often accompanied by a regression of the selected lines towards the controls, reflecting reduced fitness of selected lines (Clayton et al., 1957). In generation 23, we collected ≈250 flies from three Up-selected lines and removed them from selection. In each generation, 25% of 1000 haphazardly chosen flies were collected for oviposition. Tkd was measured on these relaxed selection lines in generation 46.

Results

The bimodality of Tkd

The distributions of knockdown temperature for males (the distributions for females are similar) at the start of selection (generation 0 for the Up and Control lines, generation 5 for the Low lines) and at generation 46 (after 32 generations of selection) are shown in Fig. 1. The bimodality of the Tkd distributions was visually apparent in most of the lines at the start of selection, and was still evident in the Controls at generation 46. This bimodality is not an artifact, as the two modes appeared consistently in the Control lines in nearly every generation.

A summary of the mean parameter estimates for the unimodal and bimodal distributions is provided in Table 1. Parameter estimates for all lines at the start of the experiment and at generation 46 are tabulated in Appendices 1 and 2. We used likelihood ratio tests to compare the goodness-of-fit to the unimodal and bimodal distributions (see Materials and methods). A bimodal normal mixture distribution gave a better fit (likelihood ratio test, P < 0.05) than a unimodal distribution in 87.8% of the Control runs, 98.2% of the Up runs and 64.1% of the Low runs. The two modes at the start of selection were at about 38.5°C and 40.5°C, with about 60% of the flies falling under the lower mode (Table 1).

Table 1 Parameter estimates for unimodal and bimodal distributions of knockdown temperature in Drosophila mealnogaster. The values shown are the mean (with SD in parentheses) of the replicate lines at the start of selection (generation 0 for the Controls and Up-selected lines, generation 5 for the Low-selected lines). Trt, selection treatment; Gen, generation; Con, control

The means and standard deviations of the two modes were consistent throughout the course of the experiment in the Controls, as shown by the histograms in Fig. 2. The low mean was usually near 38.0–38.5°C, and the high mean near 40.5°C. Note that in several cases, the mean of the upper mode was estimated at ≈38.5°C (Fig. 2a); these correspond to cases where the maximum likelihood estimate of Mhigh actually corresponds to the expectation for Mlow. These are the same observations in which the estimates of Shigh are large and similar to the expectation for Slow (Fig. 2b), indicating that the maximum likelihood routine has identified a spurious lower mode.

No evidence suggests that the bimodality is simply an artifact of mating or developmental status of the flies. First, both males and females have nearly identical bimodal distributions of Tkd, suggesting that sex or mating status does not determine the bimodality. Males and females are held together 3–5 days prior to testing, so the proportion of virgin females must be extremely small; individual females collected after a Tkd assay virtually all lay viable eggs (Huey et al., 1992).

Knockdown temperature drops slowly as the flies age, but the differences are small. Because flies for each selection episode are collected on a single day but selected over two days, the effect of age differences can be assayed by comparing the Tkd parameters for flies selected on the first day and those selected on the second day of each generation (Table 2). Repeated measures ANOVAS for the lines over the entire experiment show a significant decrease between day 1 and day 2 in Mu for the Up-selected (F1,372=13.49, P < 0.001) and Control lines (F1,372=14.07, P < 0.001), in Mhigh for the Up-selected (F1,372=14.07, P < 0.001) and Control lines (F1,372=16.14, P < 0.001) and for Mlow in the Controls (F1,372=10.58, P < 0.002). No significant differences were found for the Low-selected lines. The proportion of individuals falling under the lower mode (Plow) was not significantly different between the two days for any selection treatment. Thus, although Tkd does drop with age, it is the means of the modes along the thermal gradient, not the bimodality itself, that are changing.

Table 2 Mean (±SD) difference between various distribution parameters of 5-day-old VS. 6-day-old Drosophila melanogaster across all generations. The numbers in parentheses are the numbers of generations summarized in the statistics

Bimodality in other populations

The lines used in the present study were collected in North America. We also measured Tkd on D. melanogaster from two other continents to see if this trait exhibits a bimodal distribution in other populations. European flies were represented by Linda Partridge’s long-term selection lines (flies reared at 16.5/18°, 25° or 29°C under laboratory natural selection) which were originally collected in England over 10 years ago (Huey et al., 1991; Partridge et al., 1995). The Australian flies were from 13 populations collected along a latitudinal transect by Avis James and Linda Partridge (James et al., 1995). The parameter estimates for the distributions are summarised in Table 3. The distribution of Tkd is significantly better explained by a bimodal model in nine out of nine male and six out of nine female lines in the British flies, and in 12 out of 13 male and nine out of 13 female lines in the Australian flies.

Table 3 Parameter estimates for unimodal and bimodal distributions of knockdown temperature for Drosophila melanogaster collected in the UK and Australia. The values shown are the mean and SD of the various lines

In contrast, the Tkds of a distantly related drosophilid, Drosophila subobscura, sampled from six populations along the west coast of North America, all exhibit a unimodal distribution (mean ± SD; females: Mu = 38.20 ± 0.370, Su=1.12 ± 0.267; males: Mu= 38.51 ± 0.383, Su=1.05 ± 0.130), similar in position and shape to the lower mode of D. melanogaster.

Response trajectories

The response trajectories for selection on knockdown temperature of females are shown in Fig. 3. The coefficients of the cubic polynomial regression lines depicted in Fig. 3 are listed in Table 4 (statistics included for both sexes). The Up-selected lines showed a linear increase in Mu until approximately generation 20, at which point the selection response began to plateau (Fig. 3a, Table 4). The response in Mu over the first 20 generations reflected both a reduction in the proportion of the population under the lower mode (Plow, Fig. 3c) and a small increase in the value of the mean of Mhigh (Fig. 3d). After generation 25, virtually no flies were left under the low mode. Thus Up-selection led to a loss of the lower mode and a slight shift (Table 1) in the mean of the upper mode. The variance of Tkd within the Up-selected lines dropped steadily during selection (Fig. 3b, Table 4).

Fig. 3a–d
figure 3

Response trajectories for (a) mean Tkd, (b) the variance of Tkd (c) the proportion of the knocked-down flies under the lower mode (Plow) and (d) Mlow for the Low-selected and Mhigh for the Up-selected flies over the 46 generations of rearing. The lines are third-order polynomial regressions (estimated parameters in Table 4). Female response trajectories are shown; the trajectories for males are similar.

Table 4 Coefficients of the cubic polynomial regression lines of the mean (Mx) and the variance (Sx2) of knockdown temperature on generation for the trajectories of the Up, Low and Control selection treatment in Drosophila melanogaster. The proportion of individuals under the lower mode (Plow) was arcsine square root transformed prior to analysis. Trt, selection treatment; Stat, statistic; Con, control

The evolution of Mu in the Low-selected lines (Fig. 3a, Table 4) involved both a rapid shifting of flies from the upper to the lower mode of the distribution (Fig. 3c) and a downward shift in Mlow (Fig. 3d). Because the Low lines were selected for a targeted range of temperatures rather than an extreme, the directional component of selection diminished as Mu converged on the targeted temperature (approximately generation 26). The variance for Mu dropped rapidly in the first few generations of selection (Fig. 3b), but showed no significant linear decrease over the course of the experiment (Table 4). In spite of the stabilizing selection on Mlow, no significant linear change was observed for Slow over the course of the experiment (Table 4).

Females in the Control lines showed a slight decrease in Mu, Mlow and Mhigh throughout the course of the experiment (Fig. 3a, Table 4), whereas males showed no significant trends. No linear change in the proportion of individuals under the two modes was detected for either sex.

Although the increase in Mu for the Up-selected lines was dramatic, the maximum Tkd within those lines did not evolve in response to selection. We tested this by comparing the highest temperature at which any individual fell out of the Weber column in the ‘early’ generations (0, 1 and 3) vs. the ‘late’ generations (39, 42 and 46) for each of the six Up-selected lines. No significant difference in the maximum Tkd was observed for females (early x¯ ± SD: 43.14 ± 0.427°C; late: 43.28 ± 0.354°C; paired t-test: t5=−1.39, P=0.22) or for males (early: 42.83 ± 0.486°C; late: 43.19 ± 0.359°C; paired t-test: t5=−1.57, P=0.18).

Heritability estimates

The response to selection, as a function of cumulative selection differential, is shown in Fig. 4 (males are shown; the plots for females are similar). Fig. 4(a) is the plot for Mu; Fig. 4(b) shows the response of Mhigh in the Up-selected lines and Mlow in the Low-selected lines. The slopes of the regression over the first 10 generations of selection (solid bold lines, Fig. 4) provide the estimate of realized heritability; heritabilities of the unimodal mean and of the two means from the bimodal distribution are reported in Table 5.

Figure 4
figure 4

The response to selection as a function of the cumulative selection differential. The solid regression lines represent the mean heritability over the first 10 selection episodes. (a) Mu for the Up-selected lines and the Low-selected lines. (b) Mhigh for the Up-selected lines and Mlow for the Low-selected lines.

Table 5 Realized heritability estimates of the unimodal and bimodal means of knockdown temperature in Drosophila melanogaster over the first 10 generations. The t-statistic tests the null hypothesis that the regression coefficient of the mean trait value on the cumulative selection differential is not significantly different from zero. The asterisk by the heritability estimate indicates that there was significant heterogeneity (P < 0.05) among the lines within the selection treatment (by ANOVA). Trt, selection treatment; Stat, statistic

All of the heritability estimates for Mu were significantly different from zero. Males and females scored similarly. The heritability of Mu in the Up-selected lines was around 0.12, whereas that for the Low lines was ≈0.19. Within each selection treatment, the heritabilities of the positively selected modes (i.e. Mlow in the Low-selected lines and Mhigh in the Up-selected lines) were higher than the heritabilities of the modes being selected against (Table 3). The heritability estimates were significantly heterogeneous among the replicates within a selection treatment for three of the 12 values estimated (Table 5).

The change in heritability over the course of the experiment is depicted in Fig. 5. The mean heritability ± 1 SD, computed over a sliding window of 10 selection episodes, is plotted as a function of generation. Fig. 5(a) depicts the change in heritability of Mu; Fig. 5(b) depicts the change in Mlow for the Low-selected and Mhigh for the up-selected lines. The Up-selected lines show little decline in heritability until after generation 15; thereafter the heritability of both Mu and Mhigh declines fairly steadily.

Figure 5
figure 5

The change in heritability over the course of the experiment. Plotted is the mean ± 1 SD for a sliding window of 10 selection episodes ending at a given generation. (a) The change in Mu. The regression lines are for heritabilities over all generations for the Up-selected lines (h2 = 0.1409 − 0.0021 Generation, R2 = 0.1582, F1,130 =24.43, P < 0.001) and for generations 16 to 26 for the Low-selected lines (h2 = −0.1844 + 0.0225 Generation, R2 = 0.6923, F1,28 = 62.99, P < 0.001). (b) The change in Mhigh for Up-selected lines (over all generations: h2 = 0.1614 − 0.0027 Generation, R2 = 0.2208, F1,130 = 36.84, P < 0.001) and the change in Mlow for the Low-selected lines (over generations 16 through 26; Mlow: h2 = −0.0607 + 0.0203 Generation, R2 = 0.3955, F1,28 = 18.32, P < 0.001).

In contrast, the heritabilities of both Mu and Mlow in the Low-selected lines show a strong and rapid increase until generation 26–27 (Fig. 5a, b). Recall that the Low-selection regime focuses on a target temperature of 37°C; at generation 26, both Mu and Mlow had reached 37°C, so most directional selection ceased. The plummet in the heritability estimates at that point occurred because the directional response to selection approached zero, as shown by the wandering trajectory in Fig. 4.

Response to relaxation of selection

In generation 23, samples from three Up-selected lines were removed from selection to see if their Tkd would regress towards the controls. These populations were maintained in the same manner as the selection lines during generations when we did not select on knockdown temperature (see Materials and methods). We compared Mu, Mlow, Mhigh and Plow of these Relaxed selection lines in generation 46 with those of their paired source populations (Up-selected in generation 23) and also with those of the selected populations in generation 46 (after 12 intervening selection episodes) using paired t-tests with sequential Bonferroni tablewise significance corrections (Rice, 1989). None of the unimodal or bimodal traits in the generation 23 source populations differed significantly from those in the generation 46 Relaxed selection populations (Table 4). Mu and Mhigh were greater for females in the generation 46 selection lines than for females in the generation 46 Relaxed selection lines, reflecting the small increase in these values resulting from continued selection. From generation 23 to generation 46, Mu in the control lines increased very slightly; none of the changes in the bimodal parameters, however, was significant (Table 6).

Table 6 Mean knockdown temperature in Drosophila melanogaster for samples of Up lines removed form selection (“Relaxed selection lines”) in generation 23 and samples retained in the Up-selected regime. Differences between generations and lines are tested using paired t-tests, which are shown beneath the pair of statistics being tested. Trt, selection treatment; Con, control; Gen, generation

Discussion

Knockdown temperature is bimodally distributed in nearly every population of D. melanogaster we have surveyed from England, Australia and North America (Fig. 1, Tables 1 and 3). The distribution of Tkd of flies from North America responded dramatically to selection. The lower mode largely disappeared from the Up-selected lines within 20 generations of selection (mean ± SD selection intensity: 0.92 ± 0.119, pooled across replicates and sexes). Similarly, the upper mode largely disappeared from the Low-selected lines within 20 generations of selection (selection intensity: 0.49 ± 0.348). Furthermore, although selection dramatically increased mean knockdown temperatures in the Up-selected lines, we observed no significant increase in the maximum knockdown temperature over the course of the experiment. Thus knockdown temperature of D. melanogaster seemingly has an upper limit of ≈44°C.

Three previous studies have selected on increased knockdown time (McColl et al., 1966; Hoffmann et al., 1997; Bubli et al., 1998), the time flies exposed to a stressful temperature can maintain a grip on the knockdown column. The two measures are certainly related, because flies selected for a high knockdown temperature also have an increased knockdown time and vice versa (Huey & Gilchrist, unpubl. data), but details of the responses differ. First, knockdown time has a unimodal distribution rather than the bimodal distribution of knockdown temperature. Even populations of D. melanogaster that have been subjected to several generations of selection for long knockdown time retain a bimodal distribution for Tkd. Secondly, knockdown time measures resistance to a chronic, constant high temperature stress, whereas Tkd measures resistance to an increasing temperature stress. Although time under heat stress is unavoidably correlated with the temperature increase, our results indicate that our selection lines have diverged in sensitivity to temperature, not just in knockdown time. Finally, the previous studies have selected for increased knockdown resistance only; ours is the first also to examine both up and down selection on a knockdown trait.

The realized heritability provides an estimate of the additive genetic variation present in the population. The values for Mu in the Up-selected lines, measured over the first 10 generations (Table 5), are similar to those measured for increased knockdown time by McColl et al. (1996; estimate of 0.08–0.11) and Bubli et al. (1998; estimate of 0.05–0.10), but higher than the mean realized heritability (≈0.07) of a wide range of physiological traits (Mousseau & Roff, 1987). Huey et al. (1992) reported a realized heritability for increased knockdown temperature of 0.28; but that estimate was made over only four generations of selection.

Selection seems to have exhausted much of the genetic variation in the Up-selected lines, particularly for Mhigh. The variance both among (Fig. 3a) and within (Fig. 3b) the Up-selected lines greatly decreased over the course of selection. The heritability of Mhigh declined from 0.1 to 0.15 in the first 10 generations to near zero in the last 10 generations (Fig. 5b), suggesting that the genes responsible for high Tkd were near fixation.

Selection had a remarkable effect on the realized heritability of Tkd. Between generations 5 and 26, the heritability of both Mu and Mlow in each of the Low-selected lines approximately doubled (Fig. 5). Although the selection differential decreased in each generation as the population moved towards the target Tkd of 37°C, the response to selection remained nearly constant. This rapid rise in heritability suggests the involvement of one or more major genes influencing Tkd (Latter, 1965; Frankham & Nurthen, 1981). If an allele were initially at low frequency in the source population, the additive genetic variance would initially increase, and then decline, as selection carried that allele towards fixation. The rapid decline in heritability after generation 26 (Fig. 5) is at least partly caused by the selection differential dropping to near zero as the population reached the target Tkd, but may also reflect the decline in additive genetic variance as the allele of major effect approached fixation. Stabilizing selection on the Low-selected flies did not reduce the variation in Mlow over the course of the experiment (Fig. 3, Table 4).

The behaviour of the flies during heating may provide clues as to the functional basis of knockdown temperatures. Knockdown occurs when a fly loses its ability to grip the baffles or walls of a Weber column. The Up-selected flies retain control of their limbs as the temperature rises; they remain clustered near the top of the column until a sudden and catastrophic loss of locomotory function sends the fly tumbling to the bottom of the column. In contrast, flies from the Low-selected lines gradually work their way towards the bottom of the tube as the temperature rises; they lose control of their limbs momentarily but recover; however, each fall carries them lower and lower in the column. Hoffmann et al. (1997) noted a similar behavioural response to selection on knockdown time. In any case, the flies are temporarily immobilized when they come out of the column, usually lying on their backs with legs twitching. This pattern of behaviour is similar to that produced by some temperature sensitive mutations of channel proteins affecting neurological activity (e.g. Grigliatti et al. 1972; Jackson et al. 1985). Complementation tests showed that the two modes were not caused by allele variation at comatose, the only X-linked channel mutant known to have this phenotype (A. G. Clark, pers. comm.).

Several lines of evidence suggest that one or two loci of large effect, plus some additive modifiers, largely control knockdown temperature. First, the clear and consistent bimodality of the trait (Figs. 1 and 2) indicates something other than simple quantitative inheritance. Secondly, much of the selection in both the Up- and Low-selected lines over the first 25 generations consisted in moving individuals from one mode to the other, rather than shifting the position of the modes. Thirdly, Up-selected lines removed from selection at generation 23 showed virtually no regression towards the control means (Table 4), suggesting that the genes controlling this trait were either near fixation or conveyed no fitness disadvantage. Fourthly, the rise in heritability of Mu and Mlow in the Low-selected lines from generations 10–26 (Fig. 5) suggests an allele of large effect (Latter, 1965; Frankham & Nurthen, 1981). Finally, exploratory crosses between Up- and Low-selected (Gilchrist & Huey, unpubl. data) lines suggest the presence of at least one major Mendelian factor. Of course, more complete genetic analyses now need to be carried out to clarify the genetic architecture underlying knockdown temperature. Validation of the hypothesized major-gene effect would raise an interesting question: what selective factors maintain a genetic polymorphism that appears to be worldwide and that persists even in a long-term laboratory stock?