Introduction

Brassica oleracea is an agriculturally important species including many vegetables such as cabbage, broccoli and cauliflower. It is closely related to the model dicotyledonous plant Arabidopsis thaliana and so it is expected that information concerning control of basic biological processes in A. thaliana will be transferable to Brassica crops.

Information should also be transferable between different species within the Brassica genus, as the three diploid genomes, A, B and C, of B. rapa, B. nigra and B. oleracea, respectively, are thought to have derived from the same ancestor. These genomes reveal striking conservation of content, although chromosome duplication and translocation have occurred during divergence (Lagercrantz & Lydiate, 1996). The amphidiploid B. napus (oilseed rape) genome is made up of the A and C genomes probably from close relatives of B. rapa and B. oleracea. (U, 1935; Parkin et al., 1995).

Flowering time is not only of scientific interest because it facilitates the understanding of plant development, it is also important in agriculture, because its modification may enable the geographical range of the Brassica crop to be extended. For example, cultivation of B. napus is usually restricted to temperate latitudes, but the development of early flowering cultivars would allow the crop to be grown in low-rainfall regions such as the Western Australian wheat belts (Thurling & Depittayanan, 1992), and more northern regions of Canada (Murphy & Scarth, 1994).

Genes affecting flowering time have been identified in A. thaliana by mutagenesis (Koornneef et al., 1991). For example, mutations of the CONSTANS gene cause delayed flowering under long days, but not short, and the gene has been cloned by chromosome walking (Putterill et al., 1993, 1995).

Two regions influencing flowering time in B. nigra (on LG2 and LG8) have been found to be homologous to the CONSTANS gene region (Lagercrantz et al., 1996). These regions have also been shown to carry quantitative trait loci (QTL) in the A genome of B. napus, and show large-scale colinearity between regions on chromosomes O2, O3 and O9 of B. oleracea.

Previous studies mapping QTL for flowering time in B. oleracea have been reported. For example, Kennard et al. (1994) found significant QTL using single-factor ANOVA on an F2 from a cabbage × broccoli cross, whereas Camargo & Osborn (1996) used F3 families from a different cross between cabbage and broccoli. Because of the lack of both a standardized nomenclature for linkage groups in Brassica species and of common probes, it is not possible to compare the QTL locations found in these studies. Another drawback to these studies is the limited life-span of the populations used. The heterozygosity of individuals prevents their maintenance by selfing, so trials can only be carried out once in a single environment. Bohuon et al. (1998) overcame this by using doubled haploid lines derived from the F1 of a cross between a Chinese kale and a hybrid of calabrese, resulting in six QTL for flowering time being mapped to chromosomes O2, O3, O5 and O9.

The use of doubled haploid plants still has the limitation that segregating populations can map QTL only to a relatively large region of a chromosome. This present paper reports work which reinforces the results of Bohuon et al. (1998) by using the same cross to engineer near-isogenic lines. Repeated backcrossing and selfing of the progeny from this cross have enabled the production of lines that contain small segments of donor parent DNA within a recurrent parent background (Ramsay et al., 1996). Differences in phenotype between the lines and the recurrent parent should be caused by the substituted region. The substitution lines should be homozygous so that the lines can be maintained and multiplied by selfing allowing the same lines to be grown in different environments so that experiments can be repeated over a number of years and conditions. The precision to which a QTL can be mapped depends upon the size of the introgressed region, but comparisons between different substitution lines can narrow these regions further.

Materials and methods

Production of experimental lines

The parent plants were descended from microspore-derived doubled haploid lines. The recurrent parent is A12DHd, derived from Brassica oleracea var. alboglabra, and the donor is GDDH33, which is Brassica oleracea var. italica, derived from a commercial calabrese F1 hybrid variety, Green Duke (Bohuon et al., 1996). For simplicity, the doubled haploid lines used as parents will be referred to as A12 and GD, respectively.

The substitution lines were produced (as described in Ramsay et al., 1996) by using a single F1(A12 × GD) to pollinate three A12 plants to produce the first backcross generation (BC1). A12 was used as the female to ensure that all cytoplasm from the first backcross generation and subsequent generations was A12 in origin. Lines were selected for a high number of purely recurrent linkage groups, wide coverage of the genome by the donor introgression and duplications of introgressed regions. Lines were backcrossed again and then selfed two or three times to produce lines with homozygous regions of introgressed GD DNA in an A12 background.

Seventy-eight RFLP probes selected from those previously described by Bohuon et al. (1996) to construct a linkage map from doubled haploid lines produced from the F1 of this cross, were used to map the substituted regions in the lines together with an additional 119 AFLP markers (Sebastian et al. 1999). This gave an average spacing between markers of 4.5 cM. The positions of the introgressed GD DNA in the 79 substitution lines are shown in Appendix 1. The minimum substitution indicates the region known to be GD DNA, whereas the maximum substitution includes the region between A12 and GD markers in which recombination has occurred. It is not yet possible to define the proportion of this region which is GD, so the whole section must be taken into account.

Trial plan

Eight field trials were carried out over a period of five years; one each in 1994 and 1995, two each in 1996, 1997 and 1998. Lines were used in trials as they became available from the breeding programme. Over the five years, 79 substitution lines, with varying numbers of replicates, were grown alongside the recurrent and the donor parents. Table 1 shows the sowing dates and the number of lines and replicates for each trial.

Table 1 The sowing dates for the eight Brassica oleracea trials over the five years, with the number of lines and the number of replicates. A12 and GD are the parent lines and SL denotes the substitution lines

Seeds were sown in 5 cm plastic pots, using John Innes compost no. 3, in the glasshouse and placed in a single block with complete individual plant randomization, surrounded by guard plants. This design was used to maximize the power to detect QTL (Kearsey & Pooni, 1996). The glasshouse was unheated and unlit so that temperature and daylength depended on natural conditions.

After nine days the number of germinated seeds was noted, and plants were thinned from two to one per pot. Any pots in which no seed germinated were replaced with guard plants from the same line. The pots were moved to an open-ended polythene tunnel after 15 days for acclimatization. After 23 days the seedlings were planted in the field in the same random position. The field was set out in rows of 100 plants, spaced 76 cm between rows and 31 cm between plants in the rows. Guard plants were grown around the perimeter and in the spaces left by dead plants. The whole trial area was protected by nylon netting to prevent bird damage. Plants were irrigated during dry periods, and chemical treatments were used to control pests.

Data collection

Flowering was checked daily, and scored as the number of days from sowing to the first flower opening on each plant. Data from plants that were badly affected by cabbage root fly or grey aphid were removed. Other traits were also recorded, but their analysis will be published elsewhere.

Data analysis

1994, 1995 and 1996 trials Four one-way analyses of variance (ANOVAs) were carried out to test for variation between lines within each trial in 1994, 1995 and 1996. The within-lines mean squares from these ANOVAs were used to carry out Tukey–Kramer multiple comparison tests (Dunnett, 1980), to determine which substitution lines flowered significantly earlier or later than the A12 recurrent parent.

1997 trials Seeds for each substitution line in both trials in 1997 were taken from two different parent plants which had been grown in two completely randomized blocks in glasshouses. This was to allow maternal effects in the progeny to be estimated. The two field trials contained the same number of plants from the same parent plants so that variation between trials could also be examined.

The data for both trials were standardized to mean zero and variance one, to minimize genotype-by- environment effects arising from different ranges in flowering time, so allowing the trials to be analysed together. A cross-classified hierarchical ANOVA was carried out for the substitution lines on the standardized data to test for interactions between genotypes and environments. This was repeated for the A12 parents in the two trials. The within-lines mean square for these ANOVAs were combined for use in a Tukey–Kramer multiple comparison test.

1998 trials The lines used in the 1998 field trials were selected as being earlier or later flowering than A12 based on the analysis of lines in previous years. This allowed the use of one-tailed tests to compare each line against the recurrent A12 parent. The data from each trial were standardized permitting the trials to be analysed together.

Results

1994, 1995 and 1996 trials

The ANOVAs for the four trials in 1994, 1995 and 1996 indicated significant variation between lines within trials, implying that the different substitutions of GD affected the flowering time of the lines.

The Tukey–Kramer test showed the lines SL112, SL113, SL140, SL175 in 1994; SL113, SL129, SL140, SL172 in 1995; SL102 in 1996 trial 1; and SL102, SL128, SL137, SL157, SL172, SL175 and SL178 in 1996 trial 2 were significantly later flowering (at the 5% level) than the A12 parent. Line SL108 was significantly earlier flowering than A12 in 1995.

1997 data

The mean and standard deviation for flowering time for trial 1 and trial 2 (=64.889, s=2.527 and = 65.476, s=4.041, respectively) were used to standardize the data.

The cross-classified hierarchical ANOVA (Table 2a) showed that the variation between lines was highly significant (P < 0.001), and that replicate families within lines also differed (0.01 > P > 0.05). The latter indicates the importance of maternal effects. The interaction of lines with trials, and the families within lines interaction with trials, were not significantly different from each other (1.67/1.12=1.49, P > 0.05), although families within lines did interact with trials significantly when compared to the error (P < 0.01). As expected, the variation between trials was very small (P > 0.05), any difference from zero being attributed to unequal numbers of observations in each trial.

Table 2 (a). The cross-classified hierarchical ANOVA for the Brassica oleracea substitution lines in 1997 trials 1 and 2. The F-value is taken to be nonsignificant (NS) when P > 0.05; *indicates 0.05 > P > 0.01, **indicates 0.01 > P > 0.001; ***indicates P < 0.001 (b). The cross-classified ANOVA for the A12 recurrent parent in 1997 trials 1 and 2

A similar ANOVA was carried out for A12 recurrent parent plants (Table 2b), showing that the variation between different families within the A12 line and the variation between trials were not significant. However, interaction between trials and families was just significant despite the scalar transformation.

In order to carry out the Tukey–Kramer multiple comparison, the mean square ‘between families within substitution lines’ combined with the mean square ‘between families’ for the A12 was used as the error variation. The Tukey–Kramer test showed that lines SL115, SL134, SL141 and SL175 were significantly later flowering than the A12 recurrent parent.

1998 data

Flowering data for the two trials in 1998 were standardized, using the mean and standard deviation for the two trials (=68.417, s=3.742 and =64.257, s=3.551, respectively). ANOVAs were carried out using the combined data from the two trials. Lines SL115, SL121, SL127, SL133, SL134, SL138, SL141, SL142, SL158, SL172, SL175 and SL177 were shown to be significantly later flowering than the A12 parent, whereas SL108 and SL122 were significantly earlier flowering.

Significant substitution lines

Table 3 shows the mean of the A12 parent in each year, together with the means for each line that differed significantly in flowering time from the A12. Over the eight trials it can be seen that only two substitution lines (SL108 and SL122) were significantly earlier flowering, whereas 21 were significantly later flowering than the A12 parent (SL102, SL112, SL113, SL115, SL121, SL127, SL128, SL129, SL133, SL134, SL137, SL138, SL140, SL141, SL142, SL157, SL158, SL172, SL175, SL177, SL178).

Table 3 The mean flowering date for the Brassica oleracea recurrent parent, A12, and the substitution lines that showed significantly different flowering behaviour from A12 over the five years. Bold type denotes those lines that were significant and ‘—’ indicates that the line was not raised in that year

Discussion

Seventy-nine lines were grown over the five years. Of these, only two lines showed evidence for earlier flowering than A12, whereas 21 lines were significantly later flowering. This bias was to be expected because sections of chromosomes from the late flowering GD parent had been substituted into the early flowering A12 recurrent parent.

Of the 23 lines that showed significant variation in flowering behaviour from the A12 parent, 10 have a single substitution, four have two substitutions, eight have three regions and one has four regions substituted with GD. The following interpretation should be read in conjunction with Fig. 1Fig. 2Fig. 3

Figure 1
figure 1

The positions of introgressed regions in the Brassica oleracea substitution lines on chromosomes O1, O2, O3, O5 and O9. Minimum GD regions are shown as a box, whereas the maximum region, including where recombination occurred, is shown as a single line. The number of substitutions contained by each line is noted. The significantly late flowering lines are shaded in black, whereas those that were significantly early flowering are shaded in grey. The regions to which the substitution lines best map QTL are shown by vertical, dotted lines. For comparison, the regions to which the doubled haploid data mapped QTL are also shown.

Figure 1
figure 2

(Continued)

Figure 1
figure 3

(Continued)

Lines with one substitution

Any variation between the A12 and the 10 lines containing just one substitution must result from GD DNA in this region, therefore such lines enable the GD DNA region to be treated as a single segregating factor. Because it is not possible to define the proportion of GD DNA in the region in which recombination has occurred, it was assumed that any QTL present may be anywhere within the maximum substitution.

The late flowering line SL102 had a maximum GD substitution in the region between 0.0 and 35.7 cM on chromosome O1, but the early flowering line SL108 had a minimum substitution between 30.3 and 35.7 cM, so it is unlikely that a late flowering QTL was present in this region (Fig. 1, O1). It is therefore most likely that there was a late flowering QTL between 0.0 and 30.3 cM and an early flowering QTL between 30.3 and 38.1 cM on chromosome O1.

Lines SL128 and SL133 both showed late flowering behaviour. Assuming the presence of just one QTL on chromosome O3, it is likely to be located in the region in which the substitutions in these lines overlap, i.e. between 30.5 and 43.3 cM, although the presence of a second QTL in this region is discussed later (Fig. 3, O3).

Line SL157 suggested the presence of a late flowering QTL between 0.0 and 34.2 cM on chromosome O5 in GD (Fig. 3, O5).

The five late flowering lines with single substitutions on chromosome O9 suggest three separate regions to which QTL may be mapped. Lines SL142, SL175 and SL178 have substitutions which overlap between 70.8 and 106.8 cM; line SL172 has a substitution between 0.0 and 43.0 cM; and line SL177 has a substitution between 43.0 and 64.4 cM (Fig. 1, O9).

If these seven QTL are assumed to be present in the regions described, inferences can now be made about those lines which contain more than one QTL.

Lines with more than one substitution

Of the 13 significant lines with more than one GD substitution, the behaviour of 11 can be explained by QTL found in lines with single substitutions. Inferences for QTL in the two lines SL122 and SL127 cannot be made using substitution line data alone, but are discussed below.

Comparison to previous work

Previous work carried out on this A12 × GD cross by Bohuon et al. (1998) analysed data from field trials using a mapping method based on marker regression (Kearsey & Hyne, 1994). This method showed evidence for six QTL over the two trials analysed in 1994. Single QTL were found on chromosomes O2 and O3, whereas chromosomes O5 and O9 both showed evidence for two QTL.

Fig. 1Fig. 2Fig. 3 provides a comparison of the doubled haploid and substitution line data. The QTL are ‘named’ according to their chromosome and position in the two doubled haploid trials simply to facilitate comparisons (see Bohuon et al., 1998).

Chromosome O1

The doubled haploid data showed no evidence for QTL on chromosome O1, whereas the substitution line data showed the presence of two QTL; one late flowering between 0.0 and 30.3 cM and one early flowering between 30.3 and 38.1 cM. As the regions to which these two putative QTLs map are adjacent to each other, they are likely to cancel each other in the doubled haploid lines.

Chromosome O2

The QTL FTO2.1, which had a late flowering allele from GD, was located at 78 ± 9 cM on chromosome O2. Eight of the substitution lines have introgressed GD DNA at this position. Of these, four flowered significantly late, one flowered early and three did not show significant variation from A12. The five that showed significant flowering variation all have substitutions on other chromosomes which may be affecting flowering time. SL112 and SL113 are thought to contain a region coding for late flowering on chromosome O3; SL121 contains late flowering regions on chromosomes O1 and O9; and the early flowering of SL122 may be caused by a QTL mapped in the region of FTO9.1 in the doubled haploid lines. It is possible that SL127 is reflecting the late flowering effect of FTO2.1. It may also be possible that lines SL112 and SL113 show this effect as well as the late flowering QTL on chromosome O3, because the mean flowering dates for lines SL112 and SL113 were 17 and 16 days later than A12, respectively, in 1994, whereas the other significantly late lines in this year flowered just 12–13 days later than A12.

Chromosome O3

The late flowering lines SL128 and SL133 have single substitutions which overlap, suggesting that FTO3.1 is in the overlapping region, but the maximum substitution in the late flowering line SL115 only just overlaps the maximum substitution in SL133. There is no evidence that the GD substitutions on chromosome O1 or O6 in SL115 affect the flowering behaviour of this line, therefore there may be two QTL affecting flowering time on this chromosome. This view is supported by the size of the effects of the QTL on this chromosome. The lines SL115 and SL133 flowered 2 and 1.5 days later than the A12 parent, respectively, in 1998, whereas line SL134, which overlaps both regions, flowered 6 days later than the A12 parent, implying the presence of two QTL in this line. The presence of a second QTL on chromosome O3 may explain the large confidence interval of FTO3.1 in the doubled haploid data.

Chromosome O5

FTO5.1 maps to 16 ± 13 cM, whereas FTO5.2 maps to 40 ± 10 cM. The substitution line data suggest that there is a QTL between 0.0 and 34.2 cM. This region overlaps with the confidence intervals for both FTO5.1 and FTO5.2, so the substitution lines neither confirm nor reject the two-QTL hypothesis.

Chromosome O9

The doubled haploid data indicated the presence of two QTL on chromosome O9; an early QTL (FTO9.1) at 46 cM in trial 1 and 36 cM in trial 2, and a late QTL (FTO9.2) at 74 cM in trial 1 and 94 cM in trial 2. The marker regression approach used on these doubled haploid data does not give confidence intervals when more than one QTL is mapped to a chromosome. The substitution-line data map a late flowering QTL between 70.8 and 106.8 cM, which coincides with the doubled haploid QTL FTO9.2.

The doubled haploid data did not reveal the two late QTL found in the substitution lines in the regions of 0.0–43.0 cM and 43.0–64.4 cM, but did detect an early effect at 46 cM in trial 1 and 36 cM in trial 2. This may be the cause of early flowering in SL122, which has a GD substitution between 23.1 and 47.0 cM on chromosome O9. The late flowering line SL129 completely overlaps the region to which this early flowering QTL is mapped, but it is possible that this line contains the two late flowering QTLs at this end of chromosome O9, which mask the effect of early flowering.

There is some evidence that the doubled haploid lines show segregation distortion in favour of GD in the region of the early QTL, which may explain why the early QTL alone is detected in the doubled haploid data.

Precision of mapping

The doubled haploid data mapped QTL with 95% confidence intervals that varied between 18 and 52 cM. Usually a confidence interval of 30 cM is thought to be reasonable for analysis of segregating populations. In the substitution lines the interval to which a QTL can be mapped relies on the size of the substitution and also the distance between markers on the linkage map. Substitution lines that showed significant effects for flowering time contained introgressed GD DNA that ranged in length from 8 cM to 43 cM, showing that this method of mapping QTL does enable effects to be mapped more precisely. The substitution line approach identified at least 11 QTL compared to the six detected by the doubled haploids. Substitution lines identified more QTL, because they detected effects that had been masked in the doubled haploid population through dispersion and close linkage.

It is possible that there are still more QTL for flowering time segregating in this cross. Some of the substitution lines contained introgressed regions as long as 84 cM, so it is quite feasible that such large regions may contain more than one QTL, which mask each other’s effect. The substitution lines grown over the five years represent 62% of the entire GD genome, although the actual proportion may be larger because this calculation does not include the region within which recombination has occurred. Only 9.0% of the GD genome is definitely not present in the substitution lines. This implies that there may well be QTL in the regions that are not represented in the substitution lines.

Comparison to other genomes

The use of the linkage map produced by Bohuon et al. (1996) in this study has enabled a comparison of results to be made. This map has also been used to compare the C genomes of B. oleracea and B. napus, and colinearity of the large segments of the A and C genomes in B. napus (Parkin et al., 1995). Field experiments involving Brassica napus populations have indicated that regions on linkage groups in the A genome which show homology to the regions in O2, O3 and O9 also carry QTL for flowering time (Keith, 1996; Salinas-Garcia, 1996; Osborn et al., 1997). These regions have also been shown to be homologous with regions on LG2 and LG8 of B. nigra, to which a flowering time QTL has been mapped (Lagercrantz et al., 1996) and to the region around the CONSTANS gene and several other flowering candidate genes on chromosome 5 in A. thaliana. These regions of homology have been found to map to the regions of B. oleracea chromosomes O2, O3 and O9, and to show similar effects of late flowering in GD (Bohuon et al., 1998). These appear to be the same as the substituted regions in the present study that also showed evidence of QTL for late flowering in GD.

The substitution line data showed similar results to the doubled haploid data, but were not as easily interpreted. To maximize the benefits of the use of substitution lines, it would be advantageous to backcross lines further to reduce the size and number of introgressed GD regions in each line, and to map more markers to the linkage map to enable reduction of the regions in which recombination is presumed to have occurred.

Other measurements

Measurements for height and number of nodes at flowering were also taken in the eight substitution line trials, as these have been found to have a positive correlation with flowering time (Bohuon, 1996). Late flowering tends to be a result of extended vegetative growth rather than stunted or delayed growth (Kowalski et al., 1994). Future work will involve the analysis of these traits and a comparison to flowering time data.