Introduction

Shull (1914) coined the term heterosis to describe the improved vigour of F1 hybrids in comparison to their parental homozygous lines. Since its discovery in the last century (East, 1908; Shull, 1908), it is a widely used yet little understood phenomenon in plant breeding (Schnell, 1982). In general, heterosis is largest in allogamous plants, which are, however, also the most sensitive to inbreeding depression, and heterosis is smallest in strictly autogamous species (Becker, 1993). Furthermore, the relative amount of heterosis increases with the complexity of a trait. A better understanding of heterosis is crucial for agronomically important characters such as biomass and seed yield.

The molecular basis of heterosis is still unknown. Genetic explanations include dominance, overdominance, and epistasis (for review, see Lamkey and Edwards, 1999). With two alleles per locus and no epistasis, heterosis is theoretically a quadratic function of the parental genetic distance (GD) at the underlying quantitative trait loci (QTL) for the trait considered (Falconer and Mackay, 1996; Melchinger, 1999). Experiments with maize showed an increase in heterosis with increasing parental GD (Moll et al, 1962; Melchinger, 1999), but they also suggested an optimum level of parental GD, after which heterosis and hybrid performance decline (Moll et al, 1965).

A better understanding of the biological mechanisms underlying heterosis should lead to more systematic approaches in the identification of suitable parent matches for superior hybrids. Recent improvements in tools for plant functional genomic analysis, for example, expression profiling of parental lines and their hybrids for mRNA, proteins, or metabolites, in conjunction with the use of the well-characterized and fully sequenced genetic model plant Arabidopsis thaliana have opened new avenues for investigating the basis of heterosis (Somerville and Somerville, 1999). The Arabidopsis functional genomics project (The Multinational Arabidopsis Steering Committee, 2002) defined, as a long-term goal for 2010, the study of complex genetic networks, which includes the systematic investigation of heterosis.

In Arabidopsis, only a few traits of agronomic interest have been investigated so far with respect to heterosis, among them rosette diameter (El Asmi, 1975), stem length and biomass yield (Griffing and Langridge, 1963; Li and Rédei, 1969), number of seeds per plant (Alonso-Blanco et al, 1999), as well as phosphate acquisition efficiency (Narang and Altmann, 2001). However, most of these studies were limited in their informative value in that they employed only a few hybrids, or involved mutants as parents. QTL mapping for general viability of seedlings has been used to identify a chromosomal region showing overdominance as an explanation for heterosis in this trait (Mitchell-Olds, 1995).

Since information on heterosis for agronomically relevant traits is rather scant in Arabidopsis, more extensive experiments with additional traits are urgently needed before initiating genomics research programs on the causes of heterosis. The objectives of the present study were to

  1. 1)

    determine the general level of mid-parent (MPH) and best-parent heterosis (BPH) in A. thaliana for traits of agronomic interest in order to answer the question of whether this autogamous species represents a suitable model for investigating the molecular basis of heterosis,

  2. 2)

    measure the variation in the amount of MPH and BPH among different hybrids of Arabidopsis,

  3. 3)

    monitor for reciprocal effects in the F1 and F2 generations, and

  4. 4)

    assess the association between genetic distance and MPH.

Materials and methods

Plant materials

The ecotypes used in this study (Col-0, Ws-2, Wei-0, Aa-0, C24) and their origins were described in detail by Barth et al (2002). Ecotypes were sown on 21 March, 2000. Seeds were grown as described by Barth et al (2000). At the four-leaf stage, approximately 10–11 days after sowing, each seedling was transferred to an individual jiffy pot (7 × 7 × 8 cm3) filled with sterilized soil (Euflor GmbH: 90% peat, 7% perlite and 3 vol % sand at pH of 5–6, salt content <1.5 g/l, nitrogen availability <300 mg/l N, phosphate availability <300 mg/l P2O5, and calciumoxide availability <400 mg/l K2O). Plants in soil were irrigated with tap water.

Material development

In a previous study in our lab (Barth et al, 2002), GD values calculated according to Nei and Li (1979) were determined among 37 ecotypes from molecular data based on 54 polymorphic cleaved amplified polymorphic sequences (CAPS), markers with good coverage of the entire genome. Five F1 hybrids including reciprocals were produced based on small, medium and large GD, and synchronous flowering date of the parents: C24 × Aa-0 (GD=0.32), C24 × Ws-2 (GD=0.46), Ws-2 × Col-0 (GD=0.50), Col-0 × C24 (GD=0.53) and Col-0 × Wei-0 (GD=0.57).

F1 hybrids were produced in April and May 2000. Crossing procedures were as described by Barth et al (2000). The F1 hybrids were sown for selfing on 28 May, 2000. Siliques were harvested at maturity from five individual F1 plants per hybrid.

The materials for measurement of heterosis were grown during the winter 2000/2001 in a greenhouse at the University of Hohenheim, Stuttgart. Sowing of the parents (P1 and P2), as well as the F1 and F2 generations of each hybrid in both reciprocal forms, was carried out on 3 October 2000.

Traits measured

Flowering date (in days) was determined as the number of days from sowing to the appearance of inflorescence. Rosette diameter (in cm) was recorded 48–50 days after sowing on a single-plant basis by measuring the longest leaves in the rosette with an accuracy of ±1 mm. After recording the rosette diameter, each plant was covered with a seed collector (ARACONS, Beta tech bvba). All plants of a subplot, including the siliques but without the root system, were bulked and harvested after senescence into glass jars. Biomass yield (in g) was recorded after drying in an oven to practically zero per cent moisture content. Seed yield (in g) was determined from dried plants, which were hand threshed and passed through a 400 μm sieve (Retsch GmbH) for a minimum of three times for seed purification. 1000-seed weight (in mg) was determined from three samples of 100 seeds from each entry counted under a microscope. Number of seeds per plant (103 seeds) was calculated by dividing total seed yield by 1000-seed weight.

Experimental design and statistical analyses

The experimental design was a split plot (Snedecor and Cochran, 1980) with main plots and sub plots treated as fixed factors. Main plots comprised the five hybrids and were arranged in a randomized complete block design with four replications. The generations (P1 and P2 as duplicate entries; F1, F2 in both reciprocal forms each as single entries) were randomly assigned to the subplots within main plots. Each subplot consisted of five plants.

Analyses of variance (ANOVA) were performed with the parental and hybrid generations (F1, F2) separately, owing to significant heterogeneity of errors for most traits. Statistical analyses were performed by PLABSTAT (Utz, 1999). The following contrasts were tested with appropriate two-tailed t-tests (Snedecor and Cochran, 1980):

absolute MPH: AMPH=\(\overline{F1}\)\(\overline{P}\)

absolute BPH: ABPH=\(\overline{F1}\)Pmax

nonlinearity of inbreeding: NLID=\(\overline{F1}\)+\(\overline{P}\)−2\(\overline{F2}\),

reciprocal effect in F1: Rec (F1)=(P1 × P2)−(P2 × P1)

reciprocal effect in F2: Rec (F2)=(P1 × P2)s− (P2 × P1)s with \(\overline{P}\)=(P1+P2)/2; \(\overline{F1}\)=[(P1 × P2)+(P2 × P1)]/2; \(\overline{F2}\)=[(P1 × P2)s+(P2 × P1)s]/2 and the superior s indicating selfing. In addition, we calculated the relative MPH as MPH=100 × (\(\overline{F1}\)\(\overline{P}\)/\(\overline{P}\) and relative BPH as BPH=100 × (\(\overline{F1}\)Pmax)/Pmax where Pmax refers to the higher performing parent.

Results

In general, the parents of each hybrid differed significantly (P<0.05) from each other for all traits (data not shown). Exceptions were found for parents in C24 × Aa-0 for rosette diameter, biomass yield, seed yield, and 1000-seed weight, in Ws-2 × Col-0 for seed yield and in Col-0 × C24 for all traits related to seed yield. MPH averaged across hybrids was highly significant for all five traits (P<0.01) (Table 1). BPH, however, was not significant for seed yield and number of seeds per plant. Mean MPH was surprisingly high for biomass yield (60.3%) and rosette diameter (49.4%), but below 30% for the other traits. However, a large amount of variation for MPH existed among the five hybrids for all traits. Averaged across hybrids, the test for nonlinearity of inbreeding depression (NLID) in the three generations (\(\overline{P}\), \(\overline{F1}\), \(\overline{F2}\)) was not significant (P<0.05) for all traits (data not shown).

Table 1 Mid-parent value (\(\overline{P}\)=(P1+P2)/2), and performance of generations F1 and F2 averaged over both reciprocal forms (\(\overline{F1}\), \(\overline{F2}\)) as well as relative mid-parent heterosis (MPH) and best-parent heterosis (BPH) for six traits in five hybrids of A. thaliana

For flowering date, the overall mean of the parental generation differed significantly from both the F1 and F2 generations. MPH amounted to 27.5% and BPH to 18.5% (Table 1). In all crosses involving C24 as a parent, the F1 and F2 generations flowered 10–15 days later than \(\overline{P}\) with MPH ranging between 33.8 and 58.9%. In the other two hybrids (Ws-2 × Col-0, Col-0 × Wei-0), MPH was not significant.

For rosette diameter, \(\overline{F1}\) significantly (P<0.05) exceeded \(\overline{P}\) with intermediate values for \(\overline{F2}\) except for hybrid Ws-2 × Col-0 (Table 1). Four hybrids showed highly significant (P<0.01) MPH, ranging from 14.5% in Col-0 × Wei-0 to 84.2% in C24 × Ws-2. Three of these hybrids showed also highly significant BPH values. For biomass yield, the means of all three generations (\(\overline{P}\), \(\overline{F1}\), \(\overline{F2}\)) differed significantly (P<0.05). Apart from hybrid Ws-2 × Col-0, MPH was significantly (P<0.05) greater than zero and ranged between 41.3 and 140.1%. BPH was significant for hybrids C24 × Aa-0 (121.6%) and Col-0 × C24 (41.4%).

For seed yield, \(\overline{F1}\) exceeded \(\overline{P}\) in hybrids Ws-2 × Col-0, Col-0 × C24, and Col-0 × Wei-0. MPH values averaged 18.9% except for a higher value (65.7%) in hybrid Col-0 × Wei-0. For 1000-seed weight, all hybrids involving Col-0 as parent showed significantly (P<0.05) smaller values for \(\overline{F1}\) than \(\overline{P}\) with \(\overline{F2}\) being intermediate. However, the other hybrids deviated from this pattern as reflected by the varying size of MPH, which averaged −4.7%. For number of seeds per plant, MPH was highly significant (P<0.01) only in hybrids Col-0 × C24 and Ws-2 × Col-0 and averaged 29.4%. For seed-yield-related traits, only hybrid Col-0 × Wei-0 showed a significant positive BPH value.

The linear regression of MPH on GD values of individual hybrids was not significant (P<0.05) for all traits with R2 values below 62.7% (Figure 1). Similar results were obtained for the regression of BPH on GD values (data not shown). Significant (P<0.05) reciprocal differences were found only for biomass and seed yield (Table 2). Reciprocal effects were generally not consistent across generations except for hybrid C24 × Ws-2, where significant reciprocal differences with identical sign were found in the F1 and F2 generations.

Figure 1
figure 1

Graphical representation of linear regression of AMPH on GD (based on CAPS marker data) for six traits in five hybrids of A. thaliana: C24 × Aa-0 (GD=0.32), C24 × Ws-2 (GD=0.46), Ws-2 × Col-0 (GD=0.50), Col-0 × C24 (GD=0.53), and Col-0 × Wei-0 (GD=0.57).

Table 2 Comparison between the two reciprocal forms in generations F1 and F2 for six traits in five hybrids of A. thaliana

Discussion

Heterosis for different traits

In a breeding context, heterosis is often measured as BPH, which directly reflects the superiority of hybrids over comparable line cultivars. However, with regard to quantitative-genetic theory (Falconer and Mackay, 1996), MPH has several advantages over BPH, for example its expected quadratic relationship to the parental GD under a simple genetic model. This was the main reason for focussing our discussion on MPH.

In general, the amount of heterosis is much smaller in autogamous than in allogamous crops (Becker, 1993; Melchinger and Gumber, 1998). In autogamous crops, a maximum MPH for seed yield of 17% for rice (Virmani, 1996) and 10% for wheat (Martin et al, 1995) was reported. Since Arabidopsis is an autogamous species with an average outcrossing rate of about 0.3% (Abbott and Gomes, 1989), we expected only a low amount of heterosis in contrast to the results observed in our study. One explanation for the high MPH in Arabidopsis could be the testing of individual plants under wide spacing, whereas field trials with crops are usually grown under high planting density and considerable interplant competition.

For flowering date, we observed a moderate positive MPH (27.5%) while Westerman (1971) found no MPH (0.3%). This is in contrast to rice and maize, where hybrids flower earlier than their parents. Although flowering date itself does not display a high MPH, it should be recorded in heterosis studies because it can affect other traits such as biomass and seed yield by altering the time span available for vegetative and generative growth. Pigliucci and Hayden (2001) found such a correlation between duration of vegetative phase and rosette diameter in crosses between Ler-0 and four Arabidopsis ecotypes. However, delay of flowering date in hybrids cannot exclusively explain biomass heterosis in Arabidopsis, because Col-0 × Wei-0 displayed substantial biomass heterosis in the absence of significant differences for flowering date among generations P, F1, and F2. In addition, heterosis for flowering date in Col-0 × C24 and C24 × Aa-0 were of similar size, even though the later hybrid showed about twice as much MPH for biomass yield than the former. A more precise estimation of the influence of flowering delay on total biomass accumulation could be obtained either by monitoring growth rates from video-imaging (Leister et al, 1999) or by determining biomass yield before the start of the generative phase.

From an agronomic point of view, biomass and seed yield are the most important traits with regard to heterotic response. Rosette diameter reflects biomass accumulation at an early stage and could serve as a supple-mentary trait for the vegetative part of the life cycle. The heterosis estimates reported in literature were rather low compared to the results in Table 1. For rosette diameter, El Asmi (1975) found an MPH of 18% compared to our 49%. Pigliucci and Hayden (2001) even found smaller rosettes in their hybrids compared to the parents. Westerman (1971) reported 4% MPH for leaf number as a measure for biomass heterosis. Griffing and Langridge (1963) found 23% MPH for fresh weight, compared to our 60% increase for biomass yield. In particular, the hybrids Col-0 × C24 (60%) and C24 × Aa-0 (140%) can be recommended for investigations into the underlying causes of heterosis for biomass yield. In this context, it should also be of interest to include measurements of the root system, which might explain a substantial proportion of heterosis in biomass and seed yield by an increased efficiency of nutrient uptake. More detailed information on the physiological causes of heterosis for biomass yield could be gathered by comparing F1 hybrids and their parents for the rate of cell division, cell size, photosynthesis rate, and the metabolic flux of sugars, proteins, or other metabolites, which have been suggested as an explanation for heterosis (de Vienne et al, 2000).

For seed yield, we found unexpectedly high positive MPH values for two hybrids (20% for hybrid Ws-2 × Col-0 and 65% for hybrid Col-0 × Wei-0). In agreement with our results, Alonso-Blanco et al (1999) reported 24% MPH in the number of seeds per plant in hybrid Ler × Cvi. However, heterosis for this trait can be much higher in suitable parent matches such as hybrid Col-0 × Wei-0. The positive MPH for seed yield is attributable to an increased 1000-seed weight (C24 × Aa-0) or an increased number of seeds per plant (Col-0 × Wei-0 and Ws-2 × Col-0). While factors for yield components together with multiplication effects can explain heterosis for seed yield (Schnell and Cockerham, 1992; Melchinger et al, 1994), experimental procedures for measuring yield components are considerably more cumbersome and prone to errors in Arabidopsis than in small grains or maize.

Hybrid Col-0 × C24 was unusual because of its markedly lower seed yield in combination with a substantial increase in biomass, indicating that increased vigour because of heterosis can be manifested in different ways depending on the particular genomic constitution. Viewed under the microscope, the F1 seeds from this hybrid already had rather heterogeneous size with numerous shrunken or oversized seeds suggesting reduced fertility (S Barth, unpublished results). Thus, unintentional selection cannot be ruled out when investigating the viable plants from this stock. Similar observations were reported for wide crosses between A. thaliana and A. lyrata, where numerous malformed pollen grains were found (Nasrallah et al, 2000). Comparable to hybrid Col-0 × C24 in our study, these researchers found heterosis of biomass for both rosette diameter and root system in combination with reduced fertility.

Our study reveals little information about the genetic effects underlying the observed heterosis. We were not able to separate additive and dominance effects from various types of epistatic effects by a generation means analysis (Mather and Jinks, 1982), because, in addition to the parents, only the F1 and F2 generations were evaluated. However, since the test for NLID was nonsignificant in all instances, we conclude that epistasis was either absent or additive and any dominance type of epistatic effects were of opposite sign and cancelled each other out in the sum.

Reciprocal effects

We found significant reciprocal differences for biomass and seed yield (Table 2), but no consistent pattern was found across hybrids and parents. This inconsistency of our results could be attributable to seed production via emasculated flowers, which deteriorates the viability of the resulting seed stock in comparison to natural self-fertilization. Another cause could be maternal effects (Crane and Nyquist, 1967; Alonso-Blanco et al, 1999; Nasrallah et al, 2000). Since growth of F1 seeds depends on the genetic or nutritional condition of the seed parent, reciprocal effects from this source are plausible, but should differ between F1 and F2 seeds. Reciprocal effects disappearing in F2 were detected in C24 × Aa-0 and Col-0 × C24 in agreement with earlier reports on other hybrids (Corey et al, 1976; Alonso-Blanco et al, 1999). Only for hybrid C24 × Ws-2 did the reciprocal effect persist across both generations and this cannot be attributed to unintentional selection for vigour during sowing and planting. Since reciprocal effects cannot be ruled out entirely in crosses with Arabidopsis, it seems prudent to standardize the growing conditions to the utmost extent and use the same seed parent and production scheme for the materials to be compared. Potential effects from seed production in emasculated flowers could be avoided by use of male sterile seed parents for both parents and hybrids, or by seed multiplication of lines from emasculated flowers.

Genetic distance and heterosis

If positive dominance effects are the primary cause of heterosis, a quadratic relation between heterosis and genetic distance is expected (Falconer and Mackay, 1996). An optimum distance has been postulated, beyond which hybrid performance declines due to a lack of adaptation between divergent genomes (Moll et al, 1965). The negative association between AMPH and GD for flowering date, rosette diameter, biomass yield, and 1000-seed weight was mostly attributable to atypical results for hybrid C24 × Aa-0, which showed a high performance level despite its low parental GD (Figure 1). For a more comprehensive analysis of the correlation between GD and AMPH, a larger number of hybrids with diverse GD values should be evaluated in order to examine whether our results are representative for Arabidopsis in general. Moreover, a tighter relation between GD and AMPH is expected if GD is determined with molecular markers linked to QTLs underlying the trait rather than molecular markers covering the entire genome (Charcosset and Essioux, 1994).

Arabidopsis: a model system to study the molecular causes of heterosis

Since heterosis is a general phenomenon in the entire plant kingdom, a promising approach is to investigate its molecular basis in a model species and verify the hypotheses and results in other species of economic interest. Ideally, the model species should meet the following criteria: (1) sufficient amount of heterosis for traits of agronomic interest, (2) ease of experimental use in large-scale experiments, and (3) well-developed genomic tools. With a proper choice of hybrids and concentration on traits related to vigour, our results demonstrated that Arabidopsis is an excellent tool that matches all these requirements. Its low growth requirements and short life cycle are additional advantages. Obviously, accurate evaluation of the traits and proper statistical analysis require repeated experiments under controlled conditions. In spite of working in a greenhouse, the experimental variation between replicated entries was substantial in our study. A total of 40 plants (four replications × five plants × two reciprocal forms or duplicate entries) per experimental unit seems to be at the lower end of the scale for accurate measurement of quantitative traits. Hence, increasing the number of replications and plants per entry is highly recommended for future experiments.

Investigation of heterosis could be pursued by QTL analyses in large populations, expression studies, or by metabolite monitoring. Available high-density maps and a possibility of generating any desired number of molecular markers from the full genomic sequence of ecotype Col-0 (The Arabidopsis Genome Initiative, 2000) make Arabidopsis at this time the most suitable system for this task. Numerous single nucleotide polymorphism (SNP) markers were developed (Cho et al, 1999), which can be utilized for dissection of QTLs affecting heterosis. In crops, maize would also be very attractive for heterosis studies because of its large heterotic response and the well-established tools in genetics and genomics. However, comparable experiments with maize would have excessive demands for field trials. Therefore, the resources for a research initiative on the molecular basis of heterosis in plants are put to best use, if results gathered in experiments with Arabidopsis are corroborated for agronomically important crops such as maize.