Introduction

Pathogens are ubiquitous and often severely reduce host fitness. As a result, they can have important effects on plant ecology, such as altering community level diversity, or affecting the evolution of host morphology and life-history traits (Clay and Van der Putten, 1999). By affecting fitness-related traits, pathogens also often have important economical effects on crops by reducing growth and yield (Wolfe, 2000). The coevolutionary dynamics between host and pathogen depends on the genetic architecture of resistance and virulence, as well as the selective value of different resistance-related traits (Kover and Caicedo 2001). Thus, understanding the underlying genetic architecture of plant–pathogen interactions can provide important insights into ecological, evolutionary and agricultural systems.

Much progress in understanding the genetic basis of disease resistance in plants comes from studies on the model system Arabidopsis thaliana (Kunkel et al, 1993; Glazebrook, 1999). Studies of the genetic basis of simple forms of disease resistance in A. thaliana have identified single Mendelian genes of large effect, known as resistance (R) genes (Glazebrook, 1999; Dangl and Jones, 2001). These genes govern a plant's ability to recognize pathogens expressing specific elicitors (the product of Avr genes), resulting in the activation of plant defense responses that serve to limit pathogen growth and disease development (Whalen et al, 1991; Glazebrook et al, 1997).

Disease resistance mediated by R genes (also called ‘gene-for-gene resistance’) behaves as a qualitative trait, where the presence or absence of an R gene differentiates between a resistant and a susceptible plant. Thus, disease resistance in A. thaliana is usually considered a qualitative trait. However, closer examination of A. thaliana's response to virulent strains shows a continuous variation in disease susceptibility (Whalen et al, 1991; Kover and Schaal, 2002). For example, the accession Nossen (No-0) shows fewer symptoms and less pathogen growth than the accession Columbia (Col-0) when challenged with a virulent strain of the bacterial pathogen Pseudomonas syringae (pv. tomato strain PstDC3000) for which no R gene has been identified (Chen et al, 2000). Although the genetic basis of R gene-mediated resistance has been extensively studied at the genetic and molecular level, the genetic basis of the quantitative variation in disease susceptibility is only beginning to be investigated (Glazebrook et al, 1996; Kloek et al, 2001). In addition to furthering our understanding of the defense mechanisms of A. thaliana, studying the genetic basis of quantitative variation in susceptibility to pathogens can provide important insights into the evolutionary interaction between plants and pathogens in natural populations, where variation in susceptibility is often quantitative (Lande and Arnold, 1983; Alexander, 1992).

The genetic basis of quantitative variation is typically polygenic. Thus, analysis of the genetic basis for variation in the response of plants to pathogens can be addressed using quantitative trait loci (QTL) analysis, which allows for the simultaneous identification of multiple genomic regions responsible for quantitative variation in trait expression (Lander and Botstein, 1989). The interaction between A. thaliana and the bacterial pathogen P. syringae is an ideal system to investigate the genetic basis of quantitative variation in susceptibility using QTL analysis. Natural populations of A. thaliana in the US have been reported to be infected with P. syringae and P. viridiflava (Jakob et al, 2002), and quantitative variation in disease susceptibility has been previously observed in this system (Kover and Schaal, 2002). In addition, the genome of A. thaliana has been completely sequenced, making a large number of molecular markers available (a continuously updated list is available online at The Arabidopsis Information Resource (TAIR), www.arabidopsis.org). Furthermore, this system also presents the opportunity to measure different components of plant response to pathogens, allowing the simultaneous investigation of different components of their interaction for a more complete analysis of susceptibility.

Here, we use QTL analysis to investigate the genetic basis of quantitative variation in disease susceptibility in A. thaliana. The experimental population chosen for the analysis is the F2 generation from the intercross between No-0 and Col-0 lines. These parental lines show a significant difference in susceptibility to P. syringae (Chen et al, 2000; Kover and Schaal, 2002). We searched for QTL affecting three different susceptibility-related traits: (1) the extent of chlorosis and necrotic lesions and patches in infected leaves, which allows for the identification of genetic factors that reduce the severity of disease symptoms; (2) the number of bacterial cells per cm2 of host leaf tissue, which allows for the identification of genetic factors that may reduce pathogen entrance and/or bacterial growth and (3) the number of fruits produced by each plant, which measures variation in plant fitness (yield) under infection. Since infection by P. syringae reduces A. thaliana fitness (Kover and Schaal, 2002), it is expected that susceptibility would be negatively associated with fitness, and therefore, be under selection. However, the costs of being resistant (Bergelson and Purrington, 1996; Tian et al, 2003), as well as those of tolerance traits (Stowe et al, 2000; Kover and Schaal, 2002) can breakdown the negative relationship between susceptibility and fitness. Thus, we also investigated the relationship between different components of susceptibility and fitness using a selection analysis (Lande and Arnold 1983). Specifically, we address the following questions: (1) can we identify genetic factors (QTL) that underlie the differences in susceptibility between the two A. thaliana lines; (2) what is the genetic architecture of these factors? Does dominance and epistasis affect the expression of susceptibility?; (3) are there likely candidate genes for the identified QTL and (4) what is the relationship between susceptibility traits and fitness?

Material and methods

The F2 progeny used in this study were produced by initially crossing emasculated Col-0 rps2-201C flowers with pollen from No-0 rps2-201C flowers. F1 plants from this cross, confirmed to be heterozygous using genetic markers, were self-pollinated to produce F2 seeds. All F2 seeds used in the QTL mapping were collected from a single F1 individual. We chose to use a F2 design (instead of the commonly used sets of recombinant inbred lines) because it allows for the investigation of the role of dominance. Both parental lines used in the analysis carry mutations at the RPS2 disease resistance locus for the purpose of a different experiment. The isolation of the Col-0 rps2-201C mutant is described in Kunkel et al (1993), and the generation of the No-0 rps2-201C line is described in Chen et al (2000). RPS2 is an R gene capable of recognizing pathogens carrying the avirulence gene AvrRPS2. Plants in our experiment were challenged with the bacteria P. syringae pv. tomato strain DC3000 (PstDC3000), which does not carry AvrRPS2. Thus, it is unlikely that the presence of the rps2 mutation in these lines is important in the context of this study. For simplicity, the parental lines will be from now on referred only as ‘Col-0’ and ‘No-0’.

Seeds from the F2 sibship were planted in six trays containing 50 individual cells, for a total of 276 plants. For comparison, we also planted the original parental stocks (No-0 rps2-201C and Col-0 rps2-201C) in two cells of each tray. Seeds were cold-treated for 3 days at −4°C, and later germinated and grown under constant environment in a growth chamber kept at 25°C and 12 h of light/day. Tray position in the growth chamber was randomly assigned and rotated every week to minimize the effect of microenvironmental variation.

Pathogen inoculation and disease assays

Plants, 4-week old, were inoculated by inverting the trays and submerging entire plant rosettes in a bacterial solution containing 10 mM MgCl2, 0.02% L-77 Silwet and 108 cells/ml of PstDC3000 as described in Whalen et al (1991). Trays were covered with a plastic dome for the first 24 h to maintain high humidity. At 3 days after inoculation, four disks of leaf tissue were collected from each plant with a cork borer (0.25 cm2 area) to determine the number of bacterial cells present per cm2 of leaf tissue. Disks were collected randomly from different leaves before any symptoms were visible. The disks were ground in 10 mM MgCl2 solution and plated after appropriate dilutions were made on NYG agar plates (containing 1 mg/ml of Rifampicin). The number of colony-forming units (CFUs) per plate was counted 24 h later. To determine the severity of disease symptoms, each plant was visually inspected for disease symptoms 5 days after inoculation. Symptoms were scored on a predefined scale that takes into consideration the presence and extent of chlorosis and necrotic disease lesions. The scale ranges from 1 (no signs of disease symptoms) to 5 (extensive chlorosis and patches of necrotic lesions). To insure scoring accuracy, two different people independently scored each plant. Since the two scores were tightly correlated (r2=0.87), the mean between the two scores were used in the analysis.

Since A. thaliana is an annual plant, the effect of pathogen infection on lifetime fitness can be estimated by total seed production upon senescence. Thus, after plants were scored for disease symptoms, day length was gradually increased to 16 h to induce flowering. Plants were kept in the growth chamber until senescence, when total fitness was estimated by counting total fruit production. The number of fruits produced by a plant has been previously shown to correlate tightly with total seed production (Mauricio and Rausher, 1997). To determine the role of life-history traits on fitness, we measured rosette size one day before inoculation, and recorded flowering time. Plants were checked every day for bolting. Flowering time corresponds to the first day a flowering bud was observed.

Phenotypic analysis

For each of the five traits measured, we calculated the average and standard deviation within the parental lines and F2 progeny. To test whether parental lines significantly differ for these traits, we used univariate t-tests. Transgressive segregation was determined by evaluating whether any F2 progeny had trait values that were more extreme than the most extreme parental phenotype plus (or minus) 2 standard deviations. To test for correlation between traits in the F2 progeny, we calculated pairwise Pearson's correlations between all traits measured.

To investigate the potential selective value of reduced susceptibility, we performed a selection analysis following the regression method proposed by Lande and Arnold (1983). Since this experiment was performed in a growth chamber, the observed pattern of selection will not necessarily reflect patterns of natural selection. However, it can still provide valuable information about the relationship between susceptibility and fitness. To evaluate the total effect of each trait on fitness, we calculated the univariate regression of the standardized symptom scores, size of bacterial population, rosette size and flowering time on relative fitness. This is equivalent to calculating the standardized selection differentials (ie, the total covariance of a trait with fitness). To obtain the relative fitness of each individual, we divided the number of fruits produced by each F2 individual by the average number of fruits produced in the F2 progeny. To evaluate the direct effect of the same traits, independent of correlations between traits, we calculated the partial regression coefficients of standardized traits on relative fitness using a multiple regression (the standardized selection gradients).

QTL analysis

One large leaf from each F2 plant was collected shortly after the onset of flowering, and DNA was extracted using the Dneasyâ„¢ kit from Quiagen. DNA from the two parental lines was screened with a 100 different markers to identify markers that were polymorphic between parents. The DNA from each F2 individual was genotyped with 28 microsatellite and five CAPS (cleaved amplified polymorphic sequences) markers (see Figure 1). All primers used to genotype these markers are described at TAIR (www.arabidopsis.org). Conditions for PCR amplifications generally followed the Bell and Ecker (1994) protocol, but annealing temperature was individually optimized for each primer pair (specific annealing temperatures used are available upon request).

Figure 1
figure 1

Recombination map for the cross between No-0 and Col-0 lines showing markers used in QTL analysis. Recombination distances were determined using Mapmaker.

The genetic map for the F2 cross was generated using MAPMAKER 3.0b (Lander et al, 1987). Since the A. thaliana genome has been completely sequenced, the order of markers and their chromosome assignment (according to the information present in www.arabidopsis.org) was imposed in the analysis. Recombination distances between markers were estimated using the Kosambi function.

QTL analysis was performed using QTL Cartographer software version 2.0 (Basten et al, 1994). We initially performed an interval mapping with tests being performed every 2 cM. Statistical significance for the LOD scores obtained in the QTL analysis was determined individually for each trait by performing 10 000 permutations using the method developed by Churchill and Doerge (1994). Composite interval mapping (CIM) was used to further analyze traits for which a putative QTL was identified using interval mapping. CIM can improve QTL results by partitioning out the contribution of QTL of large effect and of linkage disequilibrium. However, the results from composite interval mapping have to be interpreted carefully since the outcome of the analysis depends on the covariate markers chosen to do the analysis (Broman, 2001). To insure that our results were not dependent on one particular model, we ran CIM models with different number of covariate markers (1–5) and window size (5–10 cM) for each trait. Potential marker covariates were identified using a stepwise forward and backward regression, which ranks the five markers that explain the highest proportion of the variation. Confidence intervals for QTL position were established as the region around the peak location where there is one order of magnitude (1 LOD) change in the probability of a false-positive (Lander and Botstein, 1989).

We searched for pairwise epistatic interactions between all genotyped markers using canonical correlations analysis in SAS as described in Cheverud and Routman (1995) and Routman and Cheverud (1997). While the QTL cartographer software does allow for tests of epistatic effects, it only performs tests on locations previously determined to have a main effect. Since genetic factors without a main effect can have significant epistatic effects, we chose to perform the complete analysis for all pairs of scored markers located on different chromosomes using SAS. Pairs of loci that showed significant interaction (P<0.05) were further analyzed with a full regression model where phenotypic values were regressed on additive, dominant, additive by additive (A × A), additive by dominant (A × D), dominant by additive (D × A) and dominant by dominant (D × D) genotypic effects. Estimates of all effects and their significance were calculated using the method described in Cheverud and Routman (1995) and Cheverud et al (1997). The significance threshold used to evaluate the significance of epistatic effects was determined by performing a Bonferroni correction for the number of independent comparisons given the covariation between markers (as described in Vaughn et al, 1999 and Cheverud, 2001).

Total phenotypic variance explained by QTL and epistatic interaction detected in our study was determined by fitting the markers closest to each significant QTL in a regression model. Calculations were performed using PROC REG in SAS.

Two approaches were used to screen candidate genes for QTL identified in this analysis. We scanned the annotated sequence of A. thaliana (available at TAIR, Garcia-Hernandez et al, 2002) within the confidence interval of each QTL for genes with known or predicted roles in interactions with pathogens. The equivalence between the available physical map for Arabidopsis and our recombination map was made using the genetic markers with known positions in the physical map. We also compared the position of genes known to be important in modulating pathogen growth or symptom development with the positions of observed QTL. Genes considered in the latter approach are ones known to participate in the salicylic acid (SA), ethylene, jasmonic acid synthesis and signalling pathways (Kunkel and Brooks, 2002). We also considered genes involved in cell death and overall disease susceptibility (Greenberg and Ausubel, 1993; Glazebrook et al, 2003).

Results

Trait distribution and correlation

All five traits measured differed significantly between the parental types (Table 1). Col-0 has a significantly smaller rosette, develops more severe disease symptoms when infected with PstDC3000, harbors a larger bacterial population in its leaves, flowers later and produces fewer fruits than the No-0 parent (Table 1). All five traits vary continuously among the F2 progeny (Figure 2). Transgressive segregation (ie progeny with phenotypes more extreme than either parent) was observed for rosette size, flowering time and fruit production (Figure 2). Transgressive segregation can be due to parental lines having a mixture of alleles that increase and decrease the traits, or the presence of epistasis. It may also be caused by genotype by environment interaction variation. However, this is less likely to explain our results considering that the transgressive F2 phenotypes are more extreme than the most extreme parental phenotype plus (or minus) 2 standard deviations.

Table 1 Mean and standard deviation of measured traits in the two parental lines, Col-0 and No-0, and the significance of a t-test comparison of parental means
Figure 2
figure 2

Frequency distribution of traits measured (for details, see text). White bars indicate distribution for No-0 parents; hatched pattern indicates distribution for Col-0 parents and black bars indicate the distribution on F2 progeny.

There were significant correlations among some of the traits in the F2 (see Table 2). Disease symptom severity and bacterial growth were positively correlated with each other (P=0.0001), but neither was significantly correlated with fruit production. Among life-history traits, we observed a significant negative correlation between rosette size and number of days to flower (P=0.00001). Both of these traits were correlated with fruit production. However, rosette size was positively correlated (P=0.0001), while flowering time was negatively correlated (P=0.0001) with fruit production.

Table 2 Pearson's correlation between all traits measured on the F2 progeny

QTL associated with symptom severity

The recombination map obtained for this cross is shown in Figure 1 (F2 genotypes are available upon request). This map spans 480 cM and encompasses 97 of the 125 MB genome of A. thaliana (by comparison to the physical map available at TAIR). The recombination distances observed between markers are comparable to the distances reported between the same markers in other mapping studies (eg Lister and Dean, 1993).

To determine the number and position of QTL that affect symptom severity, we ran three different CIM models. The same three locations were significant at the empirically determined threshold value corresponding to P=0.05 (LOD=2.12) whether we used one, two or five markers as covariates. The main difference was that the window size used in the model had a slight affect on the position, and significance of the two peaks identified at the top of chromosome 1. In Figure 3, we show the LOD distribution for the CIM model with five covariate factors and a window size of 4.9 cM. We present the model with a small window because it separates the effects of the markers Nga63 and NCC1, which are 5 cM apart. Although two independent peaks can still be identified with a window of 4.9 cM, the multiple regression analysis suggests that most of the variation associated with NCC1 is due to a QTL nearer to Nga63. Thus, we will consider that there is a single QTL in the top of chromosome I, located at 2 cM.

Figure 3
figure 3

LOD distribution for the four traits studied. The five chromosomes are presented in order (c1–c5). Marker positions in each chromosome are indicated by arrowheads in the X-axis. Horizontal line indicates the significance LOD threshold value calculated using permutations (see text).

Table 3 shows the position, effect and percentage of phenotypic variation (R2) explained by each of the two identified QTL. Negative additive effects indicate that the allele from the No-0 parent reduces the trait value. Thus, in both QTL for symptom severity, the allele from the less susceptible parent (No-0) reduces the symptom score. The negative dominance value of the putative QTL close to Nga63 indicates that the No-0 allele is partially dominant over the Col-0 allele.

Table 3 QTL positions identified by composite interval mapping (for more details, see text)

Of the 418 possible pairwise interactions tested, 13 were significant at P<0.05. Only one was significant at P<0.0002 (the appropriate threshold with the Bonferroni correction for multiple epistasis tests): the interaction between markers Nga392 located on chromosome I and Nga76 located on chromosome V (P=0.00002). This interaction is mostly due to D × D effects (P=0.00002). A full model with the two QTL with a main effect and the significant epistatic interaction explains 23.2% of the phenotypic variance in score symptoms.

No genes with known or predicted roles in pathogen response occur within the confidence interval for the QTL identified on chromosome I. However, this interval contains a putative protein with the typical characteristics of a disease-resistance protein of the CC-NBS-LRR class (AT1G10920). Other proteins of this class include disease-resistance genes such as RPS2, RPS5 and RPM1, which are involved in the recognition of pathogens (Dangl and Jones (2001), but see Meyers et al, 2003). The QTL on chromosome III coincides with the position of two genes of known function in plant/pathogen interactions: ACD1 (accelerated cell death 1) and EDS1 (enhanced disease susceptibility 1). Within the confidence interval for the QTL on chromosome III, there are also three other putative proteins with characteristics of disease-resistance proteins (AT350950, AT351560 and AT351570).

QTL associated with bacterial population size

No significant association (LOD threshold=2.16) between bacterial population size and marker genotypes was observed with single marker regression or IM (Figure 3). However, at the top of the chromosome II (21.6 cM), there is a peak that is nearly significant (LOD=2.12). This possible location for a QTL affecting bacterial growth does not match any of the putative locations for loci affecting disease symptom severity.

Although 35 pairwise interactions were significant at P<0.05, none of them were significant at P<0.00021. By chance alone, we would expect to find 21 significant interactions among the 418 tested. Thus, it is possible that some of the 35 epistatic interactions are really affecting bacterial growth, but we cannot determine which of these are true interactions and which are false positives simply due to the large number of tests performed.

QTL associated with fruit production (fitness)

The results for fruit production are qualitatively the same whether we use IM or CIM. Figure 3 shows the LOD distribution for the CIM model with five covariates and 10 cM window size, which gives a slightly sharper peak for the QTL on chromosome 3. The two QTL significant at the empirically determined threshold value (LOD= 2.15) are presented in Table 3. Positive additive effects indicate that the allele from No-0 increases fruit production. The opposite signs for the additive effects at the two putative QTL indicates that No-0 alleles increase fruit production in one QTL, but reduces in the other. This mixture of allelic effects within parental line may explain the observed transgressive segregation for fruit production. The negative value of the dominance effect indicates that the Col-0 allele is dominant over the No-0 allele in the QTL closest to ABI3. The location of these markers does not coincide with any of the locations identified for symptoms or bacterial growth.

Of the 418 pairwise interactions tested, 13 were significant at P<0.05 and two were significant at P<0.0002. Both, AthGAPAb and ABI3 on chromosome III had significant interactions with AthSO262 on chromosome V, with the interaction being mostly due to D × D effects. Since AthGAPAb and ABI3 are 4.7 cM apart, it is likely that both interactions are due to a common QTL located between the two markers. The interacting QTL is likely to be closer to AthGAPAb because the interaction between AthGAPAb has a higher F (F=6.42, P=0.00008). A full regression model with the interaction between AthGAPAb and AthSO262 plus the two QTL with main effects explains 17.2% of the total phenotypic variance in fruit production.

Selection analysis

The multiple regression analysis indicates that all four measured traits: rosette size, symptom severity, bacterial growth and flowering time have significant effects (as measured by partial regression coefficients) on relative fitness (Table 4). This means that each individual trait has a direct effect on relative fitness when all other traits are kept constant. This analysis shows that, under our experimental conditions, infected plants are under selection for larger rosette size, earlier flowering time, smaller bacterial population sizes and higher symptom scores. The R2 for this model was 0.238, indicating that there is still a large portion of the variance in fitness unexplained. However, this result is not unexpected since so many aspects of plant growth and development can affect lifetime fruit production.

Table 4 Selection analysis for susceptibility and life-history traits

The univariate regression coefficients indicate that only rosette size and flowering time has an overall correlation with relative fitness (Table 4). This means that only these two traits would be expected to evolve in this population, since the other traits show no net correlation with fitness. The discrepancy between the multivariate and univariate analyses results from the fact that bacterial population size and symptom score have partial selection gradients of similar magnitude and opposite signs while being positively correlated at the phenotypic level. Thus, the independent effect of selection on these two traits cancels out when considering the net force of selection acting on either trait in the univariate model. This may imply some sort of fitness trade-off between these traits.

Discussion

A. thaliana has been an important model for the identification of the genetic basis of disease resistance in plants. As a consequence, many genes that affect plant response to pathogens have been identified in A. thaliana (Glazebrook et al, 1996; Glazebrook, 1999). However, most of these studies have concentrated on genes and pathways that affect the qualitative response of A. thaliana to pathogens; that is, genes that determines whether a plant is resistant or susceptible. In contrast, we know very little about genes that affect quantitative variation in disease susceptibility. Quantitative variation in disease susceptibility is a common form of variation in response to pathogens, where most plants will be susceptible to infection but to different degrees (Geiger and Heun, 1989; Wyss and Muller-Scharer, 1999; Kover and Caicedo, 2001). Although QTL studies are now commonly used in A. thaliana to investigate the genetic basis of the natural variation of quantitative traits such as bolting time, tolerance to salinity and floral morphology (Alonso-Blanco et al, 1998; Ungerer et al, 2002; Quesada et al, 2002), this is the first QTL study on A. thaliana responses to infection by P. syringae. Only two studies have investigated the genetic architecture of disease resistance in A. thaliana (Schiff et al, 2001; Wilson et al, 2001), both of which investigated QTL for resistance to powdery mildew.

Our study revealed two QTL for symptom severity and two QTL for fruit production segregating between accessions No-0 and Col-0 under conditions of P. syringe infection. We also found that epistatic interactions significantly affect the expression of these traits. This result is similar to the results of other QTL studies in A. thaliana where a moderate number of QTL per trait are seen and epistasis is common (Alonso-Blanco et al, 1998; Mitchell-Olds and Pedersen, 1998; Juenger et al, 2000). Despite the fact that bacterial growth is highly variable among ecotypes (Kover and Schaal, 2002) and is significantly different among the parental lines used in this cross (which show about a 2 standard deviation difference in mean bacterial population size), we did not find any significant QTL affecting bacterial growth. This could be due to the fact that we did not have polymorphic markers in regions where QTL for bacterial growth are located (eg middle of chromosome IV), or because there are no QTL of large enough effect to be detected. It is also possible that there is a large number of QTL of small effect, which would require a denser map and a larger sample size to detect. It is important to remember that the statistical power to detect QTL is a function of sample size, number of markers and the magnitude of the QTL effects (Beavis, 1994). Thus, considering the size of our F2 progeny and the limited number of markers on chromosomes II and IV, the observed number of QTL in this study is probably an underestimate of the total number of QTL that affect susceptibility in this cross.

Several genes that affect plant response to pathogens, such as R genes that mediate specific recognition of P. syringae strains (Glazebrook et al, 1997), and genes in the SA, ethylene and jasmonic acid pathways (Glazebrook, 1999; Glazebrook et al, 2003) were considered as candidate genes for the QTL observed in our study. Some of the QTL for symptom severity map close to candidate genes in these categories. EDS1, for example, is a particularly interesting candidate gene for the QTL for symptom severity identified on chromosome III because plants carrying a mutant EDS1 gene develop more severe symptoms under infection by P. syringae than wild-type plants (Glazebrook et al, 1996). However, any positional candidate gene at this point should be regarded only as hypothetical. The average 1 LOD confidence interval in this study was 19.84 cM, which is roughly equivalent to 4600 kb (assuming 233 kb per cM) and can encompass hundreds of genes (approximately 1050 genes, considering 4.4 kb/gene). Thus, the possible role of the identified candidate genes in quantitative disease resistance should be further tested through association studies (Long et al, 1998), complementation approaches (Mackay and Fry, 1996) or using near isogenic lines (Doebley et al, 1995).

Although genes that specifically determine fruit production have not been identified, some genes that affect root architecture (AXR 4), circadian rhythm (CCA1) and photo perception (PHYB) have been shown to also affect plant fitness (Ballare and Scopel, 1997; Gilliland et al, 2002; Green et al, 2002) and can be considered potential candidate genes. In reality though, there are too many genes and pathways that can conceivably affect traits as complex as fruit production. Thus we did not identify a priori candidate genes specific for fruit production, and did not attempt to perform an exhaustive search for candidate genes for the fruit production QTL. Since infection with PstDC3000 affects fruit production (Kover and Schaal, 2002), it is possible that QTL identified in this study for fruit production are quite different from QTL that determine total number of fruits produced by an uninfected plant. It is interesting though that the QTL on chromosome V for total number of fruits is in a similar location as a QTL found in a set of uninfected recombinant inbred lines between accessions Cvi and Ler (Ungerer et al, 2002). None of the QTL locations for fruit production identified in this study were close to a gene of known function in disease resistance.

Disease susceptibility in plants is empirically evaluated in many different ways. In natural populations susceptibility is often estimated by symptom severity, while the susceptibility of crop varieties is usually evaluated by their relative yield. In many experimental systems, where it is possible to measure pathogen titer, susceptibility is measured by pathogen growth. These different estimates of plant susceptibility are equivalent if disease symptoms are a direct consequence of pathogen growth, and if fitness loss to pathogens is directly correlated with disease symptoms. While such relationship is commonly assumed in evolutionary models (eg in models of the evolution of virulence in pathogens), we still have very little information about the relationship between these different components of host response to infection at the genetic and molecular level. Previous studies have used simultaneous mapping to test for pleiotropic effects on correlated traits and possible selection trade-offs (eg Ungerer et al, 2002). Although testing for pleiotropy is notoriously difficult (Cheverud et al, 1997; Knott and Haley, 2000), all QTL identified in this study were in such distinct regions of the genome that we can rule out pleiotropy. This result suggests that symptom severity, bacterial growth and fruit production in infected plants are under independent genetic control. However, given the power limitations of QTL analysis, there are alternative interpretations for this result. For example, it is possible that two traits share many QTL in common, but that all QTL that they share have small enough effects that they are not detectable. Still, the results of the regression analysis further support the idea that the susceptibility traits studied here are not simply different measures of a single trait, since symptom severity and bacterial growth had partial regression coefficients on relative fitness with opposite signs. Thus, although our study cannot provide a final answer to whether these traits are genetically independent, it does suggest that future studies should pursue the nature of the relationship between these traits.

The consequences of pathogen infection for trait evolution in plants depend on the genetic architecture of disease-related traits and the relationship between those traits and fitness. Here, we investigated the relationship of symptom expression and bacterial growth with fitness using a regression analysis as suggested by Lande and Arnold (1983). This analysis estimates both the independent (ie, partial) effect of traits on relative fitness and their overall association with relative fitness. In the analysis of the independent effects of traits on fitness, partial regression coefficients indicate the change in relative fitness associated with a standard deviation change in the value of a trait, holding the values of all other traits in the model constant. The partial regression analysis showed that all four traits had a significant relationship with fitness. The signs of the regression coefficients indicate that, given our experimental conditions, infected plants are under selection to have larger rosettes, flower earlier, reduce bacterial growth and increase symptom severity. The direction and magnitude of selection on rosette size and flowering time is similar to what has been reported in other studies (reviewed in Pigliucci, 2003). Selection for less bacterial growth is not surprising considering that infection by P. syringae has been shown to reduce fitness in A. thaliana in other studies (Kover and Schaal, 2002; Korves and Bergelson, 2003). However, the positive relationship between symptom severity and fitness is very surprising. It indicates that all else being equal, plants with more severe symptoms produced more seeds. While such a positive association between symptom severity and fitness is nonintuitive, it could result from a real biological processes, such as ‘overcompensation’. Overcompensation is a phenomenon originally described for interactions between plant and herbivores, where plants that are damaged by herbivores show higher seed production than nondamaged plants (reviewed in Stowe et al, 2000). While the mechanism behind overcompensation is not understood, it is quite possible that it also occurs in plant–pathogen interactions. We also cannot rule out the possibility that the observed positive association between symptom and fitness is caused by selection on a trait not included in our analysis, which is correlated with both fitness and symptom severity. In any case, the relationship between disease symptoms and fitness clearly needs further investigation. Should this pattern prove to be consistent, it may provide valuable insights into the evolution of the phenotypic manifestation of disease.

Univariate regression coefficients were used to estimate overall relationship between traits and relative fitness. These overall associations result from both the direct effects that traits have on fitness and the indirect effects caused by selection acting on correlated traits. The separate analyses of direct effects and overall associations of traits with fitness is important because some traits can have a direct effect on fitness, but no overall association with relative fitness due to correlations with other traits. We find such a case in this analysis, where bacterial growth and symptom severity show a positive phenotypic correlation and are under selection of similar magnitude but in opposite directions. The net result of this antagonistic relationship is that, although we see significant partial regression coefficients for these traits, we see no overall association between either of these susceptibility traits and fitness. This result suggests that symptom severity and bacterial growth would show no evolutionary change, despite both having independent effects on fitness, because their antagonistic relationship results in no net selection on either trait. As with any other selection study, the observed patterns of selection are dependent on the environmental conditions experienced by the plants in the growth chamber and only reflect plant response to infection at a specific developmental stage. Although no phenotypic correlation was observed between either rosette size or flowering time and the two susceptibility-related traits in our study, other studies have shown that infected plants tend to flower earlier than uninfected plants (Peters, 1999; Korves and Bergelson, 2003). Since we found that selection favors earlier flowering in infected plants, it is possible that selection on tolerance-related traits (ie traits that alter the effect of infection on fitness) are more important than resistance-related traits (ie traits that affect the degree to which plants become infected).