Introduction

In genetic analysis of quantitative characters, multivariate analysis conventionally means the partitioning of the phenotypic variance into several components due to the effects of general and specific genetic and environmental factors and their linear or nonlinear interactions (Mather, 1949). Dimension is determined by the number of the factors, but, conventionally, just one character at the output is ultimately considered. Even if a number of characters are analysed in the experiment, each is treated separately.

Views on what constitutes genetic multivariate analysis have changed notably in the past 25 years. After introduction of the additive genetic variance–covariance matrix G (Lande, 1979), it has become feasible to estimate the narrow-sense heritability of any linear combination of characters, including the principal components of the matrices G and P (Atchley et al, 1981). Searches for composite characters with maximum additive heritability, based on decomposition of the matrix GP−1, have recently been proposed and performed (Ott and Rabinowitz, 1999; Klingenberg and Leamy, 2001).

We suggest an alternative method based on analysis of the relative position of the parents and the first two generations of hybrids in a multidimensional space, with the axes corresponding to the measured traits. The orientation of these three generations can be used to detect the directions of variability due to heterozygosity, additive and epistatic gene effects.

The proposed method appears promising for breeding. It can identify composite characters with high narrow-sense heritability, which should respond to selection. It may also indicate characters that might be informative in studies of natural populations.

Model

It is well known that phenotypic variability in the F1 hybrids from two true breeding lines is nonheritable. It follows that the variability due to segregation of the set of genes that differ between the parents is observed starting from F2. Let there be two parental inbred strains P1, P2, and the F1 hybrids obtained by crossing the parental strains whose M characters are measured. In the simplest additive-dominant model with no interactions between loci, the mean values for each character in the F1 are x i F 1 = m i + h i , where m i = ( x i P 1 + x i P 2 ) / 2 is the mean of the parents, and hi is the deviation due to dominance (Mather and Jinks, 1982).

As a result of segregation between the F1 and F2 generations, the means of the F2 hybrids will be (Mather and Jinks, 1982) x i F 2 = m i + h i / 2 = ( m i + x i F 1 ) / 2 and in the nth generation they will become x i F n = m i + h i f ( n ) , where f(n) is the proportion of heterozygotes at the locus, depending on the system of mating (for example, selfing or inbreeding, among others).

In a multidimensional space, a point formed by the means of characters for each generation (F=P1, P2, F1, F2, …, Fn) will be denoted as xF=xF1, xF2, …, xFM that is, xF is the centroid of F. From a simple geometric consideration, it follows that the points x P 1 , m, x P 2 and x F 1 form a triangle in which the points xFi lie on the straight line crossing the point x F 1 and the point m = ( x P 1 + x P 2 ) / 2 , which is the midpoint of the segment connecting the parental means (Figure 1).

Figure 1
figure 1

Configuration of the centroids of the parent and hybrid samples in a multidimensional space of characters in the additive-dominance model.

The point x F 2 will fall in the middle of the segment that connects the points x F 1 and m, the other points xFi will tend to the point m at a rate dependent on the scheme of mating.

In the case of deviation from the additive-dominance model, for example due to interactions between loci (broad-sense epistasis), the situation becomes complicated, and x F 2 , generally speaking, can be any point in the space, even beyond the confines of the plane formed by x P 1 , x P 2 and x F 1 . If so, one has to analyse the mutual position of samples in a three-dimensional space. The direction x F 1 - x F 2 in a multidimensional space will, nevertheless, possess the following properties. All the effects due to heterozygosity, that is, the effects of dominance, as well as all the epistatic influences due to heterozygous–homozygous and heterozygous–heterozygous interactions between loci (Mather and Jinks, 1982), will reduce with each successive generation of segregation. Thus, there is a good reason to refer to the x F 1 - x F 2 direction as to the ‘heterozygosity axis’. The additive genetic effects and the remaining epistatic effects due to interactions between the homozygous loci can be viewed as points projected onto the hyperplane orthogonal to x F 1 - x F 2 . Let x P 1 , m, x P 2 , x F 1 and x F 2 be projected upon this hyperplane. The projections of x F 1 and x F 2 will coincide, forming a single point on it. If the additive-dominance model is true, then this point will be coincident with m, the midpoint between x P 1 and x P 2 . For this reason, deviation from the point m on the hyperplane may, to a first approximation, be regarded as a manifestation of epistatic interactions and accordingly will be called the ‘epistatic direction’. The remaining orthogonal direction upon whose the projection of the x F 1 ( x F 2 ) point is already coincident with the point m will be called ‘additive’. Any additive effects will manifest as greater variance in the F2 compared to the variance in the F1. For the sake of convenience, the designations for the axes will be as follows: heterozygosity, H; epistasis, I; additivity, A. The three-dimensional space formed by the axes will be hereafter called the HIA space and the method specified below for their identification will be referred to as the HIA method.

From these considerations, the procedure of multivariate analysis of quantitative characters is as follows. The characters must initially be normalized. A normalization of all the cases for all the characters as a single sample is inappropriate because such a procedure would include the excessive variation between samples. One possible procedure would be to divide by the mean square deviation of one of the samples, for example, F2. An alternative would be to form a pooled within-group matrix by centering each sample separately by subtracting the means, then pooling them into one sample, followed by normalization of each sample (in the initial space of characters) by the pooled mean square deviation. The latter approach is advised in textbooks of statistics (Kullback, 1959; Wilks, 1962). After normalization of the P1, P2, F1, F2 samples, their means for all the characters (centroids) are calculated. The difference between the F1 and F2 centroids determines the H-axis. Let us rotate the space so that one of its axes coincides with the H-axis, and then fix this axis. In the remaining subspace orthogonal to the H-axis, the axis I is determined by x F 1 ( x F 2 ) and the m points and fixed. In the remaining subspace orthogonal to both axes, the axis A is then determined by points x P 1 and x P 2 and fixed. Both x F 1 and x F 2 lie exactly midway between the centroids of the parental lines along the A-axis. From this, it follows that, for a number of characters M≥3, the suggested HIA method automatically finds the scale (Ehrman and Parsons, 1981; Mather and Jinks, 1982), which is evidence of the adequacy of the additive model without dominance on this scale.

Let us denote the variance in each sample in a projection upon a unidimensional direction by SF. Next, let us denote the weighted within-group variance of P1, P2 by S P 1 + P 2 , and the variance in the pooled sample of P1 and P2 by S P 1 + P 2 . The broad-sense heritability is given by h 1 2 = ( S P 1 + P 2 - S P 1 , P 2 ) / S P 1 + P 2 and the narrow-sense heritability by h 2 2 = ( S F 2 - S F 1 ) / S F 2 (Ayala, 1982). The significance of the two heritability estimates is assessed using Student's t-test and Fisher's F-test. The broad-sense heritability is calculated along the I and A axes and the narrow-sense heritability along the A-axis, because precisely along this axis the F1 and F2 centroids lie exactly midway between the parent sample centroids. It is simplest to determine the contribution of the characters to the identified axes by calculation of the correlation coefficients between the axes and the characters of the pooled within-group matrix.

Materials

Samples of the set of characters obtained from studies on laboratory-inbred rat strains ISIAH, WAG and their first two hybrid generations, F1 and F2, were analysed by this newly formulated HIA method. In all, 21 craniometric measures were taken in the first experiment (Table 1) and 20 morphological, physiological and behavioural characters were measured in the second (Table 2). The ISIAH strain was developed at the Institute of Cytology and Genetics (Novosibirsk, Russia) from outbred Wistar rats in a selection experiment for increased blood pressure provoked by emotional stress through successive, more than 25 brother–sister matings (Markel, 1992). The WAG/GSto is another inbred strain derived from outbred Wistar rats, which was obtained from The Laboratory of Experimental Animals (Stolbovaya, Moscow) (Blandova et al, 1983). All the characters, with the exception of blood pressure, were normalized by the logarithmic transformation (Ehrman and Parsons, 1981; Mather and Jinks, 1982; Thorpe and Leamy, 1983).

Table 1 Mean values (M±SE) for craniometric characters
Table 2 Mean values (M±SE) for morphological, physiological and behavioural characters

Methods

The craniometric measurements were made using dial callipers to 0.05 mm accuracy. Blood pressure measurements in rats were performed by the tail-cuff method. Corticosterone plasma level was determined by the competitive protein-binding method. Behavioural characters were evaluated in an open-field arena with automatic registration. To measure the blood pressure and plasma corticosterone stress response, rats were placed in a small wire-mesh cylinder for 0.5 h. After stress, blood was sampled from the tail vein. The basal values of blood pressure were measured in anaesthetized rats to exclude the stress associated with the tail-cuff procedure. Basal corticosterone plasma level was measured in blood collected from promptly decapitated rats.

Results

The ISIAH strain did not differ in the estimates for the skull characters from the WAG strain, with the exception of interorbital width, which was significantly greater in ISIAH rats. The F1 showed heterosis for all the skull characters and, consequently, the unidimensional heritability estimates for each character inevitably contained a dominant component.

Figure 2a presents the distribution pattern of the F1 and F2 individuals on the plane formed by the I and A axes (the IA-plane) obtained after treating the means of ISIAH, WAG rats and their hybrids by the proposed HIA method. There is a marked widening in the variation along the A-axis in F2 compared to F1. The narrow-sense heritability estimates along the A-axis is h22=0.92 (F31,11=12.7; P<0.001). Interorbital width and braincase width (in the direction of the ISIAH strain), superior diastema length and nasal length (in the direction of the WAG strain) contributed mainly to the A-axis (Figure 2b). In general, the facial region was shorter and the brain case was wider in the ISIAH rats. The contributions of all the characters to the H-axis had the same sign. The H-axis might be considered a component of overall size.

Figure 2
figure 2

(a) Configuration of the centroids of the parent and hybrid samples and of the F1 and F2 individuals on the IA-plane of the craniometric characters. (b) Configuration vectors corresponding to the craniometric characters (the variable numbers are listed in Table 1) on the IA-plane.

As for the morphological, physiological and behavioural measures for the ISIAH rats, their blood pressure under emotional stress was significantly higher (t=5.80; P<0.001), the kidneys were significantly heavier (t=6.17; P<0.001), basal corticosterone concentration in plasma was significantly greater (t=3.27; P<0.01) and so were locomotion during the first minute of the open field test (t=2.76; P<0.05) and rearing time at the periphery of the open field arena (t=2.49; P<0.05).

Application of the HIA method to the whole set of characters resulted in axes along which additive heritability was not observed, although the broad-sense heritability turned out to be quite high (h21=0.72; t=7.73; P<0.001). One explanation could be that differently coadapted gene complexes controlling behaviour established in ISIAH and WAG, so that the behavioural stereotypes were disturbed, even broken down, in hybrid populations. In fact, rearing in the centre of the open field was observed in 60% of the ISIAH and only in 7.1% of the WAG individuals. The respective percentages were 27.3 and 10.3% for their F1 and F2 hybrids. Another possibility is that the loss of heritability is a result of pooling of characters of a different nature in a common matrix. Support for this explanation came from the observation that all the weight characters, with the exception of adrenal weight, formed the A-axis and that all the behavioural characters, except defecation and latency for locomotion onset, formed the I-axis. This was obviously due to the higher correlation within these sets of characters. If several quantitative characters were analysed at the same time, namely those affected together by a large set of genes and with common developmental mechanisms at their basis, then the commonness of these characters would be manifest as correlative relations between them (Schmalhausen, 1982). Conversely, separation of the correlative relations into a number of statistically independent subsystems is an indication of their different pathways of developmental and genetic regulation. For this reason, we applied to the pooled within-group matrix the principal component approach (PCA), based on a correlation (not a covariance) matrix because of the normalization performed. The PCA partitions the total phenotypic variability into several orthogonal components, which are new characters and can be analysed by the standard method of univariate and multivariate genetic analysis (Thorpe and Leamy, 1983; Markel et al, 1989). The results are set out in Table 3.

Table 3 Matrix of the correlation coefficients (× 1000) for characters with the principal components of the pooled within-group matrix

The first five components accounted for 64.4% of the total variation. Behavioural characters mainly contributed to the first and third components. The weights of the body and viscera contributed to the second component and, with the opposite sign, the basal blood pressure. The first component characterized locomotion as a whole and the third specified the difference in locomotion between the centre and the periphery of the open field arena. The fourth and fifth components proved to be of particular interest. The fourth component included all the characters for the stressed state: basal blood pressure, blood pressure under stress, adrenal weight, corticosterone concentration in plasma under stress and defecation. The fifth component mainly defined grooming at the periphery of the open field, basal plasma corticosterone and latency before locomotion onset. This is, in a sense, deviant behaviour manifest with or without stress in rats with high basal plasma corticosterone.

Thus, all the characters fell into groups. Certain characters, in compliance with their dual nature, were at the same time members of distinct groups. For example, adrenal weight contributed to both the body size and stress components; grooming at the periphery of the open field and latency before locomotion onset were included in the components of both locomotion and of deviant behaviour.

Multivariate analysis of segregation in hybrids was done by the HIA method for each of the character sets identified in this way. Significant additive heritability (h22=0.84; F105,12=6.24; P<0.001) was observed for the stress-related character set referred to the fourth component (Figure 3a). ‘Epistatic’ deviation was relatively small. Blood pressure under stress (r=0.89; P<0.001) and to a lesser extent adrenal weight (r=0.51; P<0.001) and defecation (r=0.50; P<0.001) contributed to the ‘additive’ component, the A-axis (Figure 3b). According to the data in the literature, the heritability estimates for defecation in the open field range from 0.5 to 1.0 (Ehrman and Parsons, 1981). An increase in adrenal weight in response to stress has been repeatedly reported and it is beyond question that this response is partly genetically determined (Selye, 1975). The correlation between the ‘additive’ component and blood pressure under stress was high, because it is precisely this character that the ISIAH strain was selected for, and the selection was efficient, judging by the estimates (Markel, 2000).

Figure 3
figure 3

(a) Configuration of the centroids of the parent and hybrid samples and of the F1 and F2 individuals on the IA-plane of the system of stress characters. (b) Configuration vectors corresponding to the system of stress characters on the IA-plane.

Despite the significant difference in behavioural characters between the ISIAH and WAG strains, we were unable to detect any additive genetic contribution to the difference. In our view, this supports the idea that different well coadapted gene complexes controlling behaviour have formed in the ISIAH and WAG strains.

As usual, in the genetic analysis of phenotypes produced by crossing of inbred strains, the sample size of the parental strains, P1 and P2, is small, but becomes greater in the F1 and, especially, in the segregating F2 hybrids. For this reason, the H-axis will be stable because it is determined initially, by the location of the F1 and F2 centroids. Consequently, the position of the plane orthogonal to the H-axis will be quite stable, too. The direction of the A-axis and the heritability are expected to have greater error because they are determined last, and on the basis of the P1 and P2 centroids. To evaluate the reliability of the results, bootstrapping was performed with 120 replicates.

The bootstrap distribution of h22 was asymmetrical in the case of both craniometric and physiological/behavioural measures. The calculation of t-confidence intervals was therefore considered inappropriate. The use of order statistics is not recommended for fewer than 1000 bootstrap replicates (Efron, 1987; Efron and Tibshurani, 1993), which would be very labour intensive. Therefore, the quantile was estimated through an approximation of distribution of the value z=1−h22=SF1/SF2 by the exponential distribution. The narrow-sense heritabilities were significant at 5% in both cases (with smoothed 2.5-th percentiles of 0.013 and 0.052, respectively).

Bootstrapping support for the stability of the contribution of the characters to the A-axis was demonstrated for superior diastema length (7), nasal length (15), interorbital width (6) in the first experiment, and for the blood pressure under stress (2) and adrenal weight (4) in the second. The contributions of the other characters changed their signs in more than 2.5% of cases, although the outliers did not deviate far from 0. In general, the direction of the A-axis is quite stable as seen in the bootstrap distribution of deviations of the angle cosines (dot products) from the original direction. The limits, median, mean and standard deviation were equal to 0.02–0.98, 0.82, 0.76 and 0.22, respectively, in the first analysis, and 0.05–0.996, 0.97, 0.91 and 0.14 in the second.

Discussion

It is notable that, with the HIA method, all the axes are determined by the first-degree statistics. It follows that, in theory, the distribution of the objects can be absolutely arbitrary. The proposed method is based on the relative disposition of the sample centroids, disregarding the positions of the points (specimens) around the centroids. However, in two cases, the cluster of the F1 hybrids lies almost across that of the F2 (Figures 2 and 3), thereby demonstrating that the identified A-axis represents, in fact, a heritable composite character. Heritability of A-axis trait is estimated by the ratio of the variances, that is, by the second-degree statistics. We attach little importance to the very high heritability estimates obtained in this way. The sample size was small, so that sampling error could be considerable, as was apparent from the bootstrap analysis. The distinctive feature of the HIA method is that it identifies axes in a multidimensional space that concentrates the properties of additivity, heterozygosity and epistasis. In the analysis to date, the second-degree statistics have only detected high heritability on the A directions. Similarly, Ott and Rabinowitz (1999) have suggested that additive heritability can be maximized by decomposing the GP−1 matrix into its eigenvectors, where G and P are the genetic and phenotypic covariation matrices, respectively (Lande, 1979; Steppan et al, 2002). (There is sense in calling the GP−1 matrix the ‘heritability matrix’ and to express it as H2.) With this decomposition, Klingenberg and Leamy (2001) have obtained a linear combination of mandibular features with a narrow-sense heritability estimate of 0.73, not coincident with any one of principal axes of the G and P matrices. The heritability of overall mandible size in the same data set was 0.42, which is in agreement with the heritability estimates for the other craniometric characters and the principal component of the matrices G and P (Leamy, 1974; Atchley et al, 1981). Thus, targeted searches relying on the statistics of both the first and second degrees can yield composite characters that have heritability coefficients considerably greater than those of the initial characters and their principal components. However, to our knowledge, no methods for the identification of the heterozygosity and epistasis axes based on the statistics of the first and second degrees are available.

What are the practical advantages of our results? Breeding has usually been the province of classical genetics. However, multivariate analysis of segregation in hybrids opens up new prospects for the practical breeders. Any linear combination of characters, that is, a direction in a multidimensional space, is in itself a character (Zhivotovsky, 1991). Any such character (the ‘selection index’, the ‘supercharacter’) can, in principle, be brought under selection (Rokitsky, 1974; Atchley et al, 1981; Mather and Jinks, 1982). However, the displacement of the samples along the straight line H that crosses the F1 and F2 centroids is caused only by heterozygosity of individuals in the samples; the displacement will keep vanishing as segregation proceeds. In the inter se mating (mating between individuals within a group), each generation decreases the deviation from the mid-parent point with each consecutive hybrid generation, due to loss of heterozygosity and consequent loss of dominance effects. Consequently, there is no sense in selection along the H-axis. The clusters of objects composing the F1 and F2 may be conceived of as threaded along the H-axis like beads on a string. The idea is to view the objects from the end of the H-axis so that the axis itself will be seen as a single point. This perspective reveals the variability in the hyperplane orthogonal to H-axis (the heterozygosity axis) due to additive effects and it also allows the estimation of the relative size of the nongenetic (environmental) variability by comparing the variances in the F1 and F2. The direction A, upon which the projections of F1 and F2 centroids fall midway between the projections of parental centroids, is most promising for selection. The selection can be started as early as in F2 without waiting for further genetic segregation.

The ecology of natural populations is another area to which the current results are applicable. Identification of combined characters that have high heritability in laboratory experiments and comparison of variability in the same set of characters in natural populations would give an estimate of the contribution of genotypic diversity to phenotypic variability which, to our knowledge, cannot be derived by any other method. A comparison with variability due to environmental factors would provide deeper insight into the relationship between the two forms of variability.

The high additive heritability of the stress component is because the characters it included – basal blood pressure, blood pressure under stress, adrenal weight, corticosterone concentration in plasma under stress, defecation in the open-field test – all changed rapidly and in an adaptive manner under the effect of unfavourable external conditions. The same is true of the high additive heritability of the craniometric component, to which interorbital width makes the main contribution. In natural populations, rapid change is seen in the growth rate component, which is subjected to the effect of early spring temperatures mediated through a shift in reproduction onset (Kovaleva and Faleev, 1994). Thus, a population can retain heritability in composite characters that respond to fluctuations in the environment. In such a case, the population would have reserves at two levels: the individual within its reaction norm and the population within its genotypic diversity.