Introduction

There are increasing amounts of information from whole genome scans (Milan et al, 2000), from natural mutants (Reddy and Siegel, 1977) and from the relationship between gene expression and trait (Rothschild et al, 1996) on the identity of genes that control traits of economic importance in farm animals.

However, unless it is possible to select for alleles of those genes that enhance performance in commercial lines the information will not be of practical use. In this study, we have selected candidate genes derived from the knowledge of reproductive physiology and tested whether these genes showed an association with reproductive performance in commercial poultry lines. A number of successes have been claimed for the physiological candidate gene approach to explain trait variance (eg Fotouhi et al, 1993; Rothschild et al, 1996; Sourdioux et al, 1996; Urbanek et al, 1999).

We applied the candidate gene approach to reproductive performance of broiler chickens. Broiler chickens have been selected with great success to improve the yield and the efficiency of meat production (Emmerson, 1997). This has not improved the reproductive efficiency. Selection for growth has been negatively correlated with aspects of reproductive performance, notably the occurrence of abnormal ovarian hierarchies (Hocking et al, 1987). This results in increased numbers of double-yolked eggs due to the simultaneous ovulation of two or more nonviable ova. Broiler breeders also have a relatively rapid decline in egg production, accompanied by sporadic cessation of lay (Sharp et al, 1992). Reproductive traits are relatively easily measured but can only be directly assessed over a period following sexual maturity; however, in the broiler industry, selection on growth characteristics is made before sexual maturity. Marker-assisted selection for reproductive traits could usefully augment selection based only on dam performance. Reproductive traits offer, therefore, both a convenient model to appraise the application of association analysis of candidate genes in a commercial population and a potential application if markers are discovered.

The numbers of genes that are known to control reproduction are small but their individual effects are, in most cases, large. This is because the method of their discovery has often been due to the observation of the large effects that removal or mutation have on an animal's function. From such genes we have selected our candidate genes. The growth hormone (GHR)-insulin-like growth factor-I (GH-IGF1) system controls the number of follicles in the avian ovary that are recruited to the rapid growth phase (Williams et al, 1992; Roberts et al, 1994) and a natural GHR mutation alters the ovulation rate (Reddy and Siegel, 1977). It is also known that the GH-IGF1 system has been modified as a result of selection for improved growth rate (Goddard et al, 1988). Thus, GHR was chosen as a candidate gene that might be associated with laying performance, or double-yolked eggs (Hocking et al, 1994). In addition, studies have demonstrated an association between GHR and egg production (Kuhnlein et al, 1997; Nagaraja et al, 2000).

The rapid decrease in ovarian activity in broiler breeders after the start of lay is a consequence of decreased gonadotrophin secretion (Sharp et al, 1992), resulting from decreased gonadotrophin-releasing hormone (GNRH) secretion. The expression of GNRH in the hypothalamus is decreased when gonadotrophin release is decreased (Dunn et al, 1996; Dunn and Sharp, 1999). GNRH stimulates the release of gonadotrophins from the pituitary through its receptor (GNRHR) (Sun et al, 2001a). GNRH and its receptor are therefore the start of the cascade that produces the appropriate growth, maturation and maintenance of the gonads. For these reasons, both GNRH and GNRHR were selected as candidate genes for the control of reproductive activity.

Neuropeptide Y (NPY) is known to influence the release of GNRH from the median eminence (Contijoch et al, 1993) and is critical in controlling food intake in birds, possibly matching satiety to reproductive activity and the timing of puberty (Kuenzel and Fraley, 1995). The NPY gene might produce markers for the age of the onset of lay and, through its role in the control of ovulation, influence egg production rate.

Materials and methods

Experimental design and phenotypic measurement

Birds from a pedigree broiler dam line were housed at 16 weeks of age in pens containing between 12 and 14 birds, and were fed a restricted diet to ensure that the body weight did not exceed 3.8 kg at 60 weeks of age as detailed in the Cobb 500 breeder management guide (Cobb Breeding Company, East Hanningfield, UK). Prior to that age, birds were reared on ad libitum feeding to 7 weeks of age and thereafter those birds selected for their growth characteristics were fed on a modified restricted diet aimed to gain the target weight of 3.17 kg at 26 weeks of age. Diet composition and lighting were all according to the Cobb 500 breeder management guide. The study population was three flocks, each of which were produced from seven consecutive hatches. Data on egg production including total egg production, age at first egg and number of double-yolked eggs were collected daily using trap nests to identify individual birds. The data for individual hens were collated over a 7-month period and recording commenced at 22 weeks of age. As all birds in each flock were treated the same despite differences in the hatch dates, this may have contributed to variance due to hatch. DNA and trait data were available from 772 animals. Information was also available for each bird on hatch, flock and sire and dam. There were 67 sire families. Sires could contribute to a maximum of three flocks depending on their age and the numbers of their daughters surviving culling on growth performance. In reality, only six sires contributed to all three flocks. One sire contributed to only flock 1, 15 only to flock 2 and 28 only to flock 3. Two contributed to flock 1 and 2, one to flock 1 and 3 and 14 to flock 2 and 3. Not all sire families were used in the analysis, only the offspring of heterozygous sires were used. This substantially reduced the amount of data, but gave marker effects estimated within families, and was therefore less likely to be affected by other background genetic differences between sires. These may lead to false-positive associations between the candidate gene and the primary trait, which might occur, even when the candidate gene and the true trait gene are on different chromosomes, for example, when two genes are sufficiently close to genes descended from a population founder, which have both been under selection for a second, desirable trait. In addition, there may be other QTLs for the primary trait. The within-sire analysis will remove some of the effects of these QTLs when their allele frequencies are not balanced for the two candidate gene alleles across sires. At the beginning of the experiment, only some of the parental genotypes were available. It was possible, however, to infer further parental genotypes using the offspring genotypes because of the large sire families (mean=11.5). The number of sire families available was 14 for NPY, 36 for GNRHR and 33 for GHR. Only birds with complete records were included in the analyses.

DNA preparation

Red blood cells were stored at 4°C in 96-well plates. DNA was prepared using the GFX system (Amersham Pharmacia Biotech UK Ltd, Little Chalfont, UK) adapted for avian blood by using only 8–10 μl whole blood mixed in 100 μl phosphate-buffered saline taken directly into the extraction buffer. To maximise recovery, a double elution into 220 μl of TE was performed.

Determination of genomic sequence

Unlabelled oligonucleotides (Table 1) (Cruachem Ltd, Glasgow, UK) were used in the PCR amplification of genomic DNA for sequencing prior to the design of oligonucleotides (Table 2) for single-nucleotide polymorphism (SNP) detection and genotyping. The PCR conditions for amplification were: 200 μM dNTPs, 0.5 U of Taq polymerase (Roche Diagnostics Ltd, Lewes, East Sussex, UK), 100 pmol of each primer, 100 ng genomic DNA in a 20 μl reaction containing 1.5 mM MgCl2, 10 mM Tris-HCl (pH 8.3), 50 mM KCl. The PCRs were carried out at 94°C, 4 min for one cycle; 94°C, 20 s; 56°C, 20 s; 74°C, 20 s for 40 cycles on a Hybaid Omnigene thermocycler (Thermo Life Sciences, Basingstoke, UK).

Table 1 PCR primers used for determining genomic sequence
Table 2 PCR primers used for SNP detection and the genotyping assays

Detection of polymorphisms

The majority of polymorphism detection was carried out by Base Excision Sequence Scanning (T-Scan, Epicentre Technologies; Cambio Ltd, Cambridge, UK) (Hawkins and Hoffman, 1999) and the remainder by bulk amplified restriction fragment length polymorphism detection. Detection was carried out on 30 sires from the pedigree dam line. A diagnostic restriction digest of the PCR-amplified product was performed to confirm the allele frequency for all polymorphisms.

Genotyping assay

Genotyping assays for GHR, NPY and GNRHR were based on the presence or absence of a restriction site at the polymorphic sites in PCR-amplified DNA fragments (PCR-RFLP). All genotyping PCRs differed from the standard protocol by the inclusion of one-fifth volume of 5 × gel loading dye (16.5% sucrose, 0.01% bromophenol blue). Other differences from our standard PCR conditions were the inclusion for NPY of 2 mM MgCl2 and for GNRHR 5% DMSO.

For RFLP analysis, 5 μl of PCR product was digested according to the manufacturer's protocol. All digestions were performed overnight at 37°C. After digestion with the discriminating restriction enzyme, the products were separated on a 3% agarose gel and the genotypes identified according to the resolved products; either a single full-length uncut product, two products of digestion or, in the case of heterozygotes three products, full-length uncut and two digested products.

Association analysis

Numbers of double-yolked eggs were approximately distributed as an overdispersed Poisson variable, and were analysed as a generalised linear model with parameters estimated on the logarithmic scale. Other variables were shifted and rescaled using a log transformation to give approximate normality and equality of variance. The total number of eggs was negatively skewed, and was analysed as the log of total number of eggs subtracted from a hypothetical upper limit of 150. Age at first egg was positively skewed, and had a value of 150, coincidentally, subtracted before taking logarithms. The effects of hatch (h), flock (f) and their interaction together with sires (s) and the marker genotypes (m) were fitted, as fixed effects, to the expectation of the transformed responses (y), as

Linear models were fitted by regression analysis and generalised linear models by iteratively reweighted least squares, followed by Student's t-tests to assess possible marker effects. Additive effects of markers were estimated as the differences between homozygote means, and dominance effects as the difference between twice the heterozygote mean and the sum of the homozygote means. The exception was GHR, because it is sex linked and dams carry only one allele. The collective results of the 15 tests of trait and marker combinations were assessed by a Bonferoni 5% significance threshold of P=0.05/15=0.0033, conservatively assuming the 15 tests to be independent.

Results

Genomic sequence, polymorphic loci and allele frequency

The GHR intron 5 was cloned and sequenced (Genembl accession AJ506750). Two SNPs were discovered that were linked, both cytosine–thymidine transversions in intron 5. One was chosen to develop an assay (Table 3). In the 457 offspring of heterozygous sires, the genotype frequency for Nsp− was 0.63. This exceeds the anticipated figure of 0.5 for a sex-linked gene, suggesting that the GHR locus may be under some selection pressure.

Table 3 Genomic sequence, polymorphic loci and allele frequency

A 4-bp deletion/insertion about 700 bases upstream of the NPY transcription start site was characterised (Table 3). In the offspring of heterozygous sires, the frequencies of the alleles were close to that predicted if the males were crossed randomly with dams from the whole population. Genotypes did not differ from the expected Hardy–Weinberg equilibrium.

Intron 1 of the GNRHR contained an SNP that was used for genotyping (Table 3). In the offspring of heterozygous sires, the allele frequencies were identical to those in the whole population and almost identical to that predicted if the males were crossed randomly with dams from the whole population. The genotype frequency did not differ from Hardy–Weinberg equilibrium.

Despite using 19 primer pairs covering 46% of the 6.4-kb of the known GNRH gene sequence and its upstream region (Dunn et al, 1993), no polymorphic sites were detected in the 30 individuals.

Heritability

Heritability was estimated from the data of all offspring for age at first egg, 0.17; for total egg production, 0.09 (both on the log scale); and for double-yolked eggs 0.25 (on the observed scale). The low values for total egg production were similar to those observed in other studies in broiler breeders (Koerhuis and McKay, 1996) and probably reflects the number of factors that influence the total egg production.

Results of association analysis

The back-transformed means together with approximate standard errors from the model fitted on the logarithmic scale of the analyses are presented in Table 4 for the offspring of heterozygous sires. These back-transformed means allow adjustments for possible imbalances across the marker alleles in the number of observations for flock and hatch when these affected a trait. Most markedly, there was a decline in the age at first egg with hatch. Analysis for genes used different subsets of the 772 animals. Concentrating on heterozygous sires means using about 1/2 of the data for GHR and GNRHR and only about 1/4 of the data for NPY. The observed differences between GHR alleles were small for all three traits, but even with the reduced number of observations we were able to detect associations (at P<0.05) of a trait with the GNRHR and NPY markers in single analyses.

Table 4 Back-transformed means (SEs) for the offspring of heterozygous siresa

Thus, an additive effect was observed for GNRHR and the number of double-yolked eggs, and a dominance effect of NPY and age at first egg (Table 5). However, when considered collectively as 15 tests, these combinations did not achieve significance at 5%. The GNRHR Bpu1102+/+ genotype had the greatest number of double-yolked eggs at 1.59 compared to 1.00 for Bpu1102−/− on the back-transformed scale (Table 4), with the Bpu1102+/− genotype intermediate on the back-transformed scale. The NPY DraI+/− heterozygotes had the earliest age for laying their first egg at 189.4 days compared to 198.0 days for the DraI−/− genotypes and 193.8 for the DraI+/+ genotype (Table 4).

Table 5 Association analysis fitting the offspring of heterozygous sires

Discussion

The associations detected by the analysis within the single generation of hens from the heterozygous sires suggest that the GNRHR gene and the NPY genes play a role in controlling the traits of double-yolked eggs and age at first egg, respectively. For GNRHR and number of double-yolked eggs, the sizes of the positive effects relative to the homozygous genotype Bpu1102I−/− were 0.22 for the heterozygote Bpu1102I+/− and for the homozygote Bpu1102I+/+ 0.59 (Table 4). For NPY, the reduction in age at first egg of the heterozygote DraI+/− compared to the homozygote DraI−/− was 8.6 days and for the homozygote DraI+/+ 4.2 days (Table 4).

In the case of GNRHR and double-yolked eggs, if only animals with the Bpu1102I−/− genotype were selected in this line, we would expect an improvement in the overall flock performance of 0.31 usable eggs per hen in view of the frequency of this allele (0.4). The overdominance effect observed for NPY and age at first egg would be harder to use in practice. However, the establishment of lines carrying only one allele could be used to produce heterozygous hens in the final breeder cross. This would give an overall decrease in the age at first egg of the flock of 3.2 days/hen. In practice, the benefits of earlier age at first egg must be balanced with problems of reduced egg size and possibly numbers of double-yolked eggs, but a reduction may be beneficial to enhance efficiency. There was a small, expected negative effect of age at first egg on the number of double-yolked eggs, but this did not change the association with GNRHR.

Association studies cannot determine if the gene allele markers (SNPs and RFLPs) are responsible for the variation in a trait or whether it is due to a closely linked locus. However, there is good reason to believe that these genes would influence the traits in chickens. The GNRHR is expressed in the pituitary, the gonads and the hypothalamus and has the pharmacological profile of an avian GNRH receptor (Sun et al, 2001b). Since the GNRHR occurs in the gonads as well as the pituitary, its effect might occur at the level of the ovary, possibly by affecting cell proliferation and apoptosis as suggested in mammals (Takekida et al, 2000).

NPY induces precocious puberty in chicks (Fraley and Kuenzel, 1993) and controls feed intake (Kuenzel and Fraley, 1995; Boswell et al, 1999). In mammals, the NPY neurones are targets for leptin, which may be a mechanism that metabolic factors ‘gate’ entry to puberty (Cheung et al, 1997). NPY also has an established role in controlling GNRH secretion during the preovulatary surge of gonadotrophins (Contijoch et al, 1993). Either might have an effect on age at first egg and it is tempting to speculate that each allele might have a favourable role for each function. It is hard, however, to envisage a mechanism whereby the expression of the two alleles of NPY together produces advancement in age at first egg as observed.

Feng et al (1997) observed a negative relationship in layer-type hens between the number of eggs and age at first egg, which was different between GHR genotypes. We found no such effect in the broiler line used in this study, although there was a weak negative correlation between these traits. Owing to the likely role of the GH-IGF1 system in producing polyfollicular ovaries at the onset of lay, a possible association between GHR with the number of eggs produced in the first two months was also examined, but without success. Feng et al (1997) demonstrated a significant association with age at first egg and alleles of GHR that may be worth investigating along with other components of the GH-IGFI system.

Studies seeking correlations between alleles of candidate genes and quantitative traits have frequently used single-generation comparisons (eg Feng et al, 1997, 1998; Nagaraja et al, 2000). Like these studies, when we fit a model excluding sires, we find a large number of statistically significant associations. Four significant effects (P<0.05) are observed: for GHR and number of double-yolked eggs (t=−2.40, df 596), GNRHR and number of double-yolked eggs (additive; t=2.18, df 612), GNRHR and number of eggs (dominance; t=2.36, df 612) and NPY and age at first egg (additive; t=2.05, df 636). Only the additive effect for GNRHR and number of double-yolked eggs is consistently significant between the test for association using all animals and those using only heterozygous sires. NPY and age at first egg, although significant in both tests, is significant as an additive effect in the case using all animals and as a dominance effect when only heterozygous sires offspring are used. In a population undergoing strong selection, as here, in order to avoid possible spurious associations we used comparisons between half sibs within heterozygous sires to control for some of the general genetic background differences between sires. This would also reduce false association due to cosegregation with loci under selection. Despite its potential loss of power from only using a subset of the data, we still observe two significant associations for two of the analyses of the nine combinations of candidate gene and reproductive trait, but not when the 15 tests of genetic effects are considered together in controlling experiment-wide error. Given that the genes were chosen a priori on the basis of their known effects, there may be some debate about the need for such corrections.

In this study, parental genotypes were supplemented by inference from the offspring genotypes. Sires falsely classified as heterozygous would be expected to dilute possible additive effects, while inflating possible dominance effects. This would be most likely with GNRHR, but no dominance effects are observed.

A more sophisticated allowance for a common genetic background is to fit an animal model giving correlations between individual observations based on their expected additive genetic correlations (eg Rothschild et al, 1996; Drogemuller et al, 2001). This should help to correct for a general genetic background, particularly with deep pedigrees, although there might still be some contribution to gene effects from between-family comparisons. Another approach for additive effects would be to parameterise the marker allele count to give between- and within-family estimates, and test whether they are consistent by allowing a combined estimate with more power (Hernandez-Sanchez et al, 2003). This may be carried out by creating two covariates for allele count, one for the family (sire) mean and one for the deviations from the family mean and fitting them in addition to sires in an REML analysis. In the presence of additional unbalanced fixed effects, these estimates will not be exactly independent, but may have a small correlation. For our data, this combined analysis suggested no significant additive effects for any combination of markers and traits. The most notable effect was for GNRHR on the number of double-yolked eggs (allele-substitution effect=0.37, SE=0.20). The estimate for between-sire families was 0.19 (SE=0.52) and within-sire families was 0.39 (SE=0.22) from the fitting of a generalised linear mixed model (Breslow and Clayton, 1993) allowing for the random effect of sire and the fixed effects of hatch, flock and their interaction. The correlation between the estimates was −0.07 and the standard error of their difference was 0.58. In this case, the additional information and the additive effect estimate from between-family comparisons were small, and not influential. The power to detect a discrepancy between the two estimates was poor.

Heritability values for AFE and DYE are larger than for egg production, which may explain why we detected an association with genes potentially involved in the control of these traits. There is no reason to believe that the accuracy of trait measurement and the effect of environment should not influence the estimate of association as it would for any other analysis.

It will ultimately be possible to combine in a selection index a number of molecular markers with a small effect to improve reproductive performance. This will depend on the cost of marker-assisted selection and the negative effects of including extra indices for selection on other traits alongside the net improvement in productivity.