Introduction

Major histocompatibility complex (MHC) molecules play a crucial role in activating the adaptive immune system in jawed vertebrates (Klein, 1986) and studies across taxa commonly report a direct link between MHC and fitness (reviewed in Sommer, 2005; Piertney and Oliver, 2006). Moreover, MHC genes have been shown to be involved in mate choice in different species (reviewed in Milinski, 2014). MHC-based mate choice has been suggested to provide fitness benefits in two ways. First, females could benefit from mating with males bearing resistance alleles to common or severe pathogens in a given population (‘good genes’ hypothesis), ensuring enhanced resistance to those pathogens in their progeny (Ekblom et al., 2004; Richardson et al., 2005; Eizaguirre et al., 2009; Cutrera et al., 2012). Second, as individual MHC molecules bind only a specific subset of pathogens, it should be advantageous to maximize heterozygosity in offspring (Hughes and Nei, 1988; Lenz et al., 2009). In this case, females would choose mates non-randomly to complement their own genotypes (‘complementary genes hypothesis’; see, for example, Wedekind et al., 1995; Penn and Potts, 1998; Consuegra and Garcia de Leaniz, 2008).

Despite a widespread role of MHC in mate choice, the mechanisms of recognizing a potential mating partner based on MHC genotype are not well understood. In mammals, birds and fish, MHC molecules are believed to influence body odor and hence odor preferences during mate choice (see, for example, Yamazaki et al., 1988; Potts et al., 1991; Wedekind et al., 1995; Carroll et al., 2002; Richardson et al., 2005). Studies in Atlantic salmon, Salmo salar, confirm that MHC-II diversity is linked to disease resistance (Lohm et al., 2002) and that both MHC-I and -II affect mate choice (Landry et al., 2001; Consuegra and Garcia de Leaniz, 2008; Yeates et al., 2009). Generally, MHC-I diversity is involved in T cell-mediated response and resistance to viruses, whereas MHC-II diversity affects resistance to bacteria. Interestingly, the direction of mate choice appears to differ between the two MHC classes. A study using in vitro fertilization (IVF) to induce competition between sperm of two males showed that eggs were preferentially fertilized by males with genetically similar MHC-I genotype (Yeates et al., 2009). In contrast, MHC-II showed significantly higher genetic diversity than expected under random mating in the offspring of wild salmon, suggesting premating mate choice for heterozygosity at the individual level (Landry et al., 2001). However, the inconsistencies across studies could also hint at the fact that MHC alleles may play an important role at the postmating prezygotic stages.

In externally fertilizing organisms such as fish, an MHC recognition mechanism at the postmating level is thought to be particularly important as females have reduced control over fertilization because of group spawnings or males assuming alternative mating strategies (Taborsky, 1998). In addition, external fertilization impedes the potential maternal impact on the offspring MHC genotype via abortion as it is found for example in mammals (Wedekind, 1994). The question thus arises of how eggs recognize MHC allele composition in sperm. Studies on sperm competition between sperm from different males suggest the occurrence of MHC-dependent mate choice even after mating (cryptic female choice; Schwensow et al., 2008; Yeates et al., 2009; Lovlie et al., 2013). Although these studies tested for a role of MHC in mate choice at the postmating level, they reveal no information about a possible mechanism at the level of the gametic haplotype. There is no clear evidence for the (haploid) expression of the MHC repertoire on the sperm cell surface. Although some studies on human spermatozoa show haploid expression of MHC (see, for example, Halim and Festenstein, 1975; Arnaizvillena and Festenstein, 1976), others find no MHC antigens (Haas and Nahhas, 1986) or report inconsistent results (Kurpisz et al., 1987). A second, less explored possibility for selective fertilization, which so far has only been tested in mammals, arises after membrane penetration when the egg could choose between its two remaining haplotypes to optimize complementarity with that of the penetrating sperm genotype (Agulnik et al., 1993; Wedekind et al. 1996).

To study the potential role of non-random gamete fusion in postmating mate choice, we investigated transmission ratio distortion and non-random haplotype divergence within gametes at two tightly linked functional MHC-II loci (DAA and DAB) in Atlantic salmon. The two loci represent the only copies of MHC-IIA and MHC-IIB in salmon, and their variation has been extensively described in the past (Langefors et al., 2000; Stet et al., 2002; Gomez et al., 2010). We performed IVF assays with sperm from single males thus limiting potential assortative fusion among gametes exclusively to the haploid stage. Our aim was to answer three questions: (1) Are any MHC-II alleles transmitted to offspring in a non-Mendelian manner? (2) Is there any distortion in genotype frequencies? (3) Do we observe different sequence divergence in the offspring than expected under random gamete fusion?

Materials and methods

Sampling and IVF

Fifteen males (length: 77.6 cm±11.3 s.d., weight: 4.52 kg ±1.94 s.d.) and 15 females (length: 92.2 cm±9.5 s.d., weight: 8.67 kg±2.59 s.d.) were randomly sampled during commercial stripping of wild S. salar at the Fishery Research Station in Älvkarleby, Sweden. Fish were anaesthetized with a solution of Tricaine Methanesulfonate (MS 222, Sigma-Aldrich, Stockholm, Sweden) during gamete collection. IVF was performed under laboratory conditions. Ejaculates and eggs were collected into plastic containers by carefully avoiding premature contact with water and kept cool on ice until IVFs (not more than 2 h after collection). For fertilization, on average 291±56 eggs per subclutch were placed in a dry glass beaker. Eggs remained in ovarian fluid for IVFs. We then added 100 μl of ejaculate and 200 ml of river water (~3 °C) for activation of the gametes. Each individual was mated three times independently, that is, eggs were fertilized by three different males and vice versa, resulting in a total of 45 clutches in five full factorial 3 × 3 blocks, where males and females were randomly assigned to each block. Fertilized eggs were allowed to harden for 5 min, excessive sperm were washed away and eggs were disinfected with poly vinyl pyrrolidone iodine solution (Frantsi and Withey, 1972) and incubated in the hatchery system for 4 months at river water temperature (average water temperature 3.2 °C±0.2 s.d.) until the eyes of the embryos became visible because of black pigmentation (‘eyed eggs’). The opaque colour of salmon eggs and the necessity for eggs to lay still during early development prevents the assessment of fertilization on live eggs before the eyed stage. Eggs turning white were removed throughout incubation to avoid fungal growth (7 eggs±10 s.d. per clutch; except for three clutches from one male with low fertilization success). Embryos were killed and stored in 96% ethanol at +4 °C until DNA extraction.

Genetic analysis

Genomic DNA was extracted from the ejaculate of the 15 males used for IVFs following a protocol specifically designed for DNA extraction from sperm (Qiagen: Purification of total DNA from animal sperm using the DNeasy Blood & Tissue Kit; protocol 2) and from 50 embryos per clutch (2250 embryos in total) using the DNeasy Blood & Tissue Kit (Qiagen, Sollentuna, Sweden) according to the manufacturer’s instructions. Exon 2 of DAA (303 bp) was amplified using primers for S. salar DAA published by Gomez et al. (2010) and DAB (311 bp) using the primers TVS4501 and AL1002 (Langefors et al., 2000). PCR products from all 15 males were cloned into bacterial vectors (pGEM T-easy, Promega, Stockholm, Sweden) and Sanger sequenced to estimate paternal genotypes.

Embryo genotyping

Because of the length of the DAA and DAB loci, we amplified and sequenced the two loci separately. Both forward and reverse primers were tagged using an 8 bp barcode on the 5′-end for sample identification after pooling for sequencing. The PCR consisted of 0.2 μM of each tagged forward and reverse primer, 0.2 mM dNTPs, 2 mM MgCl2, 1 × Gold PCR buffer, 1 unit of AmpliTaq Gold polymerase (Life Technologies, Stockholm, Sweden), 1 μl DNA and ddH2O to a final volume of 25 μl. Cycling conditions were: 94 °C for 5 min, followed by 30 cycles of 94 °C for 40 s, 58 °C for 30 s and 72 °C for 1 min, and a final extension step at 72 °C for 7 min. Amplification products were visualized in 1.5% agarose gels stained with GelRed (Biotium, Stockholm, Sweden). PCR products were then arbitrarily pooled by 16 samples (pre-pools) according to gel band intensity as follows: 5 μl of strong products and 10 μl of weak products, and purified using the QIAquick PCR purification kit (Qiagen) according to the manufacturer’s instruction. DNA concentration of the purified pre-pools was individually measured using Nanodrop 2000 (Thermo Scientific, Stockholm, Sweden) and then equimolarly pooled into six final samples (three for DAA and three for DAB). These final pooled samples were processed as separate paired-end libraries (2 × 300 bp) during the sequencing process using the MiSeq platform (Illumina, Eindhoven, The Netherlands) at the National Genomics Infrastructure at SciLifeLab, Uppsala, Sweden. DAA and DAB amplicons were sequenced on two different MiSeq lanes with families being randomized across the two lanes.

To estimate genotyping reliability for both DAA and DAB, 186 samples (8%) were run in duplicate, with differently tagged primers and in different library preparations. In addition, 10 samples were repeated in 5 different PCRs each, with different primer tags, but with the same library preparation. The results of the replicates fully support the reliability and robustness of the genotyping method.

Genotype extraction and sequence analysis

Paired reads were merged using FLASH (Magoč and Salzberg, 2011) and sorted by individual samples based on tagged primer combinations. The individual haplotypes and genotypes were identified using python scripts based on the coverage and frequency of each candidate haplotype. An individual was considered homozygote if the frequency of the most abundant haplotype was >0.9, and heterozygote if the frequency of the two most abundant haplotypes were between 0.45 and 0.55. The maternal genotypes were reconstructed using the information from the offspring and paternal genotypes. The identity of the haplotype sequences was determined by BLAST with referent sequences from GenBank. Relevant scripts can be downloaded from https://github.com/situssog/Promerova_et_al_2016.

Families with two homozygous parents (three cases) or with both parents heterozygous for the same alleles (one case) were excluded from the analyses, as it was not possible to distinguish between paternal or maternal origin of the alleles. The putative peptide-binding region sites were established based on Brown et al. (1993) and Dixon et al. (1996). Genetic distances between parental haplotypes were calculated as pairwise nucleotide sequence divergence using the Jukes–Cantor model, with uniform rates among sites and pairwise deletion, and pairwise amino-acid sequence divergence using Poisson distribution in Mega v.5 (Tamura et al., 2011).

Statistical analysis

Exact binomial tests were used to assess whether any of the alleles show transmission ratio distortion. For each allele, information from all families was combined to maximize statistical power. We compared the number of transmitted (number of offspring that inherited the particular allele) and non-transmitted (number of offspring not inheriting this allele) alleles across all families in which at least one of the parents was heterozygous. The expected proportion of transmitted to non-transmitted alleles was 0.5 (the predicted value under Mendelian inheritance; see Supplementary Table S1 for family values). To correct for multiple testing we used false discovery rate correction. The analysis was carried out separately for alleles of paternal and maternal origin.

According to the null hypothesis of no distortion, we expect four possible allelic combinations within a family with frequencies of 0.25 if both parents are heterozygous for a given locus. If only one of the parents is heterozygous, the two possible allelic combinations in the offspring should not differ from 0.5. Whether the observed frequencies follow expected values was tested with the χ2 test of goodness of fit and subsequent false discovery rate correction. Monte Carlo simulations with 10 000 iterations were employed to estimate expected genotype frequencies across the ‘population’ in our data set.

In addition, we tested whether eggs were preferentially fertilized by sperm carrying MHC-II haplotypes different or similar to their own. We compared the offspring’s expected mean MHC divergence and its variance under random fertilization by one of two available sperm haplotypes (obtained by Monte Carlo simulations with 10 000 iterations) to the observed mean MHC divergence and the observed variance. The same test was performed for the nucleotide and amino-acid sequences of the whole exon 2 and then for only the peptide-binding region sites. All simulations and statistical tests were performed in R version 3.1.2 (R Core Team, 2014).

Results

With 9 and 11 alleles, the genetic diversity detected at the DAA and DAB loci observed here was comparable to previous studies (Langefors et al., 2000; Stet et al., 2002; Gomez et al., 2010). The sequences of these alleles differed by 2 to 12 amino acids at DAA and 3 to 19 amino acids at DAB.

The most common allele at the DAA locus (DAA*0201) showed preferential transmission when inherited from females (P=0.021, N=15 families and 750 informative meioses, 407 transmitted and 343 non-transmitted; Table 1). However, this result was not significant after false discovery rate correction (threshold 0.006). For the same allele when inherited from males, there was a nonsignificant trend in the opposite direction (P=0.12, N=18 families and 896 informative meiosis, 424 transmitted and 472 non-transmitted). At the DAB locus, none of the allele frequencies showed significant distortion. None of the alleles of either locus showed transmission ratio distortion when tested for both parental sexes combined.

Table 1 Transmission of exon 2 DAA and DAB, MHC-II from males and from females

When testing for distortion in genotype frequencies within families, none of the 40 included Atlantic salmon families showed significant deviations from expected genotype frequencies after false discovery rate correction, even though in three families the uncorrected P-value was significant (P=0.01–0.025; Supplementary Table S2). We did not observe any deviation from expected genotype frequencies across the whole data set (Supplementary Table S3). Results were the same for both DAA and DAB.

Finally, neither nucleotide nor amino-acid sequence divergence in offspring was significantly different from expected values obtained by Monte Carlo simulations (Table 2). We found no evidence for a non-random association between similar or dissimilar MHC alleles in our data. This was also true in families where the male and the female share one allele as an ultimate test of the complementary gene hypothesis (see Supplementary Table S3).

Table 2 Mean sequence divergence in offspring measured as nucleotide and amino-acid distance and its variance for the whole exon 2 sequence and for PBR sites only

Discussion

Our results provided no evidence for any non-random association among MHC haplotypes in gametes. We observed no significant transmission distortion of the alleles across all families or any deviation from expected allele frequencies within families. MHC-sequence divergence did not deviate significantly from expected values. Although our sampling was performed at the eyed stage and hence early embryo survival was not monitored, this is unlikely to have an impact on our results. The reason for this is that if non-random gamete fusion was to occur, it would require a higher mortality rate among these offspring in order to obtain Mendelian frequencies as we found them, contradicting the idea of an adaptive scenario. Therefore, given the robustness of our data set, we can confidently exclude the existence of a mechanism ensuring complementary MHC allele combination at the haploid stage in this population of Atlantic salmon. Whether the same is true at a larger scale for example between populations remains to be tested. Our results thereby contribute to the ongoing debate on the role of MHC-based pre- and postmating mate choice. We discuss our results in the light of previous findings and their wider implications.

Only few studies have so far investigated the existence of mechanisms allowing for complementary MHC allele associations at the gametic haplotype level. An early study in house mice provided some experimental evidence for such a mechanism as MHC alleles exhibited non-random associations in blastocysts (Wedekind et al., 1996). These results in house mice were confirmed to be based on MHC-driven gamete association in a follow-up study (Rülicke et al., 1998) and a more recent study suggests that more general selection for heterozygosity could be driving the observed patterns (Firman and Simmons, 2015). However, whether this pattern occurs because of the complementary association of the gametes themselves or is influenced by the outcome of the second meiotic division shortly after fertilization is currently unclear. A later study in whitefish Coregonus sp. found no evidence for non-random gamete fusion in MHC-IIB (Wedekind et al., 2004). In this study, seven in vitro fertilized batches with 30–129 offspring were examined, and despite a strong effect of MHC genotype on disease resistance, the genotype frequencies in offspring were not different from expectation. However, this study did not test whether sequence divergence played any role in gamete fusion. A study on haplotype co-segregation at MHC-II in the Atlantic salmon reported that 1 out of 33 families showed segregation distortion after Bonferroni correction, suggesting that MHC was following Mendelian inheritance (Stet et al., 2002). These previous findings in fish are in line with our own findings, but drawing any general conclusions about the existence of a mechanism of MHC recognition at the gametic haplotype level would certainly be premature at this stage given the few empirical studies available. It would for example be interesting to investigate how mechanisms of gamete recognition differ between mammals and fish or other externally fertilizing taxa. For example, the interaction between sperm and ovarian fluid may be one important selection mechanism in externally fertilizing fish (Johnson et al., 2014; Rosengrave et al., 2016).

In salmonid fish, fertilization is external and gametes associate rapidly after they are released into water (Hoysak and Liley, 2001). As little as 2 s delay in sperm release in an in vitro competition experiment induced a significant decrease in fertilization success (Yeates et al., 2007). Yeates et al. (2009) argued that such rapid sperm–egg association makes it difficult to identify mechanisms for the recognition of the sperm MHC haplotype by the ovum. Furthermore, Wedekind et al. (2004) suggested that MHC-based non-random gamete fusion has not evolved in fish because (1) the costs of non-random gamete fusion may be higher in external fertilizers, as the gametes are released into a hostile environment with reduced time for assortative mating; (2) the selection for assortative mating is low because fish produce a much larger amount of eggs, whereas the investment per egg is lower compared with mammals; and (3) the risk of inbreeding is lower and thus there may be no selection for mechanisms to avoid inbreeding. Atlantic salmon return to their hatching grounds for spawning and hence inbreeding is likely to pose a certain risk. However, local adaptation has been shown to be under strong selection in Atlantic salmon (Dionne et al., 2008) and outbreeding might actually pose an even higher threat for the offspring survival than inbreeding (Côte et al., 2014). A topic of further investigation could therefore be to repeat the experiment using two different populations that usually do not interbreed, differing in MHC alleles, to examine whether distortion occurs with atypical fertilizations under the scope of outbreeding depression.

In conclusion, we show that MHC-II genes show no evidence of transmission ratio distortion or other signs of non-random gamete fusion in the Atlantic salmon. Our findings differ from results in studies focussing on mate choice at the premating level (reviewed in Kamiya et al., 2014; Milinski, 2014) and situations where more than two males compete for the fertilization of eggs (see, for example, Schwensow et al., 2008; Yeates et al., 2009; Lovlie et al., 2013). It appears that unlike in mammals, selection for an MHC recognition system at the haploid level may not have evolved in externally fertilizing fish. However, here we focussed on MHC-II alleles and it remains to be tested whether other MHC loci such as class I genes located on different chromosomes in fish (Sato et al., 2000) show similar inheritance patterns to those described here for class II. Furthermore, an important and currently unanswered question is whether MHC molecules are expressed at the sperm surface and whether they reflect the diploid genotype of the male or the haploid genotype of the sperm, and how these expression patterns differ across taxa.

Data archiving

Sequence data have been submitted to GenBank: DAA: KY351821–KY351823; DAB: KY351824 and KY351825.

Genotype data have been submitted to Dryad: doi:10.5061/dryad.7kq17 The AnalyseMyData package is available at: https://github.com/situssog/Promerova_et_al_2016.